Commits · a6e3d7dba92e19acffaa36aad962741a762aa8c5 · Kirill Smelkov / linux

An error occurred fetching the project authors.

12 Oct, 2007 4 commits

ocfs2: Write support for inline data · 1afc32b9

Mark Fasheh authored 17 years ago

This fixes up write, truncate, mmap, and RESVSP/UNRESVP to understand inline
inode data.

For the most part, the changes to the core write code can be relied on to do
the heavy lifting. Any code calling ocfs2_write_begin (including shared
writeable mmap) can count on it doing the right thing with respect to
growing inline data to an extent tree.

Size reducing truncates, including UNRESVP can simply zero that portion of
the inode block being removed. Size increasing truncatesm, including RESVP
have to be a little bit smarter and grow the inode to an extent tree if
necessary.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
Reviewed-by: Joel Becker <joel.becker@oracle.com>

1afc32b9

ocfs2: Read support for inline data · 6798d35a

Mark Fasheh authored 17 years ago

This hooks up ocfs2_readpage() to populate a page with data from an inode
block. Direct IO reads from inline data are modified to fall back to
buffered I/O. Appropriate checks are also placed in the extent map code to
avoid reading an extent list when inline data might be stored.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
Reviewed-by: Joel Becker <joel.becker@oracle.com>

6798d35a

ocfs2: Small refactor of truncate zeroing code · 1d410a6e

Mark Fasheh authored 17 years ago

We'll want to reuse most of this when pushing inline data back out to an
extent. Keeping this part as a seperate patch helps to keep the upcoming
changes for write support uncluttered.

The core portion of ocfs2_zero_cluster_pages() responsible for making sure a
page is mapped and properly dirtied is abstracted out into it's own
function, ocfs2_map_and_dirty_page(). Actual functionality doesn't change,
though zeroing becomes optional.

We also turn part of ocfs2_free_write_ctxt() into  a common function for
unlocking and freeing a page array. This operation is very common (and
uniform) for Ocfs2 cluster sizes greater than page size, so it makes sense
to keep the code in one place.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
Reviewed-by: Joel Becker <joel.becker@oracle.com>

1d410a6e

ocfs2: move nonsparse hole-filling into ocfs2_write_begin() · 65ed39d6

Mark Fasheh authored 17 years ago

By doing this, we can remove any higher level logic which has to have
knowledge of btree functionality - any callers of ocfs2_write_begin() can
now expect it to do anything necessary to prepare the inode for new data.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
Reviewed-by: Joel Becker <joel.becker@oracle.com>

65ed39d6

20 Sep, 2007 2 commits

ocfs2: Don't double set write parameters · 5c26a7b7

Mark Fasheh authored 17 years ago

The target page offsets were being incorrectly set a second time in
ocfs2_prepare_page_for_write(), which was causing problems on a 16k page
size kernel. Additionally, ocfs2_write_failure() was incorrectly using those
parameters instead of the parameters for the individual page being cleaned
up.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

5c26a7b7

ocfs2: Fix pos/len passed to ocfs2_write_cluster · db56246c

Mark Fasheh authored 17 years ago

This was broken for file systems whose cluster size is greater than page
size. Pos needs to be incremented as we loop through the descriptors, and
len needs to be capped to the size of a single cluster.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

db56246c

11 Sep, 2007 1 commit

[PATCH] ocfs2: Fix a wrong cluster calculation. · 30b8548f

tao.ma@oracle.com authored 17 years ago

In ocfs2_alloc_write_write_ctxt, the written clusters length is calculated
by the byte length only. This may cause some problems if we start to write
at some position in the end of one cluster and last to a second cluster
while the "len" is smaller than a cluster size. In that case, we have to
write 2 clusters actually.
So we have to take the start position into consideration also.
Signed-off-by: Tao Ma <tao.ma@oracle.com>
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

30b8548f

19 Jul, 2007 1 commit

mm: merge populate and nopage into fault (fixes nonlinear) · 54cb8821

Nick Piggin authored 17 years ago

Nonlinear mappings are (AFAIKS) simply a virtual memory concept that encodes
the virtual address -> file offset differently from linear mappings.

->populate is a layering violation because the filesystem/pagecache code
should need to know anything about the virtual memory mapping.  The hitch here
is that the ->nopage handler didn't pass down enough information (ie.  pgoff).
 But it is more logical to pass pgoff rather than have the ->nopage function
calculate it itself anyway (because that's a similar layering violation).

Having the populate handler install the pte itself is likewise a nasty thing
to be doing.

This patch introduces a new fault handler that replaces ->nopage and
->populate and (later) ->nopfn.  Most of the old mechanism is still in place
so there is a lot of duplication and nice cleanups that can be removed if
everyone switches over.

The rationale for doing this in the first place is that nonlinear mappings are
subject to the pagefault vs invalidate/truncate race too, and it seemed stupid
to duplicate the synchronisation logic rather than just consolidate the two.

After this patch, MAP_NONBLOCK no longer sets up ptes for pages present in
pagecache.  Seems like a fringe functionality anyway.

NOPAGE_REFAULT is removed.  This should be implemented with ->fault, and no
users have hit mainline yet.

[akpm@linux-foundation.org: cleanup]
[randy.dunlap@oracle.com: doc. fixes for readahead]
[akpm@linux-foundation.org: build fix]
Signed-off-by: Nick Piggin <npiggin@suse.de>
Signed-off-by: Randy Dunlap <randy.dunlap@oracle.com>
Cc: Mark Fasheh <mark.fasheh@oracle.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

54cb8821

11 Jul, 2007 9 commits

[PATCH] ocfs2: zero_user_page conversion · 54c57dc3

Eric Sandeen authored 17 years ago

Signed-off-by: Eric Sandeen <sandeen@redhat.com>
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

54c57dc3

ocfs2: Support creation of unwritten extents · 2ae99a60

Mark Fasheh authored 17 years ago

This can now be trivially supported with re-use of our existing extend code.

ocfs2_allocate_unwritten_extents() takes a start offset and a byte length
and iterates over the inode, adding extents (marked as unwritten) until len
is reached. Existing extents are skipped over.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

2ae99a60

ocfs2: support writing of unwritten extents · b27b7cbc

Mark Fasheh authored 17 years ago

Update the write code to detect when the user is asking to write to an
unwritten extent. Like writing to a hole, we must zero the region between
the write and the cluster boundaries. Most of the existing cluster zeroing
logic can be re-used with some additional checks for the unwritten flag on
extent records.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

b27b7cbc

ocfs2: small cleanup of ocfs2_write_begin_nolock() · 0d172baa

Mark Fasheh authored 17 years ago

We can easily seperate out the write descriptor setup and manipulation
into helper functions.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

0d172baa

ocfs2: plug truncate into cached dealloc routines · 59a5e416
Mark Fasheh authored 17 years ago
```
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
```
59a5e416

ocfs2: harden buffer check during mapping of page blocks · bce99768

Mark Fasheh authored 17 years ago

We don't want to submit buffer_new blocks for read i/o. This actually won't
happen right now because those requests during an allocating write are all nicely
aligned. It's probably a good idea to provide an explicit check though.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

bce99768

ocfs2: shared writeable mmap · 7307de80

Mark Fasheh authored 17 years ago

Implement cluster consistent shared writeable mappings using the
->page_mkwrite() callback.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

7307de80

ocfs2: factor out write aops into nolock variants · 607d44aa

Mark Fasheh authored 17 years ago

ocfs2_mkwrite() will want this so that it can add some mmap specific checks
before asking for a write.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

607d44aa

ocfs2: rework ocfs2_buffered_write_cluster() · 3a307ffc

Mark Fasheh authored 17 years ago

Use some ideas from the new-aops patch series and turn
ocfs2_buffered_write_cluster() into a 2 stage operation with the caller
copying data in between. The code now understands multiple cluster writes as
a result of having to deal with a full page write for greater than 4k pages.

This sets us up to easily call into the write path during ->page_mkwrite().
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

3a307ffc

06 Jun, 2007 1 commit

ocfs2: Fix invalid assertion during write on 64k pages · eeb47d12

Mark Fasheh authored 17 years ago

The write path code intends to bug if a math error (or unhandled case)
results in a write outside of the current cluster boundaries. The actual
BUG_ON() statements however are incorrect, leading to a crash on kernels
with 64k page size. Fix those by checking against the right variables.

Also, move the assertions higher up within the functions so that they trip
*before* the code starts to mark buffers.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

eeb47d12

25 May, 2007 2 commits

[PATCH] ocfs2: use zero_user_page · 5c3c6bb7

Nate Diller authored 17 years ago

Use zero_user_page() instead of open-coding it.
Signed-off-by: Nate Diller <nate.diller@gmail.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

5c3c6bb7

ocfs2: trylock in ocfs2_readpage() · e9dfc0b2

Mark Fasheh authored 17 years ago

Similarly to the page lock / cluster lock inversion in ocfs2_readpage, we
can deadlock on ip_alloc_sem. We can down_read_trylock() instead and just
return AOP_TRUNCATED_PAGE if the operation fails.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

e9dfc0b2

02 May, 2007 3 commits

ocfs2: Force use of GFP_NOFS in ocfs2_write() · 9315f130

Mark Fasheh authored 17 years ago

We can otherwise recurse into the file system.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

9315f130

ocfs2: fix sparse warnings in fs/ocfs2 · 1ca1a111

Mark Fasheh authored 17 years ago

None of these are actually harmful, but the noise makes looking for real
problems difficult.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

1ca1a111

[PATCH] fs/ocfs2/: make 3 functions static · 6cb129f5

Adrian Bunk authored 17 years ago

This patch makes the following needlessly global functions static:
- aops.c: ocfs2_write_data_page()
- dlmglue.c: ocfs2_dump_meta_lvb_info()
- file.c: ocfs2_set_inode_size()
Signed-off-by: Adrian Bunk <bunk@stusta.de>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

6cb129f5

26 Apr, 2007 10 commits

ocfs2: Remember rw lock level during direct io · 7cdfc3a1

Mark Fasheh authored 17 years ago

Cluster locking might have been redone because a direct write won't
complete, so this needs to be reflected in the iocb.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

7cdfc3a1

ocfs2: Fix up i_blocks calculation to know about holes · 8110b073

Mark Fasheh authored 17 years ago

Older file systems which didn't support holes did a dumb calculation of
i_blocks based on i_size. This is no longer accurate, so fix things up to
take actual allocation into account.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

8110b073

ocfs2: Fix extent lookup to return true size of holes · 4f902c37

Mark Fasheh authored 17 years ago

Initially, we had wired things to return a size '1' of holes. Cook up a
small amount of code to find the next extent and calculate the number of
clusters between the virtual offset and the next allocated extent.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

4f902c37

ocfs2: Read from an unwritten extent returns zeros · 49cb8d2d

Mark Fasheh authored 17 years ago

Return an optional extent flags field from our lookup functions and wire up
callers to treat unwritten regions as holes for the purpose of returning
zeros to the user.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

49cb8d2d

ocfs2: Use own splice write actor · 6af67d82

Mark Fasheh authored 17 years ago

We need to fill holes during a splice write. Provide our own splice write
actor which can call ocfs2_file_buffered_write() with a splice-specific
callback.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

6af67d82

ocfs2: zero tail of sparse files on truncate · 60b11392

Mark Fasheh authored 18 years ago

Since we don't zero on extend anymore, truncate needs to be fixed up to zero
the part of a file between i_size and and end of it's cluster. Otherwise a
subsequent extend could expose bad data.

This introduced a new helper, which can be used in ocfs2_write().
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

60b11392

ocfs2: Teach ocfs2_get_block() about holes · 25baf2da

Mark Fasheh authored 18 years ago

ocfs2_get_block() didn't understand sparse files, fix that. Also remove some
code that isn't really useful anymore. We can fix up
ocfs2_direct_IO_get_blocks() at the same time.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

25baf2da

ocfs2: remove ocfs2_prepare_write() and ocfs2_commit_write() · 5069120b

Mark Fasheh authored 18 years ago

These are no longer used, and can't handle file systems with sparse file
allocation.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

5069120b

ocfs2: teach ocfs2_file_aio_write() about sparse files · 9517bac6

Mark Fasheh authored 18 years ago

Unfortunately, ocfs2 can no longer make use of generic_file_aio_write_nlock()
because allocating writes will require zeroing of pages adjacent to the I/O
for cluster sizes greater than page size.

Implement a custom file write here, which can order page locks for zeroing.
This also has the advantage that cluster locks can easily be ordered outside
of the page locks.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

9517bac6

ocfs2: temporarily remove extent map caching · 363041a5

Mark Fasheh authored 18 years ago

The code in extent_map.c is not prepared to deal with a subtree being
rotated between lookups. This can happen when filling holes in sparse files.
Instead of a lengthy patch to update the code (which would likely lose the
benefit of caching subtree roots), we remove most of the algorithms and
implement a simple path based lookup. A less ambitious extent caching scheme
will be added in a later patch.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

363041a5

14 Mar, 2007 1 commit

ocfs2: add some missing address space callbacks · 03f981cf

Joel Becker authored 18 years ago

Under load, OCFS2 would crash in invalidate_inode_pages2_range() because
invalidate_complete_page2() was unable to invalidate a page.  It would
appear that JBD is holding on to the page.  ext3 has a specific
->releasepage() handler to cover this case.

Steal ext3's ->releasepage(), ->invalidatepage(), and ->migratepage(), as
they appear completely appropriate for OCFS2.
Signed-off-by: Joel Becker <joel.becker@oracle.com>
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

03f981cf

29 Dec, 2006 1 commit

ocfs2: Allow direct I/O read past end of file · 564f8a32

Mark Fasheh authored 18 years ago

ocfs2_direct_IO_get_blocks() was incorrectly returning -EIO for a direct I/O
read whose start block was past the end of the file allocation tree. Fix
things so that we return a hole instead. do_direct_IO() will then notice
that the range start is past eof and return a short read.

While there, remove the unused vbo_max variable.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

564f8a32

08 Dec, 2006 1 commit

[PATCH] struct path: convert ocfs2 · d28c9174

Josef Sipek authored 18 years ago

Signed-off-by: Josef Sipek <jsipek@fsl.cs.sunysb.edu>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

d28c9174

02 Dec, 2006 4 commits

ocfs2: Remove struct ocfs2_journal_handle in favor of handle_t · 1fabe148

Mark Fasheh authored 18 years ago

This is mostly a search and replace as ocfs2_journal_handle is now no more
than a container for a handle_t pointer.

ocfs2_commit_trans() becomes very straight forward, and we remove some out
of date comments / code.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

1fabe148

ocfs2: remove handle argument to ocfs2_start_trans() · 65eff9cc

Mark Fasheh authored 18 years ago

All callers either pass in NULL directly, or a local variable that is
already set to NULL.

The internals of ocfs2_start_trans() get a nice cleanup as a result.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

65eff9cc

ocfs2: pass ocfs2_super * into ocfs2_commit_trans() · 02dc1af4
Mark Fasheh authored 18 years ago
```
This sets us up to remove handle->journal.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>
```
02dc1af4

ocfs2: remove unused handle argument from ocfs2_meta_lock_full() · 4bcec184

Mark Fasheh authored 18 years ago

Now that this is unused and all callers pass NULL, we can safely remove it.
Signed-off-by: Mark Fasheh <mark.fasheh@oracle.com>

4bcec184