Commits · 1ae9bd8b7e4912b238a14adc7c559a7ecbb9c062 · Kirill Smelkov / linux

22 May, 2018 26 commits

proc_lookupfd_common(): don't bother with instantiate unless the file is open · 1ae9bd8b

Al Viro authored May 02, 2018

... and take the "check if file is open, pick ->f_mode" into a helper;
tid_fd_revalidate() can use it.

The next patch will get rid of tid_fd_revalidate() calls in instantiate
callbacks.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

1ae9bd8b

procfs: get rid of ancient BS in pid_revalidate() uses · 1bbc5513

Al Viro authored May 02, 2018

First of all, calling pid_revalidate() in the end of <pid>/* lookups
is *not* about closing any kind of races; that used to be true once
upon a time, but these days those comments are actively misleading.
Especially since pid_revalidate() doesn't even do d_drop() on
failure anymore.  It doesn't matter, anyway, since once
pid_revalidate() starts returning false, ->d_delete() of those
dentries starts saying "don't keep"; they won't get stuck in
dcache any longer than they are pinned.

These calls cannot be just removed, though - the side effect of
pid_revalidate() (updating i_uid/i_gid/etc.) is what we are calling
it for here.

Let's separate the "update ownership" into a new helper (pid_update_inode())
and use it, both in lookups and in pid_revalidate() itself.

The comments in pid_revalidate() are also out of date - they refer to
the time when pid_revalidate() used to call d_drop() directly...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

1bbc5513

cifs_lookup(): switch to d_splice_alias() · 11f17c9b
Al Viro authored May 14, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
11f17c9b
cifs_lookup(): cifs_get_inode_...() never returns 0 with *inode left NULL · a8b75f66
Al Viro authored May 14, 2018
```
not since 2004...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
a8b75f66
9p: unify paths in v9fs_vfs_lookup() · 500e2ab6
Al Viro authored May 14, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
500e2ab6
ncp_lookup(): use d_splice_alias() · 1c5fedbb
Al Viro authored May 13, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
1c5fedbb
hfsplus: switch to d_splice_alias() · 293542d8
Al Viro authored May 03, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
293542d8

hfs: don't allow mounting over .../rsrc · 0e5c56fd

Al Viro authored Apr 30, 2018

That's one case when unlink() destroys a subtree, thanks to "resource
fork" idiocy.  We might forcibly evict that shit on unlink(2), but
for now let's just disallow overmounting; as it is, anything that
plays games with those would leak mounts.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

0e5c56fd

hfs: use d_splice_alias() · 6b9cceea

Al Viro authored Apr 30, 2018

code is simpler that way
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

6b9cceea

omfs_lookup(): report IO errors, use d_splice_alias() · 18fbbfc2
Al Viro authored Apr 30, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
18fbbfc2

orangefs_lookup: simplify · 04bb1ba1

Al Viro authored Apr 30, 2018

d_splice_alias() can handle NULL and ERR_PTR() for inode just fine...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

04bb1ba1

openpromfs: switch to d_splice_alias() · 0ed883fd
Al Viro authored May 03, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
0ed883fd

xfs_vn_lookup: simplify a bit · b113a6d3

Al Viro authored Apr 30, 2018

have all post-xfs_lookup() branches converge on d_splice_alias()

Cc: Darrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

b113a6d3

adfs_lookup: do not fail with ENOENT on negatives, use d_splice_alias() · 9a7dddca
Al Viro authored Apr 30, 2018
```
Cc: Russell King <linux@armlinux.org.uk>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
9a7dddca
adfs_lookup_byname: .. *is* taken care of in fs/namei.c · 686bb96d
Al Viro authored Apr 30, 2018
```
Cc: Russell King <linux@armlinux.org.uk>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
686bb96d
romfs_lookup: switch to d_splice_alias() · 8130c151
Al Viro authored Apr 30, 2018
```
... and hash negative lookups
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
8130c151
qnx6_lookup: switch to d_splice_alias() · c1481700
Al Viro authored Apr 30, 2018
```
... and hash negative lookups
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
c1481700

ubifs_lookup: use d_splice_alias() · 191ac107

Al Viro authored Apr 30, 2018

code is simpler that way
Acked-by: Richard Weinberger <richard@nod.at>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

191ac107

sysv_lookup: use d_splice_alias() · 5bf35449

Al Viro authored Apr 30, 2018

code is simpler that way

Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

5bf35449

qnx4_lookup: use d_splice_alias() · b135dcea

Al Viro authored Apr 30, 2018

code is simpler that way
Acked-by: Anders Larsen <al@alarsen.net>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

b135dcea

minix_lookup: use d_splice_alias() · b0149516
Al Viro authored Apr 30, 2018
```
code is simpler that way
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
b0149516

freevxfs_lookup(): use d_splice_alias() · 72ff0b03

Al Viro authored Apr 30, 2018

code is simpler that way
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

72ff0b03

cramfs_lookup(): use d_splice_alias() · d023b3a1

Al Viro authored Apr 30, 2018

simpler code that way, actually
Acked-by: Nicolas Pitre <nico@linaro.org>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

d023b3a1

bfs_add_entry: pass name/len as qstr pointer · b455ecd4

Al Viro authored Apr 30, 2018

same story as with bfs_find_entry()

Cc: "Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

b455ecd4

bfs_find_entry: pass name/len as qstr pointer · 33ebdebe

Al Viro authored Apr 30, 2018

all callers feed something->name/something->len anyway

Cc: "Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

33ebdebe

bfs_lookup(): use d_splice_alias() · a596a23b

Al Viro authored Apr 30, 2018

code is actually simpler that way.
Acked-by: "Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

a596a23b

21 May, 2018 11 commits

Merge branch 'work.misc' into work.lookup · 837f3ec6
Al Viro authored May 21, 2018

837f3ec6

aio: fix io_destroy(2) vs. lookup_ioctx() race · baf10564

Al Viro authored May 20, 2018

kill_ioctx() used to have an explicit RCU delay between removing the
reference from ->ioctx_table and percpu_ref_kill() dropping the refcount.
At some point that delay had been removed, on the theory that
percpu_ref_kill() itself contained an RCU delay. Unfortunately, that was
the wrong kind of RCU delay and it didn't care about rcu_read_lock() used
by lookup_ioctx(). As the result, we could get ctx freed right under
lookup_ioctx(). Tejun has fixed that in a6d7cff4 ("fs/aio: Add explicit
RCU grace period when freeing kioctx"); however, that fix is not enough.

Suppose io_destroy() from one thread races with e.g. io_setup() from another;
CPU1 removes the reference from current->mm->ioctx_table[...] just as CPU2
has picked it (under rcu_read_lock()). Then CPU1 proceeds to drop the
refcount, getting it to 0 and triggering a call of free_ioctx_users(),
which proceeds to drop the secondary refcount and once that reaches zero
calls free_ioctx_reqs(). That does
INIT_RCU_WORK(&ctx->free_rwork, free_ioctx);
queue_rcu_work(system_wq, &ctx->free_rwork);
and schedules freeing the whole thing after RCU delay.

In the meanwhile CPU2 has gotten around to percpu_ref_get(), bumping the
refcount from 0 to 1 and returned the reference to io_setup().

Tejun's fix (that queue_rcu_work() in there) guarantees that ctx won't get
freed until after percpu_ref_get(). Sure, we'd increment the counter before
ctx can be freed. Now we are out of rcu_read_lock() and there's nothing to
stop freeing of the whole thing. Unfortunately, CPU2 assumes that since it
has grabbed the reference, ctx is *NOT* going away until it gets around to
dropping that reference.

The fix is obvious - use percpu_ref_tryget_live() and treat failure as miss.
It's not costlier than what we currently do in normal case, it's safe to
call since freeing *is* delayed and it closes the race window - either
lookup_ioctx() comes before percpu_ref_kill() (in which case ctx->users
won't reach 0 until the caller of lookup_ioctx() drops it) or lookup_ioctx()
fails, ctx->users is unaffected and caller of lookup_ioctx() doesn't see
the object in question at all.

Cc: stable@kernel.org
Fixes: a6d7cff4 "fs/aio: Add explicit RCU grace period when freeing kioctx"
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

baf10564

ext2: fix a block leak · 5aa1437d

Al Viro authored May 17, 2018

open file, unlink it, then use ioctl(2) to make it immutable or
append only.  Now close it and watch the blocks *not* freed...

Immutable/append-only checks belong in ->setattr().
Note: the bug is old and backport to anything prior to 737f2e93
("ext2: convert to use the new truncate convention") will need
these checks lifted into ext2_setattr().

Cc: stable@kernel.org
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

5aa1437d

nfsd: vfs_mkdir() might succeed leaving dentry negative unhashed · 3819bb0d

Al Viro authored May 11, 2018

That can (and does, on some filesystems) happen - ->mkdir() (and thus
vfs_mkdir()) can legitimately leave its argument negative and just
unhash it, counting upon the lookup to pick the object we'd created
next time we try to look at that name.

Some vfs_mkdir() callers forget about that possibility...
Acked-by: J. Bruce Fields <bfields@redhat.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

3819bb0d

cachefiles: vfs_mkdir() might succeed leaving dentry negative unhashed · 9c3e9025

Al Viro authored May 10, 2018

That can (and does, on some filesystems) happen - ->mkdir() (and thus
vfs_mkdir()) can legitimately leave its argument negative and just
unhash it, counting upon the lookup to pick the object we'd created
next time we try to look at that name.

Some vfs_mkdir() callers forget about that possibility...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

9c3e9025

unfuck sysfs_mount() · 7b745a4e

Al Viro authored May 14, 2018

new_sb is left uninitialized in case of early failures in kernfs_mount_ns(),
and while IS_ERR(root) is true in all such cases, using IS_ERR(root) || !new_sb
is not a solution - IS_ERR(root) is true in some cases when new_sb is true.

Make sure new_sb is initialized (and matches the reality) in all cases and
fix the condition for dropping kobj reference - we want it done precisely
in those situations where the reference has not been transferred into a new
super_block instance.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

7b745a4e

kernfs: deal with kernfs_fill_super() failures · 82382ace

Al Viro authored Apr 03, 2018

make sure that info->node is initialized early, so that kernfs_kill_sb()
can list_del() it safely.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

82382ace

cramfs: Fix IS_ENABLED typo · 08a8f308

Joe Perches authored May 13, 2018

There's an extra C here...

Fixes: 99c18ce5 ("cramfs: direct memory access support")
Acked-by: Nicolas Pitre <nico@linaro.org>
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

08a8f308

befs_lookup(): use d_splice_alias() · f4e4d434

Al Viro authored Apr 30, 2018

RTFS(Documentation/filesystems/nfs/Exporting) if you try to make
something exportable.

Fixes: ac632f5b "befs: add NFS export support"
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

f4e4d434

affs_lookup: switch to d_splice_alias() · 87fbd639

Al Viro authored May 06, 2018

Making something exportable takes more than providing ->s_export_ops.
In particular, ->lookup() *MUST* use d_splice_alias() instead of
d_add().

Reading Documentation/filesystems/nfs/Exporting would've been a good idea;
as it is, exporting AFFS is badly (and exploitably) broken.

Partially-Fixes: ed4433d7 "fs/affs: make affs exportable"
Acked-by: David Sterba <dsterba@suse.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

87fbd639

affs_lookup(): close a race with affs_remove_link() · 30da870c

Al Viro authored May 06, 2018

we unlock the directory hash too early - if we are looking at secondary
link and primary (in another directory) gets removed just as we unlock,
we could have the old primary moved in place of the secondary, leaving
us to look into freed entry (and leaving our dentry with ->d_fsdata
pointing to a freed entry).

Cc: stable@vger.kernel.org # 2.4.4+
Acked-by: David Sterba <dsterba@suse.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

30da870c

18 May, 2018 2 commits

vfs: namei: use path_equal() in follow_dotdot() · 030c7e0b

Danilo Krummrich authored Apr 23, 2018

Use path_equal() to detect whether we're already in root.
Signed-off-by: Danilo Krummrich <danilokrummrich@dk-develop.de>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

030c7e0b

fs.h: fix outdated comment about file flags · 75abe329

Li Qiang authored May 17, 2018

The __dentry_open function was removed in
commit <2a027e7a>("fold __dentry_open() into its sole caller").
Signed-off-by: Li Qiang <liq3ea@gmail.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

75abe329

14 May, 2018 1 commit
- __inode_security_revalidate() never gets NULL opt_dentry · e9193288
  Al Viro authored Apr 24, 2018
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
  e9193288