Commits · 3134f37e931d75931bdf6d4eacd82a3fd26eca7c · Kirill Smelkov / linux

29 Jul, 2012 19 commits

vfs: don't let do_last pass negative dentry to audit_inode · 3134f37e

Jeff Layton authored Jul 25, 2012

I can reliably reproduce the following panic by simply setting an audit
rule on a recent 3.5.0+ kernel:

 BUG: unable to handle kernel NULL pointer dereference at 0000000000000040
 IP: [<ffffffff810d1250>] audit_copy_inode+0x10/0x90
 PGD 7acd9067 PUD 7b8fb067 PMD 0
 Oops: 0000 [#86] SMP
 Modules linked in: nfs nfs_acl auth_rpcgss fscache lockd sunrpc tpm_bios btrfs zlib_deflate libcrc32c kvm_amd kvm joydev virtio_net pcspkr i2c_piix4 floppy virtio_balloon microcode virtio_blk cirrus drm_kms_helper ttm drm i2c_core [last unloaded: scsi_wait_scan]
 CPU 0
 Pid: 1286, comm: abrt-dump-oops Tainted: G      D      3.5.0+ #1 Bochs Bochs
 RIP: 0010:[<ffffffff810d1250>]  [<ffffffff810d1250>] audit_copy_inode+0x10/0x90
 RSP: 0018:ffff88007aebfc38  EFLAGS: 00010282
 RAX: 0000000000000000 RBX: ffff88003692d860 RCX: 00000000000038c4
 RDX: 0000000000000000 RSI: ffff88006baf5d80 RDI: ffff88003692d860
 RBP: ffff88007aebfc68 R08: 0000000000000000 R09: 0000000000000000
 R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000
 R13: ffff880036d30f00 R14: ffff88006baf5d80 R15: ffff88003692d800
 FS:  00007f7562634740(0000) GS:ffff88007fc00000(0000) knlGS:0000000000000000
 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
 CR2: 0000000000000040 CR3: 000000003643d000 CR4: 00000000000006f0
 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
 Process abrt-dump-oops (pid: 1286, threadinfo ffff88007aebe000, task ffff880079614530)
 Stack:
  ffff88007aebfdf8 ffff88007aebff28 ffff88007aebfc98 ffffffff81211358
  ffff88003692d860 0000000000000000 ffff88007aebfcc8 ffffffff810d4968
  ffff88007aebfcc8 ffff8800000038c4 0000000000000000 0000000000000000
 Call Trace:
  [<ffffffff81211358>] ? ext4_lookup+0xe8/0x160
  [<ffffffff810d4968>] __audit_inode+0x118/0x2d0
  [<ffffffff811955a9>] do_last+0x999/0xe80
  [<ffffffff81191fe8>] ? inode_permission+0x18/0x50
  [<ffffffff81171efa>] ? kmem_cache_alloc_trace+0x11a/0x130
  [<ffffffff81195b4a>] path_openat+0xba/0x420
  [<ffffffff81196111>] do_filp_open+0x41/0xa0
  [<ffffffff811a24bd>] ? alloc_fd+0x4d/0x120
  [<ffffffff811855cd>] do_sys_open+0xed/0x1c0
  [<ffffffff810d40cc>] ? __audit_syscall_entry+0xcc/0x300
  [<ffffffff811856c1>] sys_open+0x21/0x30
  [<ffffffff81611ca9>] system_call_fastpath+0x16/0x1b
  RSP <ffff88007aebfc38>
 CR2: 0000000000000040

The problem is that do_last is passing a negative dentry to audit_inode.
The comments on lookup_open note that it can pass back a negative dentry
if O_CREAT is not set.

This patch fixes the oops, but I'm not clear on whether there's a better
approach.

Cc: Miklos Szeredi <miklos@szeredi.hu>
Signed-off-by: Jeff Layton <jlayton@redhat.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

3134f37e

brcm80211: pointless current->files passed to filp_close() · 0b5306b3
Al Viro authored Jul 22, 2012
```
... only needed if it's been in descriptor table
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
0b5306b3
sound_firmware: don't pass crap to filp_close() · 58609306
Al Viro authored Jul 22, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
58609306

gadgetfs: clean up · 20818a0c

Al Viro authored Jul 22, 2012

sigh...
* opened files have non-NULL dentries and non-NULL inodes
* close_filp() needs current->files only if the file had been
in descriptor table.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

20818a0c

slightly reduce lossage in gdm72xx · 09fada5b

Al Viro authored Jul 22, 2012

* filp_close() needs non-NULL second argument only if it'd been in descriptor
table
* opened files have non-NULL dentries, TYVM
* ... and those dentries are positive - it's kinda hard to open a file that
doesn't exist.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

09fada5b

slightly reduce idiocy in drivers/staging/bcm/Misc.c · 32aecdd3

Al Viro authored Jul 22, 2012

a) vfs_llseek() does *not* access userland pointers of any kind
b) neither does filp_close(), for that matter
c) ... nor filp_open()
d) vfs_read() does, but we do have a wrapper for that (kernel_read()),
so there's no need to reinvent it.
e) passing current->files to filp_close() on something that never
had been in descriptor table is pointless.

ISAGN: voodoo dolls to be used on voodoo programmers...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

32aecdd3

consolidate pipe file creation · e4fad8e5
Al Viro authored Jul 21, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
e4fad8e5
take grabbing f->f_path to do_dentry_open() · b5bcdda3
Al Viro authored Jul 20, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
b5bcdda3

uninline file_free_rcu() · 5c33b183

Al Viro authored Jul 20, 2012

What inline?  Its only use is passing its address to call_rcu(), for fuck sake!
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

5c33b183

ecryptfs_lookup_interpose(): allocate dentry_info first · 0b1d9011
Al Viro authored Jul 20, 2012
```
less work on failure that way
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
0b1d9011

sanitize ecryptfs_lookup() · bc65a121

Al Viro authored Jul 20, 2012

* ->lookup() never gets hit with . or ..
* dentry it gets is unhashed, so unless we had gone and hashed it ourselves, there's
no need to d_drop() the sucker.
* wrong name printed in one of the printks (NULL, in fact)
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

bc65a121

clean unix_bind() up a bit · faf02010
Al Viro authored Jul 20, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
faf02010

pull mnt_want_write()/mnt_drop_write() into kern_path_create()/done_path_create() resp. · a8104a9f

Al Viro authored Jul 20, 2012

One side effect - attempt to create a cross-device link on a read-only fs fails
with EROFS instead of EXDEV now. Makes more sense, POSIX allows, etc.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

a8104a9f

mknod: take sanity checks on mode into the very beginning · 8e4bfca1

Al Viro authored Jul 20, 2012

Note that applying umask can't affect their results.  While
that affects errno in cases like
	mknod("/no_such_directory/a", 030000)
yielding -EINVAL (due to impossible mode_t) instead of
-ENOENT (due to inexistent directory), IMO that makes a lot
more sense, POSIX allows to return either and any software
that relies on getting -ENOENT instead of -EINVAL in that
case deserves everything it gets.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

8e4bfca1

new helper: done_path_create() · 921a1650

Al Viro authored Jul 20, 2012

releases what needs to be released after {kern,user}_path_create()
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

921a1650

pull unlock+dput() out into do_spu_create() · 25b2692a

Al Viro authored Jul 19, 2012

... and cleaning spufs_create() a bit, while we are at it
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

25b2692a

spufs: pull unlock-and-dput() up into spufs_create() · 1ba44cc9
Al Viro authored Jul 19, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
1ba44cc9
spufs_create_context(): simplify failure exits · 66ec7b2c
Al Viro authored Jul 19, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
66ec7b2c

move spu_forget() into spufs_rmdir() · 67cba9fd

Al Viro authored Jul 19, 2012

now that __fput() is *not* done in any callchain containing mmput(),
we can do that...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

67cba9fd

22 Jul, 2012 21 commits

ext4: switch EXT4_IOC_RESIZE_FS to mnt_want_write_file() · 8cae6f71
Al Viro authored Jul 19, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
8cae6f71
btrfs: switch btrfs_ioctl_balance() to mnt_want_write_file() · 11e62a8f
Al Viro authored Jul 19, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
11e62a8f
switch dentry_open() to struct path, make it grab references itself · 765927b2
Al Viro authored Jun 26, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
765927b2
spufs: shift dget/mntget towards dentry_open() · bf349a44
Al Viro authored Jun 25, 2012
```
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
```
bf349a44

zoran: don't bother with struct file * in zoran_map · 3b6456d2

Al Viro authored Jul 14, 2012

all we need it for is file->private_data, which is assign-once, already
assigned by that point and, incidentally, its value is already in use
by zoran ->mmap() anyway. So just store that pointer instead...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

3b6456d2

ecryptfs: don't reinvent the wheels, please - use struct completion · 3b8b4871

Al Viro authored Jun 25, 2012

... and keep the sodding requests on stack - they are small enough.
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

3b8b4871

don't expose I_NEW inodes via dentry->d_inode · 8fc37ec5

Al Viro authored Jul 19, 2012

	d_instantiate(dentry, inode);
	unlock_new_inode(inode);

is a bad idea; do it the other way round...
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

8fc37ec5

tidy up namei.c a bit · 32a7991b

Al Viro authored Jul 18, 2012

locking/unlocking for rcu walk taken to a couple of inline helpers
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

32a7991b

unobfuscate follow_up() a bit · 3c0a6163

Al Viro authored Jul 18, 2012

really convoluted test in there has grown up during struct mount
introduction; what it checks is that we'd reached the root of
mount tree.

3c0a6163

ext3: pass custom EOF to generic_file_llseek_size() · de9b9422

Eric Sandeen authored Apr 30, 2012

Use the new custom EOF argument to generic_file_llseek_size so
that SEEK_END will go to the max hash value for htree dirs
in ext3 rather than to i_size_read()
Signed-off-by: Eric Sandeen <sandeen@redhat.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

de9b9422

ext4: use core vfs llseek code for dir seeks · ec7268ce

Eric Sandeen authored Apr 30, 2012

Use the new functionality in generic_file_llseek_size() to
accept a custom EOF position, and un-cut-and-paste all the
vfs llseek code from ext4.

Also fix up comments on ext4_llseek() to reflect reality.
Signed-off-by: Eric Sandeen <sandeen@redaht.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

ec7268ce

vfs: allow custom EOF in generic_file_llseek code · e8b96eb5

Eric Sandeen authored Apr 30, 2012

For ext3/4 htree directories, using the vfs llseek function with
SEEK_END goes to i_size like for any other file, but in reality
we want the maximum possible hash value.  Recent changes
in ext4 have cut & pasted generic_file_llseek() back into fs/ext4/dir.c,
but replicating this core code seems like a bad idea, especially
since the copy has already diverged from the vfs.

This patch updates generic_file_llseek_size to accept
both a custom maximum offset, and a custom EOF position.  With this
in place, ext4_dir_llseek can pass in the appropriate maximum hash
position for both maxsize and eof, and get what it wants.

As far as I know, this does not fix any bugs - nfs in the kernel
doesn't use SEEK_END, and I don't know of any user who does.  But
some ext4 folks seem keen on doing the right thing here, and I can't
really argue.

(Patch also fixes up some comments slightly)
Signed-off-by: Eric Sandeen <sandeen@redhat.com>
Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>

e8b96eb5