• David Howells's avatar
    splice: Add a func to do a splice from an O_DIRECT file without ITER_PIPE · 33b3b041
    David Howells authored
    Implement a function, direct_file_splice(), that deals with this by using
    an ITER_BVEC iterator instead of an ITER_PIPE iterator as the former won't
    free its buffers when reverted.  The function bulk allocates all the
    buffers it thinks it is going to use in advance, does the read
    synchronously and only then trims the buffer down.  The pages we did use
    get pushed into the pipe.
    
    This fixes a problem with the upcoming iov_iter_extract_pages() function,
    whereby pages extracted from a non-user-backed iterator such as ITER_PIPE
    aren't pinned.  __iomap_dio_rw(), however, calls iov_iter_revert() to
    shorten the iterator to just the bufferage it is going to use - which has
    the side-effect of freeing the excess pipe buffers, even though they're
    attached to a bio and may get written to by DMA (thanks to Hillf Danton for
    spotting this[1]).
    
    This then causes memory corruption that is particularly noticeable when the
    syzbot test[2] is run.  The test boils down to:
    
    	out = creat(argv[1], 0666);
    	ftruncate(out, 0x800);
    	lseek(out, 0x200, SEEK_SET);
    	in = open(argv[1], O_RDONLY | O_DIRECT | O_NOFOLLOW);
    	sendfile(out, in, NULL, 0x1dd00);
    
    run repeatedly in parallel.  What I think is happening is that ftruncate()
    occasionally shortens the DIO read that's about to be made by sendfile's
    splice core by reducing i_size.
    
    This should be more efficient for DIO read by virtue of doing a bulk page
    allocation, but slightly less efficient by ignoring any partial page in the
    pipe.
    
    Reported-by: syzbot+a440341a59e3b7142895@syzkaller.appspotmail.com
    Signed-off-by: default avatarDavid Howells <dhowells@redhat.com>
    Reviewed-by: default avatarJens Axboe <axboe@kernel.dk>
    cc: Christoph Hellwig <hch@lst.de>
    cc: Al Viro <viro@zeniv.linux.org.uk>
    cc: David Hildenbrand <david@redhat.com>
    cc: John Hubbard <jhubbard@nvidia.com>
    cc: linux-mm@kvack.org
    cc: linux-block@vger.kernel.org
    cc: linux-fsdevel@vger.kernel.org
    Link: https://lore.kernel.org/r/20230207094731.1390-1-hdanton@sina.com/ [1]
    Link: https://lore.kernel.org/r/000000000000b0b3c005f3a09383@google.com/ [2]
    Signed-off-by: default avatarSteve French <stfrench@microsoft.com>
    33b3b041
splice.c 40.8 KB