1. 11 Jan, 2024 13 commits
    • Niklas Schnelle's avatar
      s390/pci: fix max size calculation in zpci_memcpy_toio() · 80df7d6a
      Niklas Schnelle authored
      The zpci_get_max_write_size() helper is used to determine the maximum
      size a PCI store or load can use at a given __iomem address.
      
      For the PCI block store the following restrictions apply:
      
      1. The dst + len must not cross a 4K boundary in the (pseudo-)MMIO space
      2. len must not exceed ZPCI_MAX_WRITE_SIZE
      3. len must be a multiple of 8 bytes
      4. The src address must be double word (8 byte) aligned
      5. The dst address must be double word (8 byte) aligned
      
      Otherwise only a normal PCI store which takes its src value from
      a register can be used. For these PCI store restriction 1 still applies.
      Similarly 1 also applies to PCI loads.
      
      It turns out zpci_max_write_size() instead implements stricter
      conditions which prevents PCI block stores from being used where they
      can and should be used. In particular instead of conditions 4 and 5 it
      wrongly enforces both dst and src to be size aligned. This indirectly
      covers condition 1 but also prevents many legal PCI block stores.
      
      On top of the functional shortcomings the zpci_get_max_write_size() is
      misnamed as it is used for both read and write size calculations. Rename
      it to zpci_get_max_io_size() and implement the listed conditions
      explicitly.
      Reviewed-by: default avatarMatthew Rosato <mjrosato@linux.ibm.com>
      Fixes: cd248341 ("s390/pci: base support")
      Signed-off-by: default avatarNiklas Schnelle <schnelle@linux.ibm.com>
      [agordeev@linux.ibm.com replaced spaces with tabs]
      Signed-off-by: default avatarAlexander Gordeev <agordeev@linux.ibm.com>
      80df7d6a
    • Alexander Gordeev's avatar
      s390/kexec: do not automatically select KEXEC option · 0130a0d3
      Alexander Gordeev authored
      Following commit dccf78d3 ("kernel/Kconfig.kexec: drop
      select of KEXEC for CRASH_DUMP") also drop automatic KEXEC
      selection for s390 while set CONFIG_KEXEC=y explicitly for
      defconfig and debug_defconfig targets. zfcpdump_defconfig
      target gets CONFIG_KEXEC unset as result, which is right
      and consistent with CONFIG_KEXEC_FILE besides.
      Acked-by: default avatarHeiko Carstens <hca@linux.ibm.com>
      Signed-off-by: default avatarAlexander Gordeev <agordeev@linux.ibm.com>
      0130a0d3
    • Heiko Carstens's avatar
      s390/compat: change default for CONFIG_COMPAT to "n" · 77ed045e
      Heiko Carstens authored
      31 bit support has been removed from the kernel more than eight years
      ago. The last 31 bit distribution is many years older. There shouldn't be
      any 31 bit code around anymore.
      
      Therefore avoid providing an unused and only partially tested user space
      interface and change the default for CONFIG_COMPAT from "yes" to "no".
      Acked-by: default avatarChristian Borntraeger <borntraeger@linux.ibm.com>
      Signed-off-by: default avatarHeiko Carstens <hca@linux.ibm.com>
      Signed-off-by: default avatarAlexander Gordeev <agordeev@linux.ibm.com>
      77ed045e
    • Linus Torvalds's avatar
      Merge tag 's390-6.8-1' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux · de927f6c
      Linus Torvalds authored
      Pull s390 updates from Alexander Gordeev:
      
       - Add machine variable capacity information to /proc/sysinfo.
      
       - Limit the waste of page tables and always align vmalloc area size and
         base address on segment boundary.
      
       - Fix a memory leak when an attempt to register interruption sub class
         (ISC) for the adjunct-processor (AP) guest failed.
      
       - Reset response code AP_RESPONSE_INVALID_GISA to understandable by
         guest AP_RESPONSE_INVALID_ADDRESS in response to a failed
         interruption sub class (ISC) registration attempt.
      
       - Improve reaction to adjunct-processor (AP)
         AP_RESPONSE_OTHERWISE_CHANGED response code when enabling interrupts
         on behalf of a guest.
      
       - Fix incorrect sysfs 'status' attribute of adjunct-processor (AP)
         queue device bound to the vfio_ap device driver when the mediated
         device is attached to a guest, but the queue device is not passed
         through.
      
       - Rework struct ap_card to hold the whole adjunct-processor (AP) card
         hardware information. As result, all the ugly bit checks are replaced
         by simple evaluations of the required bit fields.
      
       - Improve handling of some weird scenarios between service element (SE)
         host and SE guest with adjunct-processor (AP) pass-through support.
      
       - Change local_ctl_set_bit() and local_ctl_clear_bit() so they return
         the previous value of the to be changed control register. This is
         useful if a bit is only changed temporarily and the previous content
         needs to be restored.
      
       - The kernel starts with machine checks disabled and is expected to
         enable it once trap_init() is called. However the implementation
         allows machine checks early. Consistently enable it in trap_init()
         only.
      
       - local_mcck_disable() and local_mcck_enable() assume that machine
         checks are always enabled. Instead implement and use
         local_mcck_save() and local_mcck_restore() to disable machine checks
         and restore the previous state.
      
       - Modification of floating point control (FPC) register of a traced
         process using ptrace interface may lead to corruption of the FPC
         register of the tracing process. Fix this.
      
       - kvm_arch_vcpu_ioctl_set_fpu() allows to set the floating point
         control (FPC) register in vCPU, but may lead to corruption of the FPC
         register of the host process. Fix this.
      
       - Use READ_ONCE() to read a vCPU floating point register value from the
         memory mapped area. This avoids that, depending on code generation, a
         different value is tested for validity than the one that is used.
      
       - Get rid of test_fp_ctl(), since it is quite subtle to use it
         correctly. Instead copy a new floating point control register value
         into its save area and test the validity of the new value when
         loading it.
      
       - Remove superfluous save_fpu_regs() call.
      
       - Remove s390 support for ARCH_WANTS_DYNAMIC_TASK_STRUCT. All machines
         provide the vector facility since many years and the need to make the
         task structure size dependent on the vector facility does not exist.
      
       - Remove the "novx" kernel command line option, as the vector code runs
         without any problems since many years.
      
       - Add the vector facility to the z13 architecture level set (ALS). All
         hypervisors support the vector facility since many years. This allows
         compile time optimizations of the kernel.
      
       - Get rid of MACHINE_HAS_VX and replace it with cpu_has_vx(). As
         result, the compiled code will have less runtime checks and less
         code.
      
       - Convert pgste_get_lock() and pgste_set_unlock() ASM inlines to C.
      
       - Convert the struct subchannel spinlock from pointer to member.
      
      * tag 's390-6.8-1' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux: (24 commits)
        Revert "s390: update defconfigs"
        s390/cio: make sch->lock spinlock pointer a member
        s390: update defconfigs
        s390/mm: convert pgste locking functions to C
        s390/fpu: get rid of MACHINE_HAS_VX
        s390/als: add vector facility to z13 architecture level set
        s390/fpu: remove "novx" option
        s390/fpu: remove ARCH_WANTS_DYNAMIC_TASK_STRUCT support
        KVM: s390: remove superfluous save_fpu_regs() call
        s390/fpu: get rid of test_fp_ctl()
        KVM: s390: use READ_ONCE() to read fpc register value
        KVM: s390: fix setting of fpc register
        s390/ptrace: handle setting of fpc register correctly
        s390/nmi: implement and use local_mcck_save() / local_mcck_restore()
        s390/nmi: consistently enable machine checks in trap_init()
        s390/ctlreg: return old register contents when changing bits
        s390/ap: handle outband SE bind state change
        s390/ap: store TAPQ hwinfo in struct ap_card
        s390/vfio-ap: fix sysfs status attribute for AP queue devices
        s390/vfio-ap: improve reaction to response code 07 from PQAP(AQIC) command
        ...
      de927f6c
    • Linus Torvalds's avatar
      Merge tag 'asm-generic-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/arnd/asm-generic · c2990100
      Linus Torvalds authored
      Pull asm-generic cleanups from Arnd Bergmann:
       "A series from Baoquan He cleans up the asm-generic/io.h to remove the
        ioremap_uc() definition from everything except x86, which still needs
        it for pre-PAT systems. This series notably contains a patch from
        Jiaxun Yang that converts MIPS to use asm-generic/io.h like every
        other architecture does, enabling future cleanups.
      
        Some of my own patches fix -Wmissing-prototype warnings in
        architecture specific code across several architectures. This is now
        needed as the warning is enabled by default. There are still some
        remaining warnings in minor platforms, but the series should catch
        most of the widely used ones make them more consistent with one
        another.
      
        David McKay fixes a bug in __generic_cmpxchg_local() when this is used
        on 64-bit architectures. This could currently only affect parisc64 and
        sparc64.
      
        Additional cleanups address from Linus Walleij, Uwe Kleine-König,
        Thomas Huth, and Kefeng Wang help reduce unnecessary inconsistencies
        between architectures"
      
      * tag 'asm-generic-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/arnd/asm-generic:
        asm-generic: Fix 32 bit __generic_cmpxchg_local
        Hexagon: Make pfn accessors statics inlines
        ARC: mm: Make virt_to_pfn() a static inline
        mips: remove extraneous asm-generic/iomap.h include
        sparc: Use $(kecho) to announce kernel images being ready
        arm64: vdso32: Define BUILD_VDSO32_64 to correct prototypes
        csky: fix arch_jump_label_transform_static override
        arch: add do_page_fault prototypes
        arch: add missing prepare_ftrace_return() prototypes
        arch: vdso: consolidate gettime prototypes
        arch: include linux/cpu.h for trap_init() prototype
        arch: fix asm-offsets.c building with -Wmissing-prototypes
        arch: consolidate arch_irq_work_raise prototypes
        hexagon: Remove CONFIG_HEXAGON_ARCH_VERSION from uapi header
        asm/io: remove unnecessary xlate_dev_mem_ptr() and unxlate_dev_mem_ptr()
        mips: io: remove duplicated codes
        arch/*/io.h: remove ioremap_uc in some architectures
        mips: add <asm-generic/io.h> including
      c2990100
    • Linus Torvalds's avatar
      Merge tag 'modules-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux · 4cd083d5
      Linus Torvalds authored
      Pull module updates from Luis Chamberlain:
       "Just one cleanup and one documentation improvement change. No
        functional changes"
      
      * tag 'modules-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux:
        kernel/module: improve documentation for try_module_get()
        module: Remove redundant TASK_UNINTERRUPTIBLE
      4cd083d5
    • Linus Torvalds's avatar
      Merge tag 'sysctl-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux · a05aea98
      Linus Torvalds authored
      Pull sysctl updates from Luis Chamberlain:
       "To help make the move of sysctls out of kernel/sysctl.c not incur a
        size penalty sysctl has been changed to allow us to not require the
        sentinel, the final empty element on the sysctl array. Joel Granados
        has been doing all this work.
      
        In the v6.6 kernel we got the major infrastructure changes required to
        support this. For v6.7 we had all arch/ and drivers/ modified to
        remove the sentinel. For v6.8-rc1 we get a few more updates for fs/
        directory only.
      
        The kernel/ directory is left but we'll save that for v6.9-rc1 as
        those patches are still being reviewed. After that we then can expect
        also the removal of the no longer needed check for procname == NULL.
      
        Let us recap the purpose of this work:
      
         - this helps reduce the overall build time size of the kernel and run
           time memory consumed by the kernel by about ~64 bytes per array
      
         - the extra 64-byte penalty is no longer inncurred now when we move
           sysctls out from kernel/sysctl.c to their own files
      
        Thomas Weißschuh also sent a few cleanups, for v6.9-rc1 we expect to
        see further work by Thomas Weißschuh with the constificatin of the
        struct ctl_table.
      
        Due to Joel Granados's work, and to help bring in new blood, I have
        suggested for him to become a maintainer and he's accepted. So for
        v6.9-rc1 I look forward to seeing him sent you a pull request for
        further sysctl changes. This also removes Iurii Zaikin as a maintainer
        as he has moved on to other projects and has had no time to help at
        all"
      
      * tag 'sysctl-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux:
        sysctl: remove struct ctl_path
        sysctl: delete unused define SYSCTL_PERM_EMPTY_DIR
        coda: Remove the now superfluous sentinel elements from ctl_table array
        sysctl: Remove the now superfluous sentinel elements from ctl_table array
        fs: Remove the now superfluous sentinel elements from ctl_table array
        cachefiles: Remove the now superfluous sentinel element from ctl_table array
        sysclt: Clarify the results of selftest run
        sysctl: Add a selftest for handling empty dirs
        sysctl: Fix out of bounds access for empty sysctl registers
        MAINTAINERS: Add Joel Granados as co-maintainer for proc sysctl
        MAINTAINERS: remove Iurii Zaikin from proc sysctl
      a05aea98
    • Linus Torvalds's avatar
      Merge tag 'header_cleanup-2024-01-10' of https://evilpiepirate.org/git/bcachefs · 78273df7
      Linus Torvalds authored
      Pull header cleanups from Kent Overstreet:
       "The goal is to get sched.h down to a type only header, so the main
        thing happening in this patchset is splitting out various _types.h
        headers and dependency fixups, as well as moving some things out of
        sched.h to better locations.
      
        This is prep work for the memory allocation profiling patchset which
        adds new sched.h interdepencencies"
      
      * tag 'header_cleanup-2024-01-10' of https://evilpiepirate.org/git/bcachefs: (51 commits)
        Kill sched.h dependency on rcupdate.h
        kill unnecessary thread_info.h include
        Kill unnecessary kernel.h include
        preempt.h: Kill dependency on list.h
        rseq: Split out rseq.h from sched.h
        LoongArch: signal.c: add header file to fix build error
        restart_block: Trim includes
        lockdep: move held_lock to lockdep_types.h
        sem: Split out sem_types.h
        uidgid: Split out uidgid_types.h
        seccomp: Split out seccomp_types.h
        refcount: Split out refcount_types.h
        uapi/linux/resource.h: fix include
        x86/signal: kill dependency on time.h
        syscall_user_dispatch.h: split out *_types.h
        mm_types_task.h: Trim dependencies
        Split out irqflags_types.h
        ipc: Kill bogus dependency on spinlock.h
        shm: Slim down dependencies
        workqueue: Split out workqueue_types.h
        ...
      78273df7
    • Linus Torvalds's avatar
      Merge tag 'bcachefs-2024-01-10' of https://evilpiepirate.org/git/bcachefs · 999a36b5
      Linus Torvalds authored
      Pull bcachefs updates from Kent Overstreet:
      
       - btree write buffer rewrite: instead of adding keys to the btree write
         buffer at transaction commit time, we now journal them with a
         different journal entry type and copy them from the journal to the
         write buffer just prior to journal write.
      
         This reduces the number of atomic operations on shared cachelines in
         the transaction commit path and is a signicant performance
         improvement on some workloads: multithreaded 4k random writes went
         from ~650k iops to ~850k iops.
      
       - Bring back optimistic spinning for six locks: the new implementation
         doesn't use osq locks; instead we add to the lock waitlist as normal,
         and then spin on the lock_acquired bit in the waitlist entry, _not_
         the lock itself.
      
       - New ioctls:
      
          - BCH_IOCTL_DEV_USAGE_V2, which allows for new data types
      
          - BCH_IOCTL_OFFLINE_FSCK, which runs the kernel implementation of
            fsck but without mounting: useful for transparently using the
            kernel version of fsck from 'bcachefs fsck' when the kernel
            version is a better match for the on disk filesystem.
      
          - BCH_IOCTL_ONLINE_FSCK: online fsck. Not all passes are supported
            yet, but the passes that are supported are fully featured - errors
            may be corrected as normal.
      
         The new ioctls use the new 'thread_with_file' abstraction for kicking
         off a kthread that's tied to a file descriptor returned to userspace
         via the ioctl.
      
       - btree_paths within a btree_trans are now dynamically growable,
         instead of being limited to 64. This is important for the
         check_directory_structure phase of fsck, and also fixes some issues
         we were having with btree path overflow in the reflink btree.
      
       - Trigger refactoring; prep work for the upcoming disk space accounting
         rewrite
      
       - Numerous bugfixes :)
      
      * tag 'bcachefs-2024-01-10' of https://evilpiepirate.org/git/bcachefs: (226 commits)
        bcachefs: eytzinger0_find() search should be const
        bcachefs: move "ptrs not changing" optimization to bch2_trigger_extent()
        bcachefs: fix simulateously upgrading & downgrading
        bcachefs: Restart recovery passes more reliably
        bcachefs: bch2_dump_bset() doesn't choke on u64s == 0
        bcachefs: improve checksum error messages
        bcachefs: improve validate_bset_keys()
        bcachefs: print sb magic when relevant
        bcachefs: __bch2_sb_field_to_text()
        bcachefs: %pg is banished
        bcachefs: Improve would_deadlock trace event
        bcachefs: fsck_err()s don't need to manually check c->sb.version anymore
        bcachefs: Upgrades now specify errors to fix, like downgrades
        bcachefs: no thread_with_file in userspace
        bcachefs: Don't autofix errors we can't fix
        bcachefs: add missing bch2_latency_acct() call
        bcachefs: increase max_active on io_complete_wq
        bcachefs: add time_stats for btree_node_read_done()
        bcachefs: don't clear accessed bit in btree node fill
        bcachefs: Add an option to control btree node prefetching
        ...
      999a36b5
    • Linus Torvalds's avatar
      Merge tag 'v6.8-rc-part1-smb-client' of git://git.samba.org/sfrench/cifs-2.6 · 84e9a2d5
      Linus Torvalds authored
      Pull smb client fixes from Steve French:
       "Various smb client fixes, most related to better handling special file
        types:
      
         - Improve handling of special file types:
            - performance improvement (better compounding and better caching
              of readdir entries that are reparse points)
            - extend support for creating special files (sockets, fifos,
              block/char devices)
            - fix renaming and hardlinking of reparse points
            - extend support for creating symlinks with IO_REPARSE_TAG_SYMLINK
      
         - Multichannel logging improvement
      
         - Exception handling fix
      
         - Minor cleanups"
      
      * tag 'v6.8-rc-part1-smb-client' of git://git.samba.org/sfrench/cifs-2.6:
        cifs: update internal module version number for cifs.ko
        cifs: remove unneeded return statement
        cifs: make cifs_chan_update_iface() a void function
        cifs: delete unnecessary NULL checks in cifs_chan_update_iface()
        cifs: get rid of dup length check in parse_reparse_point()
        smb: client: stop revalidating reparse points unnecessarily
        cifs: Pass unbyteswapped eof value into SMB2_set_eof()
        smb3: Improve exception handling in allocate_mr_list()
        cifs: fix in logging in cifs_chan_update_iface
        smb: client: handle special files and symlinks in SMB3 POSIX
        smb: client: cleanup smb2_query_reparse_point()
        smb: client: allow creating symlinks via reparse points
        smb: client: fix hardlinking of reparse points
        smb: client: fix renaming of reparse points
        smb: client: optimise reparse point querying
        smb: client: allow creating special files via reparse points
        smb: client: extend smb2_compound_op() to accept more commands
        smb: client: Fix minor whitespace errors and warnings
      84e9a2d5
    • Linus Torvalds's avatar
      Merge tag 'nfs-for-6.8-1' of git://git.linux-nfs.org/projects/anna/linux-nfs · 587217f9
      Linus Torvalds authored
      Pull nfs client updates from Anna Schumaker:
       "New Features:
         - Always ask for type with READDIR
         - Remove nfs_writepage()
      
        Bugfixes:
         - Fix a suspicious RCU usage warning
         - Fix a blocklayoutdriver reference leak
         - Fix the block driver's calculation of layoutget size
         - Fix handling NFS4ERR_RETURNCONFLICT
         - Fix _xprt_switch_find_current_entry()
         - Fix v4.1 backchannel request timeouts
         - Don't add zero-length pnfs block devices
         - Use the parent cred in nfs_access_login_time()
      
        Cleanups:
         - A few improvements when dealing with referring calls from the
           server
         - Clean up various unused variables, struct fields, and function
           calls
         - Various tracepoint improvements"
      
      * tag 'nfs-for-6.8-1' of git://git.linux-nfs.org/projects/anna/linux-nfs: (21 commits)
        NFSv4.1: Use the nfs_client's rpc timeouts for backchannel
        SUNRPC: Fixup v4.1 backchannel request timeouts
        rpc_pipefs: Replace one label in bl_resolve_deviceid()
        nfs: Remove writepage
        NFS: drop unused nfs_direct_req bytes_left
        pNFS: Fix the pnfs block driver's calculation of layoutget size
        nfs: print fileid in lookup tracepoints
        nfs: rename the nfs_async_rename_done tracepoint
        nfs: add new tracepoint at nfs4 revalidate entry point
        SUNRPC: fix _xprt_switch_find_current_entry logic
        NFSv4.1/pnfs: Ensure we handle the error NFS4ERR_RETURNCONFLICT
        NFSv4.1: if referring calls are complete, trust the stateid argument
        NFSv4: Track the number of referring calls in struct cb_process_state
        NFS: Use parent's objective cred in nfs_access_login_time()
        NFSv4: Always ask for type with READDIR
        pnfs/blocklayout: Don't add zero-length pnfs_block_dev
        blocklayoutdriver: Fix reference leak of pnfs_device_node
        SUNRPC: Fix a suspicious RCU usage warning
        SUNRPC: Create a helper function for accessing the rpc_clnt's xprt_switch
        SUNRPC: Remove unused function rpc_clnt_xprt_switch_put()
        ...
      587217f9
    • Linus Torvalds's avatar
      Merge tag 'ext4_for_linus-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4 · 0d19d9e1
      Linus Torvalds authored
      Pull ext4 updates from Ted Ts'o:
       "Various ext4 bug fixes and cleanups. The fixes are mostly in the
        fstrim and mballoc code paths.
      
        Also enable dioread_nolock in the case where the block size is less
        than the page size (dioread_nolock has been default in the bs == ps
        case for quite some time)"
      
      * tag 'ext4_for_linus-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4:
        ext4: fix inconsistent between segment fstrim and full fstrim
        ext4: fallback to complex scan if aligned scan doesn't work
        ext4: convert ext4_da_do_write_end() to take a folio
        ext4: allow for the last group to be marked as trimmed
        ext4: move ext4_check_bdev_write_error() into nojournal mode
        jbd2: abort journal when detecting metadata writeback error of fs dev
        jbd2: remove unused 'JBD2_CHECKPOINT_IO_ERROR' and 'j_atomic_flags'
        jbd2: replace journal state flag by checking errseq
        jbd2: add errseq to detect client fs's bdev writeback error
        ext4: improving calculation of 'fe_{len|start}' in mb_find_extent()
        ext4: clarify handling of unwritten bh in __ext4_block_zero_page_range()
        ext4: treat end of range as exclusive in ext4_zero_range()
        ext4: enable dioread_nolock as default for bs < ps case
        ext4: delete redundant calculations in ext4_mb_get_buddy_page_lock()
        ext4: reduce unnecessary memory allocation in alloc_flex_gd()
        ext4: avoid online resizing failures due to oversized flex bg
        ext4: remove unnecessary check from alloc_flex_gd()
        ext4: unify the type of flexbg_size to unsigned int
      0d19d9e1
    • Linus Torvalds's avatar
      Merge tag 'unicode-next-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/krisman/unicode · 6bd593bc
      Linus Torvalds authored
      Pull unicode updates from Gabriel Krisman Bertazi:
       "Other than the update to MAINTAINERS, this PR has only a fix to stop
        ecryptfs from inadvertently mounting case-insensitive filesystems that
        it cannot handle, which would otherwise caused post-mount failures"
      
      * tag 'unicode-next-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/krisman/unicode:
        MAINTAINERS: update unicode maintainer e-mail address
        ecryptfs: Reject casefold directory inodes
      6bd593bc
  2. 10 Jan, 2024 27 commits
    • David Howells's avatar
      keys, dns: Fix size check of V1 server-list header · acc65769
      David Howells authored
      Fix the size check added to dns_resolver_preparse() for the V1 server-list
      header so that it doesn't give EINVAL if the size supplied is the same as
      the size of the header struct (which should be valid).
      
      This can be tested with:
      
              echo -n -e '\0\0\01\xff\0\0' | keyctl padd dns_resolver desc @p
      
      which will give "add_key: Invalid argument" without this fix.
      
      Fixes: 1997b3cb ("keys, dns: Fix missing size check of V1 server-list header")
      Reported-by: default avatarPengfei Xu <pengfei.xu@intel.com>
      Link: https://lore.kernel.org/r/ZZ4fyY4r3rqgZL+4@xpf.sh.intel.com/Signed-off-by: default avatarDavid Howells <dhowells@redhat.com>
      Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
      acc65769
    • Linus Torvalds's avatar
      Merge tag 'v6.8-p1' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6 · 0cb552aa
      Linus Torvalds authored
      Pull crypto updates from Herbert Xu:
       "API:
         - Add incremental lskcipher/skcipher processing
      
        Algorithms:
         - Remove SHA1 from drbg
         - Remove CFB and OFB
      
        Drivers:
         - Add comp high perf mode configuration in hisilicon/zip
         - Add support for 420xx devices in qat
         - Add IAA Compression Accelerator driver"
      
      * tag 'v6.8-p1' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6: (172 commits)
        crypto: iaa - Account for cpu-less numa nodes
        crypto: scomp - fix req->dst buffer overflow
        crypto: sahara - add support for crypto_engine
        crypto: sahara - remove error message for bad aes request size
        crypto: sahara - remove unnecessary NULL assignments
        crypto: sahara - remove 'active' flag from sahara_aes_reqctx struct
        crypto: sahara - use dev_err_probe()
        crypto: sahara - use devm_clk_get_enabled()
        crypto: sahara - use BIT() macro
        crypto: sahara - clean up macro indentation
        crypto: sahara - do not resize req->src when doing hash operations
        crypto: sahara - fix processing hash requests with req->nbytes < sg->length
        crypto: sahara - improve error handling in sahara_sha_process()
        crypto: sahara - fix wait_for_completion_timeout() error handling
        crypto: sahara - fix ahash reqsize
        crypto: sahara - handle zero-length aes requests
        crypto: skcipher - remove excess kerneldoc members
        crypto: shash - remove excess kerneldoc members
        crypto: qat - generate dynamically arbiter mappings
        crypto: qat - add support for ring pair level telemetry
        ...
      0cb552aa
    • Linus Torvalds's avatar
      Merge tag 'tpmdd-v6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/jarkko/linux-tpmdd · 6434eade
      Linus Torvalds authored
      Pull tpm updates from Jarkko Sakkinen:
       "Just a couple fixes and no new features"
      
      * tag 'tpmdd-v6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/jarkko/linux-tpmdd:
        tpm: cr50: fix kernel-doc warning and spelling
        tpm: nuvoton: Use i2c_get_match_data()
      6434eade
    • Linus Torvalds's avatar
      Merge tag 'hardening-v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · 120a201b
      Linus Torvalds authored
      Pull hardening updates from Kees Cook:
      
       - Introduce the param_unknown_fn type and other clean ups (Andy
         Shevchenko)
      
       - Various __counted_by annotations (Christophe JAILLET, Gustavo A. R.
         Silva, Kees Cook)
      
       - Add KFENCE test to LKDTM (Stephen Boyd)
      
       - Various strncpy() refactorings (Justin Stitt)
      
       - Fix qnx4 to avoid writing into the smaller of two overlapping buffers
      
       - Various strlcpy() refactorings
      
      * tag 'hardening-v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        qnx4: Use get_directory_fname() in qnx4_match()
        qnx4: Extract dir entry filename processing into helper
        atags_proc: Add __counted_by for struct buffer and use struct_size()
        tracing/uprobe: Replace strlcpy() with strscpy()
        params: Fix multi-line comment style
        params: Sort headers
        params: Use size_add() for kmalloc()
        params: Do not go over the limit when getting the string length
        params: Introduce the param_unknown_fn type
        lkdtm: Add kfence read after free crash type
        nvme-fc: replace deprecated strncpy with strscpy
        nvdimm/btt: replace deprecated strncpy with strscpy
        nvme-fabrics: replace deprecated strncpy with strscpy
        drm/modes: replace deprecated strncpy with strscpy_pad
        afs: Add __counted_by for struct afs_acl and use struct_size()
        VMCI: Annotate struct vmci_handle_arr with __counted_by
        i40e: Annotate struct i40e_qvlist_info with __counted_by
        HID: uhid: replace deprecated strncpy with strscpy
        samples: Replace strlcpy() with strscpy()
        SUNRPC: Replace strlcpy() with strscpy()
      120a201b
    • Ye Bin's avatar
      ext4: fix inconsistent between segment fstrim and full fstrim · 68da4c44
      Ye Bin authored
      Suppose we issue two FITRIM ioctls for ranges [0,15] and [16,31] with
      mininum length of trimmed range set to 8 blocks. If we have say a range of
      blocks 10-22 free, this range will not be trimmed because it straddles the
      boundary of the two FITRIM ranges and neither part is big enough. This is a
      bit surprising to some users that call FITRIM on smaller ranges of blocks
      to limit impact on the system. Also XFS trims all free space extents that
      overlap with the specified range so we are inconsistent among filesystems.
      Let's change ext4_try_to_trim_range() to consider for trimming the whole
      free space extent that straddles the end of specified range, not just the
      part of it within the range.
      Signed-off-by: default avatarYe Bin <yebin10@huawei.com>
      Reviewed-by: default avatarJan Kara <jack@suse.cz>
      Link: https://lore.kernel.org/r/20231216010919.1995851-1-yebin10@huawei.comSigned-off-by: default avatarTheodore Ts'o <tytso@mit.edu>
      68da4c44
    • Ojaswin Mujoo's avatar
      ext4: fallback to complex scan if aligned scan doesn't work · 1f6bc02f
      Ojaswin Mujoo authored
      Currently in case the goal length is a multiple of stripe size we use
      ext4_mb_scan_aligned() to find the stripe size aligned physical blocks.
      In case we are not able to find any, we again go back to calling
      ext4_mb_choose_next_group() to search for a different suitable block
      group. However, since the linear search always begins from the start,
      most of the times we end up with the same BG and the cycle continues.
      
      With large fliesystems, the CPU can be stuck in this loop for hours
      which can slow down the whole system. Hence, until we figure out a
      better way to continue the search (rather than starting from beginning)
      in ext4_mb_choose_next_group(), lets just fallback to
      ext4_mb_complex_scan_group() in case aligned scan fails, as it is much
      more likely to find the needed blocks.
      Signed-off-by: default avatarOjaswin Mujoo <ojaswin@linux.ibm.com>
      Reviewed-by: default avatarJan Kara <jack@suse.cz>
      Link: https://lore.kernel.org/r/ee033f6dfa0a7f2934437008a909c3788233950f.1702455010.git.ojaswin@linux.ibm.comSigned-off-by: default avatarTheodore Ts'o <tytso@mit.edu>
      1f6bc02f
    • Matthew Wilcox (Oracle)'s avatar
      ext4: convert ext4_da_do_write_end() to take a folio · 4d5cdd75
      Matthew Wilcox (Oracle) authored
      There's nothing page-specific happening in ext4_da_do_write_end();
      it's merely used for its refcount & lock, both of which are folio
      properties.  Saves four calls to compound_head().
      Signed-off-by: default avatarMatthew Wilcox (Oracle) <willy@infradead.org>
      Reviewed-by: default avatarJan Kara <jack@suse.cz>
      Link: https://lore.kernel.org/r/20231214053035.1018876-1-willy@infradead.orgSigned-off-by: default avatarTheodore Ts'o <tytso@mit.edu>
      4d5cdd75
    • Suraj Jitindar Singh's avatar
      ext4: allow for the last group to be marked as trimmed · 7c784d62
      Suraj Jitindar Singh authored
      The ext4 filesystem tracks the trim status of blocks at the group
      level.  When an entire group has been trimmed then it is marked as
      such and subsequent trim invocations with the same minimum trim size
      will not be attempted on that group unless it is marked as able to be
      trimmed again such as when a block is freed.
      
      Currently the last group can't be marked as trimmed due to incorrect
      logic in ext4_last_grp_cluster(). ext4_last_grp_cluster() is supposed
      to return the zero based index of the last cluster in a group. This is
      then used by ext4_try_to_trim_range() to determine if the trim
      operation spans the entire group and as such if the trim status of the
      group should be recorded.
      
      ext4_last_grp_cluster() takes a 0 based group index, thus the valid
      values for grp are 0..(ext4_get_groups_count - 1). Any group index
      less than (ext4_get_groups_count - 1) is not the last group and must
      have EXT4_CLUSTERS_PER_GROUP(sb) clusters. For the last group we need
      to calculate the number of clusters based on the number of blocks in
      the group. Finally subtract 1 from the number of clusters as zero
      based indexing is expected.  Rearrange the function slightly to make
      it clear what we are calculating and returning.
      
      Reproducer:
      // Create file system where the last group has fewer blocks than
      // blocks per group
      $ mkfs.ext4 -b 4096 -g 8192 /dev/nvme0n1 8191
      $ mount /dev/nvme0n1 /mnt
      
      Before Patch:
      $ fstrim -v /mnt
      /mnt: 25.9 MiB (27156480 bytes) trimmed
      // Group not marked as trimmed so second invocation still discards blocks
      $ fstrim -v /mnt
      /mnt: 25.9 MiB (27156480 bytes) trimmed
      
      After Patch:
      fstrim -v /mnt
      /mnt: 25.9 MiB (27156480 bytes) trimmed
      // Group marked as trimmed so second invocation DOESN'T discard any blocks
      fstrim -v /mnt
      /mnt: 0 B (0 bytes) trimmed
      
      Fixes: 45e4ab32 ("ext4: move setting of trimmed bit into ext4_try_to_trim_range()")
      Cc:  <stable@vger.kernel.org> # 4.19+
      Signed-off-by: default avatarSuraj Jitindar Singh <surajjs@amazon.com>
      Reviewed-by: default avatarJan Kara <jack@suse.cz>
      Link: https://lore.kernel.org/r/20231213051635.37731-1-surajjs@amazon.comSigned-off-by: default avatarTheodore Ts'o <tytso@mit.edu>
      7c784d62
    • Linus Torvalds's avatar
      Merge tag 'pstore-v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux · 72116efd
      Linus Torvalds authored
      Pull pstore updates from Kees Cook:
      
       - Do not allow misconfigured ECC sizes (Sergey Shtylyov)
      
       - Allow for odd number of CPUs (Weichen Chen)
      
       - Refactor error handling to use cleanup.h
      
      * tag 'pstore-v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux:
        pstore: inode: Use cleanup.h for struct pstore_private
        pstore: inode: Use __free(pstore_iput) for inode allocations
        pstore: inode: Convert mutex usage to guard(mutex)
        pstore: inode: Convert kfree() usage to __free(kfree)
        pstore: ram_core: fix possible overflow in persistent_ram_init_ecc()
        pstore/ram: Fix crash when setting number of cpus to an odd number
      72116efd
    • Linus Torvalds's avatar
      Merge tag 'ovl-update-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/overlayfs/vfs · 4d925f60
      Linus Torvalds authored
      Pull overlayfs updates from Amir Goldstein:
       "This is a very small update with no bug fixes and no new features.
      
        The larger update of overlayfs for this cycle, the re-factoring of
        overlayfs code into generic backing_file helpers, was already merged
        via Christian.
      
        Summary:
      
         - Simplify/clarify some code
      
           No bug fixes here, just some changes following questions from Al
           about overlayfs code that could be a little more simple to follow.
      
         - Overlayfs documentation style fixes
      
           Mainly fixes for ReST formatting suggested by documentation
           developers"
      
      * tag 'ovl-update-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/overlayfs/vfs:
        overlayfs.rst: fix ReST formatting
        overlayfs.rst: use consistent feature names
        ovl: initialize ovl_copy_up_ctx.destname inside ovl_do_copy_up()
        ovl: remove redundant ofs->indexdir member
      4d925f60
    • Linus Torvalds's avatar
      Merge tag 'erofs-for-6.8-rc1-2' of git://git.kernel.org/pub/scm/linux/kernel/git/xiang/erofs · 0507d252
      Linus Torvalds authored
      Pull erofs updates from Gao Xiang:
       "In this cycle, we'd like to enable basic sub-page compressed data
        support for Android ecosystem (for vendors to try out 16k page size
        with 4k-block images in their compatibility mode) as well as container
        images (so that 4k-block images can be parsed on arm64 cloud servers
        using 64k page size.)
      
        In addition, there are several bugfixes and cleanups as usual. All
        commits have been in -next for a while and no potential merge conflict
        is observed.
      
        Summary:
      
         - Add basic sub-page compressed data support
      
         - Fix a memory leak on MicroLZMA and DEFLATE compression
      
         - Fix a rare LZ4 inplace decompression issue on recent x86 CPUs
      
         - Fix a KASAN issue reported by syzbot around crafted images
      
         - Some cleanups"
      
      * tag 'erofs-for-6.8-rc1-2' of git://git.kernel.org/pub/scm/linux/kernel/git/xiang/erofs:
        erofs: make erofs_{err,info}() support NULL sb parameter
        erofs: avoid debugging output for (de)compressed data
        erofs: allow partially filled compressed bvecs
        erofs: enable sub-page compressed block support
        erofs: refine z_erofs_transform_plain() for sub-page block support
        erofs: fix ztailpacking for subpage compressed blocks
        erofs: fix up compacted indexes for block size < 4096
        erofs: record `pclustersize` in bytes instead of pages
        erofs: support I/O submission for sub-page compressed blocks
        erofs: fix lz4 inplace decompression
        erofs: fix memory leak on short-lived bounced pages
      0507d252
    • Linus Torvalds's avatar
      Merge tag 'fscrypt-for-linus' of git://git.kernel.org/pub/scm/fs/fscrypt/linux · 17b9e388
      Linus Torvalds authored
      Pull fscrypt updates from Eric Biggers:
       "Adjust the timing of the fscrypt keyring destruction, to prepare for
        btrfs's fscrypt support.
      
        Also document that CephFS supports fscrypt now"
      
      * tag 'fscrypt-for-linus' of git://git.kernel.org/pub/scm/fs/fscrypt/linux:
        fs: move fscrypt keyring destruction to after ->put_super
        f2fs: move release of block devices to after kill_block_super()
        fscrypt: document that CephFS supports fscrypt now
        fscrypt: update comment for do_remove_key()
        fscrypt.rst: update definition of struct fscrypt_context_v2
      17b9e388
    • Linus Torvalds's avatar
      Merge tag 'nfsd-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/cel/linux · 49f48103
      Linus Torvalds authored
      Pull nfsd updates from Chuck Lever:
       "The bulk of the patches for this release are clean-ups and minor bug
        fixes.
      
        There is one significant revert to mention: support for RDMA Read
        operations in the server's RPC-over-RDMA transport implementation has
        been fixed so it waits for Read completion in a way that avoids tying
        up an nfsd thread. This prevents a possible DoS vector if an
        RPC-over-RDMA client should become unresponsive during RDMA Read
        operations.
      
        As always I am grateful to NFSD contributors, reviewers, and testers"
      
      * tag 'nfsd-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/cel/linux: (56 commits)
        nfsd: rename nfsd_last_thread() to nfsd_destroy_serv()
        SUNRPC: discard sv_refcnt, and svc_get/svc_put
        svc: don't hold reference for poolstats, only mutex.
        SUNRPC: remove printk when back channel request not found
        svcrdma: Implement multi-stage Read completion again
        svcrdma: Copy construction of svc_rqst::rq_arg to rdma_read_complete()
        svcrdma: Add back svcxprt_rdma::sc_read_complete_q
        svcrdma: Add back svc_rdma_recv_ctxt::rc_pages
        svcrdma: Clean up comment in svc_rdma_accept()
        svcrdma: Remove queue-shortening warnings
        svcrdma: Remove pointer addresses shown in dprintk()
        svcrdma: Optimize svc_rdma_cc_init()
        svcrdma: De-duplicate completion ID initialization helpers
        svcrdma: Move the svc_rdma_cc_init() call
        svcrdma: Remove struct svc_rdma_read_info
        svcrdma: Update the synopsis of svc_rdma_read_special()
        svcrdma: Update the synopsis of svc_rdma_read_call_chunk()
        svcrdma: Update synopsis of svc_rdma_read_multiple_chunks()
        svcrdma: Update synopsis of svc_rdma_copy_inline_range()
        svcrdma: Update the synopsis of svc_rdma_read_data_item()
        ...
      49f48103
    • Linus Torvalds's avatar
      Merge tag 'dlm-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/teigland/linux-dlm · d8c8e595
      Linus Torvalds authored
      Pull dlm updates from David Teigland:
       "This set cleans up the interface between nfs lockd and dlm, which is
        handling nfs file locking for gfs2 and ocfs2. Very basic lockd
        functionality is fixed, in which the fl owner was using the lockd pid
        instead of the owner value from nfs"
      
      * tag 'dlm-6.8' of git://git.kernel.org/pub/scm/linux/kernel/git/teigland/linux-dlm:
        dlm: update format header reflect current format
        dlm: fix format seq ops type 4
        dlm: implement EXPORT_OP_ASYNC_LOCK
        dlm: use FL_SLEEP to determine blocking vs non-blocking
        dlm: use fl_owner from lockd
        dlm: use kernel_connect() and kernel_bind()
      d8c8e595
    • Linus Torvalds's avatar
      Merge tag 'afs-fix-rotation-20240105' of... · 0c59ae12
      Linus Torvalds authored
      Merge tag 'afs-fix-rotation-20240105' of git://git.kernel.org/pub/scm/linux/kernel/git/dhowells/linux-fs
      
      Pull afs updates from David Howells:
       "The majority of the patches are aimed at fixing and improving the AFS
        filesystem's rotation over server IP addresses, but there are also
        some fixes from Oleg Nesterov for the use of read_seqbegin_or_lock().
      
         - Fix fileserver probe handling so that the next round of probes
           doesn't break ongoing server/address rotation by clearing all the
           probe result tracking. This could occasionally cause the rotation
           algorithm to drop straight through, give a 'successful' result
           without actually emitting any RPC calls, leaving the reply buffer
           in an undefined state.
      
           Instead, detach the probe results into a separate struct and
           allocate a new one each time we start probing and update the
           pointer to it. Probes are also sent in order of address preference
           to try and improve the chance that the preferred one will complete
           first.
      
         - Fix server rotation so that it uses configurable address
           preferences across on the probes that have completed so far than
           ranking them by RTT as the latter doesn't necessarily give the best
           route. The preference list can be altered by writing into
           /proc/net/afs/addr_prefs.
      
         - Fix the handling of Read-Only (and Backup) volume callbacks as
           there is one per volume, not one per file, so if someone performs a
           command that, say, offlines the volume but doesn't change it, when
           it comes back online we don't spam the server with a status fetch
           for every vnode we're using. Instead, check the Creation timestamp
           in the VolSync record when prompted by a callback break.
      
         - Handle volume regression (ie. a RW volume being restored from a
           backup) by scrubbing all cache data for that volume. This is
           detected from the VolSync creation timestamp.
      
         - Adjust abort handling and abort -> error mapping to match better
           with what other AFS clients do.
      
         - Fix offline and busy volume state handling as they only apply to
           individual server instances and not entire volumes and the rotation
           algorithm should go and look at other servers if available. Also
           make it sleep briefly before each retry if all the volume instances
           are unavailable"
      
      * tag 'afs-fix-rotation-20240105' of git://git.kernel.org/pub/scm/linux/kernel/git/dhowells/linux-fs: (40 commits)
        afs: trace: Log afs_make_call(), including server address
        afs: Fix offline and busy message emission
        afs: Fix fileserver rotation
        afs: Overhaul invalidation handling to better support RO volumes
        afs: Parse the VolSync record in the reply of a number of RPC ops
        afs: Don't leave DONTUSE/NEWREPSITE servers out of server list
        afs: Fix comment in afs_do_lookup()
        afs: Apply server breaks to mmap'd files in the call processor
        afs: Move the vnode/volume validity checking code into its own file
        afs: Defer volume record destruction to a workqueue
        afs: Make it possible to find the volumes that are using a server
        afs: Combine the endpoint state bools into a bitmask
        afs: Keep a record of the current fileserver endpoint state
        afs: Dispatch vlserver probes in priority order
        afs: Dispatch fileserver probes in priority order
        afs: Mark address lists with configured priorities
        afs: Provide a way to configure address priorities
        afs: Remove the unimplemented afs_cmp_addr_list()
        afs: Add some more info to /proc/net/afs/servers
        rxrpc: Create a procfile to display outstanding client conn bundles
        ...
      0c59ae12
    • Linus Torvalds's avatar
      Merge tag 'jfs-6.8' of github.com:kleikamp/linux-shaggy · 032500ab
      Linus Torvalds authored
      Pull jfs updates from David Kleikamp:
       "Stability improvements"
      
      * tag 'jfs-6.8' of github.com:kleikamp/linux-shaggy:
        jfs: Add missing set_freezable() for freezable kthread
        jfs: fix array-index-out-of-bounds in diNewExt
        jfs: fix shift-out-of-bounds in dbJoin
        jfs: fix uaf in jfs_evict_inode
        jfs: fix array-index-out-of-bounds in dbAdjTree
        jfs: fix slab-out-of-bounds Read in dtSearch
        UBSAN: array-index-out-of-bounds in dtSplitRoot
        FS:JFS:UBSAN:array-index-out-of-bounds in dbAdjTree
      032500ab
    • Linus Torvalds's avatar
      Merge tag 'gfs2-v6.7-rc1-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/gfs2/linux-gfs2 · bfed9a92
      Linus Torvalds authored
      Pull gfs2 updates from Andreas Gruenbacher:
      
       - Add support for non-blocking lookup (MAY_NOT_BLOCK / LOOKUP_RCU)
      
       - Various minor fixes and cleanups
      
      * tag 'gfs2-v6.7-rc1-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/gfs2/linux-gfs2:
        gfs2: Fix freeze consistency check in log_write_header
        gfs2: Refcounting fix in gfs2_thaw_super
        gfs2: Minor gfs2_{freeze,thaw}_super cleanup
        gfs2: Use wait_event_freezable_timeout() for freezable kthread
        gfs2: Add missing set_freezable() for freezable kthread
        gfs2: Remove use of error flag in journal reads
        gfs2: Lift withdraw check out of gfs2_ail1_empty
        gfs2: Rename gfs2_withdrawn to gfs2_withdrawing_or_withdrawn
        gfs2: Mark withdraws as unlikely
        gfs2: Minor gfs2_ail1_empty cleanup
        gfs2: use is_subdir()
        gfs2: d_obtain_alias(ERR_PTR(...)) will do the right thing
        gfs2: Use GL_NOBLOCK flag for non-blocking lookups
        gfs2: Add GL_NOBLOCK flag
        gfs2: rgrp: fix kernel-doc warnings
        gfs2: fix kernel BUG in gfs2_quota_cleanup
        gfs2: Fix inode_go_instantiate description
        gfs2: Fix kernel NULL pointer dereference in gfs2_rgrp_dump
      bfed9a92
    • Linus Torvalds's avatar
      Merge tag 'for-6.8-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux · affc5af3
      Linus Torvalds authored
      Pull btrfs updates from David Sterba:
       "There are no exciting changes for users, it's been mostly API
        conversions and some fixes or refactoring.
      
        The mount API conversion is a base for future improvements that would
        come with VFS. Metadata processing has been converted to folios, not
        yet enabling the large folios but it's one patch away once everything
        gets tested enough.
      
        Core changes:
      
         - convert extent buffers to folios:
            - direct API conversion where possible
            - performance can drop by a few percent on metadata heavy
              workloads, the folio sizes are not constant and the calculations
              add up in the item helpers
            - both regular and subpage modes
            - data cannot be converted yet, we need to port that to iomap and
              there are some other generic changes required
      
         - convert mount to the new API, should not be user visible:
            - options deprecated long time ago have been removed: inode_cache,
              recovery
            - the new logic that splits mount to two phases slightly changes
              timing of device scanning for multi-device filesystems
            - LSM options will now work (like for selinux)
      
         - convert delayed nodes radix tree to xarray, preserving the
           preload-like logic that still allows to allocate with GFP_NOFS
      
         - more validation of sysfs value of scrub_speed_max
      
         - refactor chunk map structure, reduce size and improve performance
      
         - extent map refactoring, smaller data structures, improved
           performance
      
         - reduce size of struct extent_io_tree, embedded in several
           structures
      
         - temporary pages used for compression are cached and attached to a
           shrinker, this may slightly improve performance
      
         - in zoned mode, remove redirty extent buffer tracking, zeros are
           written in case an out-of-order is detected and proper data are
           written to the actual write pointer
      
         - cleanups, refactoring, error message improvements, updated tests
      
         - verify and update branch name or tag
      
         - remove unwanted text"
      
      * tag 'for-6.8-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux: (89 commits)
        btrfs: pass btrfs_io_geometry into btrfs_max_io_len
        btrfs: pass struct btrfs_io_geometry to set_io_stripe
        btrfs: open code set_io_stripe for RAID56
        btrfs: change block mapping to switch/case in btrfs_map_block
        btrfs: factor out block mapping for single profiles
        btrfs: factor out block mapping for RAID5/6
        btrfs: reduce scope of data_stripes in btrfs_map_block
        btrfs: factor out block mapping for RAID10
        btrfs: factor out block mapping for DUP profiles
        btrfs: factor out RAID1 block mapping
        btrfs: factor out block-mapping for RAID0
        btrfs: re-introduce struct btrfs_io_geometry
        btrfs: factor out helper for single device IO check
        btrfs: migrate btrfs_repair_io_failure() to folio interfaces
        btrfs: migrate eb_bitmap_offset() to folio interfaces
        btrfs: migrate various end io functions to folios
        btrfs: migrate subpage code to folio interfaces
        btrfs: migrate get_eb_page_index() and get_eb_offset_in_page() to folios
        btrfs: don't double put our subpage reference in alloc_extent_buffer
        btrfs: cleanup metadata page pointer usage
        ...
      affc5af3
    • Linus Torvalds's avatar
      Merge tag 'xfs-6.8-merge-3' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux · 12958e9c
      Linus Torvalds authored
      Pull xfs updates from Chandan Babu:
       "New features/functionality:
         - Online repair:
             - Reserve disk space for online repairs
             - Fix misinteraction between the AIL and btree bulkloader because
               of which the bulk load fails to queue a buffer for writeback if
               it happens to be on the AIL list
             - Prevent transaction reservation overflows when reaping blocks
               during online repair
             - Whenever possible, bulkloader now copies multiple records into
               a block
             - Support repairing of
                 1. Per-AG free space, inode and refcount btrees
                 2. Ondisk inodes
                 3. File data and attribute fork mappings
             - Verify the contents of
                 1. Inode and data fork of realtime bitmap file
                 2. Quota files
         - Introduce MF_MEM_PRE_REMOVE. This will be used to notify tasks
           about a pmem device being removed
      
        Bug fixes:
         - Fix memory leak of recovered attri intent items
         - Fix UAF during log intent recovery
         - Fix realtime geometry integer overflows
         - Prevent scrub from live locking in xchk_iget
         - Prevent fs shutdown when removing files during low free disk space
         - Prevent transaction reservation overflow when extending an RT
           device
         - Prevent incorrect warning from being printed when extending a
           filesystem
         - Fix an off-by-one error in xreap_agextent_binval
         - Serialize access to perag radix tree during deletion operation
         - Fix perag memory leak during growfs
         - Allow allocation of minlen realtime extent when the maximum sized
           realtime free extent is minlen in size
      
        Cleanups:
         - Remove duplicate boilerplate code spread across functionality
           associated with different log items
         - Cleanup resblks interfaces
         - Pass defer ops pointer to defer helpers instead of an enum
         - Initialize di_crc in xfs_log_dinode to prevent KMSAN warnings
         - Use static_assert() instead of BUILD_BUG_ON_MSG() to validate size
           of structures and structure member offsets. This is done in order
           to be able to share the code with userspace
         - Move XFS documentation under a new directory specific to XFS
         - Do not invoke deferred ops' ->create_done callback if the deferred
           operation does not have an intent item associated with it
         - Remove duplicate inclusion of header files from scrub/health.c
         - Refactor Realtime code
         - Cleanup attr code"
      
      * tag 'xfs-6.8-merge-3' of git://git.kernel.org/pub/scm/fs/xfs/xfs-linux: (123 commits)
        xfs: use the op name in trace_xlog_intent_recovery_failed
        xfs: fix a use after free in xfs_defer_finish_recovery
        xfs: turn the XFS_DA_OP_REPLACE checks in xfs_attr_shortform_addname into asserts
        xfs: remove xfs_attr_sf_hdr_t
        xfs: remove struct xfs_attr_shortform
        xfs: use xfs_attr_sf_findname in xfs_attr_shortform_getvalue
        xfs: remove xfs_attr_shortform_lookup
        xfs: simplify xfs_attr_sf_findname
        xfs: move the xfs_attr_sf_lookup tracepoint
        xfs: return if_data from xfs_idata_realloc
        xfs: make if_data a void pointer
        xfs: fold xfs_rtallocate_extent into xfs_bmap_rtalloc
        xfs: simplify and optimize the RT allocation fallback cascade
        xfs: reorder the minlen and prod calculations in xfs_bmap_rtalloc
        xfs: remove XFS_RTMIN/XFS_RTMAX
        xfs: remove rt-wrappers from xfs_format.h
        xfs: factor out a xfs_rtalloc_sumlevel helper
        xfs: tidy up xfs_rtallocate_extent_exact
        xfs: merge the calls to xfs_rtallocate_range in xfs_rtallocate_block
        xfs: reflow the tail end of xfs_rtallocate_extent_block
        ...
      12958e9c
    • Linus Torvalds's avatar
      Merge tag 'fsnotify_for_v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jack/linux-fs · 32720aca
      Linus Torvalds authored
      Pull fsnotify updates from Jan Kara:
       "fanotify changes allowing use of fanotify directory events even for
        filesystems such as FUSE which don't report proper fsid"
      
      * tag 'fsnotify_for_v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jack/linux-fs:
        fanotify: allow "weak" fsid when watching a single filesystem
        fanotify: store fsid in mark instead of in connector
      32720aca
    • Linus Torvalds's avatar
      Merge tag 'fs_for_v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jack/linux-fs · 9963327f
      Linus Torvalds authored
      Pull small quota cleanup from Jan Kara.
      
      * tag 'fs_for_v6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jack/linux-fs:
        quota: convert dquot_claim_space_nodirty() to return void
      9963327f
    • Chunhai Guo's avatar
      erofs: make erofs_{err,info}() support NULL sb parameter · aa12a790
      Chunhai Guo authored
      Make erofs_err() and erofs_info() support NULL sb parameter for more
      general usage.
      Suggested-by: default avatarGao Xiang <xiang@kernel.org>
      Signed-off-by: default avatarChunhai Guo <guochunhai@vivo.com>
      Link: https://lore.kernel.org/r/20240103123202.3054718-1-guochunhai@vivo.comReviewed-by: default avatarJingbo Xu <jefflexu@linux.alibaba.com>
      Reviewed-by: default avatarGao Xiang <hsiangkao@linux.alibaba.com>
      Signed-off-by: default avatarGao Xiang <hsiangkao@linux.alibaba.com>
      aa12a790
    • Gao Xiang's avatar
      erofs: avoid debugging output for (de)compressed data · 496530c7
      Gao Xiang authored
      Syzbot reported a KMSAN warning,
      erofs: (device loop0): z_erofs_lz4_decompress_mem: failed to decompress -12 in[46, 4050] out[917]
      =====================================================
      BUG: KMSAN: uninit-value in hex_dump_to_buffer+0xae9/0x10f0 lib/hexdump.c:194
        ..
        print_hex_dump+0x13d/0x3e0 lib/hexdump.c:276
        z_erofs_lz4_decompress_mem fs/erofs/decompressor.c:252 [inline]
        z_erofs_lz4_decompress+0x257e/0x2a70 fs/erofs/decompressor.c:311
        z_erofs_decompress_pcluster fs/erofs/zdata.c:1290 [inline]
        z_erofs_decompress_queue+0x338c/0x6460 fs/erofs/zdata.c:1372
        z_erofs_runqueue+0x36cd/0x3830
        z_erofs_read_folio+0x435/0x810 fs/erofs/zdata.c:1843
      
      The root cause is that the printed decompressed buffer may be filled
      incompletely due to decompression failure.  Since they were once only
      used for debugging, get rid of them now.
      
      Reported-and-tested-by: syzbot+6c746eea496f34b3161d@syzkaller.appspotmail.com
      Closes: https://lore.kernel.org/r/000000000000321c24060d7cfa1c@google.comReviewed-by: default avatarYue Hu <huyue2@coolpad.com>
      Signed-off-by: default avatarGao Xiang <hsiangkao@linux.alibaba.com>
      Link: https://lore.kernel.org/r/20231227151903.2900413-1-hsiangkao@linux.alibaba.com
      496530c7
    • Steve French's avatar
      cifs: update internal module version number for cifs.ko · 26ba1bf3
      Steve French authored
      From 2.46 to 2.47
      Signed-off-by: default avatarSteve French <stfrench@microsoft.com>
      26ba1bf3
    • Linus Torvalds's avatar
      Merge tag 'linux_kselftest-next-6.8-rc1' of... · ab27740f
      Linus Torvalds authored
      Merge tag 'linux_kselftest-next-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest
      
      Pull kselftest update from Shuah Khan:
       "Enhancements to reporting test results, fixes to root and user run
        behavior and fixing ksft_print_msg() calls"
      
      * tag 'linux_kselftest-next-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest:
        tracing/selftests: Add ownership modification tests for eventfs
        selftests: sched: Remove initialization to 0 for a static variable
        selftests: capabilities: namespace create varies for root and normal user
        selftests: prctl: Add prctl test for PR_GET_NAME
        kselftest/vDSO: Use ksft_print_msg() rather than printf in vdso_test_abi
        kselftest/vDSO: Fix message formatting for clock_id logging
        kselftest/vDSO: Make test name reporting for vdso_abi_test tooling friendly
        selftests:x86: Fix Format String Warnings in lam.c
        selftests/breakpoints: Fix format specifier in ksft_print_msg in step_after_suspend_test.c
        selftests:breakpoints: Fix Format String Warning in breakpoint_test
      ab27740f
    • Linus Torvalds's avatar
      Merge tag 'linux_kselftest-kunit-6.8-rc1' of... · 41daf06e
      Linus Torvalds authored
      Merge tag 'linux_kselftest-kunit-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest
      
      Pull KUnit updates from Shuah Khan:
      
       - a new feature that adds APIs for managing devices introducing a set
         of helper functions which allow devices (internally a struct
         kunit_device) to be created and managed by KUnit.
      
         These devices will be automatically unregistered on test exit. These
         helpers can either use a user-provided struct device_driver, or have
         one automatically created and managed by KUnit. In both cases, the
         device lives on a new kunit_bus.
      
       - changes to switch drm/tests to use kunit devices
      
       - several fixes and enhancements to attribute feature
      
       - changes to reorganize deferred action function introducing
         KUNIT_DEFINE_ACTION_WRAPPER
      
       - new feature adds ability to run tests after boot using debugfs
      
       - fixes and enhancements to string-stream-test:
           - parse ERR_PTR in string_stream_destroy()
           - unchecked dereference in bug fix in debugfs_print_results()
           - handling errors from alloc_string_stream()
           - NULL-dereference bug fix in kunit_init_suite()
      
      * tag 'linux_kselftest-kunit-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest: (27 commits)
        kunit: Fix some comments which were mistakenly kerneldoc
        kunit: Protect string comparisons against NULL
        kunit: Add example of kunit_activate_static_stub() with pointer-to-function
        kunit: Allow passing function pointer to kunit_activate_static_stub()
        kunit: Fix NULL-dereference in kunit_init_suite() if suite->log is NULL
        kunit: Reset test->priv after each param iteration
        kunit: Add example for using test->priv
        drm/tests: Switch to kunit devices
        ASoC: topology: Replace fake root_device with kunit_device in tests
        overflow: Replace fake root_device with kunit_device
        fortify: test: Use kunit_device
        kunit: Add APIs for managing devices
        Documentation: Add debugfs docs with run after boot
        kunit: add ability to run tests after boot using debugfs
        kunit: add is_init test attribute
        kunit: add example suite to test init suites
        kunit: add KUNIT_INIT_TABLE to init linker section
        kunit: move KUNIT_TABLE out of INIT_DATA
        kunit: tool: add test for parsing attributes
        kunit: tool: fix parsing of test attributes
        ...
      41daf06e
    • Linus Torvalds's avatar
      Merge tag 'linux_kselftest-nolibc-6.8-rc1' of... · 5d09f61e
      Linus Torvalds authored
      Merge tag 'linux_kselftest-nolibc-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest
      
      Pull nolibc updates from Shuah Khan:
      
       - Support for PIC mode on MIPS
      
       - Support for getrlimit()/setrlimit()
      
       - Replace some custom declarations with UAPI includes
      
       - A new script "run-tests.sh" to run the testsuite over different
         architectures and configurations
      
       - A few non-functional code cleanups
      
       - Minor improvements to nolibc-test, primarily to support the test
         script
      
      * tag 'linux_kselftest-nolibc-6.8-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest: (22 commits)
        selftests/nolibc: disable coredump via setrlimit
        tools/nolibc: add support for getrlimit/setrlimit
        tools/nolibc: drop custom definition of struct rusage
        tools/nolibc: drop duplicated testcase ioctl_tiocinq
        tools/nolibc: annotate va_list printf formats
        selftests/nolibc: make result alignment more robust
        tools/nolibc: mips: add support for PIC
        selftests/nolibc: run-tests.sh: enable testing via qemu-user
        selftests/nolibc: introduce QEMU_ARCH_USER
        selftests/nolibc: fix testcase status alignment
        selftests/nolibc: add configuration for mipso32be
        selftests/nolibc: extraconfig support
        selftests/nolibc: explicitly specify ABI for MIPS
        selftests/nolibc: use XARCH for MIPS
        tools/nolibc: move MIPS ABI validation into arch-mips.h
        tools/nolibc: error out on unsupported architecture
        selftests/nolibc: add script to run testsuite
        selftests/nolibc: support out-of-tree builds
        selftests/nolibc: anchor paths in $(srcdir) if possible
        selftests/nolibc: use EFI -bios for LoongArch qemu
        ...
      5d09f61e