1. 15 Jul, 2024 15 commits
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.iomap' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 4f5e249e
      Linus Torvalds authored
      Pull iomap updates from Christian Brauner:
       "This contains some minor work for the iomap subsystem:
      
         - Add documentation on the design of iomap and how to port to it
      
         - Optimize iomap_read_folio()
      
         - Bring back the change to iomap_write_end() to no increase i_size.
      
           This is accompanied by a change to xfs to reserve blocks for
           truncating large realtime inodes to avoid exposing stale data when
           iomap_write_end() stops increasing i_size"
      
      * tag 'vfs-6.11.iomap' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        iomap: don't increase i_size in iomap_write_end()
        xfs: reserve blocks for truncating large realtime inode
        Documentation: the design of iomap and how to port
        iomap: Optimize iomap_read_folio
      4f5e249e
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.pidfs' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 98f3a9a4
      Linus Torvalds authored
      Pull pidfs updates from Christian Brauner:
       "This contains work to make it possible to derive namespace file
        descriptors from pidfd file descriptors.
      
        Right now it is already possible to use a pidfd with setns() to
        atomically change multiple namespaces at the same time. In other
        words, it is possible to switch to the namespace context of a process
        using a pidfd. There is no need to first open namespace file
        descriptors via procfs.
      
        The work included here is an extension of these abilities by allowing
        to open namespace file descriptors using a pidfd. This means it is now
        possible to interact with namespaces without ever touching procfs.
      
        To this end a new set of ioctls() on pidfds is introduced covering all
        supported namespace types"
      
      * tag 'vfs-6.11.pidfs' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        pidfs: allow retrieval of namespace file descriptors
        nsfs: add open_namespace()
        nsproxy: add helper to go from arbitrary namespace to ns_common
        nsproxy: add a cleanup helper for nsproxy
        file: add take_fd() cleanup helper
      98f3a9a4
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.nsfs' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 1b074abe
      Linus Torvalds authored
      Pull namespace-fs updates from Christian Brauner:
       "This adds ioctls allowing to translate PIDs between PID namespaces.
      
        The motivating use-case comes from LXCFS which is a tiny fuse
        filesystem used to virtualize various aspects of procfs. LXCFS is run
        on the host. The files and directories it creates can be bind-mounted
        by e.g. a container at startup and mounted over the various procfs
        files the container wishes to have virtualized.
      
        When e.g. a read request for uptime is received, LXCFS will receive
        the pid of the reader. In order to virtualize the corresponding read,
        LXCFS needs to know the pid of the init process of the reader's pid
        namespace.
      
        In order to do this, LXCFS first needs to fork() two helper processes.
        The first helper process setns() to the readers pid namespace. The
        second helper process is needed to create a process that is a proper
        member of the pid namespace.
      
        The second helper process then creates a ucred message with ucred.pid
        set to 1 and sends it back to LXCFS. The kernel will translate the
        ucred.pid field to the corresponding pid number in LXCFS's pid
        namespace. This way LXCFS can learn the init pid number of the
        reader's pid namespace and can go on to virtualize.
      
        Since these two forks() are costly LXCFS maintains an init pid cache
        that caches a given pid for a fixed amount of time. The cache is
        pruned during new read requests. However, even with the cache the hit
        of the two forks() is singificant when a very large number of
        containers are running.
      
        So this adds a simple set of ioctls that let's a caller translate PIDs
        from and into a given PID namespace. This significantly improves
        performance with a very simple change.
      
        To protect against races pidfds can be used to check whether the
        process is still valid"
      
      * tag 'vfs-6.11.nsfs' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        nsfs: add pid translation ioctls
      1b074abe
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.mount' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · f608caba
      Linus Torvalds authored
      Pull vfs mount query updates from Christian Brauner:
       "This contains work to extend the abilities of listmount() and
        statmount() and various fixes and cleanups.
      
        Features:
      
         - Allow iterating through mounts via listmount() from newest to
           oldest. This makes it possible for mount(8) to keep iterating the
           mount table in reverse order so it gets newest mounts first.
      
         - Relax permissions on listmount() and statmount().
      
           It's not necessary to have capabilities in the initial namespace:
           it is sufficient to have capabilities in the owning namespace of
           the mount namespace we're located in to list unreachable mounts in
           that namespace.
      
         - Extend both listmount() and statmount() to list and stat mounts in
           foreign mount namespaces.
      
           Currently the only way to iterate over mount entries in mount
           namespaces that aren't in the caller's mount namespace is by
           crawling through /proc in order to find /proc/<pid>/mountinfo for
           the relevant mount namespace.
      
           This is both very clumsy and hugely inefficient. So extend struct
           mnt_id_req with a new member that allows to specify the mount
           namespace id of the mount namespace we want to look at.
      
           Luckily internally we already have most of the infrastructure for
           this so we just need to expose it to userspace. Give userspace a
           way to retrieve the id of a mount namespace via statmount() and
           through a new nsfs ioctl() on mount namespace file descriptor.
      
           This comes with appropriate selftests.
      
         - Expose mount options through statmount().
      
           Currently if userspace wants to get mount options for a mount and
           with statmount(), they still have to open /proc/<pid>/mountinfo to
           parse mount options. Simply the information through statmount()
           directly.
      
           Afterwards it's possible to only rely on statmount() and
           listmount() to retrieve all and more information than
           /proc/<pid>/mountinfo provides.
      
           This comes with appropriate selftests.
      
        Fixes:
      
         - Avoid copying to userspace under the namespace semaphore in
           listmount.
      
        Cleanups:
      
         - Simplify the error handling in listmount by relying on our newly
           added cleanup infrastructure.
      
         - Refuse invalid mount ids early for both listmount and statmount"
      
      * tag 'vfs-6.11.mount' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        fs: reject invalid last mount id early
        fs: refuse mnt id requests with invalid ids early
        fs: find rootfs mount of the mount namespace
        fs: only copy to userspace on success in listmount()
        sefltests: extend the statmount test for mount options
        fs: use guard for namespace_sem in statmount()
        fs: export mount options via statmount()
        fs: rename show_mnt_opts -> show_vfsmnt_opts
        selftests: add a test for the foreign mnt ns extensions
        fs: add an ioctl to get the mnt ns id from nsfs
        fs: Allow statmount() in foreign mount namespace
        fs: Allow listmount() in foreign mount namespace
        fs: export the mount ns id via statmount
        fs: keep an index of current mount namespaces
        fs: relax permissions for statmount()
        listmount: allow listing in reverse order
        fs: relax permissions for listmount()
        fs: simplify error handling
        fs: don't copy to userspace under namespace semaphore
        path: add cleanup helper
      f608caba
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.inode' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 2aae1d67
      Linus Torvalds authored
      Pull vfs inode / dentry updates from Christian Brauner:
       "This contains smaller performance improvements to inodes and dentries:
      
        inode:
      
         - Add rcu based inode lookup variants.
      
           They avoid one inode hash lock acquire in the common case thereby
           significantly reducing contention. We already support RCU-based
           operations but didn't take advantage of them during inode
           insertion.
      
           Callers of iget_locked() get the improvement without any code
           changes. Callers that need a custom callback can switch to
           iget5_locked_rcu() as e.g., did btrfs.
      
           With 20 threads each walking a dedicated 1000 dirs * 1000 files
           directory tree to stat(2) on a 32 core + 24GB ram vm:
      
              before: 3.54s user 892.30s system 1966% cpu 45.549 total
              after:  3.28s user 738.66s system 1955% cpu 37.932 total (-16.7%)
      
           Long-term we should pick up the effort to introduce more
           fine-grained locking and possibly improve on the currently used
           hash implementation.
      
         - Start zeroing i_state in inode_init_always() instead of doing it in
           individual filesystems.
      
           This allows us to remove an unneeded lock acquire in new_inode()
           and not burden individual filesystems with this.
      
        dcache:
      
         - Move d_lockref out of the area used by RCU lookup to avoid
           cacheline ping poing because the embedded name is sharing a
           cacheline with d_lockref.
      
         - Fix dentry size on 32bit with CONFIG_SMP=y so it does actually end
           up with 128 bytes in total"
      
      * tag 'vfs-6.11.inode' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        fs: fix dentry size
        vfs: move d_lockref out of the area used by RCU lookup
        bcachefs: remove now spurious i_state initialization
        xfs: remove now spurious i_state initialization in xfs_inode_alloc
        vfs: partially sanitize i_state zeroing on inode creation
        xfs: preserve i_state around inode_init_always in xfs_reinit_inode
        btrfs: use iget5_locked_rcu
        vfs: add rcu-based find_inode variants for iget ops
      2aae1d67
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.mount.api' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · b8fc1bd7
      Linus Torvalds authored
      Pull vfs mount API updates from Christian Brauner:
      
       - Add a generic helper to parse uid and gid mount options.
      
         Currently we open-code the same logic in various filesystems which is
         error prone, especially since the verification of uid and gid mount
         options is a sensitive operation in the face of idmappings.
      
         Add a generic helper and convert all filesystems over to it. Make
         sure that filesystems that are mountable in unprivileged containers
         verify that the specified uid and gid can be represented in the
         owning namespace of the filesystem.
      
       - Convert hostfs to the new mount api.
      
      * tag 'vfs-6.11.mount.api' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        fuse: Convert to new uid/gid option parsing helpers
        fuse: verify {g,u}id mount options correctly
        fat: Convert to new uid/gid option parsing helpers
        fat: Convert to new mount api
        fat: move debug into fat_mount_options
        vboxsf: Convert to new uid/gid option parsing helpers
        tracefs: Convert to new uid/gid option parsing helpers
        smb: client: Convert to new uid/gid option parsing helpers
        tmpfs: Convert to new uid/gid option parsing helpers
        ntfs3: Convert to new uid/gid option parsing helpers
        isofs: Convert to new uid/gid option parsing helpers
        hugetlbfs: Convert to new uid/gid option parsing helpers
        ext4: Convert to new uid/gid option parsing helpers
        exfat: Convert to new uid/gid option parsing helpers
        efivarfs: Convert to new uid/gid option parsing helpers
        debugfs: Convert to new uid/gid option parsing helpers
        autofs: Convert to new uid/gid option parsing helpers
        fs_parse: add uid & gid option option parsing helpers
        hostfs: Add const qualifier to host_root in hostfs_fill_super()
        hostfs: convert hostfs to use the new mount API
      b8fc1bd7
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.casefold' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 4a051e4c
      Linus Torvalds authored
      Pull vfs casefolding updates from Christian Brauner:
       "This contains some work to simplify the handling of casefolded names:
      
         - Simplify the handling of casefolded names in f2fs and ext4 by
           keeping the names as a qstr to avoiding unnecessary conversions
      
         - Introduce a new generic_ci_match() libfs case-insensitive lookup
           helper and use it in both f2fs and ext4 allowing to remove the
           filesystem specific implementations
      
         - Remove a bunch of ifdefs by making the unicode build checks part of
           the code flow"
      
      * tag 'vfs-6.11.casefold' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        f2fs: Move CONFIG_UNICODE defguards into the code flow
        ext4: Move CONFIG_UNICODE defguards into the code flow
        f2fs: Reuse generic_ci_match for ci comparisons
        ext4: Reuse generic_ci_match for ci comparisons
        libfs: Introduce case-insensitive string comparison helper
        f2fs: Simplify the handling of cached casefolded names
        ext4: Simplify the handling of cached casefolded names
      4a051e4c
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.module.description' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · 7d156879
      Linus Torvalds authored
      Pull vfs module description updates from Christian Brauner:
       "This contains patches to add module descriptions to all modules under
        fs/ currently lacking them"
      
      * tag 'vfs-6.11.module.description' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        openpromfs: add missing MODULE_DESCRIPTION() macro
        fs: nls: add missing MODULE_DESCRIPTION() macros
        fs: autofs: add MODULE_DESCRIPTION()
        fs: fat: add missing MODULE_DESCRIPTION() macros
        fs: binfmt: add missing MODULE_DESCRIPTION() macros
        fs: cramfs: add MODULE_DESCRIPTION()
        fs: hfs: add MODULE_DESCRIPTION()
        fs: hpfs: add MODULE_DESCRIPTION()
        qnx4: add MODULE_DESCRIPTION()
        qnx6: add MODULE_DESCRIPTION()
        fs: sysv: add MODULE_DESCRIPTION()
        fs: efs: add MODULE_DESCRIPTION()
        fs: minix: add MODULE_DESCRIPTION()
      7d156879
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.pg_error' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · aff31330
      Linus Torvalds authored
      Pull PG_error removal updates from Christian Brauner:
       "This contains work to remove almost all remaining users of PG_error
        from filesystems and filesystem helper libraries. An additional patch
        will be coming in via the jfs tree which tests the PG_error bit.
      
        Afterwards nothing will be testing it anymore and it's safe to remove
        all places which set or clear the PG_error bit.
      
        The goal is to fully remove PG_error by the next merge window"
      
      * tag 'vfs-6.11.pg_error' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs:
        buffer: Remove calls to set and clear the folio error flag
        iomap: Remove calls to set and clear folio error flag
        vboxsf: Convert vboxsf_read_folio() to use a folio
        ufs: Remove call to set the folio error flag
        romfs: Convert romfs_read_folio() to use a folio
        reiserfs: Remove call to folio_set_error()
        orangefs: Remove calls to set/clear the error flag
        nfs: Remove calls to folio_set_error
        jffs2: Remove calls to set/clear the folio error flag
        hostfs: Convert hostfs_read_folio() to use a folio
        isofs: Convert rock_ridge_symlink_read_folio to use a folio
        hpfs: Convert hpfs_symlink_read_folio to use a folio
        efs: Convert efs_symlink_read_folio to use a folio
        cramfs: Convert cramfs_read_folio to use a folio
        coda: Convert coda_symlink_filler() to use folio_end_read()
        befs: Convert befs_symlink_read_folio() to use folio_end_read()
      aff31330
    • Linus Torvalds's avatar
      Merge tag 'vfs-6.11.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs · b051320d
      Linus Torvalds authored
      Pull misc vfs updates from Christian Brauner:
       "Features:
      
         - Support passing NULL along AT_EMPTY_PATH for statx().
      
           NULL paths with any flag value other than AT_EMPTY_PATH go the
           usual route and end up with -EFAULT to retain compatibility (Rust
           is abusing calls of the sort to detect availability of statx)
      
           This avoids path lookup code, lockref management, memory allocation
           and in case of NULL path userspace memory access (which can be
           quite expensive with SMAP on x86_64)
      
         - Don't block i_writecount during exec. Remove the
           deny_write_access() mechanism for executables
      
         - Relax open_by_handle_at() permissions in specific cases where we
           can prove that the caller had sufficient privileges to open a file
      
         - Switch timespec64 fields in struct inode to discrete integers
           freeing up 4 bytes
      
        Fixes:
      
         - Fix false positive circular locking warning in hfsplus
      
         - Initialize hfs_inode_info after hfs_alloc_inode() in hfs
      
         - Avoid accidental overflows in vfs_fallocate()
      
         - Don't interrupt fallocate with EINTR in tmpfs to avoid constantly
           restarting shmem_fallocate()
      
         - Add missing quote in comment in fs/readdir
      
        Cleanups:
      
         - Don't assign and test in an if statement in mqueue. Move the
           assignment out of the if statement
      
         - Reflow the logic in may_create_in_sticky()
      
         - Remove the usage of the deprecated ida_simple_xx() API from procfs
      
         - Reject FSCONFIG_CMD_CREATE_EXCL requets that depend on the new
           mount api early
      
         - Rename variables in copy_tree() to make it easier to understand
      
         - Replace WARN(down_read_trylock, ...) abuse with proper asserts in
           various places in the VFS
      
         - Get rid of user_path_at_empty() and drop the empty argument from
           getname_flags()
      
         - Check for error while copying and no path in one branch in
           getname_flags()
      
         - Avoid redundant smp_mb() for THP handling in do_dentry_open()
      
         - Rename parent_ino to d_parent_ino and make it use RCU
      
         - Remove unused header include in fs/readdir
      
         - Export in_group_capable() helper and switch f2fs and fuse over to
           it instead of open-coding the logic in both places"
      
      * tag 'vfs-6.11.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/vfs/vfs: (27 commits)
        ipc: mqueue: remove assignment from IS_ERR argument
        vfs: rename parent_ino to d_parent_ino and make it use RCU
        vfs: support statx(..., NULL, AT_EMPTY_PATH, ...)
        stat: use vfs_empty_path() helper
        fs: new helper vfs_empty_path()
        fs: reflow may_create_in_sticky()
        vfs: remove redundant smp_mb for thp handling in do_dentry_open
        fuse: Use in_group_or_capable() helper
        f2fs: Use in_group_or_capable() helper
        fs: Export in_group_or_capable()
        vfs: reorder checks in may_create_in_sticky
        hfs: fix to initialize fields of hfs_inode_info after hfs_alloc_inode()
        proc: Remove usage of the deprecated ida_simple_xx() API
        hfsplus: fix to avoid false alarm of circular locking
        Improve readability of copy_tree
        vfs: shave a branch in getname_flags
        vfs: retire user_path_at_empty and drop empty arg from getname_flags
        vfs: stop using user_path_at_empty in do_readlinkat
        tmpfs: don't interrupt fallocate with EINTR
        fs: don't block i_writecount during exec
        ...
      b051320d
    • Linus Torvalds's avatar
      Merge tag 'drm-fixes-2024-07-12' of https://gitlab.freedesktop.org/drm/kernel · 2ffd45da
      Linus Torvalds authored
      Pull drm fixes from Dave Airlie:
       "Oh I screwed up last week's fixes pull, and forgot to send..
      
        Back to work, thanks to Sima for last week, not too many fixes as
        expected getting close to release [ sic - Linus ], amdgpu and xe have
        a couple each, and then some other misc ones.
      
        amdgpu:
         - PSR-SU fix
         - Reseved VMID fix
      
        xe:
         - Use write-back caching mode for system memory on DGFX
         - Do not leak object when finalizing hdcp gsc
      
        bridge:
         - adv7511 EDID irq fix
      
        gma500:
         - NULL mode fixes.
      
        meson:
         - fix resource leak"
      
      * tag 'drm-fixes-2024-07-12' of https://gitlab.freedesktop.org/drm/kernel:
        Revert "drm/amd/display: Reset freesync config before update new state"
        drm/xe/display/xe_hdcp_gsc: Free arbiter on driver removal
        drm/xe: Use write-back caching mode for system memory on DGFX
        drm/amdgpu: reject gang submit on reserved VMIDs
        drm/gma500: fix null pointer dereference in cdv_intel_lvds_get_modes
        drm/gma500: fix null pointer dereference in psb_intel_lvds_get_modes
        drm/meson: fix canvas release in bind function
        drm/bridge: adv7511: Fix Intermittent EDID failures
      2ffd45da
    • Linus Torvalds's avatar
      Merge branch 'link_path_walk' · 5e049755
      Linus Torvalds authored
      This is the last - for now - of the "look, we generated some
      questionable code for basic pathname lookup operations" set of
      branches.
      
      This is mainly just re-organizing the name hashing code in
      link_path_walk(), mostly by improving the calling conventions to
      the inlined helper functions and moving some of the code around
      to allow for more straightforward code generation.
      
      The profiles - and the generated code - look much more palatable
      to me now.
      
      * link_path_walk:
        vfs: link_path_walk: move more of the name hashing into hash_name()
        vfs: link_path_walk: improve may_lookup() code generation
        vfs: link_path_walk: do '.' and '..' detection while hashing
        vfs: link_path_walk: clarify and improve name hashing interface
        vfs: link_path_walk: simplify name hash flow
      5e049755
    • Linus Torvalds's avatar
      Merge branch 'arm64-uaccess' (early part) · 1654c37d
      Linus Torvalds authored
      Merge arm64 support for proper 'unsafe' user accessor functionality,
      with 'asm goto' for handling exceptions.
      
      The arm64 user access code used the slow fallback code for the user
      access code, which generates horrendous code for things like
      strncpy_from_user(), because it causes us to generate code for SW PAN
      and for range checking for every individual word.
      
      Teach arm64 about 'user_access_begin()' and the so-called 'unsafe' user
      access functions that take an error label and use 'asm goto' to make all
      the exception handling be entirely out of line.
      
      [ These user access functions are called 'unsafe' not because the
        concept is unsafe, but because the low-level accessor functions
        absolutely have to be protected by the 'user_access_begin()' code,
        because that's what does the range checking.
      
        So the accessor functions have that scary name to make sure people
        don't think they are usable on their own, and cannot be mis-used the
        way our old "double underscore" versions of __get_user() and friends
        were ]
      
      The "(early part)" of the branch is because the full branch also
      improved on the "access_ok()" function, but the exact semantics of TBI
      (top byte ignore) have to be discussed before doing that part.  So this
      just does the low-level accessor update to use "asm goto".
      
      * 'arm64-uaccess' (early part):
        arm64: start using 'asm goto' for put_user()
        arm64: start using 'asm goto' for get_user() when available
      1654c37d
    • Linus Torvalds's avatar
      Merge branch 'word-at-a-time' · 6a31ffdf
      Linus Torvalds authored
      Merge minor word-at-a-time instruction choice improvements for x86 and
      arm64.
      
      This is the second of four branches that came out of me looking at the
      code generation for path lookup on arm64.
      
      The word-at-a-time infrastructure is used to do string operations in
      chunks of one word both when copying the pathname from user space (in
      strncpy_from_user()), and when parsing and hashing the individual path
      components (in link_path_walk()).
      
      In particular, the "find the first zero byte" uses various bit tricks to
      figure out the end of the string or path component, and get the length
      without having to do things one byte at a time.  Both x86-64 and arm64
      had less than optimal code choices for that.
      
      The commit message for the arm64 change in particular tries to explain
      the exact code flow for the zero byte finding for people who care.  It's
      made a bit more complicated by the fact that we support big-endian
      hardware too, and so we have some extra abstraction layers to allow
      different models for finding the zero byte, quite apart from the issue
      of picking specialized instructions.
      
      * word-at-a-time:
        arm64: word-at-a-time: improve byte count calculations for LE
        x86-64: word-at-a-time: improve byte count calculations
      6a31ffdf
    • Linus Torvalds's avatar
      Merge branch 'runtime-constants' · a5819099
      Linus Torvalds authored
      Merge runtime constants infrastructure with implementations for x86 and
      arm64.
      
      This is one of four branches that came out of me looking at profiles of
      my kernel build filesystem load on my 128-core Altra arm64 system, where
      pathname walking and the user copies (particularly strncpy_from_user()
      for fetching the pathname from user space) is very hot.
      
      This is a very specialized "instruction alternatives" model where the
      dentry hash pointer and hash count will be constants for the lifetime of
      the kernel, but the allocation are not static but done early during the
      kernel boot.  In order to avoid the pointer load and dynamic shift, we
      just rewrite the constants in the instructions in place.
      
      We can't use the "generic" alternative instructions infrastructure,
      because different architectures do it very differently, and it's
      actually simpler to just have very specific helpers, with a fallback to
      the generic ("old") model of just using variables for architectures that
      do not implement the runtime constant patching infrastructure.
      
      Link: https://lore.kernel.org/all/CAHk-=widPe38fUNjUOmX11ByDckaeEo9tN4Eiyke9u1SAtu9sA@mail.gmail.com/
      
      * runtime-constants:
        arm64: add 'runtime constant' support
        runtime constants: add x86 architecture support
        runtime constants: add default dummy infrastructure
        vfs: dcache: move hashlen_hash() from callers into d_hash()
      a5819099
  2. 14 Jul, 2024 7 commits
  3. 13 Jul, 2024 6 commits
    • Linus Torvalds's avatar
      Merge tag 'i2c-for-6.10-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux · 4d145e3f
      Linus Torvalds authored
      Pull i2c fixes from Wolfram Sang:
       "Fixes for the I2C testunit, the Renesas R-Car driver and some
        MAINTAINERS corrections"
      
      * tag 'i2c-for-6.10-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux:
        i2c: testunit: avoid re-issued work after read message
        i2c: rcar: ensure Gen3+ reset does not disturb local targets
        i2c: mark HostNotify target address as used
        i2c: testunit: correct Kconfig description
        MAINTAINERS: VIRTIO I2C loses a maintainer, gains a reviewer
        MAINTAINERS: delete entries for Thor Thayer
        i2c: rcar: clear NO_RXDMA flag after resetting
        i2c: rcar: bring hardware to known state when probing
      4d145e3f
    • Linus Torvalds's avatar
      Merge tag '6.10-rc7-smb3-client-fix' of git://git.samba.org/sfrench/cifs-2.6 · d0d0cd38
      Linus Torvalds authored
      Pull smb client fix from Steve French:
       "Small fix, also for stable"
      
      * tag '6.10-rc7-smb3-client-fix' of git://git.samba.org/sfrench/cifs-2.6:
        cifs: fix setting SecurityFlags to true
      d0d0cd38
    • Steve French's avatar
      cifs: fix setting SecurityFlags to true · d2346e28
      Steve French authored
      If you try to set /proc/fs/cifs/SecurityFlags to 1 it
      will set them to CIFSSEC_MUST_NTLMV2 which no longer is
      relevant (the less secure ones like lanman have been removed
      from cifs.ko) and is also missing some flags (like for
      signing and encryption) and can even cause mount to fail,
      so change this to set it to Kerberos in this case.
      
      Also change the description of the SecurityFlags to remove mention
      of flags which are no longer supported.
      
      Cc: stable@vger.kernel.org
      Reviewed-by: default avatarShyam Prasad N <sprasad@microsoft.com>
      Signed-off-by: default avatarSteve French <stfrench@microsoft.com>
      d2346e28
    • Wolfram Sang's avatar
      Merge tag 'i2c-host-fixes-6.10-rc8' of... · 3fdd2d21
      Wolfram Sang authored
      Merge tag 'i2c-host-fixes-6.10-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/andi.shyti/linux into i2c/for-current
      
      This tag includes three fixes for the Renesas R-Car driver:
      
       1. Ensures the device is in a known state after probing.
       2. Allows clearing the NO_RXDMA flag after a reset.
       3. Forces a reset before any transfer on Gen3+ platforms to
          prevent disruption of the configuration during parallel
          transfers.
      3fdd2d21
    • Linus Torvalds's avatar
      Merge tag 'net-6.10-rc8-2' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net · 528dd46d
      Linus Torvalds authored
      Pull more networking fixes from Jakub Kicinski:
       "A quick follow up to yesterday's pull. We got a regressions report for
        the bnxt patch as soon as it got to your tree. The ethtool fix is also
        good to have, although it's an older regression.
      
        Current release - regressions:
      
         - eth: bnxt_en: fix crash in bnxt_get_max_rss_ctx_ring() on older HW
           when user tries to decrease the ring count
      
        Previous releases - regressions:
      
         - ethtool: fix RSS setting, accept "no change" setting if the driver
           doesn't support the new features
      
         - eth: i40e: remove needless retries of NVM update, don't wait 20min
           when we know the firmware update won't succeed"
      
      * tag 'net-6.10-rc8-2' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net:
        bnxt_en: Fix crash in bnxt_get_max_rss_ctx_ring()
        octeontx2-af: fix issue with IPv4 match for RSS
        octeontx2-af: fix issue with IPv6 ext match for RSS
        octeontx2-af: fix detection of IP layer
        octeontx2-af: fix a issue with cpt_lf_alloc mailbox
        octeontx2-af: replace cpt slot with lf id on reg write
        i40e: fix: remove needless retries of NVM update
        net: ethtool: Fix RSS setting
      528dd46d
    • Michael Chan's avatar
      bnxt_en: Fix crash in bnxt_get_max_rss_ctx_ring() · f7ce5eb2
      Michael Chan authored
      On older chips not supporting multiple RSS contexts, reducing
      ethtool channels will crash:
      
      BUG: kernel NULL pointer dereference, address: 00000000000000b8
      PGD 0 P4D 0
      Oops: Oops: 0000 [#1] PREEMPT SMP PTI
      CPU: 1 PID: 7032 Comm: ethtool Tainted: G S                 6.10.0-rc4 #1
      Hardware name: Dell Inc. PowerEdge R730/072T6D, BIOS 2.4.3 01/17/2017
      RIP: 0010:bnxt_get_max_rss_ctx_ring+0x4c/0x90 [bnxt_en]
      Code: c3 d3 eb 4c 8b 83 38 01 00 00 48 8d bb 38 01 00 00 4c 39 c7 74 42 41 8d 54 24 ff 31 c0 0f b7 d2 4c 8d 4c 12 02 66 85 ed 74 1d <49> 8b 90 b8 00 00 00 49 8d 34 11 0f b7 0a 66 39 c8 0f 42 c1 48 83
      RSP: 0018:ffffaaa501d23ba8 EFLAGS: 00010202
      RAX: 0000000000000000 RBX: ffff8efdf600c940 RCX: 0000000000000000
      RDX: 000000000000007f RSI: ffffffffacf429c4 RDI: ffff8efdf600ca78
      RBP: 0000000000000080 R08: 0000000000000000 R09: 0000000000000100
      R10: 0000000000000001 R11: ffffaaa501d238c0 R12: 0000000000000080
      R13: 0000000000000000 R14: ffff8efdf600c000 R15: 0000000000000006
      FS:  00007f977a7d2740(0000) GS:ffff8f041f840000(0000) knlGS:0000000000000000
      CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      CR2: 00000000000000b8 CR3: 00000002320aa004 CR4: 00000000003706f0
      DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      Call Trace:
      <TASK>
      ? __die_body+0x15/0x60
      ? page_fault_oops+0x157/0x440
      ? do_user_addr_fault+0x60/0x770
      ? _raw_spin_lock_irqsave+0x12/0x40
      ? exc_page_fault+0x61/0x120
      ? asm_exc_page_fault+0x22/0x30
      ? bnxt_get_max_rss_ctx_ring+0x4c/0x90 [bnxt_en]
      ? bnxt_get_max_rss_ctx_ring+0x25/0x90 [bnxt_en]
      bnxt_set_channels+0x9d/0x340 [bnxt_en]
      ethtool_set_channels+0x14b/0x210
      __dev_ethtool+0xdf8/0x2890
      ? preempt_count_add+0x6a/0xa0
      ? percpu_counter_add_batch+0x23/0x90
      ? filemap_map_pages+0x417/0x4a0
      ? avc_has_extended_perms+0x185/0x420
      ? __pfx_udp_ioctl+0x10/0x10
      ? sk_ioctl+0x55/0xf0
      ? kmalloc_trace_noprof+0xe0/0x210
      ? dev_ethtool+0x54/0x170
      dev_ethtool+0xa2/0x170
      dev_ioctl+0xbe/0x530
      sock_do_ioctl+0xa3/0xf0
      sock_ioctl+0x20d/0x2e0
      
      bp->rss_ctx_list is not initialized if the chip or firmware does not
      support multiple RSS contexts.  Fix it by adding a check in
      bnxt_get_max_rss_ctx_ring() before proceeding to reference
      bp->rss_ctx_list.
      
      Fixes: 0d1b7d6c ("bnxt: fix crashes when reducing ring count with active RSS contexts")
      Reported-by: default avatarBreno Leitao <leitao@debian.org>
      Link: https://lore.kernel.org/netdev/ZpFEJeNpwxW1aW9k@gmail.com/Reviewed-by: default avatarAndy Gospodarek <andrew.gospodarek@broadcom.com>
      Signed-off-by: default avatarMichael Chan <michael.chan@broadcom.com>
      Link: https://patch.msgid.link/20240712175318.166811-1-michael.chan@broadcom.comSigned-off-by: default avatarJakub Kicinski <kuba@kernel.org>
      f7ce5eb2
  4. 12 Jul, 2024 12 commits
    • Linus Torvalds's avatar
      Merge tag 'for-6.10-rc7-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux · 975f3b6d
      Linus Torvalds authored
      Pull btrfs fixes from David Sterba:
       "Fix a regression in extent map shrinker behaviour.
      
        In the past weeks we got reports from users that there are huge
        latency spikes or freezes. This was bisected to newly added shrinker
        of extent maps (it was added to fix a build up of the structures in
        memory).
      
        I'm assuming that the freezes would happen to many users after release
        so I'd like to get it merged now so it's in 6.10. Although the diff
        size is not small the changes are relatively straightforward, the
        reporters verified the fixes and we did testing on our side.
      
        The fixes:
      
         - adjust behaviour under memory pressure and check lock or scheduling
           conditions, bail out if needed
      
         - synchronize tracking of the scanning progress so inode ranges are
           not skipped or work duplicated
      
         - do a delayed iput when scanning a root so evicting an inode does
           not slow things down in case of lots of dirty data, also fix
           lockdep warning, a deadlock could happen when writing the dirty
           data would need to start a transaction"
      
      * tag 'for-6.10-rc7-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux:
        btrfs: avoid races when tracking progress for extent map shrinking
        btrfs: stop extent map shrinker if reschedule is needed
        btrfs: use delayed iput during extent map shrinking
      975f3b6d
    • Linus Torvalds's avatar
      Merge tag 'ceph-for-6.10-rc8' of https://github.com/ceph/ceph-client · a52ff901
      Linus Torvalds authored
      Pull ceph fixes from Ilya Dryomov:
       "A fix for a possible use-after-free following "rbd unmap" or "umount"
        marked for stable and two kernel-doc fixups"
      
      * tag 'ceph-for-6.10-rc8' of https://github.com/ceph/ceph-client:
        libceph: fix crush_choose_firstn() kernel-doc warnings
        libceph: suppress crush_choose_indep() kernel-doc warnings
        libceph: fix race between delayed_work() and ceph_monc_stop()
      a52ff901
    • Linus Torvalds's avatar
      Merge tag 'pmdomain-v6.10-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/linux-pm · ac6a9e07
      Linus Torvalds authored
      Pull pmdomain fix from Ulf Hansson:
      
       - qcom: Skip retention level for rpmhpd's
      
      * tag 'pmdomain-v6.10-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/linux-pm:
        pmdomain: qcom: rpmhpd: Skip retention level for Power Domains
      ac6a9e07
    • Linus Torvalds's avatar
      Merge tag 'mmc-v6.10-rc4-2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc · 01ec3bb6
      Linus Torvalds authored
      Pull MMC host fixes from Ulf Hansson:
      
       - davinci_mmc: Prevent transmitted data size from exceeding sgm's
         length
      
       - sdhci: Fix max_seg_size for 64KiB PAGE_SIZE
      
      * tag 'mmc-v6.10-rc4-2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc:
        mmc: davinci_mmc: Prevent transmitted data size from exceeding sgm's length
        mmc: sdhci: Fix max_seg_size for 64KiB PAGE_SIZE
      01ec3bb6
    • Linus Torvalds's avatar
      Merge tag 'arm-fixes-6.10-3' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc · e091caf9
      Linus Torvalds authored
      Pull ARM SoC fixes from Arnd Bergmann:
       "Most of these changes are Qualcomm SoC specific and came in just after
        I sent out the last set of fixes. This includes two regression fixes
        for SoC drivers, a defconfig change to ensure the Lenovo X13s is
        usable and 11 changes to DT files to fix regressions and minor
        platform specific issues.
      
        Tony and Chunyan step back from their respective maintainership roles
        on the omap and unisoc platforms, and Christophe in turn takes over
        maintaining some of the Freescale SoC drivers that he has been taking
        care of in practice already.
      
        Lastly, there are two trivial fixes for the davinci and sunxi
        platforms"
      
      * tag 'arm-fixes-6.10-3' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc:
        MAINTAINERS: Update FREESCALE SOC DRIVERS and QUICC ENGINE LIBRARY
        MAINTAINERS: Add more maintainers for omaps
        ARM: davinci: Convert comma to semicolon
        MAINTAINERS: Move myself from SPRD Maintainer to Reviewer
        Revert "dt-bindings: cache: qcom,llcc: correct QDU1000 reg entries"
        arm64: dts: qcom: qdu1000: Fix LLCC reg property
        arm64: dts: qcom: sm6115: add iommu for sdhc_1
        arm64: dts: qcom: x1e80100-crd: fix DAI used for headset recording
        arm64: dts: qcom: x1e80100-crd: fix WCD audio codec TX port mapping
        soc: qcom: pmic_glink: disable UCSI on sc8280xp
        arm64: defconfig: enable Elan i2c-hid driver
        arm64: dts: qcom: sc8280xp-crd: use external pull up for touch reset
        arm64: dts: qcom: sc8280xp-x13s: fix touchscreen power on
        arm64: dts: qcom: x1e80100: Fix PCIe 6a reg offsets and add MHI
        arm64: dts: qcom: sa8775p: Correct IRQ number of EL2 non-secure physical timer
        arm64: dts: allwinner: Fix PMIC interrupt number
        arm64: dts: qcom: sc8280xp: Set status = "reserved" on PSHOLD
        arm64: dts: qcom: x1e80100-*: Allocate some CMA buffers
        arm64: dts: qcom: sc8180x: Fix LLCC reg property again
      e091caf9
    • Linus Torvalds's avatar
      Merge tag 'char-misc-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc · f469cf96
      Linus Torvalds authored
      Pull char / misc driver fixes from Greg KH:
       "Here are some small remaining driver fixes for 6.10-final that have
        all been in linux-next for a while and resolve reported issues.
        Included in here are:
      
         - mei driver fixes (and a spelling fix at the end just to be clean)
      
         - iio driver fixes for reported problems
      
         - fastrpc bugfixes
      
         - nvmem small fixes"
      
      * tag 'char-misc-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/char-misc:
        mei: vsc: Fix spelling error
        mei: vsc: Enhance SPI transfer of IVSC ROM
        mei: vsc: Utilize the appropriate byte order swap function
        mei: vsc: Prevent timeout error with added delay post-firmware download
        mei: vsc: Enhance IVSC chipset stability during warm reboot
        nvmem: core: limit cell sysfs permissions to main attribute ones
        nvmem: core: only change name to fram for current attribute
        nvmem: meson-efuse: Fix return value of nvmem callbacks
        nvmem: rmem: Fix return value of rmem_read()
        misc: microchip: pci1xxxx: Fix return value of nvmem callbacks
        hpet: Support 32-bit userspace
        misc: fastrpc: Restrict untrusted app to attach to privileged PD
        misc: fastrpc: Fix ownership reassignment of remote heap
        misc: fastrpc: Fix memory leak in audio daemon attach operation
        misc: fastrpc: Avoid updating PD type for capability request
        misc: fastrpc: Copy the complete capability structure to user
        misc: fastrpc: Fix DSP capabilities request
        iio: light: apds9306: Fix error handing
        iio: trigger: Fix condition for own trigger
      f469cf96
    • Linus Torvalds's avatar
      Merge tag 'tty-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty · 1cb67bcc
      Linus Torvalds authored
      Pull tty / serial fixes from Greg KH:
       "Here are some small serial driver fixes for 6.10-final. Included in
        here are:
      
         - qcom-geni fixes for a much much much discussed issue and everyone
           now seems to be agreed that this is the proper way forward to
           resolve the reported lockups
      
         - imx serial driver bugfixes
      
         - 8250_omap errata fix
      
         - ma35d1 serial driver bugfix
      
        All of these have been in linux-next for over a week with no reported
        issues"
      
      * tag 'tty-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty:
        serial: qcom-geni: do not kill the machine on fifo underrun
        serial: qcom-geni: fix hard lockup on buffer flush
        serial: qcom-geni: fix soft lockup on sw flow control and suspend
        serial: imx: ensure RTS signal is not left active after shutdown
        tty: serial: ma35d1: Add a NULL check for of_node
        serial: 8250_omap: Fix Errata i2310 with RX FIFO level check
        serial: imx: only set receiver level if it is zero
      1cb67bcc
    • Linus Torvalds's avatar
      Merge tag 'usb-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb · 1293147a
      Linus Torvalds authored
      Pull USB fixes from Greg KH:
       "Here are some small USB driver fixes and new device ids for
        6.10-final. Included in here are:
      
         - new usb-serial device ids for reported devices
      
         - syzbot-triggered duplicate endpoint bugfix
      
         - gadget bugfix for configfs memory overwrite
      
         - xhci resume bugfix
      
         - new device quirk added
      
         - usb core error path bugfix
      
        All of these have been in linux-next (most for a while) with no
        reported issues"
      
      * tag 'usb-6.10-final' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb:
        USB: serial: mos7840: fix crash on resume
        USB: serial: option: add Rolling RW350-GL variants
        USB: serial: option: add support for Foxconn T99W651
        USB: serial: option: add Netprisma LCUK54 series modules
        usb: gadget: configfs: Prevent OOB read/write in usb_string_copy()
        usb: dwc3: pci: add support for the Intel Panther Lake
        usb: core: add missing of_node_put() in usb_of_has_devices_or_graph
        USB: Add USB_QUIRK_NO_SET_INTF quirk for START BP-850k
        USB: core: Fix duplicate endpoint bug by clearing reserved bits in the descriptor
        xhci: always resume roothubs if xHC was reset during resume
        USB: serial: option: add Telit generic core-dump composition
        USB: serial: option: add Fibocom FM350-GL
        USB: serial: option: add Telit FN912 rmnet compositions
      1293147a
    • Linus Torvalds's avatar
      Merge tag 'sound-6.10' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound · 9b48104b
      Linus Torvalds authored
      Pull sound fixes from Takashi Iwai:
       "The majority of changes here are small device-specific fixes for ASoC
        SOF / Intel and usual HD-audio quirks.
      
        The only significant high LOC is found in the Cirrus firmware driver,
        but all those are for hardening against malicious firmware blobs, and
        they look fine for taking as a last minute fix, too"
      
      * tag 'sound-6.10' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound:
        ALSA: hda/realtek: Enable Mute LED on HP 250 G7
        firmware: cs_dsp: Use strnlen() on name fields in V1 wmfw files
        ALSA: hda/realtek: Limit mic boost on VAIO PRO PX
        ALSA: hda: cs35l41: Fix swapped l/r audio channels for Lenovo ThinBook 13x Gen4
        ASoC: SOF: Intel: hda-pcm: Limit the maximum number of periods by MAX_BDL_ENTRIES
        ASoC: rt711-sdw: add missing readable registers
        ASoC: SOF: Intel: hda: fix null deref on system suspend entry
        ALSA: hda/realtek: add quirk for Clevo V5[46]0TU
        firmware: cs_dsp: Prevent buffer overrun when processing V2 alg headers
        firmware: cs_dsp: Validate payload length before processing block
        firmware: cs_dsp: Return error if block header overflows file
        firmware: cs_dsp: Fix overflow checking of wmfw header
      9b48104b
    • Linus Torvalds's avatar
      Merge tag 'bcachefs-2024-07-12' of https://evilpiepirate.org/git/bcachefs · 5d4c8513
      Linus Torvalds authored
      Pull more bcachefs fixes from Kent Overstreet:
      
       - revert the SLAB_ACCOUNT patch, something crazy is going on in memcg
         and someone forgot to test
      
       - minor fixes: missing rcu_read_lock(), scheduling while atomic (in an
         emergency shutdown path)
      
       - two lockdep fixes; these could have gone earlier, but were left to
         bake awhile
      
      * tag 'bcachefs-2024-07-12' of https://evilpiepirate.org/git/bcachefs:
        bcachefs: bch2_gc_btree() should not use btree_root_lock
        bcachefs: Set PF_MEMALLOC_NOFS when trans->locked
        bcachefs; Use trans_unlock_long() when waiting on allocator
        Revert "bcachefs: Mark bch_inode_info as SLAB_ACCOUNT"
        bcachefs: fix scheduling while atomic in break_cycle()
        bcachefs: Fix RCU splat
      5d4c8513
    • David S. Miller's avatar
      Merge branch 'octeontx2-cpt-rss-cfg-fixes' into main · 425652d4
      David S. Miller authored
      Srujana Challa says:
      
      ====================
      Fixes for CPT and RSS configuration
      
      This series of patches fixes various issues related to CPT
      configuration and RSS configuration.
      
      v1->v2:
      - Excluded the patch "octeontx2-af: reduce cpt flt interrupt vectors for
        cn10kb" to submit it to net-next.
      - Addressed the review comments.
      
      Kiran Kumar K (1):
        octeontx2-af: Fix issue with IPv6 ext match for RSS
      ====================
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      425652d4
    • Satheesh Paul's avatar
      octeontx2-af: fix issue with IPv4 match for RSS · 60795bbf
      Satheesh Paul authored
      While performing RSS based on IPv4, packets with
      IPv4 options are not being considered. Adding changes
      to match both plain IPv4 and IPv4 with option header.
      
      Fixes: 41a7aa7b ("octeontx2-af: NIX Rx flowkey configuration for RSS")
      Signed-off-by: default avatarSatheesh Paul <psatheesh@marvell.com>
      Reviewed-by: default avatarKalesh AP <kalesh-anakkur.purayil@broadcom.com>
      Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
      60795bbf