1. 07 Dec, 2022 2 commits
  2. 02 Dec, 2022 1 commit
  3. 23 Nov, 2022 1 commit
  4. 28 Oct, 2022 2 commits
    • Ard Biesheuvel's avatar
      apei/ghes: Use xchg_release() for updating new cache slot instead of cmpxchg() · dd3fa54b
      Ard Biesheuvel authored
      Some documentation first, about how this machinery works:
      
      It seems, the intent of the GHES error records cache is to collect
      already reported errors - see the ghes_estatus_cached() checks. There's
      even a sentence trying to say what this does:
      
        /*
         * GHES error status reporting throttle, to report more kinds of
         * errors, instead of just most frequently occurred errors.
         */
      
      New elements are added to the cache this way:
      
        if (!ghes_estatus_cached(estatus)) {
                if (ghes_print_estatus(NULL, ghes->generic, estatus))
                        ghes_estatus_cache_add(ghes->generic, estatus);
      
      The intent being, once this new error record is reported, it gets cached
      so that it doesn't get reported for a while due to too many, same-type
      error records getting reported in burst-like scenarios. I.e., new,
      unreported error types can have a higher chance of getting reported.
      
      Now, the loop in ghes_estatus_cache_add() is trying to pick out the
      oldest element in there. Meaning, something which got reported already
      but a long while ago, i.e., a LRU-type scheme.
      
      And the cmpxchg() is there presumably to make sure when that selected
      element slot_cache is removed, it really *is* that element that gets
      removed and not one which replaced it in the meantime.
      
      Now, ghes_estatus_cache_add() selects a slot, and either succeeds in
      replacing its contents with a pointer to a newly cached item, or it just
      gives up and frees the new item again, without attempting to select
      another slot even if one might be available.
      
      Since only inserting new items is being done here, the race can only
      cause a failure if the selected slot was updated with another new item
      concurrently, which means that it is arbitrary which of those two items
      gets dropped.
      
      And "dropped" here means, the item doesn't get added to the cache so
      the next time it is seen, it'll get reported again and an insertion
      attempt will be done again. Eventually, it'll get inserted and all those
      times when the insertion fails, the item will get reported although the
      cache is supposed to prevent that and "ratelimit" those repeated error
      records. Not a big deal in any case.
      
      This means the cmpxchg() and the special case are not necessary.
      Therefore, just drop the existing item unconditionally.
      
      Move the xchg_release() and call_rcu() out of rcu_read_lock/unlock
      section since there is no actually dereferencing the pointer at all.
      
        [ bp:
          - Flesh out and summarize what was discussed on the thread now
            that that cache contraption is understood;
          - Touch up code style. ]
      Co-developed-by: default avatarJia He <justin.he@arm.com>
      Signed-off-by: default avatarJia He <justin.he@arm.com>
      Signed-off-by: default avatarArd Biesheuvel <ardb@kernel.org>
      Signed-off-by: default avatarBorislav Petkov <bp@suse.de>
      Link: https://lore.kernel.org/r/20221010023559.69655-7-justin.he@arm.comSigned-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      dd3fa54b
    • Uwe Kleine-König's avatar
      ACPI: APEI: Drop unsetting driver data on remove · 36006ccb
      Uwe Kleine-König authored
      Since commit 0998d063 ("device-core: Ensure drvdata = NULL when no
      driver is bound") the driver core cares for cleaning driver data, so
      don't do it in the driver, too.
      Signed-off-by: default avatarUwe Kleine-König <u.kleine-koenig@pengutronix.de>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      36006ccb
  5. 13 Oct, 2022 1 commit
    • Ashish Kalra's avatar
      ACPI: APEI: Fix integer overflow in ghes_estatus_pool_init() · 43d27483
      Ashish Kalra authored
      Change num_ghes from int to unsigned int, preventing an overflow
      and causing subsequent vmalloc() to fail.
      
      The overflow happens in ghes_estatus_pool_init() when calculating
      len during execution of the statement below as both multiplication
      operands here are signed int:
      
      len += (num_ghes * GHES_ESOURCE_PREALLOC_MAX_SIZE);
      
      The following call trace is observed because of this bug:
      
      [    9.317108] swapper/0: vmalloc error: size 18446744071562596352, exceeds total pages, mode:0xcc0(GFP_KERNEL), nodemask=(null),cpuset=/,mems_allowed=0-1
      [    9.317131] Call Trace:
      [    9.317134]  <TASK>
      [    9.317137]  dump_stack_lvl+0x49/0x5f
      [    9.317145]  dump_stack+0x10/0x12
      [    9.317146]  warn_alloc.cold+0x7b/0xdf
      [    9.317150]  ? __device_attach+0x16a/0x1b0
      [    9.317155]  __vmalloc_node_range+0x702/0x740
      [    9.317160]  ? device_add+0x17f/0x920
      [    9.317164]  ? dev_set_name+0x53/0x70
      [    9.317166]  ? platform_device_add+0xf9/0x240
      [    9.317168]  __vmalloc_node+0x49/0x50
      [    9.317170]  ? ghes_estatus_pool_init+0x43/0xa0
      [    9.317176]  vmalloc+0x21/0x30
      [    9.317177]  ghes_estatus_pool_init+0x43/0xa0
      [    9.317179]  acpi_hest_init+0x129/0x19c
      [    9.317185]  acpi_init+0x434/0x4a4
      [    9.317188]  ? acpi_sleep_proc_init+0x2a/0x2a
      [    9.317190]  do_one_initcall+0x48/0x200
      [    9.317195]  kernel_init_freeable+0x221/0x284
      [    9.317200]  ? rest_init+0xe0/0xe0
      [    9.317204]  kernel_init+0x1a/0x130
      [    9.317205]  ret_from_fork+0x22/0x30
      [    9.317208]  </TASK>
      Signed-off-by: default avatarAshish Kalra <ashish.kalra@amd.com>
      [ rjw: Subject and changelog edits ]
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      43d27483
  6. 10 Oct, 2022 24 commits
    • Linus Torvalds's avatar
      Merge tag 'acpi-6.1-rc1-2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · 3a1e24fa
      Linus Torvalds authored
      Pull more ACPI updates from Rafael Wysocki:
       "These fix two issues, in APEI and in the int3472 driver, clean up the
        ACPI thermal driver, add ACPI support for non-GPE system wakeup events
        and make the system reboot code use the S5 (system off) state by
        default.
      
        Specifics:
      
         - Fix ACPI device object reference counting in (recently updated)
           skl_int3472_fill_clk_pdata() (Andy Shevchenko).
      
         - Fix a memory leak in APEI by avoiding to add a task_work to kernel
           threads running when an asynchronous error is detected (Shuai Xue).
      
         - Add ACPI support for handling system wakeups via GPIO wake capable
           IRQs in addition to GPEs (Raul E Rangel).
      
         - Make the system reboot code put ACPI-enabled systems into the S5
           (system off) state which is necessary for some platforms to work as
           expected (Kai-Heng Feng).
      
         - Make the white space usage in the ACPI thermal driver more
           consistent and drop redundant code from it (Rafael Wysocki)"
      
      * tag 'acpi-6.1-rc1-2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm:
        ACPI: thermal: Drop some redundant code
        ACPI: thermal: Drop redundant parens from expressions
        ACPI: thermal: Use white space more consistently
        platform/x86: int3472: Don't leak reference on error
        ACPI: APEI: do not add task_work to kernel thread to avoid memory leak
        PM: ACPI: reboot: Reinstate S5 for reboot
        kernel/reboot: Add SYS_OFF_MODE_RESTART_PREPARE mode
        ACPI: PM: Take wake IRQ into consideration when entering suspend-to-idle
        i2c: acpi: Use ACPI wake capability bit to set wake_irq
        ACPI: resources: Add wake_capable parameter to acpi_dev_irq_flags
        gpiolib: acpi: Add wake_capable variants of acpi_dev_gpio_irq_get
      3a1e24fa
    • Linus Torvalds's avatar
      Merge tag 'dma-mapping-6.1-2022-10-10' of git://git.infradead.org/users/hch/dma-mapping · 10b22b53
      Linus Torvalds authored
      Pull dma-mapping updates from Christoph Hellwig:
      
       - fix a regression in the ARM dma-direct conversion (Christoph Hellwig)
      
       - use memcpy_{from,to}_page (Fabio M. De Francesco)
      
       - cleanup the swiotlb MAINTAINERS entry (Lukas Bulwahn)
      
       - make SG table pool allocation less fragile (Masahiro Yamada)
      
       - don't panic on swiotlb initialization failure (Robin Murphy)
      
      * tag 'dma-mapping-6.1-2022-10-10' of git://git.infradead.org/users/hch/dma-mapping:
        ARM/dma-mapping: remove the dma_coherent member of struct dev_archdata
        ARM/dma-mappіng: don't override ->dma_coherent when set from a bus notifier
        lib/sg_pool: change module_init(sg_pool_init) to subsys_initcall
        MAINTAINERS: merge SWIOTLB SUBSYSTEM into DMA MAPPING HELPERS
        swiotlb: don't panic!
        swiotlb: replace kmap_atomic() with memcpy_{from,to}_page()
      10b22b53
    • Linus Torvalds's avatar
      Merge tag 'iommu-updates-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu · f23cdfcd
      Linus Torvalds authored
      Pull iommu updates from Joerg Roedel:
      
       - remove the bus_set_iommu() interface which became unnecesary because
         of IOMMU per-device probing
      
       - make the dma-iommu.h header private
      
       - Intel VT-d changes from Lu Baolu:
      	  - Decouple PASID and PRI from SVA
      	  - Add ESRTPS & ESIRTPS capability check
      	  - Cleanups
      
       - Apple DART support for the M1 Pro/MAX SOCs
      
       - support for AMD IOMMUv2 page-tables for the DMA-API layer.
      
         The v2 page-tables are compatible with the x86 CPU page-tables. Using
         them for DMA-API prepares support for hardware-assisted IOMMU
         virtualization
      
       - support for MT6795 Helio X10 M4Us in the Mediatek IOMMU driver
      
       - some smaller fixes and cleanups
      
      * tag 'iommu-updates-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu: (59 commits)
        iommu/vt-d: Avoid unnecessary global DMA cache invalidation
        iommu/vt-d: Avoid unnecessary global IRTE cache invalidation
        iommu/vt-d: Rename cap_5lp_support to cap_fl5lp_support
        iommu/vt-d: Remove pasid_set_eafe()
        iommu/vt-d: Decouple PASID & PRI enabling from SVA
        iommu/vt-d: Remove unnecessary SVA data accesses in page fault path
        dt-bindings: iommu: arm,smmu-v3: Relax order of interrupt names
        iommu: dart: Support t6000 variant
        iommu/io-pgtable-dart: Add DART PTE support for t6000
        iommu/io-pgtable: Add DART subpage protection support
        iommu/io-pgtable: Move Apple DART support to its own file
        iommu/mediatek: Add support for MT6795 Helio X10 M4Us
        iommu/mediatek: Introduce new flag TF_PORT_TO_ADDR_MT8173
        dt-bindings: mediatek: Add bindings for MT6795 M4U
        iommu/iova: Fix module config properly
        iommu/amd: Fix sparse warning
        iommu/amd: Remove outdated comment
        iommu/amd: Free domain ID after domain_flush_pages
        iommu/amd: Free domain id in error path
        iommu/virtio: Fix compile error with viommu_capable()
        ...
      f23cdfcd
    • Linus Torvalds's avatar
      Merge tag 'devicetree-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux · 706eacad
      Linus Torvalds authored
      Pull devicetree updates from Rob Herring:
       "DT core:
      
         - Fix node refcounting in of_find_last_cache_level()
      
         - Constify device_node in of_device_compatible_match()
      
         - Fix 'dma-ranges' handling in bus controller nodes
      
         - Fix handling of initrd start > end
      
         - Improve error reporting in of_irq_init()
      
         - Taint kernel on DT unittest running
      
         - Use strscpy instead of strlcpy
      
         - Add a build target, dt_compatible_check, to check for compatible
           strings used in kernel sources against compatible strings in DT
           schemas.
      
         - Handle DT_SCHEMA_FILES changes when rebuilding
      
        DT bindings:
      
         - LED bindings for MT6370 PMIC
      
         - Convert Mediatek mtk-gce mailbox, MIPS CPU interrupt controller,
           mt7621 I2C, virtio,pci-iommu, nxp,tda998x, QCom fastrpc, qcom,pdc,
           and arm,versatile-sysreg to DT schema format
      
         - Add nvmem cells to u-boot,env schema
      
         - Add more LED_COLOR_ID definitions
      
         - Require 'opp-table' uses to be a node
      
         - Various schema fixes to match QEMU 'virt' DT usage
      
         - Tree wide dropping of redundant 'Device Tree Binding' in schema
           titles
      
         - More (unevaluated|additional)Properties fixes in schema child nodes
      
         - Drop various redundant minItems equal to maxItems"
      
      * tag 'devicetree-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux: (62 commits)
        of: base: Shift refcount decrement in of_find_last_cache_level()
        dt-bindings: leds: Add MediaTek MT6370 flashlight
        dt-bindings: leds: mt6370: Add MediaTek MT6370 current sink type LED indicator
        dt-bindings: mailbox: Convert mtk-gce to DT schema
        of: base: make of_device_compatible_match() accept const device node
        of: Fix "dma-ranges" handling for bus controllers
        of: fdt: Remove unused struct fdt_scan_status
        dt-bindings: display: st,stm32-dsi: Handle data-lanes in DSI port node
        dt-bindings: timer: Add power-domains for TI timer-dm on K3
        dt: Add a check for undocumented compatible strings in kernel
        kbuild: take into account DT_SCHEMA_FILES changes while checking dtbs
        dt-bindings: interrupt-controller: migrate MIPS CPU interrupt controller text bindings to YAML
        dt-bindings: i2c: migrate mt7621 text bindings to YAML
        dt-bindings: power: gpcv2: correct patternProperties
        dt-bindings: virtio: Convert virtio,pci-iommu to DT schema
        dt-bindings: timer: arm,arch_timer: Allow dual compatible string
        dt-bindings: arm: cpus: Add kryo240 compatible
        dt-bindings: display: bridge: nxp,tda998x: Convert to json-schema
        dt-bindings: nvmem: u-boot,env: add basic NVMEM cells
        dt-bindings: remoteproc: qcom,adsp: enforce smd-edge schema
        ...
      706eacad
    • Linus Torvalds's avatar
      Merge tag 'tpmdd-next-v6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jarkko/linux-tpmdd · ada3bfb6
      Linus Torvalds authored
      Pull tpm updates from Jarkko Sakkinen:
       "Just a few bug fixes this time"
      
      * tag 'tpmdd-next-v6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/jarkko/linux-tpmdd:
        selftest: tpm2: Add Client.__del__() to close /dev/tpm* handle
        security/keys: Remove inconsistent __user annotation
        char: move from strlcpy with unused retval to strscpy
      ada3bfb6
    • Linus Torvalds's avatar
      Merge tag 'v6.1-p1' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6 · 3604a7f5
      Linus Torvalds authored
      Pull crypto updates from Herbert Xu:
       "API:
         - Feed untrusted RNGs into /dev/random
         - Allow HWRNG sleeping to be more interruptible
         - Create lib/utils module
         - Setting private keys no longer required for akcipher
         - Remove tcrypt mode=1000
         - Reorganised Kconfig entries
      
        Algorithms:
         - Load x86/sha512 based on CPU features
         - Add AES-NI/AVX/x86_64/GFNI assembler implementation of aria cipher
      
        Drivers:
         - Add HACE crypto driver aspeed"
      
      * tag 'v6.1-p1' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6: (124 commits)
        crypto: aspeed - Remove redundant dev_err call
        crypto: scatterwalk - Remove unused inline function scatterwalk_aligned()
        crypto: aead - Remove unused inline functions from aead
        crypto: bcm - Simplify obtain the name for cipher
        crypto: marvell/octeontx - use sysfs_emit() to instead of scnprintf()
        hwrng: core - start hwrng kthread also for untrusted sources
        crypto: zip - remove the unneeded result variable
        crypto: qat - add limit to linked list parsing
        crypto: octeontx2 - Remove the unneeded result variable
        crypto: ccp - Remove the unneeded result variable
        crypto: aspeed - Fix check for platform_get_irq() errors
        crypto: virtio - fix memory-leak
        crypto: cavium - prevent integer overflow loading firmware
        crypto: marvell/octeontx - prevent integer overflows
        crypto: aspeed - fix build error when only CRYPTO_DEV_ASPEED is enabled
        crypto: hisilicon/qm - fix the qos value initialization
        crypto: sun4i-ss - use DEFINE_SHOW_ATTRIBUTE to simplify sun4i_ss_debugfs
        crypto: tcrypt - add async speed test for aria cipher
        crypto: aria-avx - add AES-NI/AVX/x86_64/GFNI assembler implementation of aria cipher
        crypto: aria - prepare generic module for optimized implementations
        ...
      3604a7f5
    • Linus Torvalds's avatar
      Merge tag 'bitmap-6.1-rc1' of https://github.com/norov/linux · d4013bc4
      Linus Torvalds authored
      Pull bitmap updates from Yury Norov:
      
       - Fix unsigned comparison to -1 in CPUMAP_FILE_MAX_BYTES (Phil Auld)
      
       - cleanup nr_cpu_ids vs nr_cpumask_bits mess (me)
      
         This series cleans that mess and adds new config FORCE_NR_CPUS that
         allows to optimize cpumask subsystem if the number of CPUs is known
         at compile-time.
      
       - optimize find_bit() functions (me)
      
         Reworks find_bit() functions based on new FIND_{FIRST,NEXT}_BIT()
         macros.
      
       - add find_nth_bit() (me)
      
         Adds find_nth_bit(), which is ~70 times faster than bitcounting with
         for_each() loop:
      
      	for_each_set_bit(bit, mask, size)
      		if (n-- == 0)
      			return bit;
      
         Also adds bitmap_weight_and() to let people replace this pattern:
      
      	tmp = bitmap_alloc(nbits);
      	bitmap_and(tmp, map1, map2, nbits);
      	weight = bitmap_weight(tmp, nbits);
      	bitmap_free(tmp);
      
         with a single bitmap_weight_and() call.
      
       - repair cpumask_check() (me)
      
         After switching cpumask to use nr_cpu_ids, cpumask_check() started
         generating many false-positive warnings. This series fixes it.
      
       - Add for_each_cpu_andnot() and for_each_cpu_andnot() (Valentin
         Schneider)
      
         Extends the API with one more function and applies it in sched/core.
      
      * tag 'bitmap-6.1-rc1' of https://github.com/norov/linux: (28 commits)
        sched/core: Merge cpumask_andnot()+for_each_cpu() into for_each_cpu_andnot()
        lib/test_cpumask: Add for_each_cpu_and(not) tests
        cpumask: Introduce for_each_cpu_andnot()
        lib/find_bit: Introduce find_next_andnot_bit()
        cpumask: fix checking valid cpu range
        lib/bitmap: add tests for for_each() loops
        lib/find: optimize for_each() macros
        lib/bitmap: introduce for_each_set_bit_wrap() macro
        lib/find_bit: add find_next{,_and}_bit_wrap
        cpumask: switch for_each_cpu{,_not} to use for_each_bit()
        net: fix cpu_max_bits_warn() usage in netif_attrmask_next{,_and}
        cpumask: add cpumask_nth_{,and,andnot}
        lib/bitmap: remove bitmap_ord_to_pos
        lib/bitmap: add tests for find_nth_bit()
        lib: add find_nth{,_and,_andnot}_bit()
        lib/bitmap: add bitmap_weight_and()
        lib/bitmap: don't call __bitmap_weight() in kernel code
        tools: sync find_bit() implementation
        lib/find_bit: optimize find_next_bit() functions
        lib/find_bit: create find_first_zero_bit_le()
        ...
      d4013bc4
    • Linus Torvalds's avatar
      Merge tag 'trace-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace · cdf072ac
      Linus Torvalds authored
      Pull tracing updates from Steven Rostedt:
       "Major changes:
      
         - Changed location of tracing repo from personal git repo to:
           git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace.git
      
         - Added Masami Hiramatsu as co-maintainer
      
         - Updated MAINTAINERS file to separate out FTRACE as it is more than
           just TRACING.
      
        Minor changes:
      
         - Added Mark Rutland as FTRACE reviewer
      
         - Updated user_events to make it on its way to remove the BROKEN tag.
           The changes should now be acceptable but will run it through a
           cycle and hopefully we can remove the BROKEN tag next release.
      
         - Added filtering to eprobes
      
         - Added a delta time to the benchmark trace event
      
         - Have the histogram and filter callbacks called via a switch
           statement instead of indirect functions. This speeds it up to avoid
           retpolines.
      
         - Add a way to wake up ring buffer waiters waiting for the ring
           buffer to fill up to its watermark.
      
         - New ioctl() on the trace_pipe_raw file to wake up ring buffer
           waiters.
      
         - Wake up waiters when the ring buffer is disabled. A reader may
           block when the ring buffer is disabled, but if it was blocked when
           the ring buffer is disabled it should then wake up.
      
        Fixes:
      
         - Allow splice to read partially read ring buffer pages. This fixes
           splice never moving forward.
      
         - Fix inverted compare that made the "shortest" ring buffer wait
           queue actually the longest.
      
         - Fix a race in the ring buffer between resetting a page when a
           writer goes to another page, and the reader.
      
         - Fix ftrace accounting bug when function hooks are added at boot up
           before the weak functions are set to "disabled".
      
         - Fix bug that freed a user allocated snapshot buffer when enabling a
           tracer.
      
         - Fix possible recursive locks in osnoise tracer
      
         - Fix recursive locking direct functions
      
         - Other minor clean ups and fixes"
      
      * tag 'trace-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: (44 commits)
        ftrace: Create separate entry in MAINTAINERS for function hooks
        tracing: Update MAINTAINERS to reflect new tracing git repo
        tracing: Do not free snapshot if tracer is on cmdline
        ftrace: Still disable enabled records marked as disabled
        tracing/user_events: Move pages/locks into groups to prepare for namespaces
        tracing: Add Masami Hiramatsu as co-maintainer
        tracing: Remove unused variable 'dups'
        MAINTAINERS: add myself as a tracing reviewer
        ring-buffer: Fix race between reset page and reading page
        tracing/user_events: Update ABI documentation to align to bits vs bytes
        tracing/user_events: Use bits vs bytes for enabled status page data
        tracing/user_events: Use refcount instead of atomic for ref tracking
        tracing/user_events: Ensure user provided strings are safely formatted
        tracing/user_events: Use WRITE instead of READ for io vector import
        tracing/user_events: Use NULL for strstr checks
        tracing: Fix spelling mistake "preapre" -> "prepare"
        tracing: Wake up waiters when tracing is disabled
        tracing: Add ioctl() to force ring buffer waiters to wake up
        tracing: Wake up ring buffer waiters on closing of the file
        ring-buffer: Add ring_buffer_wake_waiters()
        ...
      cdf072ac
    • Linus Torvalds's avatar
      Merge tag 'sysctl-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux · dc553428
      Linus Torvalds authored
      Pull sysctl updates from Luis Chamberlain:
       "Just some boring cleanups on the sysctl front for this release"
      
      * tag 'sysctl-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux:
        kernel/sysctl-test: use SYSCTL_{ZERO/ONE_HUNDRED} instead of i_{zero/one_hundred}
        kernel/sysctl.c: move sysctl_vals and sysctl_long_vals to sysctl.c
        sysctl: remove max_extfrag_threshold
        kernel/sysctl.c: remove unnecessary (void*) conversions
        proc: remove initialization assignment
      dc553428
    • Aaron Tomlin's avatar
      module: tracking: Keep a record of tainted unloaded modules only · 47cc75aa
      Aaron Tomlin authored
      This ensures that no module record/or entry is added to the
      unloaded_tainted_modules list if it does not carry a taint.
      Reported-by: default avatarAlexey Dobriyan <adobriyan@gmail.com>
      Fixes: 99bd9956 ("module: Introduce module unload taint tracking")
      Signed-off-by: default avatarAaron Tomlin <atomlin@redhat.com>
      Acked-by: default avatarLuis Chamberlain <mcgrof@kernel.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
      47cc75aa
    • Linus Torvalds's avatar
      Merge tag 'modules-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux · 385f4a10
      Linus Torvalds authored
      Pull module updates from Luis Chamberlain:
      
       - minor enhancement for sysfs compression string (David Disseldorp)
      
       - debugfs interface to view unloaded tainted modules (Aaron Tomlin)
      
      * tag 'modules-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/mcgrof/linux:
        module/decompress: generate sysfs string at compile time
        module: Add debugfs interface to view unloaded tainted modules
      385f4a10
    • Linus Torvalds's avatar
      Merge tag 'kbuild-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild · 8afc66e8
      Linus Torvalds authored
      Pull Kbuild updates from Masahiro Yamada:
      
       - Remove potentially incomplete targets when Kbuid is interrupted by
         SIGINT etc in case GNU Make may miss to do that when stderr is piped
         to another program.
      
       - Rewrite the single target build so it works more correctly.
      
       - Fix rpm-pkg builds with V=1.
      
       - List top-level subdirectories in ./Kbuild.
      
       - Ignore auto-generated __kstrtab_* and __kstrtabns_* symbols in
         kallsyms.
      
       - Avoid two different modules in lib/zstd/ having shared code, which
         potentially causes building the common code as build-in and modular
         back-and-forth.
      
       - Unify two modpost invocations to optimize the build process.
      
       - Remove head-y syntax in favor of linker scripts for placing
         particular sections in the head of vmlinux.
      
       - Bump the minimal GNU Make version to 3.82.
      
       - Clean up misc Makefiles and scripts.
      
      * tag 'kbuild-v6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild: (41 commits)
        docs: bump minimal GNU Make version to 3.82
        ia64: simplify esi object addition in Makefile
        Revert "kbuild: Check if linker supports the -X option"
        kbuild: rebuild .vmlinux.export.o when its prerequisite is updated
        kbuild: move modules.builtin(.modinfo) rules to Makefile.vmlinux_o
        zstd: Fixing mixed module-builtin objects
        kallsyms: ignore __kstrtab_* and __kstrtabns_* symbols
        kallsyms: take the input file instead of reading stdin
        kallsyms: drop duplicated ignore patterns from kallsyms.c
        kbuild: reuse mksysmap output for kallsyms
        mksysmap: update comment about __crc_*
        kbuild: remove head-y syntax
        kbuild: use obj-y instead extra-y for objects placed at the head
        kbuild: hide error checker logs for V=1 builds
        kbuild: re-run modpost when it is updated
        kbuild: unify two modpost invocations
        kbuild: move vmlinux.o rule to the top Makefile
        kbuild: move .vmlinux.objs rule to Makefile.modpost
        kbuild: list sub-directories in ./Kbuild
        Makefile.compiler: replace cc-ifversion with compiler-specific macros
        ...
      8afc66e8
    • Linus Torvalds's avatar
      Merge tag 'livepatching-for-6.1' of... · 4de65c58
      Linus Torvalds authored
      Merge tag 'livepatching-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/livepatching/livepatching
      
      Pull livepatching updates from Petr Mladek:
      
       - Fix race between fork and livepatch transition revert
      
       - Add sysfs entry that shows "patched" state for each object (module)
         that can be livepatched by the given livepatch
      
       - Some clean up
      
      * tag 'livepatching-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/livepatching/livepatching:
        selftests/livepatch: add sysfs test
        livepatch: add sysfs entry "patched" for each klp_object
        selftests/livepatch: normalize sysctl error message
        livepatch: Add a missing newline character in klp_module_coming()
        livepatch: fix race between fork and KLP transition
      4de65c58
    • Linus Torvalds's avatar
      Merge tag 'printk-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux · b5204106
      Linus Torvalds authored
      Pull printk updates from Petr Mladek:
      
       - Initialize pointer hashing using the system workqueue. It avoids
         taking locks in printk()/vsprintf() code path
      
       - Misc code clean up
      
      * tag 'printk-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/printk/linux:
        printk: Mark __printk percpu data ready __ro_after_init
        printk: Remove bogus comment vs. boot consoles
        printk: Remove write only variable nr_ext_console_drivers
        printk: Declare log_wait properly
        printk: Make pr_flush() static
        lib/vsprintf: Initialize vsprintf's pointer hash once the random core is ready.
        lib/vsprintf: Remove static_branch_likely() from __ptr_to_hashval().
        lib/vnsprintf: add const modifier for param 'bitmap'
      b5204106
    • Linus Torvalds's avatar
      Merge tag 'cgroup-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/cgroup · adf4bfc4
      Linus Torvalds authored
      Pull cgroup updates from Tejun Heo:
      
       - cpuset now support isolated cpus.partition type, which will enable
         dynamic CPU isolation
      
       - pids.peak added to remember the max number of pids used
      
       - holes in cgroup namespace plugged
      
       - internal cleanups
      
      * tag 'cgroup-for-6.1' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/cgroup: (25 commits)
        cgroup: use strscpy() is more robust and safer
        iocost_monitor: reorder BlkgIterator
        cgroup: simplify code in cgroup_apply_control
        cgroup: Make cgroup_get_from_id() prettier
        cgroup/cpuset: remove unreachable code
        cgroup: Remove CFTYPE_PRESSURE
        cgroup: Improve cftype add/rm error handling
        kselftest/cgroup: Add cpuset v2 partition root state test
        cgroup/cpuset: Update description of cpuset.cpus.partition in cgroup-v2.rst
        cgroup/cpuset: Make partition invalid if cpumask change violates exclusivity rule
        cgroup/cpuset: Relocate a code block in validate_change()
        cgroup/cpuset: Show invalid partition reason string
        cgroup/cpuset: Add a new isolated cpus.partition type
        cgroup/cpuset: Relax constraints to partition & cpus changes
        cgroup/cpuset: Allow no-task partition to have empty cpuset.cpus.effective
        cgroup/cpuset: Miscellaneous cleanups & add helper functions
        cgroup/cpuset: Enable update_tasks_cpumask() on top_cpuset
        cgroup: add pids.peak interface for pids controller
        cgroup: Remove data-race around cgrp_dfl_visible
        cgroup: Fix build failure when CONFIG_SHRINKER_DEBUG
        ...
      adf4bfc4
    • Linus Torvalds's avatar
      Merge tag 'random-6.1-rc1-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/crng/random · 8adc0486
      Linus Torvalds authored
      Pull random number generator updates from Jason Donenfeld:
      
       - Huawei reported that when they updated their kernel from 4.4 to
         something much newer, some userspace code they had broke, the culprit
         being the accidental removal of O_NONBLOCK from /dev/random way back
         in 5.6. It's been gone for over 2 years now and this is the first
         we've heard of it, but userspace breakage is userspace breakage, so
         O_NONBLOCK is now back.
      
       - Use randomness from hardware RNGs much more often during early boot,
         at the same interval that crng reseeds are done, from Dominik.
      
       - A semantic change in hardware RNG throttling, so that the hwrng
         framework can properly feed random.c with randomness from hardware
         RNGs that aren't specifically marked as creditable.
      
         A related patch coming to you via Herbert's hwrng tree depends on
         this one, not to compile, but just to function properly, so you may
         want to merge this PULL before that one.
      
       - A fix to clamp credited bits from the interrupts pool to the size of
         the pool sample. This is mainly just a theoretical fix, as it'd be
         pretty hard to exceed it in practice.
      
       - Oracle reported that InfiniBand TCP latency regressed by around
         10-15% after a change a few cycles ago made at the request of the RT
         folks, in which we hoisted a somewhat rare operation (1 in 1024
         times) out of the hard IRQ handler and into a workqueue, a pretty
         common and boring pattern.
      
         It turns out, though, that scheduling a worker from there has
         overhead of its own, whereas scheduling a timer on that same CPU for
         the next jiffy amortizes better and doesn't incur the same overhead.
      
         I also eliminated a cache miss by moving the work_struct (and
         subsequently, the timer_list) to below a critical cache line, so that
         the more critical members that are accessed on every hard IRQ aren't
         split between two cache lines.
      
       - The boot-time initialization of the RNG has been split into two
         approximate phases: what we can accomplish before timekeeping is
         possible and what we can accomplish after.
      
         This winds up being useful so that we can use RDRAND to seed the RNG
         before CONFIG_SLAB_FREELIST_RANDOM=y systems initialize slabs, in
         addition to other early uses of randomness. The effect is that
         systems with RDRAND (or a bootloader seed) will never see any
         warnings at all when setting CONFIG_WARN_ALL_UNSEEDED_RANDOM=y. And
         kfence benefits from getting a better seed of its own.
      
       - Small systems without much entropy sometimes wind up putting some
         truncated serial number read from flash into hostname, so contribute
         utsname changes to the RNG, without crediting.
      
       - Add smaller batches to serve requests for smaller integers, and make
         use of them when people ask for random numbers bounded by a given
         compile-time constant. This has positive effects all over the tree,
         most notably in networking and kfence.
      
       - The original jitter algorithm intended (I believe) to schedule the
         timer for the next jiffy, not the next-next jiffy, yet it used
         mod_timer(jiffies + 1), which will fire on the next-next jiffy,
         instead of what I believe was intended, mod_timer(jiffies), which
         will fire on the next jiffy. So fix that.
      
       - Fix a comment typo, from William.
      
      * tag 'random-6.1-rc1-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/crng/random:
        random: clear new batches when bringing new CPUs online
        random: fix typos in get_random_bytes() comment
        random: schedule jitter credit for next jiffy, not in two jiffies
        prandom: make use of smaller types in prandom_u32_max
        random: add 8-bit and 16-bit batches
        utsname: contribute changes to RNG
        random: use init_utsname() instead of utsname()
        kfence: use better stack hash seed
        random: split initialization into early step and later step
        random: use expired timer rather than wq for mixing fast pool
        random: avoid reading two cache lines on irq randomness
        random: clamp credited irq bits to maximum mixed
        random: throttle hwrng writes if no entropy is credited
        random: use hwgenerator randomness more frequently at early boot
        random: restore O_NONBLOCK support
      8adc0486
    • Linus Torvalds's avatar
      Merge tag 'slab-for-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/vbabka/slab · 52abb27a
      Linus Torvalds authored
      Pull slab fixes from Vlastimil Babka:
      
       - The "common kmalloc v4" series [1] by Hyeonggon Yoo.
      
         While the plan after LPC is to try again if it's possible to get rid
         of SLOB and SLAB (and if any critical aspect of those is not possible
         to achieve with SLUB today, modify it accordingly), it will take a
         while even in case there are no objections.
      
         Meanwhile this is a nice cleanup and some parts (e.g. to the
         tracepoints) will be useful even if we end up with a single slab
         implementation in the future:
      
            - Improves the mm/slab_common.c wrappers to allow deleting
              duplicated code between SLAB and SLUB.
      
            - Large kmalloc() allocations in SLAB are passed to page allocator
              like in SLUB, reducing number of kmalloc caches.
      
            - Removes the {kmem_cache_alloc,kmalloc}_node variants of
              tracepoints, node id parameter added to non-_node variants.
      
       - Addition of kmalloc_size_roundup()
      
         The first two patches from a series by Kees Cook [2] that introduce
         kmalloc_size_roundup(). This will allow merging of per-subsystem
         patches using the new function and ultimately stop (ab)using ksize()
         in a way that causes ongoing trouble for debugging functionality and
         static checkers.
      
       - Wasted kmalloc() memory tracking in debugfs alloc_traces
      
         A patch from Feng Tang that enhances the existing debugfs
         alloc_traces file for kmalloc caches with information about how much
         space is wasted by allocations that needs less space than the
         particular kmalloc cache provides.
      
       - My series [3] to fix validation races for caches with enabled
         debugging:
      
            - By decoupling the debug cache operation more from non-debug
              fastpaths, extra locking simplifications were possible and thus
              done afterwards.
      
            - Additional cleanup of PREEMPT_RT specific code on top, by Thomas
              Gleixner.
      
            - A late fix for slab page leaks caused by the series, by Feng
              Tang.
      
       - Smaller fixes and cleanups:
      
            - Unneeded variable removals, by ye xingchen
      
            - A cleanup removing a BUG_ON() in create_unique_id(), by Chao Yu
      
      Link: https://lore.kernel.org/all/20220817101826.236819-1-42.hyeyoo@gmail.com/ [1]
      Link: https://lore.kernel.org/all/20220923202822.2667581-1-keescook@chromium.org/ [2]
      Link: https://lore.kernel.org/all/20220823170400.26546-1-vbabka@suse.cz/ [3]
      
      * tag 'slab-for-6.1-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/vbabka/slab: (30 commits)
        mm/slub: fix a slab missed to be freed problem
        slab: Introduce kmalloc_size_roundup()
        slab: Remove __malloc attribute from realloc functions
        mm/slub: clean up create_unique_id()
        mm/slub: enable debugging memory wasting of kmalloc
        slub: Make PREEMPT_RT support less convoluted
        mm/slub: simplify __cmpxchg_double_slab() and slab_[un]lock()
        mm/slub: convert object_map_lock to non-raw spinlock
        mm/slub: remove slab_lock() usage for debug operations
        mm/slub: restrict sysfs validation to debug caches and make it safe
        mm/sl[au]b: check if large object is valid in __ksize()
        mm/slab_common: move declaration of __ksize() to mm/slab.h
        mm/slab_common: drop kmem_alloc & avoid dereferencing fields when not using
        mm/slab_common: unify NUMA and UMA version of tracepoints
        mm/sl[au]b: cleanup kmem_cache_alloc[_node]_trace()
        mm/sl[au]b: generalize kmalloc subsystem
        mm/slub: move free_debug_processing() further
        mm/sl[au]b: introduce common alloc/free functions without tracepoint
        mm/slab: kmalloc: pass requests larger than order-1 page to page allocator
        mm/slab_common: cleanup kmalloc_large()
        ...
      52abb27a
    • Linus Torvalds's avatar
      Merge tag 'timers-core-2022-10-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 55be6084
      Linus Torvalds authored
      Pull timer updates from Thomas Gleixner:
       "A boring time, timekeeping, timers update:
      
         - No core code changes
      
         - No new clocksource/event driver
      
         - Cleanup of the TI DM clocksource/event driver
      
         - The usual set of device tree binding updates
      
         - Small improvement, fixes and cleanups all over the place"
      
      * tag 'timers-core-2022-10-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (22 commits)
        clocksource/drivers/arm_arch_timer: Fix CNTPCT_LO and CNTVCT_LO value
        clocksource/drivers/imx-sysctr: handle nxp,no-divider property
        dt-bindings: timer: nxp,sysctr-timer: add nxp,no-divider property
        clocksource/drivers/timer-ti-dm: Get clock in probe with devm_clk_get()
        clocksource/drivers/timer-ti-dm: Add flag to detect omap1
        clocksource/drivers/timer-ti-dm: Move struct omap_dm_timer fields to driver
        clocksource/drivers/timer-ti-dm: Use runtime PM directly and check errors
        clocksource/drivers/timer-ti-dm: Move private defines to the driver
        clocksource/drivers/timer-ti-dm: Simplify register access further
        clocksource/drivers/timer-ti-dm: Simplify register writes with dmtimer_write()
        clocksource/drivers/timer-ti-dm: Simplify register reads with dmtimer_read()
        clocksource/drivers/timer-ti-dm: Drop unused functions
        clocksource/drivers/timer-gxp: Add missing error handling in gxp_timer_probe
        clocksource/drivers/arm_arch_timer: Fix handling of ARM erratum 858921
        clocksource/drivers/exynos_mct: Enable building on ARTPEC
        clocksource/drivers/exynos_mct: Support local-timers property
        clocksource/drivers/exynos_mct: Support frc-shared property
        dt-bindings: timer: exynos4210-mct: Add ARTPEC-8 MCT support
        clocksource/drivers/sun4i: Add definition of clear interrupt
        clocksource/drivers/renesas-ostm: Add support for RZ/V2L SoC
        ...
      55be6084
    • Linus Torvalds's avatar
      Merge tag 'sched-rt-2022-10-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 7f6dcffb
      Linus Torvalds authored
      Pull preempt RT updates from Thomas Gleixner:
       "Introduce preempt_[dis|enable_nested() and use it to clean up various
        places which have open coded PREEMPT_RT conditionals.
      
        On PREEMPT_RT enabled kernels, spinlocks and rwlocks are neither
        disabling preemption nor interrupts. Though there are a few places
        which depend on the implicit preemption/interrupt disable of those
        locks, e.g. seqcount write sections, per CPU statistics updates etc.
      
        PREEMPT_RT added open coded CONFIG_PREEMPT_RT conditionals to
        disable/enable preemption in the related code parts all over the
        place. That's hard to read and does not really explain why this is
        necessary.
      
        Linus suggested to use helper functions (preempt_disable_nested() and
        preempt_enable_nested()) and use those in the affected places. On !RT
        enabled kernels these functions are NOPs, but contain a lockdep assert
        to validate that preemption is actually disabled to catch call sites
        which do not have preemption disabled.
      
        Clean up the affected code paths in mm, dentry and lib"
      
      * tag 'sched-rt-2022-10-05' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        u64_stats: Streamline the implementation
        flex_proportions: Disable preemption entering the write section.
        mm/compaction: Get rid of RT ifdeffery
        mm/memcontrol: Replace the PREEMPT_RT conditionals
        mm/debug: Provide VM_WARN_ON_IRQS_ENABLED()
        mm/vmstat: Use preempt_[dis|en]able_nested()
        dentry: Use preempt_[dis|en]able_nested()
        preempt: Provide preempt_[dis|en]able_nested()
      7f6dcffb
    • Linus Torvalds's avatar
      Merge tag 'objtool-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 65f109e1
      Linus Torvalds authored
      Pull objtool updates from Ingo Molnar:
      
       - Remove the "ANNOTATE_NOENDBR on ENDBR" warning: it's not really
         useful and only found a non-bug false positive so far.
      
       - Properly decode LOOP/LOOPE/LOOPNE, which were missing from the x86
         decoder. Because these instructions are rather ineffective, they
         never showed up in compiler output, but they are simple enough to
         support, so add them for completeness.
      
       - A bit more cross-arch preparatory work.
      
      * tag 'objtool-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        objtool,x86: Teach decode about LOOP* instructions
        objtool: Remove "ANNOTATE_NOENDBR on ENDBR" warning
        objtool: Use arch_jump_destination() in read_intra_function_calls()
      65f109e1
    • Linus Torvalds's avatar
      Merge tag 'locking-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3e71f016
      Linus Torvalds authored
      Pull locking updates from Ingo Molnar:
      
       - Disable preemption in rwsem_write_trylock()'s attempt to take the
         rwsem, to avoid RT tasks hogging the CPU, which managed to preempt
         this function after the owner has been cleared but before a new owner
         is set. Also add debug checks to enforce this.
      
       - Add __lockfunc to more slow path functions and add __sched to
         semaphore functions.
      
       - Mark spinlock APIs noinline when the respective CONFIG_INLINE_SPIN_*
         toggles are disabled, to reduce LTO text size.
      
       - Print more debug information when lockdep gets confused in
         look_up_lock_class().
      
       - Improve header file abuse checks.
      
       - Misc cleanups
      
      * tag 'locking-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        locking/lockdep: Print more debug information - report name and key when look_up_lock_class() got confused
        locking: Add __sched to semaphore functions
        locking/rwsem: Disable preemption while trying for rwsem lock
        locking: Detect includes rwlock.h outside of spinlock.h
        locking: Add __lockfunc to slow path functions
        locking/spinlocks: Mark spinlocks noinline when inline spinlocks are disabled
        selftests: futex: Fix 'the the' typo in comment
      3e71f016
    • Linus Torvalds's avatar
      Merge tag 'perf-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3871d93b
      Linus Torvalds authored
      Pull perf events updates from Ingo Molnar:
       "PMU driver updates:
      
         - Add AMD Last Branch Record Extension Version 2 (LbrExtV2) feature
           support for Zen 4 processors.
      
         - Extend the perf ABI to provide branch speculation information, if
           available, and use this on CPUs that have it (eg. LbrExtV2).
      
         - Improve Intel PEBS TSC timestamp handling & integration.
      
         - Add Intel Raptor Lake S CPU support.
      
         - Add 'perf mem' and 'perf c2c' memory profiling support on AMD CPUs
           by utilizing IBS tagged load/store samples.
      
         - Clean up & optimize various x86 PMU details.
      
        HW breakpoints:
      
         - Big rework to optimize the code for systems with hundreds of CPUs
           and thousands of breakpoints:
      
            - Replace the nr_bp_mutex global mutex with the bp_cpuinfo_sem
              per-CPU rwsem that is read-locked during most of the key
              operations.
      
            - Improve the O(#cpus * #tasks) logic in toggle_bp_slot() and
              fetch_bp_busy_slots().
      
            - Apply micro-optimizations & cleanups.
      
        - Misc cleanups & enhancements"
      
      * tag 'perf-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (75 commits)
        perf/hw_breakpoint: Annotate tsk->perf_event_mutex vs ctx->mutex
        perf: Fix pmu_filter_match()
        perf: Fix lockdep_assert_event_ctx()
        perf/x86/amd/lbr: Adjust LBR regardless of filtering
        perf/x86/utils: Fix uninitialized var in get_branch_type()
        perf/uapi: Define PERF_MEM_SNOOPX_PEER in kernel header file
        perf/x86/amd: Support PERF_SAMPLE_PHY_ADDR
        perf/x86/amd: Support PERF_SAMPLE_ADDR
        perf/x86/amd: Support PERF_SAMPLE_{WEIGHT|WEIGHT_STRUCT}
        perf/x86/amd: Support PERF_SAMPLE_DATA_SRC
        perf/x86/amd: Add IBS OP_DATA2 DataSrc bit definitions
        perf/mem: Introduce PERF_MEM_LVLNUM_{EXTN_MEM|IO}
        perf/x86/uncore: Add new Raptor Lake S support
        perf/x86/cstate: Add new Raptor Lake S support
        perf/x86/msr: Add new Raptor Lake S support
        perf/x86: Add new Raptor Lake S support
        bpf: Check flags for branch stack in bpf_read_branch_records helper
        perf, hw_breakpoint: Fix use-after-free if perf_event_open() fails
        perf: Use sample_flags for raw_data
        perf: Use sample_flags for addr
        ...
      3871d93b
    • Rafael J. Wysocki's avatar
      Merge branches 'acpi-apei', 'acpi-wakeup', 'acpi-reboot' and 'acpi-thermal' · 056a8154
      Rafael J. Wysocki authored
      Merge additional APEI changes, ACPI updates related to device wakeup and
      system restart and ACPI thermal driver cleanups for 6.1-rc1:
      
       - Fix a memory leak in APEI by avoiding to add do not add task_work to
         kernel threads running when an asynchronous error is detected (Shuai
         Xue).
      
       - Add ACPI support for handling system wakeups via GPIO wake capable
         IRQs in addition to GPEs (Raul E Rangel).
      
       - Make the system reboot code put ACPI-enabled systems into the S5
         (system off) state which is necessary for some platforms to work as
         expected (Kai-Heng Feng).
      
       - Make the white space usage in the ACPI thermal driver more consistent
         and drop redundant code from it (Rafael Wysocki).
      
      * acpi-apei:
        ACPI: APEI: do not add task_work to kernel thread to avoid memory leak
      
      * acpi-wakeup:
        ACPI: PM: Take wake IRQ into consideration when entering suspend-to-idle
        i2c: acpi: Use ACPI wake capability bit to set wake_irq
        ACPI: resources: Add wake_capable parameter to acpi_dev_irq_flags
        gpiolib: acpi: Add wake_capable variants of acpi_dev_gpio_irq_get
      
      * acpi-reboot:
        PM: ACPI: reboot: Reinstate S5 for reboot
        kernel/reboot: Add SYS_OFF_MODE_RESTART_PREPARE mode
      
      * acpi-thermal:
        ACPI: thermal: Drop some redundant code
        ACPI: thermal: Drop redundant parens from expressions
        ACPI: thermal: Use white space more consistently
      056a8154
    • Linus Torvalds's avatar
      Merge tag 'sched-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 30c99993
      Linus Torvalds authored
      Pull scheduler updates from Ingo Molnar:
       "Debuggability:
      
         - Change most occurances of BUG_ON() to WARN_ON_ONCE()
      
         - Reorganize & fix TASK_ state comparisons, turn it into a bitmap
      
         - Update/fix misc scheduler debugging facilities
      
        Load-balancing & regular scheduling:
      
         - Improve the behavior of the scheduler in presence of lot of
           SCHED_IDLE tasks - in particular they should not impact other
           scheduling classes.
      
         - Optimize task load tracking, cleanups & fixes
      
         - Clean up & simplify misc load-balancing code
      
        Freezer:
      
         - Rewrite the core freezer to behave better wrt thawing and be
           simpler in general, by replacing PF_FROZEN with TASK_FROZEN &
           fixing/adjusting all the fallout.
      
        Deadline scheduler:
      
         - Fix the DL capacity-aware code
      
         - Factor out dl_task_is_earliest_deadline() &
           replenish_dl_new_period()
      
         - Relax/optimize locking in task_non_contending()
      
        Cleanups:
      
         - Factor out the update_current_exec_runtime() helper
      
         - Various cleanups, simplifications"
      
      * tag 'sched-core-2022-10-07' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (41 commits)
        sched: Fix more TASK_state comparisons
        sched: Fix TASK_state comparisons
        sched/fair: Move call to list_last_entry() in detach_tasks
        sched/fair: Cleanup loop_max and loop_break
        sched/fair: Make sure to try to detach at least one movable task
        sched: Show PF_flag holes
        freezer,sched: Rewrite core freezer logic
        sched: Widen TAKS_state literals
        sched/wait: Add wait_event_state()
        sched/completion: Add wait_for_completion_state()
        sched: Add TASK_ANY for wait_task_inactive()
        sched: Change wait_task_inactive()s match_state
        freezer,umh: Clean up freezer/initrd interaction
        freezer: Have {,un}lock_system_sleep() save/restore flags
        sched: Rename task_running() to task_on_cpu()
        sched/fair: Cleanup for SIS_PROP
        sched/fair: Default to false in test_idle_cores()
        sched/fair: Remove useless check in select_idle_core()
        sched/fair: Avoid double search on same cpu
        sched/fair: Remove redundant check in select_idle_smt()
        ...
      30c99993
  7. 09 Oct, 2022 9 commits
    • Linus Torvalds's avatar
      Merge tag 'ucount-rlimits-cleanups-for-v5.19' of... · 493ffd66
      Linus Torvalds authored
      Merge tag 'ucount-rlimits-cleanups-for-v5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace
      
      Pull ucounts update from Eric Biederman:
       "Split rlimit and ucount values and max values
      
        After the ucount rlimit code was merged a bunch of small but
        siginificant bugs were found and fixed. At the time it was realized
        that part of the problem was that while the ucount rlimits were very
        similar to the oridinary ucounts (in being nested counts with limits)
        the semantics were slightly different and the code would be less error
        prone if there was less sharing.
      
        This is the long awaited cleanup that should hopefully keep things
        more comprehensible and less error prone for whoever needs to touch
        that code next"
      
      * tag 'ucount-rlimits-cleanups-for-v5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
        ucounts: Split rlimit and ucount values and max values
      493ffd66
    • Linus Torvalds's avatar
      Merge tag 'signal-for-v5.20' of... · e572410e
      Linus Torvalds authored
      Merge tag 'signal-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace
      
      Pull ptrace update from Eric Biederman:
       "ptrace: Stop supporting SIGKILL for PTRACE_EVENT_EXIT
      
        Recently I had a conversation where it was pointed out to me that
        SIGKILL sent to a tracee stropped in PTRACE_EVENT_EXIT is quite
        difficult for a tracer to handle.
      
        Keeping SIGKILL working after the process has been killed is pain from
        an implementation point of view.
      
        So since the debuggers don't want this behavior let's see if we can
        remove this wart for the userspace API
      
        If a regression is detected it should only need to be the last change
        that is the reverted. The other two are just general cleanups that
        make the last patch simpler"
      
      * tag 'signal-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
        signal: Drop signals received after a fatal signal has been processed
        signal: Guarantee that SIGNAL_GROUP_EXIT is set on process exit
        signal: Ensure SIGNAL_GROUP_EXIT gets set in do_group_exit
      e572410e
    • Linus Torvalds's avatar
      Merge tag 'retire_mq_sysctls-for-v5.19' of... · 86fb9c53
      Linus Torvalds authored
      Merge tag 'retire_mq_sysctls-for-v5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace
      
      Pull mqueue fix from Eric Biederman:
       "A fix for an unlikely but possible memory leak"
      
      * tag 'retire_mq_sysctls-for-v5.19' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
        ipc: mqueue: fix possible memory leak in init_mqueue_fs()
      86fb9c53
    • Linus Torvalds's avatar
      Merge tag 'interrupting_kthread_stop-for-v5.20' of... · c71370bd
      Linus Torvalds authored
      Merge tag 'interrupting_kthread_stop-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace
      
      Pull kthread update from Eric Biederman:
       "Break out of wait loops on kthread_stop()
      
        This is a small tweak to kthread_stop so it breaks out of
        interruptible waits, that don't explicitly test for kthread_stop.
      
        These interruptible waits occassionaly occur in kernel threads do to
        code sharing"
      
      * tag 'interrupting_kthread_stop-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/ebiederm/user-namespace:
        signal: break out of wait loops on kthread_stop()
      c71370bd
    • Linus Torvalds's avatar
      Merge tag 'powerpc-6.1-1' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux · 4899a36f
      Linus Torvalds authored
      Pull powerpc updates from Michael Ellerman:
      
       - Remove our now never-true definitions for pgd_huge() and p4d_leaf().
      
       - Add pte_needs_flush() and huge_pmd_needs_flush() for 64-bit.
      
       - Add support for syscall wrappers.
      
       - Add support for KFENCE on 64-bit.
      
       - Update 64-bit HV KVM to use the new guest state entry/exit accounting
         API.
      
       - Support execute-only memory when using the Radix MMU (P9 or later).
      
       - Implement CONFIG_PARAVIRT_TIME_ACCOUNTING for pseries guests.
      
       - Updates to our linker script to move more data into read-only
         sections.
      
       - Allow the VDSO to be randomised on 32-bit.
      
       - Many other small features and fixes.
      
      Thanks to Andrew Donnellan, Aneesh Kumar K.V, Arnd Bergmann, Athira
      Rajeev, Christophe Leroy, David Hildenbrand, Disha Goel, Fabiano Rosas,
      Gaosheng Cui, Gustavo A. R. Silva, Haren Myneni, Hari Bathini, Jilin
      Yuan, Joel Stanley, Kajol Jain, Kees Cook, Krzysztof Kozlowski, Laurent
      Dufour, Liang He, Li Huafei, Lukas Bulwahn, Madhavan Srinivasan, Nathan
      Chancellor, Nathan Lynch, Nicholas Miehlbradt, Nicholas Piggin, Pali
      Rohár, Rohan McLure, Russell Currey, Sachin Sant, Segher Boessenkool,
      Shrikanth Hegde, Tyrel Datwyler, Wolfram Sang, ye xingchen, and Zheng
      Yongjun.
      
      * tag 'powerpc-6.1-1' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux: (214 commits)
        KVM: PPC: Book3S HV: Fix stack frame regs marker
        powerpc: Don't add __powerpc_ prefix to syscall entry points
        powerpc/64s/interrupt: Fix stack frame regs marker
        powerpc/64: Fix msr_check_and_set/clear MSR[EE] race
        powerpc/64s/interrupt: Change must-hard-mask interrupt check from BUG to WARN
        powerpc/pseries: Add firmware details to the hardware description
        powerpc/powernv: Add opal details to the hardware description
        powerpc: Add device-tree model to the hardware description
        powerpc/64: Add logical PVR to the hardware description
        powerpc: Add PVR & CPU name to hardware description
        powerpc: Add hardware description string
        powerpc/configs: Enable PPC_UV in powernv_defconfig
        powerpc/configs: Update config files for removed/renamed symbols
        powerpc/mm: Fix UBSAN warning reported on hugetlb
        powerpc/mm: Always update max/min_low_pfn in mem_topology_setup()
        powerpc/mm/book3s/hash: Rename flush_tlb_pmd_range
        powerpc: Drops STABS_DEBUG from linker scripts
        powerpc/64s: Remove lost/old comment
        powerpc/64s: Remove old STAB comment
        powerpc: remove orphan systbl_chk.sh
        ...
      4899a36f
    • Linus Torvalds's avatar
      Merge tag 's390-6.1-1' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux · 03785a69
      Linus Torvalds authored
      Pull s390 updates from Vasily Gorbik:
      
       - Make use of the IBM z16 processor activity instrumentation facility
         extension to count neural network processor assist operations: add a
         new PMU device driver so that perf can make use of this.
      
       - Rework memcpy_real() to avoid DAT-off mode.
      
       - Rework absolute lowcore access code.
      
       - Various small fixes and improvements all over the code.
      
      * tag 's390-6.1-1' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux:
        s390/pci: remove unused bus_next field from struct zpci_dev
        s390/cio: remove unused ccw_device_force_console() declaration
        s390/pai: Add support for PAI Extension 1 NNPA counters
        s390/mm: fix no previous prototype warnings in maccess.c
        s390/mm: uninline copy_oldmem_kernel() function
        s390/mm,ptdump: add real memory copy page markers
        s390/mm: rework memcpy_real() to avoid DAT-off mode
        s390/dump: save IPL CPU registers once DAT is available
        s390/pci: convert high_memory to physical address
        s390/smp,ptdump: add absolute lowcore markers
        s390/smp: rework absolute lowcore access
        s390/smp: call smp_reinit_ipl_cpu() before scheduler is available
        s390/ptdump: add missing amode31 markers
        s390/mm: split lowcore pages with set_memory_4k()
        s390/mm: remove unused access parameter from do_fault_error()
        s390/delay: sync comment within __delay() with reality
        s390: move from strlcpy with unused retval to strscpy
      03785a69
    • Linus Torvalds's avatar
      Merge tag 'riscv-for-linus-6.1-mw1' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux · 2e64066d
      Linus Torvalds authored
      Pull RISC-V updates from Palmer Dabbelt:
      
       - Improvements to the CPU topology subsystem, which fix some issues
         where RISC-V would report bad topology information.
      
       - The default NR_CPUS has increased to XLEN, and the maximum
         configurable value is 512.
      
       - The CD-ROM filesystems have been enabled in the defconfig.
      
       - Support for THP_SWAP has been added for rv64 systems.
      
      There are also a handful of cleanups and fixes throughout the tree.
      
      * tag 'riscv-for-linus-6.1-mw1' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux:
        riscv: enable THP_SWAP for RV64
        RISC-V: Print SSTC in canonical order
        riscv: compat: s/failed/unsupported if compat mode isn't supported
        RISC-V: Increase range and default value of NR_CPUS
        cpuidle: riscv-sbi: Fix CPU_PM_CPU_IDLE_ENTER_xyz() macro usage
        perf: RISC-V: throttle perf events
        perf: RISC-V: exclude invalid pmu counters from SBI calls
        riscv: enable CD-ROM file systems in defconfig
        riscv: topology: fix default topology reporting
        arm64: topology: move store_cpu_topology() to shared code
      2e64066d
    • Linus Torvalds's avatar
      Merge tag 'microblaze-v6.1' of git://git.monstr.eu/linux-2.6-microblaze · 57c92724
      Linus Torvalds authored
      Pull microblaze updates from Michal Simek:
       "This adds architecture support for error injection which can be done
        only via local memory (BRAM) with enabling path for recovery after
        reset.
      
        These patches targets Triple Modular Redundacy (TMR) configuration
        where 3 Microblazes are running in parallel with monitoring logic.
      
        When an error happens (or is injected) system goes to break handler
        with full CPU reset and system recovery back to origin context. More
        information can be found at [1]"
      
      Link: https://www.xilinx.com/content/dam/xilinx/support/documents/ip_documentation/tmr/v1_0/pg268-tmr.pdf [1]
      
      * tag 'microblaze-v6.1' of git://git.monstr.eu/linux-2.6-microblaze:
        microblaze: Add support for error injection
        microblaze: Add custom break vector handler for mb manager
        microblaze: Add xmb_manager_register function
      57c92724
    • Linus Torvalds's avatar
      Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm · ef688f8b
      Linus Torvalds authored
      Pull kvm updates from Paolo Bonzini:
       "The first batch of KVM patches, mostly covering x86.
      
        ARM:
      
         - Account stage2 page table allocations in memory stats
      
        x86:
      
         - Account EPT/NPT arm64 page table allocations in memory stats
      
         - Tracepoint cleanups/fixes for nested VM-Enter and emulated MSR
           accesses
      
         - Drop eVMCS controls filtering for KVM on Hyper-V, all known
           versions of Hyper-V now support eVMCS fields associated with
           features that are enumerated to the guest
      
         - Use KVM's sanitized VMCS config as the basis for the values of
           nested VMX capabilities MSRs
      
         - A myriad event/exception fixes and cleanups. Most notably, pending
           exceptions morph into VM-Exits earlier, as soon as the exception is
           queued, instead of waiting until the next vmentry. This fixed a
           longstanding issue where the exceptions would incorrecly become
           double-faults instead of triggering a vmexit; the common case of
           page-fault vmexits had a special workaround, but now it's fixed for
           good
      
         - A handful of fixes for memory leaks in error paths
      
         - Cleanups for VMREAD trampoline and VMX's VM-Exit assembly flow
      
         - Never write to memory from non-sleepable kvm_vcpu_check_block()
      
         - Selftests refinements and cleanups
      
         - Misc typo cleanups
      
        Generic:
      
         - remove KVM_REQ_UNHALT"
      
      * tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm: (94 commits)
        KVM: remove KVM_REQ_UNHALT
        KVM: mips, x86: do not rely on KVM_REQ_UNHALT
        KVM: x86: never write to memory from kvm_vcpu_check_block()
        KVM: x86: Don't snapshot pending INIT/SIPI prior to checking nested events
        KVM: nVMX: Make event request on VMXOFF iff INIT/SIPI is pending
        KVM: nVMX: Make an event request if INIT or SIPI is pending on VM-Enter
        KVM: SVM: Make an event request if INIT or SIPI is pending when GIF is set
        KVM: x86: lapic does not have to process INIT if it is blocked
        KVM: x86: Rename kvm_apic_has_events() to make it INIT/SIPI specific
        KVM: x86: Rename and expose helper to detect if INIT/SIPI are allowed
        KVM: nVMX: Make an event request when pending an MTF nested VM-Exit
        KVM: x86: make vendor code check for all nested events
        mailmap: Update Oliver's email address
        KVM: x86: Allow force_emulation_prefix to be written without a reload
        KVM: selftests: Add an x86-only test to verify nested exception queueing
        KVM: selftests: Use uapi header to get VMX and SVM exit reasons/codes
        KVM: x86: Rename inject_pending_events() to kvm_check_and_inject_events()
        KVM: VMX: Update MTF and ICEBP comments to document KVM's subtle behavior
        KVM: x86: Treat pending TRIPLE_FAULT requests as pending exceptions
        KVM: x86: Morph pending exceptions to pending VM-Exits at queue time
        ...
      ef688f8b