1. 24 May, 2023 15 commits
    • Zhang Rui's avatar
      powercap: intel_rapl: Introduce RAPL TPMI interface driver · 9eef7f9d
      Zhang Rui authored
      The TPMI (Topology Aware Register and PM Capsule Interface) provides a
      flexible, extendable and PCIe enumerable MMIO interface for PM features.
      
      Intel RAPL (Running Average Power Limit) is one of the features that
      benefit from this. Using TPMI Interface has advantage over traditional MSR
      (Model Specific Register) interface, where a thread needs to be scheduled
      on the target CPU to read or write. Also the RAPL features vary between
      CPU models, and hence lot of model specific code. Here TPMI provides an
      architectural interface by providing hierarchical tables and fields,
      which will not need any model specific implementation.
      
      TPMI interface uses a PCI VSEC structure to expose the location of MMIO
      interface for PM feature enumeration and control.
      
      The Intel VSEC driver parses VSEC structures present in the PCI
      configuration space of the given device and creates an auxiliary device
      object for each of them. In particular, it creates an auxiliary device
      object representing TPMI that can be bound to by an auxiliary driver.
      
      Then the TPMI enumeration driver binds to the TPMI auxiliary device
      object created by the Intel VSEC driver, parses the PM Feature Structure
      (PFS) present in the TPMI MMIO region and creates device nodes for PM
      features described in the PFS.
      
      This RAPL TPMI Interface driver binds the RAPL auxiliary device created
      by the TPMI enumeration driver and expose the RAPL control to userspace
      via powercap sysfs class.
      
      RAPL TPMI details are published in the following document:
      https://github.com/intel/tpmi_power_management/blob/main/RAPL_TPMI_public_disclosure_FINAL.docx
      
      Note, for now, the RAPL TPMI Interface and RAPL MSR Interface cannot
      co-exists on the same platform (RAPL TPMI Interface is not supported on
      any platforms in the CPU model list for RAPL MSR Interface). Thus
      register the RAPL TPMI powercap control type with name "intel-rapl",
      the same as RAPL MSR Interface, so that it is transparent to userspace.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      9eef7f9d
    • Zhang Rui's avatar
      powercap: intel_rapl: Introduce core support for TPMI interface · e12dee18
      Zhang Rui authored
      Compared with existing RAPL MSR/MMIO Interface, the RAPL TPMI Interface
      1. has per Power Limit register, thus has per Power Limit Lock and
         Enable bit.
      2. doesn't have Power Limit Clamp bit.
      3. the Power Limit Lock and Enable bits have different bit offsets.
      These mean RAPL TPMI Interface needs its own primitive information.
      
      RAPL TPMI Interface also has per domain unit register but with a
      different register layout. This requires a TPMI specific rapl_defaults
      call to decode the unit register.
      
      Introduce the RAPL core support for TPMI Interface.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      e12dee18
    • Zhang Rui's avatar
      powercap: intel_rapl: Introduce RAPL I/F type · b4288ce7
      Zhang Rui authored
      Different RAPL Interfaces may have different primitive information and
      rapl_defaults calls.
      
      To better distinguish this difference in the RAPL framework code,
      introduce a new enum to represent different types of RAPL Interfaces.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      b4288ce7
    • Zhang Rui's avatar
      powercap: intel_rapl: Make cpu optional for rapl_package · bf44b901
      Zhang Rui authored
      MSR RAPL Interface always removes a rapl_package when all the CPUs in
      that rapl_package are offlined. This is because it relies on an online
      CPU to access the MSR.
      
      But for RAPL Interface using MMIO registers, when all the cpus within
      the rapl_package are offlined,
      1. the register can still be accessed
      2. monitoring and setting the Power Pimits for the rapl_package is still
         meaningful because of uncore power.
      
      This means that, a valid rapl_package doesn't rely on one or more cpus
      being onlined.
      
      For this sense, make cpu optional for rapl_package. A rapl_package can
      be registered either using a CPU id to represent the physical
      package/die, or using the physical package id directly.
      
      Note that, the thermal throttling interrupt is not disabled via
      MSR_IA32_PACKAGE_THERM_INTERRUPT for such rapl_package at the moment.
      If it is still needed in the future, this can be achieved by selecting
      an onlined CPU using the physical package id.
      
      Note that, processor_thermal_rapl, the current MMIO RAPL Interface
      driver, can also be converted to register using a package id instead.
      But this is not done right now because processor_thermal_rapl driver
      works on single-package systems only, and offlining the only package
      will not happen. So keep the previous logic.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      bf44b901
    • Zhang Rui's avatar
      powercap: intel_rapl: Remove redundant cpu parameter · 693c1d78
      Zhang Rui authored
      For rapl_packages that rely on online CPUs to work, rp->lead_cpu always
      has a valid CPU id.
      
      Remove the redundant cpu parameter in rapl_check_domain(),
      rapl_detect_domains() and .check_unit() callbacks.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      693c1d78
    • Zhang Rui's avatar
      powercap: intel_rapl: Add support for lock bit per Power Limit · f442bd27
      Zhang Rui authored
      With RAPL MSR/MMIO Interface, each RAPL domain has one Power Limit
      register. Each Power Limit register has one lock bit which tells the OS
      if the power limit register can be used or not.
      Depending on the number of power limits supported by the power limit
      register, the lock bit may apply to one or more power limits.
      
      With RAPL TPMI Interface, each RAPL domain has multiple Power Limits,
      and each Power Limit has its own register, with a lock bit.
      
      To handle this, introduce support for lock bit per Power Limit.
      
      For existing RAPL MSR/MMIO Interfaces, the lock bit in the Power Limit
      register applies to all the Power Limits controlled by this register.
      
      Remove the per domain DOMAIN_STATE_BIOS_LOCKED flag at the same time
      because it can be replaced by the per Power Limit lock.
      
      No functional change intended.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      f442bd27
    • Zhang Rui's avatar
      powercap: intel_rapl: Cleanup Power Limits support · 9050a9cd
      Zhang Rui authored
      The same set of operations are shared by different Powert Limits,
      including Power Limit get/set, Power Limit enable/disable, clamping
      enable/disable, time window get/set, and max power get/set, etc.
      
      But the same operation for different Power Limit has different
      primitives because they use different registers/register bits.
      
      A lot of dirty/duplicate code was introduced to handle this difference.
      
      Introduce a universal way to issue Power Limit operations.
      Instead of using hardcoded primitive name directly, use Power Limit id
      + operation type, and hide all the Power Limit difference details in a
      central place, get_pl_prim(). Two helpers, rapl_read_pl_data() and
      rapl_write_pl_data(), are introduced at the same time to simplify the
      code for issuing Power Limit operations.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      9050a9cd
    • Zhang Rui's avatar
      powercap: intel_rapl: Use bitmap for Power Limits · a38f300b
      Zhang Rui authored
      Currently, a RAPL package is registered with the number of Power Limits
      supported in each RAPL domain. But this doesn't tell which Power Limits
      are available. Using the number of Power Limits supported to guess the
      availability of each Power Limit is fragile.
      
      Use bitmap to represent the availability of each Power Limit.
      
      Note that PL1 is mandatory thus it does not need to be set explicitly by
      the RAPL Interface drivers.
      
      No functional change intended.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      a38f300b
    • Zhang Rui's avatar
      powercap: intel_rapl: Change primitive order · 045610c3
      Zhang Rui authored
      The same set of operations are shared by different Powert Limits,
      including Power Limit get/set, Power Limit enable/disable, clamping
      enable/disable, time window get/set, and max power get/set, etc.
      
      But the same operation for different Power Limit has different
      primitives because they use different registers/register bits.
      
      A lot of dirty/duplicate code was introduced to handle this difference.
      
      Instead of using hardcoded primitive name directly, using Power Limit id
      + operation type is much cleaner.
      
      For this sense, move POWER_LIMIT1/POWER_LIMIT2/POWER_LIMIT4 to the
      beginning of enum rapl_primitives so that they can be reused as
      Power Limit ids.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      045610c3
    • Zhang Rui's avatar
      powercap: intel_rapl: Use index to initialize primitive information · 11edbe5c
      Zhang Rui authored
      Currently, the RAPL primitive information array is required to be
      initialized in the order of enum rapl_primitives.
      This can break easily, especially when different RAPL Interfaces may
      support different sets of primitives.
      
      Convert the code to initialize the primitive information using array
      index explicitly.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      11edbe5c
    • Zhang Rui's avatar
      powercap: intel_rapl: Support per domain energy/power/time unit · cb532e72
      Zhang Rui authored
      RAPL MSR/MMIO Interface has package scope unit register but some RAPL
      domains like Dram/Psys may use a fixed energy unit value instead of the
      default unit value on certain platforms.
      RAPL TPMI Interface supports per domain unit register.
      
      For the above reasons, add support for per domain unit register and per
      domain energy/power/time unit.
      
      When per domain unit register is not available, use the package scope
      unit register as the per domain unit register for each RAPL domain so
      that this change is transparent to MSR/MMIO Interface.
      
      No functional change intended.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      cb532e72
    • Zhang Rui's avatar
      powercap: intel_rapl: Support per Interface primitive information · 98ff639a
      Zhang Rui authored
      RAPL primitive information is Interface specific.
      
      Although current MSR and MMIO Interface share the same RAPL primitives,
      new Interface like TPMI has its own RAPL primitive information.
      
      Save the primitive information in the Interface private structure.
      
      Plus, using variant name "rp" for struct rapl_primitive_info is
      confusing because "rp" is also used for struct rapl_package.
      Use "rpi" as the variant name for struct rapl_primitive_info, and rename
      the previous rpi[] array to avoid conflict.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      98ff639a
    • Zhang Rui's avatar
      powercap: intel_rapl: Support per Interface rapl_defaults · e8e28c2a
      Zhang Rui authored
      rapl_defaults is Interface specific.
      
      Although current MSR and MMIO Interface share the same rapl_defaults,
      new Interface like TPMI need its own rapl_defaults callbacks.
      
      Save the rapl_defaults information in the Interface private structure.
      
      No functional change.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      e8e28c2a
    • Zhang Rui's avatar
      powercap: intel_rapl: Allow probing without CPUID match · 1488ac99
      Zhang Rui authored
      Currently, CPU model checks is used to
      1. get proper rapl_defaults callbacks for RAPL MSR/MMIO Interface.
      2. create a platform device node for the intel_rapl_msr driver to probe.
      
      Both of these are only mandatory for the RAPL MSR/MMIO Interface.
      
      Make the CPUID match optional.
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Tested-by: default avatarWang Wendy <wendy.wang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      1488ac99
    • Zhang Rui's avatar
      powercap: intel_rapl: Remove unused field in struct rapl_if_priv · 31cb1304
      Zhang Rui authored
      After commit f1e8d756 ("powercap/intel_rapl: enumerate Psys RAPL
      domain together with package RAPL domain"), the platform_rapl_domain field
      is not used anymore. Remove it from rapl_if_priv structure.
      
      Fixes: f1e8d756 ("powercap/intel_rapl: enumerate Psys RAPL domain together with package RAPL domain")
      Signed-off-by: default avatarZhang Rui <rui.zhang@intel.com>
      Signed-off-by: default avatarRafael J. Wysocki <rafael.j.wysocki@intel.com>
      31cb1304
  2. 21 May, 2023 12 commits
    • Linus Torvalds's avatar
      Linux 6.4-rc3 · 44c026a7
      Linus Torvalds authored
      44c026a7
    • Linus Torvalds's avatar
      Merge tag 'uml-for-linus-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/uml/linux · fa4fe8ce
      Linus Torvalds authored
      Pull UML fix from Richard Weinberger:
      
       - Fix modular build for UML watchdog
      
      * tag 'uml-for-linus-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/uml/linux:
        um: harddog: fix modular build
      fa4fe8ce
    • Linus Torvalds's avatar
      Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm · a35747c3
      Linus Torvalds authored
      Pull kvm fixes from Paolo Bonzini:
       "ARM:
      
         - Plug a race in the stage-2 mapping code where the IPA and the PA
           would end up being out of sync
      
         - Make better use of the bitmap API (bitmap_zero, bitmap_zalloc...)
      
         - FP/SVE/SME documentation update, in the hope that this field
           becomes clearer...
      
         - Add workaround for Apple SEIS brokenness to a new SoC
      
         - Random comment fixes
      
        x86:
      
         - add MSR_IA32_TSX_CTRL into msrs_to_save
      
         - fixes for XCR0 handling in SGX enclaves
      
        Generic:
      
         - Fix vcpu_array[0] races
      
         - Fix race between starting a VM and 'reboot -f'"
      
      * tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm:
        KVM: VMX: add MSR_IA32_TSX_CTRL into msrs_to_save
        KVM: x86: Don't adjust guest's CPUID.0x12.1 (allowed SGX enclave XFRM)
        KVM: VMX: Don't rely _only_ on CPUID to enforce XCR0 restrictions for ECREATE
        KVM: Fix vcpu_array[0] races
        KVM: VMX: Fix header file dependency of asm/vmx.h
        KVM: Don't enable hardware after a restart/shutdown is initiated
        KVM: Use syscore_ops instead of reboot_notifier to hook restart/shutdown
        KVM: arm64: vgic: Add Apple M2 PRO/MAX cpus to the list of broken SEIS implementations
        KVM: arm64: Clarify host SME state management
        KVM: arm64: Restructure check for SVE support in FP trap handler
        KVM: arm64: Document check for TIF_FOREIGN_FPSTATE
        KVM: arm64: Fix repeated words in comments
        KVM: arm64: Constify start/end/phys fields of the pgtable walker data
        KVM: arm64: Infer PA offset from VA in hyp map walker
        KVM: arm64: Infer the PA offset from IPA in stage-2 map walker
        KVM: arm64: Use the bitmap API to allocate bitmaps
        KVM: arm64: Slightly optimize flush_context()
      a35747c3
    • Linus Torvalds's avatar
      Merge tag 'perf-tools-fixes-for-v6.4-1-2023-05-20' of... · c47d122c
      Linus Torvalds authored
      Merge tag 'perf-tools-fixes-for-v6.4-1-2023-05-20' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux
      
      Pull perf tools fixes from Arnaldo Carvalho de Melo:
      
       - Fail graciously if BUILD_BPF_SKEL=1 is specified and clang isn't
         available
      
       - Add empty 'struct rq' to 'perf lock contention' to satisfy libbpf
         'runqueue' type verification. This feature is built only with
         BUILD_BPF_SKEL=1
      
       - Make vmlinux.h use bpf.h and perf_event.h in source directory, not
         system ones that may be old and not have things like 'union
         perf_sample_weight'
      
       - Add system include paths to BPF builds to pick things missing in the
         headers included by clang -target bpf
      
       - Update various header copies with the kernel sources
      
       - Change divide by zero and not supported events behavior to show
         'nan'/'not counted' in 'perf stat' output.
      
         This happens when using things like 'perf stat -M TopdownL2 true',
         involving JSON metrics
      
       - Update no event/metric expectations affected by using JSON metrics in
         'perf stat -ddd' perf test
      
       - Avoid segv with 'perf stat --topdown' for metrics without a group
      
       - Do not assume which events may have a PMU name, allowing the logic to
         keep an AUX event group together. Makes this usecase work again:
      
           $ perf record --no-bpf-event -c 10 -e '{intel_pt//,tlb_flush.stlb_any/aux-sample-size=8192/pp}:u' -- sleep 0.1
           [ perf record: Woken up 1 times to write data ]
           [ perf record: Captured and wrote 0.078 MB perf.data ]
           $ perf script -F-dso,+addr | grep -C5 tlb_flush.stlb_any | head -11
           sleep 20444 [003]  7939.510243:  1  branches:uH:  7f5350cc82a2 dl_main+0x9a2 => 7f5350cb38f0 _dl_add_to_namespace_list+0x0
           sleep 20444 [003]  7939.510243:  1  branches:uH:  7f5350cb3908 _dl_add_to_namespace_list+0x18 => 7f5350cbb080 rtld_mutex_dummy+0x0
           sleep 20444 [003]  7939.510243:  1  branches:uH:  7f5350cc8350 dl_main+0xa50 => 0 [unknown]
           sleep 20444 [003]  7939.510244:  1  branches:uH:  7f5350cc83ca dl_main+0xaca => 7f5350caeb60 _dl_process_pt_gnu_property+0x0
           sleep 20444 [003]  7939.510245:  1  branches:uH:  7f5350caeb60 _dl_process_pt_gnu_property+0x0 => 0 [unknown]
           sleep 20444  7939.510245:       10 tlb_flush.stlb_any/aux-sample-size=8192/pp: 0 7f5350caeb60 _dl_process_pt_gnu_property+0x0
           sleep 20444 [003]  7939.510254:  1  branches:uH:  7f5350cc87fe dl_main+0xefe => 7f5350ccd240 strcmp+0x0
           sleep 20444 [003]  7939.510254:  1  branches:uH:  7f5350cc8862 dl_main+0xf62 => 0 [unknown]
      
       - Add a check for the above use case in 'perf test test_intel_pt'
      
       - Fix build with refcount checking on arm64, it was still accessing
         fields that need to be wrapped so that the refcounted struct gets
         checked
      
       - Fix contextid validation in ARM's CS-ETM, so that older kernels
         without that field can still be supported
      
       - Skip unsupported aggregation for stat events found in perf.data files
         in 'perf script'
      
       - Add stat test for record and script to check the previous problem
      
       - Remove needless debuginfod queries from 'perf test java symbol', this
         was just making the test take a long time to complete
      
       - Address python SafeConfigParser() deprecation warning in 'perf test
         attr'
      
       - Fix __NR_execve undeclared on i386 'perf bench syscall' build error
      
      * tag 'perf-tools-fixes-for-v6.4-1-2023-05-20' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux: (33 commits)
        perf bench syscall: Fix __NR_execve undeclared build error
        perf test attr: Fix python SafeConfigParser() deprecation warning
        perf test attr: Update no event/metric expectations
        tools headers disabled-features: Sync with the kernel sources
        tools headers UAPI: Sync arch prctl headers with the kernel sources
        tools headers: Update the copy of x86's mem{cpy,set}_64.S used in 'perf bench'
        tools headers x86 cpufeatures: Sync with the kernel sources
        tools headers UAPI: Sync s390 syscall table file that wires up the memfd_secret syscall
        tools headers UAPI: Sync linux/prctl.h with the kernel sources
        perf metrics: Avoid segv with --topdown for metrics without a group
        perf lock contention: Add empty 'struct rq' to satisfy libbpf 'runqueue' type verification
        perf cs-etm: Fix contextid validation
        perf arm64: Fix build with refcount checking
        perf test: Add stat test for record and script
        perf script: Skip aggregation for stat events
        perf build: Add system include paths to BPF builds
        perf bpf skels: Make vmlinux.h use bpf.h and perf_event.h in source directory
        perf parse-events: Do not break up AUX event group
        perf test test_intel_pt.sh: Test sample mode with event with PMU name
        perf evsel: Modify group pmu name for software events
        ...
      c47d122c
    • Linus Torvalds's avatar
      Merge tag 'powerpc-6.4-2' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux · 4927cb98
      Linus Torvalds authored
      Pull powerpc fixes from Michael Ellerman:
      
       - Fix broken soft dirty tracking when using the Radix MMU (>= P9)
      
       - Fix ISA mapping when "ranges" property is not present, for PASemi
         Nemo boards
      
       - Fix a possible WARN_ON_ONCE hitting in BPF extable handling
      
       - Fix incorrect DMA address handling when using 2MB TCEs
      
       - Fix a bug in IOMMU table handling for SR-IOV devices
      
       - Fix the recent rework of IOMMU handling which left arch code calling
         clean up routines that are handled by the IOMMU core
      
       - A few assorted build fixes
      
      Thanks to Christian Zigotzky, Dan Horák, Gaurav Batra, Hari Bathini,
      Jason Gunthorpe, Nathan Chancellor, Naveen N. Rao, Nicholas Piggin, Pali
      Rohár, Randy Dunlap, and Rob Herring.
      
      * tag 'powerpc-6.4-2' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux:
        powerpc/iommu: Incorrect DDW Table is referenced for SR-IOV device
        powerpc/iommu: DMA address offset is incorrectly calculated with 2MB TCEs
        powerpc/iommu: Remove iommu_del_device()
        powerpc/crypto: Fix aes-gcm-p10 build when VSX=n
        powerpc/bpf: populate extable entries only during the last pass
        powerpc/boot: Disable power10 features after BOOTAFLAGS assignment
        powerpc/64s/radix: Fix soft dirty tracking
        powerpc/fsl_uli1575: fix kconfig warnings and build errors
        powerpc/isa-bridge: Fix ISA mapping when "ranges" is not present
      4927cb98
    • Linus Torvalds's avatar
      Merge tag 'ata-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/dlemoal/libata · 90af47ed
      Linus Torvalds authored
      Pull ata fix from Damien Le Moal:
      
       - Fix DT binding for the ahci-ceva driver to fully describe all iommus,
         from Michal
      
      * tag 'ata-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/dlemoal/libata:
        dt-bindings: ata: ahci-ceva: Cover all 4 iommus entries
      90af47ed
    • Linus Torvalds's avatar
      Merge tag 'fbdev-for-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/linux-fbdev · 70e137e3
      Linus Torvalds authored
      Pull fbdev fixes from Helge Deller:
       "A few small unspectacular fbdev fixes:
      
         - Fix for USB endpoint check in udlfb (found by syzbot fuzzer)
      
         - Small fix in error code path in omapfb
      
         - compiler warning fixes in fbmem & i810
      
         - code removal and whitespace cleanups in stifb and atyfb"
      
      * tag 'fbdev-for-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/linux-fbdev:
        fbdev: stifb: Whitespace cleanups
        fbdev: udlfb: Use usb_control_msg_send()
        fbdev: udlfb: Fix endpoint check
        fbdev: atyfb: Remove unused clock determination
        fbdev: i810: include i810_main.h in i810_dvt.c
        fbdev: fbmem: mark get_fb_unmapped_area() static
        fbdev: omapfb: panel-tpo-td043mtea1: fix error code in probe()
      70e137e3
    • Linus Torvalds's avatar
      Merge tag '6.4-rc2-ksmbd-server-fixes' of git://git.samba.org/ksmbd · e2065b8c
      Linus Torvalds authored
      Pull ksmbd server fixes from Steve French:
      
       - two fixes for incorrect SMB3 message validation (one for client which
         uses 8 byte padding, and one for empty bcc)
      
       - two fixes for out of bounds bugs: one for username offset checks (in
         session setup) and the other for create context name length checks in
         open requests
      
      * tag '6.4-rc2-ksmbd-server-fixes' of git://git.samba.org/ksmbd:
        ksmbd: smb2: Allow messages padded to 8byte boundary
        ksmbd: allocate one more byte for implied bcc[0]
        ksmbd: fix wrong UserName check in session_user
        ksmbd: fix global-out-of-bounds in smb2_find_context_vals
      e2065b8c
    • Linus Torvalds's avatar
      Merge tag '6.4-rc2-smb3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6 · 0c9dcf12
      Linus Torvalds authored
      Pull cifs client fixes from Steve French:
       "Two smb3 client fixes, both related to deferred close, and also for
        stable:
      
         - send close for deferred handles before not after lease break
           response to avoid possible sharing violations
      
         - check all opens on an inode (looking for deferred handles) when
           lease break is returned not just the handle the lease break came in
           on"
      
      * tag '6.4-rc2-smb3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6:
        SMB3: drop reference to cfile before sending oplock break
        SMB3: Close all deferred handles of inode in case of handle lease break
      0c9dcf12
    • Mingwei Zhang's avatar
      KVM: VMX: add MSR_IA32_TSX_CTRL into msrs_to_save · b9846a69
      Mingwei Zhang authored
      Add MSR_IA32_TSX_CTRL into msrs_to_save[] to explicitly tell userspace to
      save/restore the register value during migration. Missing this may cause
      userspace that relies on KVM ioctl(KVM_GET_MSR_INDEX_LIST) fail to port the
      value to the target VM.
      
      In addition, there is no need to add MSR_IA32_TSX_CTRL when
      ARCH_CAP_TSX_CTRL_MSR is not supported in kvm_get_arch_capabilities(). So
      add the checking in kvm_probe_msr_to_save().
      
      Fixes: c11f83e0 ("KVM: vmx: implement MSR_IA32_TSX_CTRL disable RTM functionality")
      Reported-by: default avatarJim Mattson <jmattson@google.com>
      Signed-off-by: default avatarMingwei Zhang <mizhang@google.com>
      Reviewed-by: default avatarXiaoyao Li <xiaoyao.li@intel.com>
      Reviewed-by: default avatarJim Mattson <jmattson@google.com>
      Message-Id: <20230509032348.1153070-1-mizhang@google.com>
      Signed-off-by: default avatarPaolo Bonzini <pbonzini@redhat.com>
      b9846a69
    • Sean Christopherson's avatar
      KVM: x86: Don't adjust guest's CPUID.0x12.1 (allowed SGX enclave XFRM) · 275a8724
      Sean Christopherson authored
      Drop KVM's manipulation of guest's CPUID.0x12.1 ECX and EDX, i.e. the
      allowed XFRM of SGX enclaves, now that KVM explicitly checks the guest's
      allowed XCR0 when emulating ECREATE.
      
      Note, this could theoretically break a setup where userspace advertises
      a "bad" XFRM and relies on KVM to provide a sane CPUID model, but QEMU
      is the only known user of KVM SGX, and QEMU explicitly sets the SGX CPUID
      XFRM subleaf based on the guest's XCR0.
      Reviewed-by: default avatarKai Huang <kai.huang@intel.com>
      Tested-by: default avatarKai Huang <kai.huang@intel.com>
      Signed-off-by: default avatarSean Christopherson <seanjc@google.com>
      Message-Id: <20230503160838.3412617-3-seanjc@google.com>
      Signed-off-by: default avatarPaolo Bonzini <pbonzini@redhat.com>
      275a8724
    • Sean Christopherson's avatar
      KVM: VMX: Don't rely _only_ on CPUID to enforce XCR0 restrictions for ECREATE · ad45413d
      Sean Christopherson authored
      Explicitly check the vCPU's supported XCR0 when determining whether or not
      the XFRM for ECREATE is valid.  Checking CPUID works because KVM updates
      guest CPUID.0x12.1 to restrict the leaf to a subset of the guest's allowed
      XCR0, but that is rather subtle and KVM should not modify guest CPUID
      except for modeling true runtime behavior (allowed XFRM is most definitely
      not "runtime" behavior).
      Reviewed-by: default avatarKai Huang <kai.huang@intel.com>
      Tested-by: default avatarKai Huang <kai.huang@intel.com>
      Signed-off-by: default avatarSean Christopherson <seanjc@google.com>
      Message-Id: <20230503160838.3412617-2-seanjc@google.com>
      Signed-off-by: default avatarPaolo Bonzini <pbonzini@redhat.com>
      ad45413d
  3. 20 May, 2023 13 commits
    • Helge Deller's avatar
      fbdev: stifb: Whitespace cleanups · d9a45969
      Helge Deller authored
      Missed whitespace cleanups in stifb.
      
      Fixes: 80004257 ("fbdev: stifb: Remove trailing whitespaces")
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      d9a45969
    • Helge Deller's avatar
      fbdev: udlfb: Use usb_control_msg_send() · 537adba4
      Helge Deller authored
      Use the newly introduced usb_control_msg_send() instead of usb_control_msg()
      when selecting the channel.
      Reviewed-by: default avatarAlan Stern <stern@rowland.harvard.edu>
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      537adba4
    • Linus Torvalds's avatar
      Merge tag 'tty-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty · 0dd2a6fb
      Linus Torvalds authored
      Pull tty / serial fixes from Greg KH:
       "Here are some small tty and serial driver fixes for 6.4-rc3 to resolve
        some reported problems, and add some new device ids. These include:
      
         - termios documentation updates
      
         - vc_screen use-after-free fix
      
         - memory leak fix in arc_uart driver
      
         - new 8250 driver ids
      
         - other small serial driver fixes
      
        All of these have been in linux-next for a while with no reported
        problems"
      
      * tag 'tty-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/tty:
        vc_screen: reload load of struct vc_data pointer in vcs_write() to avoid UAF
        serial: qcom-geni: fix enabling deactivated interrupt
        serial: 8250_bcm7271: fix leak in `brcmuart_probe`
        serial: 8250_bcm7271: balance clk_enable calls
        serial: arc_uart: fix of_iomap leak in `arc_serial_probe`
        serial: 8250: Document termios parameter of serial8250_em485_config()
        serial: Add support for Advantech PCI-1611U card
        serial: 8250_exar: Add support for USR298x PCI Modems
      0dd2a6fb
    • Linus Torvalds's avatar
      Merge tag 'usb-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb · 2dd0d98d
      Linus Torvalds authored
      Pull USB / Thunderbolt fixes from Greg KH:
       "Here are some USB fixes for 6.4-rc3, as well as a driver core fix that
        resolves a memory leak that shows up in USB devices easier than other
        subsystems.
      
        Included in here are:
      
         - driver core memory leak as reported and tested by syzbot and
           developers
      
         - dwc3 driver fixes for reported problems
      
         - xhci driver fixes for reported problems
      
         - USB gadget driver reverts to resolve regressions
      
         - usbtmc driver fix for syzbot reported problem
      
         - thunderbolt driver fixes for reported issues
      
         - other small USB fixes
      
        All of these, except for the driver core fix, have been in linux-next
        with no reported problems. The driver core fix was tested and verified
        to solve the issue by syzbot and the original reporter"
      
      * tag 'usb-6.4-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb:
        driver core: class: properly reference count class_dev_iter()
        xhci: Fix incorrect tracking of free space on transfer rings
        xhci-pci: Only run d3cold avoidance quirk for s2idle
        usb-storage: fix deadlock when a scsi command timeouts more than once
        usb: dwc3: fix a test for error in dwc3_core_init()
        usb: typec: tps6598x: Fix fault at module removal
        usb: gadget: u_ether: Fix host MAC address case
        usb: typec: altmodes/displayport: fix pin_assignment_show
        Revert "usb: gadget: udc: core: Invoke usb_gadget_connect only when started"
        Revert "usb: gadget: udc: core: Prevent redundant calls to pullup"
        usb: gadget: drop superfluous ':' in doc string
        usb: dwc3: debugfs: Resume dwc3 before accessing registers
        USB: UHCI: adjust zhaoxin UHCI controllers OverCurrent bit value
        usb: dwc3: fix gadget mode suspend interrupt handler issue
        usb: dwc3: gadget: Improve dwc3_gadget_suspend() and dwc3_gadget_resume()
        USB: usbtmc: Fix direction for 0-length ioctl control messages
        thunderbolt: Clear registers properly when auto clear isn't in use
      2dd0d98d
    • Linus Torvalds's avatar
      Merge tag 'block-6.4-2023-05-20' of git://git.kernel.dk/linux · 98be58a6
      Linus Torvalds authored
      Pull block fixes from Jens Axboe:
      
       - NVMe pull request via Keith:
           - More device quirks (Sagi, Hristo, Adrian, Daniel)
           - Controller delete race (Maurizo)
           - Multipath cleanup fix (Christoph)
      
       - Deny writeable mmap mapping on a readonly block device (Loic)
      
       - Kill unused define that got introduced by accident (Christoph)
      
       - Error handling fix for s390 dasd (Stefan)
      
       - ublk locking fix (Ming)
      
      * tag 'block-6.4-2023-05-20' of git://git.kernel.dk/linux:
        block: remove NFL4_UFLG_MASK
        block: Deny writable memory mapping if block is read-only
        s390/dasd: fix command reject error on ESE devices
        nvme-pci: Add quirk for Teamgroup MP33 SSD
        ublk: fix AB-BA lockdep warning
        nvme: do not let the user delete a ctrl before a complete initialization
        nvme-multipath: don't call blk_mark_disk_dead in nvme_mpath_remove_disk
        nvme-pci: clamp max_hw_sectors based on DMA optimized limitation
        nvme-pci: add quirk for missing secondary temperature thresholds
        nvme-pci: add NVME_QUIRK_BOGUS_NID for HS-SSD-FUTURE 2048G
      98be58a6
    • Christoph Hellwig's avatar
      block: remove NFL4_UFLG_MASK · e3afec91
      Christoph Hellwig authored
      The NFL4_UFLG_MASK define slipped in in commit 9208d414
      ("block: add a ->get_unique_id method") and should never have been
      added, as NFSD as the only user of it already has it's copy.
      Signed-off-by: default avatarChristoph Hellwig <hch@lst.de>
      Link: https://lore.kernel.org/r/20230520090010.527046-1-hch@lst.deSigned-off-by: default avatarJens Axboe <axboe@kernel.dk>
      e3afec91
    • Alan Stern's avatar
      fbdev: udlfb: Fix endpoint check · ed9de4ed
      Alan Stern authored
      The syzbot fuzzer detected a problem in the udlfb driver, caused by an
      endpoint not having the expected type:
      
      usb 1-1: Read EDID byte 0 failed: -71
      usb 1-1: Unable to get valid EDID from device/display
      ------------[ cut here ]------------
      usb 1-1: BOGUS urb xfer, pipe 3 != type 1
      WARNING: CPU: 0 PID: 9 at drivers/usb/core/urb.c:504 usb_submit_urb+0xed6/0x1880
      drivers/usb/core/urb.c:504
      Modules linked in:
      CPU: 0 PID: 9 Comm: kworker/0:1 Not tainted
      6.4.0-rc1-syzkaller-00016-ga4422ff2 #0
      Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google
      04/28/2023
      Workqueue: usb_hub_wq hub_event
      RIP: 0010:usb_submit_urb+0xed6/0x1880 drivers/usb/core/urb.c:504
      ...
      Call Trace:
       <TASK>
       dlfb_submit_urb+0x92/0x180 drivers/video/fbdev/udlfb.c:1980
       dlfb_set_video_mode+0x21f0/0x2950 drivers/video/fbdev/udlfb.c:315
       dlfb_ops_set_par+0x2a7/0x8d0 drivers/video/fbdev/udlfb.c:1111
       dlfb_usb_probe+0x149a/0x2710 drivers/video/fbdev/udlfb.c:1743
      
      The current approach for this issue failed to catch the problem
      because it only checks for the existence of a bulk-OUT endpoint; it
      doesn't check whether this endpoint is the one that the driver will
      actually use.
      
      We can fix the problem by instead checking that the endpoint used by
      the driver does exist and is bulk-OUT.
      
      Reported-and-tested-by: syzbot+0e22d63dcebb802b9bc8@syzkaller.appspotmail.com
      Signed-off-by: default avatarAlan Stern <stern@rowland.harvard.edu>
      CC: Pavel Skripkin <paskripkin@gmail.com>
      Fixes: aaf7dbe0 ("video: fbdev: udlfb: properly check endpoint type")
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      ed9de4ed
    • Niklas Schnelle's avatar
      fbdev: atyfb: Remove unused clock determination · 93f57c7a
      Niklas Schnelle authored
      Just below the removed lines par->clk_wr_offset is hard coded to 3 so
      there is no use in determining a different clock just to then ignore it
      anyway. This also removes the only I/O port use remaining in the driver
      allowing it to be built without CONFIG_HAS_IOPORT.
      
      Link: https://lore.kernel.org/all/ZBx5aLo5h546BzBt@intel.com/Suggested-by: default avatarVille Syrjälä <ville.syrjala@linux.intel.com>
      Signed-off-by: default avatarNiklas Schnelle <schnelle@linux.ibm.com>
      Reviewed-by: default avatarVille Syrjälä <ville.syrjala@linux.intel.com>
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      93f57c7a
    • Arnd Bergmann's avatar
      fbdev: i810: include i810_main.h in i810_dvt.c · f6cd4c96
      Arnd Bergmann authored
      Building with W=1 shows that a header needs to be included to
      make the prototypes visible:
      
      drivers/video/fbdev/i810/i810_dvt.c:194:6: error: no previous prototype for 'round_off_xres' [-Werror=missing-prototypes]
      drivers/video/fbdev/i810/i810_dvt.c:233:6: error: no previous prototype for 'i810fb_encode_registers' [-Werror=missing-prototypes]
      drivers/video/fbdev/i810/i810_dvt.c:245:6: error: no previous prototype for 'i810fb_fill_var_timings' [-Werror=missing-prototypes]
      drivers/video/fbdev/i810/i810_dvt.c:279:5: error: no previous prototype for 'i810_get_watermark' [-Werror=missing-prototypes]
      
      Adding the header leads to another warning from a mismatched
      prototype, so fix this as well:
      
      drivers/video/fbdev/i810/i810_dvt.c:280:5: error: conflicting types for 'i810_get_watermark'; have 'u32(struct fb_var_screeninfo *,
      Signed-off-by: default avatarArnd Bergmann <arnd@arndb.de>
      Reviewed-by: default avatarJani Nikula <jani.nikula@intel.com>
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      f6cd4c96
    • Arnd Bergmann's avatar
      fbdev: fbmem: mark get_fb_unmapped_area() static · 19abb458
      Arnd Bergmann authored
      There is a global function with this name on sparc, but no
      global declaration:
      
      drivers/video/fbdev/core/fbmem.c:1469:15: error: no previous prototype for 'get_fb_unmapped_area'
      
      Make the generic definition static to avoid this warning. On
      sparc, this is never seen.
      
      Edit by Helge:
      Update Kconfig text as suggested by Geert Uytterhoeven.
      Signed-off-by: default avatarArnd Bergmann <arnd@arndb.de>
      Signed-off-by: default avatarHelge Deller <deller@gmx.de>
      19abb458
    • Loic Poulain's avatar
      block: Deny writable memory mapping if block is read-only · 69baa3a6
      Loic Poulain authored
      User should not be able to write block device if it is read-only at
      block level (e.g force_ro attribute). This is ensured in the regular
      fops write operation (blkdev_write_iter) but not when writing via
      user mapping (mmap), allowing user to actually write a read-only
      block device via a PROT_WRITE mapping.
      
      Example: This can lead to integrity issue of eMMC boot partition
      (e.g mmcblk0boot0) which is read-only by default.
      
      To fix this issue, simply deny shared writable mapping if the block
      is readonly.
      
      Note: Block remains writable if switch to read-only is performed
      after the initial mapping, but this is expected behavior according
      to commit a32e236e ("Partially revert "block: fail op_is_write()
      requests to read-only partitions"")'.
      Signed-off-by: default avatarLoic Poulain <loic.poulain@linaro.org>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Link: https://lore.kernel.org/r/20230510074223.991297-1-loic.poulain@linaro.orgSigned-off-by: default avatarJens Axboe <axboe@kernel.dk>
      69baa3a6
    • Linus Torvalds's avatar
      Merge tag 'drm-fixes-2023-05-20' of git://anongit.freedesktop.org/drm/drm · d635f6cc
      Linus Torvalds authored
      Pull drm fixes from Dave Airlie:
       "Regular fixes pull, amdgpu and msm make up most of these, nothing too
        serious, also one i915 and one exynos.
      
        I didn't get a misc fixes pull this week (one of the maintainers is
        off, so have to engage the backup) so I think there are a few
        outstanding patches that will show up next week,
      
        amdgpu:
         - update gfx11 clock counter logic
         - Fix a race when disabling gfxoff on gfx10/11 for profiling
         - Raven/Raven2/PCO clock counter fix
         - Add missing get_vbios_fb_size for GMC 11
         - Fix a spurious irq warning in the device remove case
         - Fix possible power mode mismatch between driver and PMFW
         - USB4 fix
      
        exynos:
         - fix build warning
      
        i915:
         - fix missing NULL check in HDCP code
      
        msm:
         - display:
            - msm8998: fix fetch and qos to align with downstream
            - msm8998: fix LM pairs to align with downstream
            - remove unused INTF0 interrupt mask on some chipsets
            - remove TE2 block from relevant chipsets
            - relocate non-MDP_TOP offset to different header
            - fix some indentation
            - fix register offets/masks for dither blocks
            - make ping-ping block length 0
            - remove duplicated defines
            - fix log mask for writeback block
            - unregister the hdmi codec for dp during unbind
            - fix yaml warnings
         - gpu:
            - fix submit error path leak
            - arm-smmu-qcom fix for regression that broke per-process page
              tables
            - fix no-iommu crash"
      
      * tag 'drm-fixes-2023-05-20' of git://anongit.freedesktop.org/drm/drm: (29 commits)
        drm/amd/display: enable dpia validate
        drm/amd/pm: fix possible power mode mismatch between driver and PMFW
        drm/amdgpu: skip disabling fence driver src_irqs when device is unplugged
        drm/amdgpu/gmc11: implement get_vbios_fb_size()
        drm/amdgpu: Differentiate between Raven2 and Raven/Picasso according to revision id
        drm/amdgpu/gfx11: Adjust gfxoff before powergating on gfx11 as well
        drm/amdgpu/gfx10: Disable gfxoff before disabling powergating.
        drm/amdgpu/gfx11: update gpu_clock_counter logic
        drm/msm: Be more shouty if per-process pgtables aren't working
        iommu/arm-smmu-qcom: Fix missing adreno_smmu's
        drm/i915/hdcp: Check if media_gt exists
        drm/exynos: fix g2d_open/close helper function definitions
        drm/msm: Fix submit error-path leaks
        drm/msm/iommu: Fix null pointer dereference in no-IOMMU case
        dt-bindings: display/msm: dsi-controller-main: Document qcom, master-dsi and qcom, sync-dual-dsi
        drm/msm/dpu: Remove duplicate register defines from INTF
        drm/msm/dpu: Set PINGPONG block length to zero for DPU >= 7.0.0
        drm/msm/dpu: Use V2 DITHER PINGPONG sub-block in SM8[34]50/SC8280XP
        drm/msm/dpu: Fix PP_BLK_DIPHER -> DITHER typo
        drm/msm/dpu: Reindent REV_7xxx interrupt masks with tabs
        ...
      d635f6cc
    • Stefan Haberland's avatar
      s390/dasd: fix command reject error on ESE devices · c99bff34
      Stefan Haberland authored
      Formatting a thin-provisioned (ESE) device that is part of a PPRC copy
      relation might fail with the following error:
      
      dasd-eckd 0.0.f500: An error occurred in the DASD device driver, reason=09
      [...]
      24 Byte: 0 MSG 4, no MSGb to SYSOP
      
      During format of an ESE disk the Release Allocated Space command is used.
      A bit in the payload of the command is set that is not allowed to be set
      for devices in a copy relation. This bit is set to allow the partial
      release of an extent.
      
      Check for the existence of a copy relation before setting the respective
      bit.
      
      Fixes: 91dc4a19 ("s390/dasd: Add new ioctl to release space")
      Cc: stable@kernel.org # 5.3+
      Signed-off-by: default avatarStefan Haberland <sth@linux.ibm.com>
      Reviewed-by: default avatarJan Hoeppner <hoeppner@linux.ibm.com>
      Link: https://lore.kernel.org/r/20230519102340.3854819-2-sth@linux.ibm.comSigned-off-by: default avatarJens Axboe <axboe@kernel.dk>
      c99bff34