1. 19 Jul, 2024 14 commits
    • Bjorn Helgaas's avatar
      Merge branch 'pci/controller/artpec6' · 43f25adf
      Bjorn Helgaas authored
      - Correct the artpec6_pcie_cpu_addr_fixup() parameter name, which takes a
        CPU address but called it "pci_addr" (Niklas Cassel)
      
      * pci/controller/artpec6:
        PCI: artpec6: Fix artpec6_pcie_cpu_addr_fixup() parameter name
      43f25adf
    • Bjorn Helgaas's avatar
      Merge branch 'pci/controller/al' · e38de94e
      Bjorn Helgaas authored
      - Check IORESOURCE_BUS existence to avoid NULL pointer dereference
        (Aleksandr Mishin)
      
      * pci/controller/al:
        PCI: al: Check IORESOURCE_BUS existence during probe
      e38de94e
    • Bjorn Helgaas's avatar
      Merge branch 'pci/controller/dwc' · 37853932
      Bjorn Helgaas authored
      - Use msleep() in DWC core instead of usleep_range() for ~100 ms sleep
        (Konrad Dybcio)
      
      - Fix iATU slot management to avoid using the wrong slot after PERST#
        assert/deassert, which could potentially cause DMA to go the wrong place
        (Frank Li)
      
      - Consolidate dw_pcie_prog_outbound_atu() arguments into a struct to ease
        adding new functionality like initiating Message TLPs (Yoshihiro Shimoda)
      
      - Add support for endpoints to initiate PCIe messages (Yoshihiro Shimoda)
      
      - Add #defines for PCIe INTx messages (Yoshihiro Shimoda)
      
      - Add support for endpoints to initiate PCIe PME_Turn_Off messages for
        system suspend (Frank Li)
      
      - Add dw_pcie_ep_linkdown() to reinitialize registers that are lost when
        the link goes down (Manivannan Sadhasivam)
      
      - Use dw_pcie_ep_linkdown() to reinitialize qcom non-sticky registers that
        are lost when the link goes down (Manivannan Sadhasivam)
      
      - Enforce DWC limitation that 64-bit BARs must start with the even numbered
        BAR (Niklas Cassel)
      
      * pci/controller/dwc:
        PCI: dwc: ep: Enforce DWC specific 64-bit BAR limitation
        PCI: layerscape-ep: Use the generic dw_pcie_ep_linkdown() API to handle Link Down event
        PCI: qcom-ep: Use the generic dw_pcie_ep_linkdown() API to handle Link Down event
        PCI: dwc: ep: Remove dw_pcie_ep_init_notify() wrapper
        PCI: dwc: ep: Add a generic dw_pcie_ep_linkdown() API to handle Link Down event
        PCI: dwc: Add generic MSG TLP support for sending PME_Turn_Off when system suspend
        PCI: Add PCIE_MSG_CODE_PME_TURN_OFF message macro
        PCI: Add PCIE_MSG_CODE_ASSERT_INTx message macros
        PCI: dwc: Add outbound MSG TLPs support
        PCI: dwc: Consolidate args of dw_pcie_prog_outbound_atu() into a structure
        PCI: dwc: Fix index 0 incorrectly being interpreted as a free ATU slot
        PCI: dwc: Use msleep() in dw_pcie_wait_for_link()
      37853932
    • Bjorn Helgaas's avatar
      Merge branch 'pci/controller/gpio' · 35f0c94a
      Bjorn Helgaas authored
      - Include <linux/irqchip/chained_irq.h> in dra7xx to avoid implicitly
        including it elsewhere (Andy Shevchenko)
      
      - Remove unused <linux/of_gpio.h> from aardvark and dwc drivers (dra7xx,
        meson, qcom, tegra194) (Andy Shevchenko)
      
      - Convert kirin to use scoped for_each_available_child_of_node() to ease
        future error exits (Javier Carrasco)
      
      - Convert imx6 and kirin to use the agnostic GPIO API to simplify GPIO
        setup and remove usage of the deprecated of_gpio.h API (Andy Shevchenko)
      
      * pci/controller/gpio:
        PCI: kirin: Convert to use agnostic GPIO API
        PCI: kirin: Convert kirin_pcie_parse_port() to scoped iterator
        PCI: imx6: Convert to use agnostic GPIO API
        PCI: dwc: Remove unused of_gpio.h inclusion
        PCI: aardvark: Remove unused of_gpio.h inclusion
        PCI: dra7xx: Add missing chained IRQ header inclusion
      35f0c94a
    • Bjorn Helgaas's avatar
      Merge branch 'pci/endpoint' · 0f74d898
      Bjorn Helgaas authored
      - Remove unused struct pci_epf_group.type_group (Christophe JAILLET)
      
      - Use cached epc_features instead of pci_epc_get_features() to avoid having
        to check for failure (potential NULL pointer dereference) (Manivannan
        Sadhasivam)
      
      - Drop pointless local msix_capable variable in pci_epf_test_alloc_space()
        (Manivannan Sadhasivam)
      
      - Rename struct pci_epc_event_ops.core_init to .epc_init, since "core" is
        no longer meaningful here (Manivannan Sadhasivam)
      
      - Rename pci_epc_bme_notify(), pci_epf_mhi_bme(), pci_epc_bme_notify() to
        spell out "bus_master_enable" instead of "bme" (Manivannan Sadhasivam)
      
      - Factor pci_epf_test_clear_bar() and pci_epf_test_free_space() out of
        pci_epf_test_unbind() so they can be reused elsewhere (Manivannan
        Sadhasivam)
      
      - Move DMA initialization to the pci_epf_mhi_epc_init() callback so
        endpoint drivers do this uniformly (Manivannan Sadhasivam)
      
      - Add endpoint testing for Link Down events (Manivannan Sadhasivam)
      
      - Add 'epc_deinit' event so endpoints that can be reset via PERST# (qcom,
        tegra194) can notify EPF drivers when this happens (Manivannan
        Sadhasivam)
      
      - Make pci_epc_class constant (Greg Kroah-Hartman)
      
      - Fix vpci_scan_bus() error checking to print error for failure (not
        success) and clean up after failure (Dan Carpenter)
      
      - Fix epf_ntb_epc_cleanup() error handling to clean up scratchpad BARs and
        clean up in mirror order of allocation (Dan Carpenter)
      
      - Add rk3588, which requires 64KB BAR alignment, to pci_endpoint_test
        (Niklas Cassel)
      
      - Use memcpy_toio()/memcpy_fromio() for endpoint BAR tests to improve
        performance (Niklas Cassel)
      
      - Set DMA mask to 48 bits always to simplify endpoint test, since there's
        there's no need to check for error or to fallback to 32 bits (Frank Li)
      
      - Suggest using programmable Vendor/Device ID (when supported) to use
        pci_endpoint_test without having to add new entries (Yoshihiro Shimoda)
      
      - Remove unused pci_endpoint_test_bar_{readl,writel}() (Jiapeng Chong)
      
      - Remove 'linkup' and add 'add_cfs' to the endpoint function driver 'ops'
        documentation to match the code (Alexander Stein)
      
      -
      
      * pci/endpoint:
        Documentation: PCI: pci-endpoint: Fix EPF ops list
        misc: pci_endpoint_test: Remove unused pci_endpoint_test_bar_{readl,writel} functions
        misc: pci_endpoint_test: Document policy about adding pci_device_id
        misc: pci_endpoint_test: Refactor dma_set_mask_and_coherent() logic
        misc: pci_endpoint_test: Use memcpy_toio()/memcpy_fromio() for BAR tests
        misc: pci_endpoint_test: Add support for Rockchip rk3588
        PCI: endpoint: Fix error handling in epf_ntb_epc_cleanup()
        PCI: endpoint: Clean up error handling in vpci_scan_bus()
        PCI: endpoint: Make pci_epc_class struct constant
        PCI: endpoint: Introduce 'epc_deinit' event and notify the EPF drivers
        PCI: endpoint: pci-epf-test: Handle Link Down event
        PCI: endpoint: pci-epf-{mhi/test}: Move DMA initialization to EPC init callback
        PCI: endpoint: pci-epf-test: Refactor pci_epf_test_unbind() function
        PCI: endpoint: Rename BME to Bus Master Enable
        PCI: endpoint: Rename core_init() callback in 'struct pci_epc_event_ops' to epc_init()
        PCI: endpoint: pci-epf-test: Use 'msix_capable' flag directly in pci_epf_test_alloc_space()
        PCI: endpoint: pci-epf-test: Make use of cached 'epc_features' in pci_epf_test_core_init()
        PCI: endpoint: Remove unused field in struct pci_epf_group
      0f74d898
    • Bjorn Helgaas's avatar
      Merge branch 'pci/dt-bindings' · 7095d21e
      Bjorn Helgaas authored
      - Add "apb", "sys", "pmc", "msg", "err" for Endpoint descriptions as well
        as for Root Complexes (Niklas Cassel)
      
      - Add "tx_inta", "tx_intb", "tx_intc", "tx_intd" for interrupt signals
        triggered in response to PCIe Assert_INTx messages (Niklas Cassel)
      
      - Refactor rockchip-dw-pcie binding to move generic properties to a new
        rockchip-dw-pcie-common binding that can be shared by both RC and EP mode
        (Niklas Cassel)
      
      - Fix rockchip-dw-pcie description of INTx signals (Niklas Cassel)
      
      - Add rockchip-dw-pcie description of Endpoint controller (Niklas Cassel)
      
      - Avoid xilinx-versal-cpm overlapping of bridge registers and 32-bit BAR
        addresses (Thippeswamy Havalige)
      
      - Add qcom Operating Performance Points (OPP) table (Krishna chaitanya
        chundru)
      
      - Add a picture of mediatek,mt7621-pcie topology (Sergio Paracuellos)
      
      - Add a generic "ats-supported" property so the OS can discover whether a
        Root Complex supports ATS (Jean-Philippe Brucker)
      
      - Make the qcom,pcie-x1e80100 MHI register region mandatory (Abel Vesa)
      
      * pci/dt-bindings:
        dt-bindings: PCI: qcom: x1e80100: Make the MHI reg region mandatory
        dt-bindings: PCI: generic: Add ats-supported property
        dt-bindings: PCI: mediatek,mt7621-pcie: Add PCIe host topology ASCII graph
        dt-bindings: PCI: qcom: Add OPP table
        dt-bindings: PCI: xilinx-cpm: Fix overlapping of bridge register and 32-bit BAR addresses
        dt-bindings: PCI: rockchip: Add DesignWare based PCIe Endpoint controller
        dt-bindings: PCI: rockchip-dw-pcie: Fix description of legacy IRQ
        dt-bindings: PCI: rockchip-dw-pcie: Prepare for Endpoint mode support
        dt-bindings: PCI: snps,dw-pcie-ep: Add tx_int{a,b,c,d} legacy IRQs
        dt-bindings: PCI: snps,dw-pcie-ep: Add vendor specific interrupt-names
        dt-bindings: PCI: snps,dw-pcie-ep: Add vendor specific reg-name
      7095d21e
    • Bjorn Helgaas's avatar
      Merge branch 'pci/resource' · 65d8f684
      Bjorn Helgaas authored
      - Rename find_resource() to find_resource_space() to make it more
        descriptive for exporting outside resource.c (Ilpo Järvinen)
      
      - Document find_resource_space() and the resource_constraint struct it uses
        (Ilpo Järvinen)
      
      - Add typedef resource_alignf to make it simpler to declare allocation
        constraint alignf callbacks (Ilpo Järvinen)
      
      - Open-code the no-constraint simple alignment case to make the
        simple_align_resource() default callback unnecessary (Ilpo Järvinen)
      
      - Export find_resource_space() because PCI bridge window allocation needs
        to learn whether there's space for a window (Ilpo Järvinen)
      
      - Fix a double-counting problem in PCI calculate_memsize() that led to
        allocating larger windows each time a bus was removed and rescanned (Ilpo
        Järvinen)
      
      - When we don't have space to allocate larger bridge windows, allocate
        windows only large enough for the downstream devices to prevent cases
        where a device worked originally, but not after being removed and
        re-added (Ilpo Järvinen)
      
      * pci/resource:
        PCI: Relax bridge window tail sizing rules
        PCI: Make minimum bridge window alignment reference more obvious
        PCI: Fix resource double counting on remove & rescan
        resource: Export find_resource_space()
        resource: Handle simple alignment inside __find_resource_space()
        resource: Use typedef for alignf callback
        resource: Document find_resource_space() and resource_constraint
        resource: Rename find_resource() to find_resource_space()
      65d8f684
    • Bjorn Helgaas's avatar
      Merge branch 'pci/reset' · 62281339
      Bjorn Helgaas authored
      - Warn about doing a Secondary Bus Reset without holding the device lock
        (Dan Williams)
      
      - Lock bridge in addition to downstream hierarchy before doing a Secondary
        Bus Reset (Dan Williams)
      
      * pci/reset:
        PCI: Add missing bridge lock to pci_bus_lock()
        PCI: Warn on missing cfg_access_lock during secondary bus reset
      62281339
    • Bjorn Helgaas's avatar
      Merge branch 'pci/hotplug' · 675ba773
      Bjorn Helgaas authored
      - Detect if a device was removed or replaced during system sleep so we
        don't assume a new device is the one that used to be there.  This uses
        Vendor/Device/Subsystem/Class/Revision and Device Serial Number (if
        implemented), so it's not fool-proof and drivers may know how to detect
        more cases (Lukas Wunner)
      
      - Add missing MODULE_DESCRIPTION() macro (Jeff Johnson)
      
      * pci/hotplug:
        PCI: acpiphp: Add missing MODULE_DESCRIPTION() macro
        PCI: pciehp: Detect device replacement during system sleep
      675ba773
    • Bjorn Helgaas's avatar
      Merge branch 'pci/err' · 52490480
      Bjorn Helgaas authored
      - Disable AER and DPC during suspend so that if they share an interrupt
        with PME and errors occur during suspend, the AER or DPC interrupt
        doesn't cause spurious wakeups (Kai-Heng Feng)
      
      * pci/err:
        PCI/DPC: Disable DPC service on suspend
        PCI/AER: Disable AER service on suspend
      52490480
    • Bjorn Helgaas's avatar
      Merge branch 'pci/enumeration' · 903a3b1e
      Bjorn Helgaas authored
      - Move the PRESERVE_BOOT_CONFIG ACPI _DSM evaluation from drivers/acpi to
        drivers/pci so we can unify with similar DT functionality (Vidya Sagar)
      
      - Add of_pci_preserve_config() to check for a DT "linux,pci-probe-only"
        property on a per-host bridge basis in addition to a global basis (Vidya
        Sagar)
      
      - Unify ACPI PRESERVE_BOOT_CONFIG _DSM and DT "linux,pci-probe-only" in a
        generic pci_preserve_config() path (Vidya Sagar)
      
      * pci/enumeration:
        PCI: Use preserve_config in place of pci_flags
        PCI: Unify ACPI and DT 'preserve config' support
        PCI: of: Add of_pci_preserve_config() for per-host bridge support
        PCI: Move PRESERVE_BOOT_CONFIG _DSM evaluation to pci_register_host_bridge()
      903a3b1e
    • Bjorn Helgaas's avatar
      Merge branch 'pci/dpc' · 147ea50e
      Bjorn Helgaas authored
      - If there's a device below a bridge, prevent a use-after-free by holding a
        reference to the device while waiting for the secondary bus to be ready
        in case the device is concurrently removed, e.g., by DPC (Lukas Wunner)
      
      * pci/dpc:
        PCI/DPC: Fix use-after-free on concurrent DPC and hot-removal
      147ea50e
    • Bjorn Helgaas's avatar
      Merge branch 'pci/devres' · 06bbe25c
      Bjorn Helgaas authored
      - Add pcim_add_mapping_to_legacy_table() and
        pcim_remove_mapping_from_legacy_table() helper functions to simplify
        devres iomap table (Philipp Stanner)
      
      - Reimplement devres that take a bit mask of BARs in a way that can be used
        to map partial BARs as well as entire BARs (Philipp Stanner)
      
      - Deprecate pcim_iomap_table() and pcim_iomap_regions_request_all() in
        favor of pcim_* request plus pcim_* mapping (Philipp Stanner)
      
      - Add pcim_request_region(), a managed interface to request a single BAR
        (Philipp Stanner)
      
      - Use the existing pci_is_enabled() interface to replace the struct
        devres.enabled bit (Philipp Stanner)
      
      - Move the struct pci_devres.pinned bit to struct pci_dev (Philipp Stanner)
      
      - Reimplement pcim_set_mwi() so it uses its own devres cleanup callback
        instead of a special-purpose bit in struct pci_devres (Philipp Stanner)
      
      - Add pcim_intx(), which is unambiguously managed, unlike pci_intx(), which
        is managed if pcim_enable_device() has been called but unmanaged
        otherwise (Philipp Stanner)
      
      - Remove pcim_release(), which is no longer needed after previous cleanups
        of pcim_set_mwi() and pci_intx() (Philipp Stanner)
      
      - Add pcim_iomap_range(), a managed interface to map part of a BAR (Philipp
        Stanner)
      
      - Fix vboxvideo leak by using the new pcim_iomap_range() instead of the
        unmanaged pci_iomap_range() (Philipp Stanner)
      
      * pci/devres:
        drm/vboxvideo: fix mapping leaks
        PCI: Add managed pcim_iomap_range()
        PCI: Remove legacy pcim_release()
        PCI: Add managed pcim_intx()
        PCI: Give pcim_set_mwi() its own devres cleanup callback
        PCI: Move struct pci_devres.pinned bit to struct pci_dev
        PCI: Remove struct pci_devres.enabled status bit
        PCI: Document hybrid devres hazards
        PCI: Add managed pcim_request_region()
        PCI: Deprecate pcim_iomap_table(), pcim_iomap_regions_request_all()
        PCI: Add managed partial-BAR request and map infrastructure
        PCI: Add devres helpers for iomap table
        PCI: Add and use devres helper for bit masks
      06bbe25c
    • Bjorn Helgaas's avatar
      Merge branch 'pci/acs' · cb43487e
      Bjorn Helgaas authored
      - Add ACS quirk for Broadcom BCM5760X NIC, which doesn't allow peer-to-peer
        transactions between functions, but doesn't advertise ACS support (Ajit
        Khaparde)
      
      - Add "pci=config_acs=" kernel command-line parameter to relax default ACS
        settings to enable peer-to-peer configurations.  Requires expert
        knowledge of topology and ACS operation (Vidya Sagar)
      
      * pci/acs:
        PCI: Extend ACS configurability
        PCI: Add ACS quirk for Broadcom BCM5760X NIC
      cb43487e
  2. 12 Jul, 2024 2 commits
    • Vidya Sagar's avatar
      PCI: Extend ACS configurability · 47c8846a
      Vidya Sagar authored
      PCIe ACS settings control the level of isolation and the possible P2P paths
      between devices. With greater isolation the kernel will create smaller
      iommu_groups and with less isolation there is more HW that can achieve P2P
      transfers. From a virtualization perspective all devices in the same
      iommu_group must be assigned to the same VM as they lack security
      isolation.
      
      There is no way for the kernel to automatically know the correct ACS
      settings for any given system and workload. Existing command line options
      (e.g., disable_acs_redir) allow only for large scale change, disabling all
      isolation, but this is not sufficient for more complex cases.
      
      Add a kernel command-line option 'config_acs' to directly control all the
      ACS bits for specific devices, which allows the operator to setup the right
      level of isolation to achieve the desired P2P configuration.  The
      definition is future proof; when new ACS bits are added to the spec the
      open syntax can be extended.
      
      ACS needs to be setup early in the kernel boot as the ACS settings affect
      how iommu_groups are formed. iommu_group formation is a one time event
      during initial device discovery, so changing ACS bits after kernel boot can
      result in an inaccurate view of the iommu_groups compared to the current
      isolation configuration.
      
      ACS applies to PCIe Downstream Ports and multi-function devices.  The
      default ACS settings are strict and deny any direct traffic between two
      functions. This results in the smallest iommu_group the HW can support.
      Frequently these values result in slow or non-working P2PDMA.
      
      ACS offers a range of security choices controlling how traffic is
      allowed to go directly between two devices. Some popular choices:
      
        - Full prevention
      
        - Translated requests can be direct, with various options
      
        - Asymmetric direct traffic, A can reach B but not the reverse
      
        - All traffic can be direct
      
      Along with some other less common ones for special topologies.
      
      The intention is that this option would be used with expert knowledge of
      the HW capability and workload to achieve the desired configuration.
      
      Link: https://lore.kernel.org/r/20240625153150.159310-1-vidyas@nvidia.comSigned-off-by: default avatarVidya Sagar <vidyas@nvidia.com>
      [bhelgaas: add example, tidy printk formats]
      Signed-off-by: default avatarBjorn Helgaas <bhelgaas@google.com>
      47c8846a
    • Dan Williams's avatar
      PCI: Add missing bridge lock to pci_bus_lock() · a4e77289
      Dan Williams authored
      One of the true positives that the cfg_access_lock lockdep effort
      identified is this sequence:
      
        WARNING: CPU: 14 PID: 1 at drivers/pci/pci.c:4886 pci_bridge_secondary_bus_reset+0x5d/0x70
        RIP: 0010:pci_bridge_secondary_bus_reset+0x5d/0x70
        Call Trace:
         <TASK>
         ? __warn+0x8c/0x190
         ? pci_bridge_secondary_bus_reset+0x5d/0x70
         ? report_bug+0x1f8/0x200
         ? handle_bug+0x3c/0x70
         ? exc_invalid_op+0x18/0x70
         ? asm_exc_invalid_op+0x1a/0x20
         ? pci_bridge_secondary_bus_reset+0x5d/0x70
         pci_reset_bus+0x1d8/0x270
         vmd_probe+0x778/0xa10
         pci_device_probe+0x95/0x120
      
      Where pci_reset_bus() users are triggering unlocked secondary bus resets.
      Ironically pci_bus_reset(), several calls down from pci_reset_bus(), uses
      pci_bus_lock() before issuing the reset which locks everything *but* the
      bridge itself.
      
      For the same motivation as adding:
      
        bridge = pci_upstream_bridge(dev);
        if (bridge)
          pci_dev_lock(bridge);
      
      to pci_reset_function() for the "bus" and "cxl_bus" reset cases, add
      pci_dev_lock() for @bus->self to pci_bus_lock().
      
      Link: https://lore.kernel.org/r/171711747501.1628941.15217746952476635316.stgit@dwillia2-xfh.jf.intel.comReported-by: default avatarImre Deak <imre.deak@intel.com>
      Closes: http://lore.kernel.org/r/6657833b3b5ae_14984b29437@dwillia2-xfh.jf.intel.com.notmuchSigned-off-by: default avatarDan Williams <dan.j.williams@intel.com>
      Signed-off-by: default avatarKeith Busch <kbusch@kernel.org>
      [bhelgaas: squash in recursive locking deadlock fix from Keith Busch:
      https://lore.kernel.org/r/20240711193650.701834-1-kbusch@meta.com]
      Signed-off-by: default avatarBjorn Helgaas <bhelgaas@google.com>
      Tested-by: default avatarHans de Goede <hdegoede@redhat.com>
      Tested-by: default avatarKalle Valo <kvalo@kernel.org>
      Reviewed-by: default avatarDave Jiang <dave.jiang@intel.com>
      a4e77289
  3. 11 Jul, 2024 4 commits
  4. 10 Jul, 2024 9 commits
  5. 09 Jul, 2024 11 commits