1. 28 Jun, 2024 16 commits
  2. 17 Jun, 2024 3 commits
  3. 11 Jun, 2024 1 commit
  4. 04 Jun, 2024 4 commits
    • Nathan Lynch's avatar
      powerpc/mm/drmem: Silence drmem_init() early return · 11e6e6d8
      Nathan Lynch authored
      It's not an error or noteworthy condition if the
      "ibm,dynamic-reconfiguration-memory" node isn't present.
      
      Drop the needless message.
      Signed-off-by: default avatarNathan Lynch <nathanl@linux.ibm.com>
      Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
      Link: https://msgid.link/20240603-silence-drmem_init-v1-1-e9d71646bc3d@linux.ibm.com
      11e6e6d8
    • Haren Myneni's avatar
      powerpc/pseries/vas: Use usleep_range() to support HCALL delay · 43ac9f5c
      Haren Myneni authored
      VAS allocate, modify and deallocate HCALLs returns
      H_LONG_BUSY_ORDER_1_MSEC or H_LONG_BUSY_ORDER_10_MSEC for busy
      delay and expects OS to reissue HCALL after that delay. But using
      msleep() will often sleep at least 20 msecs even though the
      hypervisor suggests OS reissue these HCALLs after 1 or 10msecs.
      
      The open and close VAS window functions hold mutex and then issue
      these HCALLs. So these operations can take longer than the
      necessary when multiple threads issue open or close window APIs
      simultaneously, especially might affect the performance in the
      case of repeat open/close APIs for each compression request.
      
      Multiple tasks can open / close VAS windows at the same time
      which depends on the available VAS credits. For example, 240
      cores system provides 4800 VAS credits. It means 4800 tasks can
      execute open VAS windows HCALLs with the mutex. Since each
      msleep() will often sleep more than 20 msecs, some tasks are
      waiting more than 120 secs to acquire mutex. It can cause hung
      traces for these tasks in dmesg due to mutex contention around
      open/close HCALLs.
      
      Instead of msleep(), use usleep_range() to ensure sleep with
      the expected value before issuing HCALL again. So since each
      task sleep 10 msecs maximum, this patch allow more tasks can
      issue open/close VAS calls without any hung traces in the
      dmesg.
      Signed-off-by: default avatarHaren Myneni <haren@linux.ibm.com>
      Suggested-by: default avatarNathan Lynch <nathanl@linux.ibm.com>
      Reviewed-by: default avatarNathan Lynch <nathanl@linux.ibm.com>
      Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
      Link: https://msgid.link/20240116055910.421605-1-haren@linux.ibm.com
      43ac9f5c
    • Nilay Shroff's avatar
      powerpc/numa: Online a node if PHB is attached. · 11981816
      Nilay Shroff authored
      In the current design, a numa-node is made online only if that node is
      attached to cpu/memory. With this design, if any PCI/IO device is found
      to be attached to a numa-node which is not online then the numa-node
      id of the corresponding PCI/IO device is set to NUMA_NO_NODE(-1). This
      design may negatively impact the performance of PCIe device if the
      numa-node assigned to PCIe device is -1 because in such case we may not
      be able to accurately calculate the distance between two nodes.
      
      The multi-controller NVMe PCIe disk has an issue with calculating the
      node distance if the PCIe NVMe controller is attached to a PCI host
      bridge which has numa-node id value set to NUMA_NO_NODE. This patch
      helps fix this ensuring that a cpu/memory less numa node is made online
      if it's attached to PCI host bridge.
      Signed-off-by: default avatarNilay Shroff <nilay@linux.ibm.com>
      Reviewed-by: default avatarSrikar Dronamraju <srikar@linux.ibm.com>
      Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
      Link: https://msgid.link/20240517142531.3273464-3-nilay@linux.ibm.com
      11981816
    • Gaurav Batra's avatar
      powerpc/pseries/iommu: Split Dynamic DMA Window to be used in Hybrid mode · ff5163bb
      Gaurav Batra authored
      Dynamic DMA Window (DDW) supports TCEs that are backed by 2MB page
      size. In most configurations, DDW is big enough to pre-map all of LPAR
      memory for IO. Pre-mapping of memory for DMA results in improvements in
      IO performance.
      
      Persistent memory, vPMEM, can be assigned to an LPAR as well. vPMEM is
      not contiguous with LPAR memory and usually is assigned at high memory
      addresses.  This makes is not possible to pre-map both vPMEM and LPAR
      memory in the same DDW.
      
      For a dedicated adapter this limitation is not an issue. Dedicated
      adapters can have both Default DMA window, which is backed by 4K page
      size and a DDW backed by 2MB page size TCEs. In this scenario, LPAR
      memory is pre-mapped in the DDW.  Any DMA going to the vPMEM is routed
      via dynamically allocated TCEs in the default window.
      
      The issue arises with SR-IOV adapters. There is only one DMA window -
      either Default or DDW. If an LPAR has vPMEM assigned, memory is not
      pre-mapped in the DDW since TCEs needs to be allocated for vPMEM as well.
      In this case, DDW is created and TCEs are dynamically allocated for both
      vPMEM and LPAR memory.
      
      Today, DDW is only used in single mode - direct mapped TCEs or
      dynamically mapped TCEs. This enhancement breaks a single DDW in 2
      regions -
      
      	1. First region to pre-map LPAR memory
      	2. Second region to dynamically allocate TCEs for IO to vPMEM
      
      The DDW is split only if it is big enough to pre-map complete LPAR
      memory and still have some space left to dynamically map vPMEM. Maximum
      size possible DDW is created as permitted by the Hypervisor.
      Signed-off-by: default avatarGaurav Batra <gbatra@linux.ibm.com>
      Reviewed-by: default avatarBrian King <brking@linux.vnet.ibm.com>
      Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
      Link: https://msgid.link/20240514014608.35537-1-gbatra@linux.ibm.com
      ff5163bb
  5. 03 Jun, 2024 1 commit
  6. 02 Jun, 2024 8 commits
  7. 01 Jun, 2024 6 commits
  8. 31 May, 2024 1 commit
    • Linus Torvalds's avatar
      Merge tag 'drm-fixes-2024-06-01' of https://gitlab.freedesktop.org/drm/kernel · cc8ed4d0
      Linus Torvalds authored
      Pull drm fixes from Dave Airlie:
       "This is the weekly fixes. Lots of small fixes across the board, one
        BUG_ON fix in shmem seems most important, otherwise amdgpu, i915, xe
        mostly with small fixes to all the other drivers.
      
        shmem:
         - fix BUG_ON in COW handling
         - warn when trying to pin imported objects
      
        buddy:
         - fix page size handling
      
        dma-buf:
         - sw-sync: Don't interfere with IRQ handling
         - fix kthreads-handling error path
      
        i915:
         - fix a race in audio component by registering it later
         - make DPT object unshrinkable to avoid shrinking when framebuffer
           has not shrunk
         - fix CCS id calculation to fix a perf regression
         - fix selftest caching mode
         - fix FIELD_PREP compiler warnings
         - fix indefinite wait for GT wakeref release
         - revert overeager multi-gt pm reference removal
      
        xe:
         - pcode polling timeout change
         - fix for deadlocks for faulting VMs
         - error-path lock imbalance fix
      
        amdgpu:
         - RAS fix
         - fix colorspace property for MST connectors
         - fix for PCIe DPM
         - silence UBSAN warning
         - GPUVM robustness fix
         - partition fix
         - drop deprecated I2C_CLASS_SPD
      
        amdkfd:
         - revert unused changes for certain 11.0.3 devices
         - simplify APU VRAM handling
      
        lima:
         - fix dma_resv-related deadlock in object pin
      
        msm:
         - remove build-time dependency on Python 3.9
      
        nouveau:
         - nvif: Fix possible integer overflow
      
        panel:
         - lg-sw43408: Select DP helpers; Declare backlight ops as static
         - sitronix-st7789v: Various fixes for jt240mhqs_hwt_ek_e3 panel
      
        panfrost:
         - fix dma_resv-related deadlock in object pin"
      
      * tag 'drm-fixes-2024-06-01' of https://gitlab.freedesktop.org/drm/kernel: (35 commits)
        drm/msm: remove python 3.9 dependency for compiling msm
        drm/panel: sitronix-st7789v: fix display size for jt240mhqs_hwt_ek_e3 panel
        drm/panel: sitronix-st7789v: tweak timing for jt240mhqs_hwt_ek_e3 panel
        drm/panel: sitronix-st7789v: fix timing for jt240mhqs_hwt_ek_e3 panel
        drm/amd/pm: remove deprecated I2C_CLASS_SPD support from newly added SMU_14_0_2
        drm/amdgpu: Make CPX mode auto default in NPS4
        drm/amdkfd: simplify APU VRAM handling
        Revert "drm/amdkfd: fix gfx_target_version for certain 11.0.3 devices"
        drm/amdgpu: fix dereference null return value for the function amdgpu_vm_pt_parent
        drm/amdgpu: silence UBSAN warning
        drm/amdgpu: Adjust logic in amdgpu_device_partner_bandwidth()
        drm/i915: Fix audio component initialization
        drm/i915/dpt: Make DPT object unshrinkable
        drm/i915/gt: Fix CCS id's calculation for CCS mode setting
        drm/panel/lg-sw43408: mark sw43408_backlight_ops as static
        drm/i915/selftests: Set always_coherent to false when reading from CPU
        drm/panel/lg-sw43408: select CONFIG_DRM_DISPLAY_DP_HELPER
        drm/i915/guc: avoid FIELD_PREP warning
        drm/i915/gt: Disarm breadcrumbs if engines are already idle
        Revert "drm/i915: Remove extra multi-gt pm-references"
        ...
      cc8ed4d0