1. 08 Nov, 2023 6 commits
    • Hengqi Chen's avatar
      LoongArch: Add more instruction opcodes and emit_* helpers · add28024
      Hengqi Chen authored
      This patch adds more instruction opcodes and their corresponding emit_*
      helpers which will be used in later patches.
      Signed-off-by: default avatarHengqi Chen <hengqi.chen@gmail.com>
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      add28024
    • Huacai Chen's avatar
      LoongArch/smp: Call rcutree_report_cpu_starting() earlier · a2ccf463
      Huacai Chen authored
      rcutree_report_cpu_starting() must be called before cpu_probe() to avoid
      the following lockdep splat that triggered by calling __alloc_pages() when
      CONFIG_PROVE_RCU_LIST=y:
      
       =============================
       WARNING: suspicious RCU usage
       6.6.0+ #980 Not tainted
       -----------------------------
       kernel/locking/lockdep.c:3761 RCU-list traversed in non-reader section!!
       other info that might help us debug this:
       RCU used illegally from offline CPU!
       rcu_scheduler_active = 1, debug_locks = 1
       1 lock held by swapper/1/0:
        #0: 900000000c82ef98 (&pcp->lock){+.+.}-{2:2}, at: get_page_from_freelist+0x894/0x1790
       CPU: 1 PID: 0 Comm: swapper/1 Not tainted 6.6.0+ #980
       Stack : 0000000000000001 9000000004f79508 9000000004893670 9000000100310000
               90000001003137d0 0000000000000000 90000001003137d8 9000000004f79508
               0000000000000000 0000000000000001 0000000000000000 90000000048a3384
               203a656d616e2065 ca43677b3687e616 90000001002c3480 0000000000000008
               000000000000009d 0000000000000000 0000000000000001 80000000ffffe0b8
               000000000000000d 0000000000000033 0000000007ec0000 13bbf50562dad831
               9000000005140748 0000000000000000 9000000004f79508 0000000000000004
               0000000000000000 9000000005140748 90000001002bad40 0000000000000000
               90000001002ba400 0000000000000000 9000000003573ec8 0000000000000000
               00000000000000b0 0000000000000004 0000000000000000 0000000000070000
               ...
       Call Trace:
       [<9000000003573ec8>] show_stack+0x38/0x150
       [<9000000004893670>] dump_stack_lvl+0x74/0xa8
       [<900000000360d2bc>] lockdep_rcu_suspicious+0x14c/0x190
       [<900000000361235c>] __lock_acquire+0xd0c/0x2740
       [<90000000036146f4>] lock_acquire+0x104/0x2c0
       [<90000000048a955c>] _raw_spin_lock_irqsave+0x5c/0x90
       [<900000000381cd5c>] rmqueue_bulk+0x6c/0x950
       [<900000000381fc0c>] get_page_from_freelist+0xd4c/0x1790
       [<9000000003821c6c>] __alloc_pages+0x1bc/0x3e0
       [<9000000003583b40>] tlb_init+0x150/0x2a0
       [<90000000035742a0>] per_cpu_trap_init+0xf0/0x110
       [<90000000035712fc>] cpu_probe+0x3dc/0x7a0
       [<900000000357ed20>] start_secondary+0x40/0xb0
       [<9000000004897138>] smpboot_entry+0x54/0x58
      
      raw_smp_processor_id() is required in order to avoid calling into lockdep
      before RCU has declared the CPU to be watched for readers.
      
      See also commit 29368e09 ("x86/smpboot: Move rcu_cpu_starting() earlier"),
      commit de5d9dae ("s390/smp: move rcu_cpu_starting() earlier") and commit
      99f070b6 ("powerpc/smp: Call rcu_cpu_starting() earlier").
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      a2ccf463
    • WANG Rui's avatar
      LoongArch: Relax memory ordering for atomic operations · affef66b
      WANG Rui authored
      This patch relaxes the implementation while satisfying the memory ordering
      requirements for atomic operations, which will help improve performance on
      LA664+.
      
      Unixbench with full threads (8)
                                                 before       after
        Dhrystone 2 using register variables   203910714.2  203909539.8   0.00%
        Double-Precision Whetstone                 37930.9        37931   0.00%
        Execl Throughput                           29431.5      29545.8   0.39%
        File Copy 1024 bufsize 2000 maxblocks    6645759.5      6676320   0.46%
        File Copy 256 bufsize 500 maxblocks      2138772.4    2144182.4   0.25%
        File Copy 4096 bufsize 8000 maxblocks   11640698.4     11602703  -0.33%
        Pipe Throughput                          8849077.7    8917009.4   0.77%
        Pipe-based Context Switching             1255108.5    1287277.3   2.56%
        Process Creation                           50825.9      50442.1  -0.76%
        Shell Scripts (1 concurrent)               25795.8      25942.3   0.57%
        Shell Scripts (8 concurrent)                3812.6       3835.2   0.59%
        System Call Overhead                     9248212.6    9353348.6   1.14%
                                                                        =======
        System Benchmarks Index Score               8076.6       8114.4   0.47%
      Signed-off-by: default avatarWANG Rui <wangrui@loongson.cn>
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      affef66b
    • Nathan Chancellor's avatar
      LoongArch: Mark __percpu functions as always inline · 71945968
      Nathan Chancellor authored
      A recent change to the optimization pipeline in LLVM reveals some
      fragility around the inlining of LoongArch's __percpu functions, which
      manifests as a BUILD_BUG() failure:
      
        In file included from kernel/sched/build_policy.c:17:
        In file included from include/linux/sched/cputime.h:5:
        In file included from include/linux/sched/signal.h:5:
        In file included from include/linux/rculist.h:11:
        In file included from include/linux/rcupdate.h:26:
        In file included from include/linux/irqflags.h:18:
        arch/loongarch/include/asm/percpu.h:97:3: error: call to '__compiletime_assert_51' declared with 'error' attribute: BUILD_BUG failed
           97 |                 BUILD_BUG();
              |                 ^
        include/linux/build_bug.h:59:21: note: expanded from macro 'BUILD_BUG'
           59 | #define BUILD_BUG() BUILD_BUG_ON_MSG(1, "BUILD_BUG failed")
              |                     ^
        include/linux/build_bug.h:39:37: note: expanded from macro 'BUILD_BUG_ON_MSG'
           39 | #define BUILD_BUG_ON_MSG(cond, msg) compiletime_assert(!(cond), msg)
              |                                     ^
        include/linux/compiler_types.h:425:2: note: expanded from macro 'compiletime_assert'
          425 |         _compiletime_assert(condition, msg, __compiletime_assert_, __COUNTER__)
              |         ^
        include/linux/compiler_types.h:413:2: note: expanded from macro '_compiletime_assert'
          413 |         __compiletime_assert(condition, msg, prefix, suffix)
              |         ^
        include/linux/compiler_types.h:406:4: note: expanded from macro '__compiletime_assert'
          406 |                         prefix ## suffix();                             \
              |                         ^
        <scratch space>:86:1: note: expanded from here
           86 | __compiletime_assert_51
              | ^
        1 error generated.
      
      If these functions are not inlined (which the compiler is free to do
      even with functions marked with the standard 'inline' keyword), the
      BUILD_BUG() in the default case cannot be eliminated since the compiler
      cannot prove it is never used, resulting in a build failure due to the
      error attribute.
      
      Mark these functions as __always_inline to guarantee inlining so that
      the BUILD_BUG() only triggers when the default case genuinely cannot be
      eliminated due to an unexpected size.
      
      Cc:  <stable@vger.kernel.org>
      Closes: https://github.com/ClangBuiltLinux/linux/issues/1955
      Fixes: 46859ac8 ("LoongArch: Add multi-processor (SMP) support")
      Link: https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16eSuggested-by: default avatarNick Desaulniers <ndesaulniers@google.com>
      Signed-off-by: default avatarNathan Chancellor <nathan@kernel.org>
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      71945968
    • WANG Rui's avatar
      LoongArch: Disable module from accessing external data directly · 21eb2bfe
      WANG Rui authored
      The distance between vmlinux and the module is too far so that PC-REL
      cannot be accessed directly, only GOT.
      
      When compiling module with GCC, the option `-mdirect-extern-access` is
      disabled by default. The Clang option `-fdirect-access-external-data` is
      enabled by default, so it needs to be explicitly disabled.
      Signed-off-by: default avatarWANG Rui <wangrui@loongson.cn>
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      21eb2bfe
    • Huacai Chen's avatar
      LoongArch: Support PREEMPT_DYNAMIC with static keys · 80c7889d
      Huacai Chen authored
      Since commit 4e90d052 ("riscv: support PREEMPT_DYNAMIC with
      static keys"), the infrastructure is complete and we can simply select
      HAVE_PREEMPT_DYNAMIC_KEY to enable PREEMPT_DYNAMIC on LoongArch because
      we already support static keys.
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      80c7889d
  2. 01 Nov, 2023 1 commit
  3. 30 Oct, 2023 1 commit
  4. 28 Oct, 2023 15 commits
  5. 27 Oct, 2023 15 commits
  6. 26 Oct, 2023 2 commits
    • Linus Torvalds's avatar
      Merge tag 'soc-fixes-6.7-3' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc · 3a568e3a
      Linus Torvalds authored
      Pull ARM SoC fixes from Arnd Bergmann:
       "A couple of platforms have some last-minute fixes, in particular:
      
         - riscv gets some fixes for noncoherent DMA on the renesas and thead
           platforms and dts fix for SPI on the visionfive 2 board
      
         - Qualcomm Snapdragon gets three dts fixes to address board specific
           regressions on the pmic and gpio nodes
      
         - Rockchip platforms get multiple dts fixes to address issues on the
           recent rk3399 platform as well as the older rk3128 platform that
           apparently regressed a while ago.
      
         - TI OMAP gets some trivial code and dts fixes and a regression fix
           for the omap1 ams-delta modem
      
         - NXP i.MX firmware has one fix for a use-after-free but in its error
           handling"
      
      * tag 'soc-fixes-6.7-3' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc: (25 commits)
        soc: renesas: ARCH_R9A07G043 depends on !RISCV_ISA_ZICBOM
        riscv: only select DMA_DIRECT_REMAP from RISCV_ISA_ZICBOM and ERRATA_THEAD_PBMT
        riscv: RISCV_NONSTANDARD_CACHE_OPS shouldn't depend on RISCV_DMA_NONCOHERENT
        riscv: dts: thead: set dma-noncoherent to soc bus
        arm64: dts: rockchip: Fix i2s0 pin conflict on ROCK Pi 4 boards
        arm64: dts: rockchip: Add i2s0-2ch-bus-bclk-off pins to RK3399
        clk: ti: Fix missing omap5 mcbsp functional clock and aliases
        clk: ti: Fix missing omap4 mcbsp functional clock and aliases
        ARM: OMAP1: ams-delta: Fix MODEM initialization failure
        soc: renesas: Make ARCH_R9A07G043 depend on required options
        riscv: dts: starfive: visionfive 2: correct spi's ss pin
        firmware/imx-dsp: Fix use_after_free in imx_dsp_setup_channels()
        ARM: OMAP: timer32K: fix all kernel-doc warnings
        ARM: omap2: fix a debug printk
        ARM: dts: rockchip: Fix timer clocks for RK3128
        ARM: dts: rockchip: Add missing quirk for RK3128's dma engine
        ARM: dts: rockchip: Add missing arm timer interrupt for RK3128
        ARM: dts: rockchip: Fix i2c0 register address for RK3128
        arm64: dts: rockchip: set codec system-clock-fixed on px30-ringneck-haikou
        arm64: dts: rockchip: use codec as clock master on px30-ringneck-haikou
        ...
      3a568e3a
    • Linus Torvalds's avatar
      Merge tag 'net-6.6-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net · c17cda15
      Linus Torvalds authored
      Pull networking fixes from Paolo Abeni:
       "Including fixes from WiFi and netfilter.
      
        Most regressions addressed here come from quite old versions, with the
        exceptions of the iavf one and the WiFi fixes. No known outstanding
        reports or investigation.
      
        Fixes to fixes:
      
         - eth: iavf: in iavf_down, disable queues when removing the driver
      
        Previous releases - regressions:
      
         - sched: act_ct: additional checks for outdated flows
      
         - tcp: do not leave an empty skb in write queue
      
         - tcp: fix wrong RTO timeout when received SACK reneging
      
         - wifi: cfg80211: pass correct pointer to rdev_inform_bss()
      
         - eth: i40e: sync next_to_clean and next_to_process for programming
           status desc
      
         - eth: iavf: initialize waitqueues before starting watchdog_task
      
        Previous releases - always broken:
      
         - eth: r8169: fix data-races
      
         - eth: igb: fix potential memory leak in igb_add_ethtool_nfc_entry
      
         - eth: r8152: avoid writing garbage to the adapter's registers
      
         - eth: gtp: fix fragmentation needed check with gso"
      
      * tag 'net-6.6-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net: (43 commits)
        iavf: in iavf_down, disable queues when removing the driver
        vsock/virtio: initialize the_virtio_vsock before using VQs
        net: ipv6: fix typo in comments
        net: ipv4: fix typo in comments
        net/sched: act_ct: additional checks for outdated flows
        netfilter: flowtable: GC pushes back packets to classic path
        i40e: Fix wrong check for I40E_TXR_FLAGS_WB_ON_ITR
        gtp: fix fragmentation needed check with gso
        gtp: uapi: fix GTPA_MAX
        Fix NULL pointer dereference in cn_filter()
        sfc: cleanup and reduce netlink error messages
        net/handshake: fix file ref count in handshake_nl_accept_doit()
        wifi: mac80211: don't drop all unprotected public action frames
        wifi: cfg80211: fix assoc response warning on failed links
        wifi: cfg80211: pass correct pointer to rdev_inform_bss()
        isdn: mISDN: hfcsusb: Spelling fix in comment
        tcp: fix wrong RTO timeout when received SACK reneging
        r8152: Block future register access if register access fails
        r8152: Rename RTL8152_UNPLUG to RTL8152_INACCESSIBLE
        r8152: Check for unplug in r8153b_ups_en() / r8153c_ups_en()
        ...
      c17cda15