1. 14 May, 2024 1 commit
    • Xi Ruoyao's avatar
      LoongArch: Select ARCH_HAS_FAST_MULTIPLIER · 2cce9059
      Xi Ruoyao authored
      LA464 and LA664 can do 32-bit/64-bit integer multiplication with a
      latency of 4 cycles and a throughput of 2 ops per cycle.  It is
      comparable to the mainstream x86 and arm64 cores, so we can select
      ARCH_HAS_FAST_MULTIPLIER like them.
      
      It speeds up __sw_hweight32() in lib/hweight.c for about 14% on LA464
      and 11% on LA664, while __sw_hweight64() for about 30% on LA464 and 33%
      on LA664.
      Signed-off-by: default avatarXi Ruoyao <xry111@xry111.site>
      Signed-off-by: default avatarHuacai Chen <chenhuacai@loongson.cn>
      2cce9059
  2. 12 May, 2024 5 commits
  3. 11 May, 2024 10 commits
  4. 10 May, 2024 20 commits
  5. 09 May, 2024 4 commits