• Linus Torvalds's avatar
    Merge tag 'smp-core-2023-06-26' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/tip/tip · 9244724f
    Linus Torvalds authored
    Pull SMP updates from Thomas Gleixner:
     "A large update for SMP management:
    
       - Parallel CPU bringup
    
         The reason why people are interested in parallel bringup is to
         shorten the (kexec) reboot time of cloud servers to reduce the
         downtime of the VM tenants.
    
         The current fully serialized bringup does the following per AP:
    
           1) Prepare callbacks (allocate, intialize, create threads)
           2) Kick the AP alive (e.g. INIT/SIPI on x86)
           3) Wait for the AP to report alive state
           4) Let the AP continue through the atomic bringup
           5) Let the AP run the threaded bringup to full online state
    
         There are two significant delays:
    
           #3 The time for an AP to report alive state in start_secondary()
              on x86 has been measured in the range between 350us and 3.5ms
              depending on vendor and CPU type, BIOS microcode size etc.
    
           #4 The atomic bringup does the microcode update. This has been
              measured to take up to ~8ms on the primary threads depending
              on the microcode patch size to apply.
    
         On a two socket SKL server with 56 cores (112 threads) the boot CPU
         spends on current mainline about 800ms busy waiting for the APs to
         come up and apply microcode. That's more than 80% of the actual
         onlining procedure.
    
         This can be reduced significantly by splitting the bringup
         mechanism into two parts:
    
           1) Run the prepare callbacks and kick the AP alive for each AP
              which needs to be brought up.
    
              The APs wake up, do their firmware initialization and run the
              low level kernel startup code including microcode loading in
              parallel up to the first synchronization point. (#1 and #2
              above)
    
           2) Run the rest of the bringup code strictly serialized per CPU
              (#3 - #5 above) as it's done today.
    
              Parallelizing that stage of the CPU bringup might be possible
              in theory, but it's questionable whether required surgery
              would be justified for a pretty small gain.
    
         If the system is large enough the first AP is already waiting at
         the first synchronization point when the boot CPU finished the
         wake-up of the last AP. That reduces the AP bringup time on that
         SKL from ~800ms to ~80ms, i.e. by a factor ~10x.
    
         The actual gain varies wildly depending on the system, CPU,
         microcode patch size and other factors. There are some
         opportunities to reduce the overhead further, but that needs some
         deep surgery in the x86 CPU bringup code.
    
         For now this is only enabled on x86, but the core functionality
         obviously works for all SMP capable architectures.
    
       - Enhancements for SMP function call tracing so it is possible to
         locate the scheduling and the actual execution points. That allows
         to measure IPI delivery time precisely"
    
    * tag 'smp-core-2023-06-26' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/tip/tip: (45 commits)
      trace,smp: Add tracepoints for scheduling remotelly called functions
      trace,smp: Add tracepoints around remotelly called functions
      MAINTAINERS: Add CPU HOTPLUG entry
      x86/smpboot: Fix the parallel bringup decision
      x86/realmode: Make stack lock work in trampoline_compat()
      x86/smp: Initialize cpu_primary_thread_mask late
      cpu/hotplug: Fix off by one in cpuhp_bringup_mask()
      x86/apic: Fix use of X{,2}APIC_ENABLE in asm with older binutils
      x86/smpboot/64: Implement arch_cpuhp_init_parallel_bringup() and enable it
      x86/smpboot: Support parallel startup of secondary CPUs
      x86/smpboot: Implement a bit spinlock to protect the realmode stack
      x86/apic: Save the APIC virtual base address
      cpu/hotplug: Allow "parallel" bringup up to CPUHP_BP_KICK_AP_STATE
      x86/apic: Provide cpu_primary_thread mask
      x86/smpboot: Enable split CPU startup
      cpu/hotplug: Provide a split up CPUHP_BRINGUP mechanism
      cpu/hotplug: Reset task stack state in _cpu_up()
      cpu/hotplug: Remove unused state functions
      riscv: Switch to hotplug core state synchronization
      parisc: Switch to hotplug core state synchronization
      ...
    9244724f
Kconfig 45.1 KB