1. 07 Apr, 2021 5 commits
    • Jianyong Wu's avatar
      ptp: arm/arm64: Enable ptp_kvm for arm/arm64 · 300bb1fe
      Jianyong Wu authored
      Currently, there is no mechanism to keep time sync between guest and host
      in arm/arm64 virtualization environment. Time in guest will drift compared
      with host after boot up as they may both use third party time sources
      to correct their time respectively. The time deviation will be in order
      of milliseconds. But in some scenarios,like in cloud environment, we ask
      for higher time precision.
      
      kvm ptp clock, which chooses the host clock source as a reference
      clock to sync time between guest and host, has been adopted by x86
      which takes the time sync order from milliseconds to nanoseconds.
      
      This patch enables kvm ptp clock for arm/arm64 and improves clock sync precision
      significantly.
      
      Test result comparisons between with kvm ptp clock and without it in arm/arm64
      are as follows. This test derived from the result of command 'chronyc
      sources'. we should take more care of the last sample column which shows
      the offset between the local clock and the source at the last measurement.
      
      no kvm ptp in guest:
      MS Name/IP address   Stratum Poll Reach LastRx Last sample
      ========================================================================
      ^* dns1.synet.edu.cn      2   6   377    13  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    21  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    29  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    37  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    45  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    53  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    61  +1040us[+1581us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377     4   -130us[ +796us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    12   -130us[ +796us] +/-   21ms
      ^* dns1.synet.edu.cn      2   6   377    20   -130us[ +796us] +/-   21ms
      
      in host:
      MS Name/IP address   Stratum Poll Reach LastRx Last sample
      ========================================================================
      ^* 120.25.115.20          2   7   377    72   -470us[ -603us] +/-   18ms
      ^* 120.25.115.20          2   7   377    92   -470us[ -603us] +/-   18ms
      ^* 120.25.115.20          2   7   377   112   -470us[ -603us] +/-   18ms
      ^* 120.25.115.20          2   7   377     2   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377    22   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377    43   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377    63   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377    83   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377   103   +872ns[-6808ns] +/-   17ms
      ^* 120.25.115.20          2   7   377   123   +872ns[-6808ns] +/-   17ms
      
      The dns1.synet.edu.cn is the network reference clock for guest and
      120.25.115.20 is the network reference clock for host. we can't get the
      clock error between guest and host directly, but a roughly estimated value
      will be in order of hundreds of us to ms.
      
      with kvm ptp in guest:
      chrony has been disabled in host to remove the disturb by network clock.
      
      MS Name/IP address         Stratum Poll Reach LastRx Last sample
      ========================================================================
      * PHC0                    0   3   377     8     -7ns[   +1ns] +/-    3ns
      * PHC0                    0   3   377     8     +1ns[  +16ns] +/-    3ns
      * PHC0                    0   3   377     6     -4ns[   -0ns] +/-    6ns
      * PHC0                    0   3   377     6     -8ns[  -12ns] +/-    5ns
      * PHC0                    0   3   377     5     +2ns[   +4ns] +/-    4ns
      * PHC0                    0   3   377    13     +2ns[   +4ns] +/-    4ns
      * PHC0                    0   3   377    12     -4ns[   -6ns] +/-    4ns
      * PHC0                    0   3   377    11     -8ns[  -11ns] +/-    6ns
      * PHC0                    0   3   377    10    -14ns[  -20ns] +/-    4ns
      * PHC0                    0   3   377     8     +4ns[   +5ns] +/-    4ns
      
      The PHC0 is the ptp clock which choose the host clock as its source
      clock. So we can see that the clock difference between host and guest
      is in order of ns.
      
      Cc: Mark Rutland <mark.rutland@arm.com>
      Acked-by: default avatarRichard Cochran <richardcochran@gmail.com>
      Signed-off-by: default avatarJianyong Wu <jianyong.wu@arm.com>
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20201209060932.212364-8-jianyong.wu@arm.com
      300bb1fe
    • Jianyong Wu's avatar
      KVM: arm64: Add support for the KVM PTP service · 3bf72569
      Jianyong Wu authored
      Implement the hypervisor side of the KVM PTP interface.
      
      The service offers wall time and cycle count from host to guest.
      The caller must specify whether they want the host's view of
      either the virtual or physical counter.
      Signed-off-by: default avatarJianyong Wu <jianyong.wu@arm.com>
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20201209060932.212364-7-jianyong.wu@arm.com
      3bf72569
    • Jianyong Wu's avatar
      clocksource: Add clocksource id for arm arch counter · 100148d0
      Jianyong Wu authored
      Add clocksource id to the ARM generic counter so that it can be easily
      identified from callers such as ptp_kvm.
      
      Cc: Mark Rutland <mark.rutland@arm.com>
      Reviewed-by: default avatarAndre Przywara <andre.przywara@arm.com>
      Signed-off-by: default avatarJianyong Wu <jianyong.wu@arm.com>
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20201209060932.212364-6-jianyong.wu@arm.com
      100148d0
    • Thomas Gleixner's avatar
      time: Add mechanism to recognize clocksource in time_get_snapshot · b2c67cbe
      Thomas Gleixner authored
      System time snapshots are not conveying information about the current
      clocksource which was used, but callers like the PTP KVM guest
      implementation have the requirement to evaluate the clocksource type to
      select the appropriate mechanism.
      
      Introduce a clocksource id field in struct clocksource which is by default
      set to CSID_GENERIC (0). Clocksource implementations can set that field to
      a value which allows to identify the clocksource.
      
      Store the clocksource id of the current clocksource in the
      system_time_snapshot so callers can evaluate which clocksource was used to
      take the snapshot and act accordingly.
      Signed-off-by: default avatarThomas Gleixner <tglx@linutronix.de>
      Signed-off-by: default avatarJianyong Wu <jianyong.wu@arm.com>
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20201209060932.212364-5-jianyong.wu@arm.com
      b2c67cbe
    • Jianyong Wu's avatar
      ptp: Reorganize ptp_kvm.c to make it arch-independent · a8cf291b
      Jianyong Wu authored
      Currently, the ptp_kvm module contains a lot of x86-specific code.
      Let's move this code into a new arch-specific file in the same directory,
      and rename the arch-independent file to ptp_kvm_common.c.
      Acked-by: default avatarRichard Cochran <richardcochran@gmail.com>
      Reviewed-by: default avatarAndre Przywara <andre.przywara@arm.com>
      Signed-off-by: default avatarJianyong Wu <jianyong.wu@arm.com>
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20201209060932.212364-4-jianyong.wu@arm.com
      a8cf291b
  2. 31 Mar, 2021 2 commits
  3. 14 Mar, 2021 14 commits
    • Linus Torvalds's avatar
      Linux 5.12-rc3 · 1e28eed1
      Linus Torvalds authored
      1e28eed1
    • Alexey Dobriyan's avatar
      prctl: fix PR_SET_MM_AUXV kernel stack leak · c995f12a
      Alexey Dobriyan authored
      Doing a
      
      	prctl(PR_SET_MM, PR_SET_MM_AUXV, addr, 1);
      
      will copy 1 byte from userspace to (quite big) on-stack array
      and then stash everything to mm->saved_auxv.
      AT_NULL terminator will be inserted at the very end.
      
      /proc/*/auxv handler will find that AT_NULL terminator
      and copy original stack contents to userspace.
      
      This devious scheme requires CAP_SYS_RESOURCE.
      Signed-off-by: default avatarAlexey Dobriyan <adobriyan@gmail.com>
      Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
      c995f12a
    • Linus Torvalds's avatar
      Merge tag 'irq-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 70404fe3
      Linus Torvalds authored
      Pull irq fixes from Thomas Gleixner:
       "A set of irqchip updates:
      
         - Make the GENERIC_IRQ_MULTI_HANDLER configuration correct
      
         - Add a missing DT compatible string for the Ingenic driver
      
         - Remove the pointless debugfs_file pointer from struct irqdomain"
      
      * tag 'irq-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        irqchip/ingenic: Add support for the JZ4760
        dt-bindings/irq: Add compatible string for the JZ4760B
        irqchip: Do not blindly select CONFIG_GENERIC_IRQ_MULTI_HANDLER
        ARM: ep93xx: Select GENERIC_IRQ_MULTI_HANDLER directly
        irqdomain: Remove debugfs_file from struct irq_domain
      70404fe3
    • Linus Torvalds's avatar
      Merge tag 'timers-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 802b31c0
      Linus Torvalds authored
      Pull timer fix from Thomas Gleixner:
       "A single fix in for hrtimers to prevent an interrupt storm caused by
        the lack of reevaluation of the timers which expire in softirq context
        under certain circumstances, e.g. when the clock was set"
      
      * tag 'timers-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        hrtimer: Update softirq_expires_next correctly after __hrtimer_get_next_event()
      802b31c0
    • Linus Torvalds's avatar
      Merge tag 'sched-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · c72cbc93
      Linus Torvalds authored
      Pull scheduler fixes from Thomas Gleixner:
       "A set of scheduler updates:
      
         - Prevent a NULL pointer dereference in the migration_stop_cpu()
           mechanims
      
         - Prevent self concurrency of affine_move_task()
      
         - Small fixes and cleanups related to task migration/affinity setting
      
         - Ensure that sync_runqueues_membarrier_state() is invoked on the
           current CPU when it is in the cpu mask"
      
      * tag 'sched-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        sched/membarrier: fix missing local execution of ipi_sync_rq_state()
        sched: Simplify set_affinity_pending refcounts
        sched: Fix affine_move_task() self-concurrency
        sched: Optimize migration_cpu_stop()
        sched: Collate affine_move_task() stoppers
        sched: Simplify migration_cpu_stop()
        sched: Fix migration_cpu_stop() requeueing
      c72cbc93
    • Linus Torvalds's avatar
      Merge tag 'objtool-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 19469d2a
      Linus Torvalds authored
      Pull objtool fix from Thomas Gleixner:
       "A single objtool fix to handle the PUSHF/POPF validation correctly for
        the paravirt changes which modified arch_local_irq_restore not to use
        popf"
      
      * tag 'objtool-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        objtool,x86: Fix uaccess PUSHF/POPF validation
      19469d2a
    • Linus Torvalds's avatar
      Merge tag 'locking-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · fa509ff8
      Linus Torvalds authored
      Pull locking fixes from Thomas Gleixner:
       "A couple of locking fixes:
      
         - A fix for the static_call mechanism so it handles unaligned
           addresses correctly.
      
         - Make u64_stats_init() a macro so every instance gets a seperate
           lockdep key.
      
         - Make seqcount_latch_init() a macro as well to preserve the static
           variable which is used for the lockdep key"
      
      * tag 'locking-urgent-2021-03-14' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        seqlock,lockdep: Fix seqcount_latch_init()
        u64_stats,lockdep: Fix u64_stats_init() vs lockdep
        static_call: Fix the module key fixup
      fa509ff8
    • Linus Torvalds's avatar
      Merge tag 'perf_urgent_for_v5.12-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 75013c6c
      Linus Torvalds authored
      Pull perf fixes from Borislav Petkov:
      
       - Make sure PMU internal buffers are flushed for per-CPU events too and
         properly handle PID/TID for large PEBS.
      
       - Handle the case properly when there's no PMU and therefore return an
         empty list of perf MSRs for VMX to switch instead of reading random
         garbage from the stack.
      
      * tag 'perf_urgent_for_v5.12-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/perf: Use RET0 as default for guest_get_msrs to handle "no PMU" case
        perf/x86/intel: Set PERF_ATTACH_SCHED_CB for large PEBS and LBR
        perf/core: Flush PMU internal buffers for per-CPU events
      75013c6c
    • Linus Torvalds's avatar
      Merge tag 'efi-urgent-for-v5.12-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 836d7f05
      Linus Torvalds authored
      Pull EFI fix from Ard Biesheuvel via Borislav Petkov:
       "Fix an oversight in the handling of EFI_RT_PROPERTIES_TABLE, which was
        added v5.10, but failed to take the SetVirtualAddressMap() RT service
        into account"
      
      * tag 'efi-urgent-for-v5.12-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        efi: stub: omit SetVirtualAddressMap() if marked unsupported in RT_PROP table
      836d7f05
    • Linus Torvalds's avatar
      Merge tag 'x86_urgent_for_v5.12_rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 0a7c10df
      Linus Torvalds authored
      Pull x86 fixes from Borislav Petkov:
      
       - A couple of SEV-ES fixes and robustifications: verify usermode stack
         pointer in NMI is not coming from the syscall gap, correctly track
         IRQ states in the #VC handler and access user insn bytes atomically
         in same handler as latter cannot sleep.
      
       - Balance 32-bit fast syscall exit path to do the proper work on exit
         and thus not confuse audit and ptrace frameworks.
      
       - Two fixes for the ORC unwinder going "off the rails" into KASAN
         redzones and when ORC data is missing.
      
      * tag 'x86_urgent_for_v5.12_rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/sev-es: Use __copy_from_user_inatomic()
        x86/sev-es: Correctly track IRQ states in runtime #VC handler
        x86/sev-es: Check regs->sp is trusted before adjusting #VC IST stack
        x86/sev-es: Introduce ip_within_syscall_gap() helper
        x86/entry: Fix entry/exit mismatch on failed fast 32-bit syscalls
        x86/unwind/orc: Silence warnings caused by missing ORC data
        x86/unwind/orc: Disable KASAN checking in the ORC unwinder, part 2
      0a7c10df
    • Linus Torvalds's avatar
      Merge tag 'powerpc-5.12-3' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux · c3c7579f
      Linus Torvalds authored
      Pull powerpc fixes from Michael Ellerman:
       "Some more powerpc fixes for 5.12:
      
         - Fix wrong instruction encoding for lis in ppc_function_entry(),
           which could potentially lead to missed kprobes.
      
         - Fix SET_FULL_REGS on 32-bit and 64e, which prevented ptrace of
           non-volatile GPRs immediately after exec.
      
         - Clean up a missed SRR specifier in the recent interrupt rework.
      
         - Don't treat unrecoverable_exception() as an interrupt handler, it's
           called from other handlers so shouldn't do the interrupt entry/exit
           accounting itself.
      
         - Fix build errors caused by missing declarations for
           [en/dis]able_kernel_vsx().
      
        Thanks to Christophe Leroy, Daniel Axtens, Geert Uytterhoeven, Jiri
        Olsa, Naveen N. Rao, and Nicholas Piggin"
      
      * tag 'powerpc-5.12-3' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux:
        powerpc/traps: unrecoverable_exception() is not an interrupt handler
        powerpc: Fix missing declaration of [en/dis]able_kernel_vsx()
        powerpc/64s/exception: Clean up a missed SRR specifier
        powerpc: Fix inverted SET_FULL_REGS bitop
        powerpc/64s: Use symbolic macros for function entry encoding
        powerpc/64s: Fix instruction encoding for lis in ppc_function_entry()
      c3c7579f
    • Linus Torvalds's avatar
      Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm · 9d0c8e79
      Linus Torvalds authored
      Pull KVM fixes from Paolo Bonzini:
       "More fixes for ARM and x86"
      
      * tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm:
        KVM: LAPIC: Advancing the timer expiration on guest initiated write
        KVM: x86/mmu: Skip !MMU-present SPTEs when removing SP in exclusive mode
        KVM: kvmclock: Fix vCPUs > 64 can't be online/hotpluged
        kvm: x86: annotate RCU pointers
        KVM: arm64: Fix exclusive limit for IPA size
        KVM: arm64: Reject VM creation when the default IPA size is unsupported
        KVM: arm64: Ensure I-cache isolation between vcpus of a same VM
        KVM: arm64: Don't use cbz/adr with external symbols
        KVM: arm64: Fix range alignment when walking page tables
        KVM: arm64: Workaround firmware wrongly advertising GICv2-on-v3 compatibility
        KVM: arm64: Rename __vgic_v3_get_ich_vtr_el2() to __vgic_v3_get_gic_config()
        KVM: arm64: Don't access PMSELR_EL0/PMUSERENR_EL0 when no PMU is available
        KVM: arm64: Turn kvm_arm_support_pmu_v3() into a static key
        KVM: arm64: Fix nVHE hyp panic host context restore
        KVM: arm64: Avoid corrupting vCPU context register in guest exit
        KVM: arm64: nvhe: Save the SPE context early
        kvm: x86: use NULL instead of using plain integer as pointer
        KVM: SVM: Connect 'npt' module param to KVM's internal 'npt_enabled'
        KVM: x86: Ensure deadline timer has truly expired before posting its IRQ
      9d0c8e79
    • Linus Torvalds's avatar
      Merge branch 'akpm' (patches from Andrew) · 50eb842f
      Linus Torvalds authored
      Merge misc fixes from Andrew Morton:
       "28 patches.
      
        Subsystems affected by this series: mm (memblock, pagealloc, hugetlb,
        highmem, kfence, oom-kill, madvise, kasan, userfaultfd, memcg, and
        zram), core-kernel, kconfig, fork, binfmt, MAINTAINERS, kbuild, and
        ia64"
      
      * emailed patches from Andrew Morton <akpm@linux-foundation.org>: (28 commits)
        zram: fix broken page writeback
        zram: fix return value on writeback_store
        mm/memcg: set memcg when splitting page
        mm/memcg: rename mem_cgroup_split_huge_fixup to split_page_memcg and add nr_pages argument
        ia64: fix ptrace(PTRACE_SYSCALL_INFO_EXIT) sign
        ia64: fix ia64_syscall_get_set_arguments() for break-based syscalls
        mm/userfaultfd: fix memory corruption due to writeprotect
        kasan: fix KASAN_STACK dependency for HW_TAGS
        kasan, mm: fix crash with HW_TAGS and DEBUG_PAGEALLOC
        mm/madvise: replace ptrace attach requirement for process_madvise
        include/linux/sched/mm.h: use rcu_dereference in in_vfork()
        kfence: fix reports if constant function prefixes exist
        kfence, slab: fix cache_alloc_debugcheck_after() for bulk allocations
        kfence: fix printk format for ptrdiff_t
        linux/compiler-clang.h: define HAVE_BUILTIN_BSWAP*
        MAINTAINERS: exclude uapi directories in API/ABI section
        binfmt_misc: fix possible deadlock in bm_register_write
        mm/highmem.c: fix zero_user_segments() with start > end
        hugetlb: do early cow when page pinned on src mm
        mm: use is_cow_mapping() across tree where proper
        ...
      50eb842f
    • Thomas Gleixner's avatar
      Merge tag 'irqchip-fixes-5.12-1' of... · b470ebc9
      Thomas Gleixner authored
      Merge tag 'irqchip-fixes-5.12-1' of git://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms into irq/urgent
      
      Pull irqchip fixes from Marc Zyngier:
      
        - More compatible strings for the Ingenic irqchip (introducing the
          JZ4760B SoC)
        - Select GENERIC_IRQ_MULTI_HANDLER on the ARM ep93xx platform
        - Drop all GENERIC_IRQ_MULTI_HANDLER selections from the irqchip
          Kconfig, now relying on the architecture to get it right
        - Drop the debugfs_file field from struct irq_domain, now that
          debugfs can track things on its own
      b470ebc9
  4. 13 Mar, 2021 19 commits