• Laurentiu Tudor's avatar
    powerpc/booke64: wrap tlb lock and search in htw miss with FTR_SMT · 7c480050
    Laurentiu Tudor authored
    Virtualized environments may expose a e6500 dual-threaded core
    as two single-threaded e6500 cores. Take advantage of this
    and get rid of the tlb lock and the trap-causing tlbsx in
    the htw miss handler by guarding with CPU_FTR_SMT, as it's
    already being done in the bolted tlb1 miss handler.
    
    As seen in the results below, measurements done with lmbench
    random memory access latency test running under Freescale's
    Embedded Hypervisor, there is a ~34% improvement.
    
    Memory latencies in nanoseconds - smaller is better
        (WARNING - may not be correct, check graphs)
    ----------------------------------------------------
    Host       Mhz   L1 $   L2 $    Main mem    Rand mem
    ---------  ---   ----   ----    --------    --------
    smt       1665 1.8020   13.2    83.0         1149.7
    nosmt     1665 1.8020   13.2    83.0          758.1
    Signed-off-by: default avatarLaurentiu Tudor <Laurentiu.Tudor@freescale.com>
    Cc: Scott Wood <scottwood@freescale.com>
    [scottwood@freescale.com: commit message tweak]
    Signed-off-by: default avatarScott Wood <scottwood@freescale.com>
    7c480050
tlb_low_64e.S 34.3 KB