• Eric Gouriou's avatar
    ext4: optimize memmmove lengths in extent/index insertions · 80e675f9
    Eric Gouriou authored
    ext4_ext_insert_extent() (respectively ext4_ext_insert_index())
    was using EXT_MAX_EXTENT() (resp. EXT_MAX_INDEX()) to determine
    how many entries needed to be moved beyond the insertion point.
    In practice this means that (320 - I) * 24 bytes were memmove()'d
    when I is the insertion point, rather than (#entries - I) * 24 bytes.
    
    This patch uses EXT_LAST_EXTENT() (resp. EXT_LAST_INDEX()) instead
    to only move existing entries. The code flow is also simplified
    slightly to highlight similarities and reduce code duplication in
    the insertion logic.
    
    This patch reduces system CPU consumption by over 25% on a 4kB
    synchronous append DIO write workload when used with the
    pre-2.6.39 x86_64 memmove() implementation. With the much faster
    2.6.39 memmove() implementation we still see a decrease in
    system CPU usage between 2% and 7%.
    
    Note that the ext_debug() output changes with this patch, splitting
    some log information between entries. Users of the ext_debug() output
    should note that the "move %d" units changed from reporting the number
    of bytes moved to reporting the number of entries moved.
    Signed-off-by: default avatarEric Gouriou <egouriou@google.com>
    Signed-off-by: default avatar"Theodore Ts'o" <tytso@mit.edu>
    80e675f9
extents.c 133 KB