• Chris Metcalf's avatar
    tile: optimize and clean up string functions · c53c70a9
    Chris Metcalf authored
    This change cleans up the string code in a number of ways:
    
    - For memcpy(), fix bug in prefetch and increase distance to 3 lines;
      optimize for unaligned data; do all loads before wh64 to make memcpy
      safe for forward-overlapping calls; etc.  Performance is improved.
    
    - Use new copy_byte() function on tilegx to spread a single byte value
      out into a full word using the shufflebytes instruction.
    
    - Clean up header include ordering to be more canonical, and remove
      spurious #undefs of function names.
    Signed-off-by: default avatarChris Metcalf <cmetcalf@tilera.com>
    c53c70a9
memcpy_64.c 8.35 KB