arch/arm64/crypto/aes-modes.S · ed6ed11830a9ded520db31a6e2b69b6b0a1eb0e2 · nexedi / linux

crypto: arm64/aes-modes - get rid of literal load of addend vector · ed6ed118

Ard Biesheuvel authored Aug 23, 2018

Replace the literal load of the addend vector with a sequence that
performs each add individually. This sequence is only 2 instructions
longer than the original, and 2% faster on Cortex-A53.

This is an improvement by itself, but also works around a Clang issue,
whose integrated assembler does not implement the GNU ARM asm syntax
completely, and does not support the =literal notation for FP registers
(more info at https://bugs.llvm.org/show_bug.cgi?id=38642)

Cc: Nick Desaulniers <ndesaulniers@google.com>
Signed-off-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
Reviewed-by: Nick Desaulniers <ndesaulniers@google.com>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>

ed6ed118

aes-modes.S 11.1 KB

Replace aes-modes.S