• Andrii Nakryiko's avatar
    libbpf: Generate more efficient BPF_CORE_READ code · 40901784
    Andrii Nakryiko authored
    Existing BPF_CORE_READ() macro generates slightly suboptimal code. If
    there are intermediate pointers to be read, initial source pointer is
    going to be assigned into a temporary variable and then temporary
    variable is going to be uniformly used as a "source" pointer for all
    intermediate pointer reads. Schematically (ignoring all the type casts),
    BPF_CORE_READ(s, a, b, c) is expanded into:
    ({
    	const void *__t = src;
    	bpf_probe_read(&__t, sizeof(*__t), &__t->a);
    	bpf_probe_read(&__t, sizeof(*__t), &__t->b);
    
    	typeof(s->a->b->c) __r;
    	bpf_probe_read(&__r, sizeof(*__r), &__t->c);
    })
    
    This initial `__t = src` makes calls more uniform, but causes slightly
    less optimal register usage sometimes when compiled with Clang. This can
    cascase into, e.g., more register spills.
    
    This patch fixes this issue by generating more optimal sequence:
    ({
    	const void *__t;
    	bpf_probe_read(&__t, sizeof(*__t), &src->a); /* <-- src here */
    	bpf_probe_read(&__t, sizeof(*__t), &__t->b);
    
    	typeof(s->a->b->c) __r;
    	bpf_probe_read(&__r, sizeof(*__r), &__t->c);
    })
    
    Fixes: 7db3822a ("libbpf: Add BPF_CORE_READ/BPF_CORE_READ_INTO helpers")
    Signed-off-by: default avatarAndrii Nakryiko <andriin@fb.com>
    Signed-off-by: default avatarDaniel Borkmann <daniel@iogearbox.net>
    Link: https://lore.kernel.org/bpf/20191011023847.275936-1-andriin@fb.com
    40901784
bpf_core_read.h 6.99 KB