* [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance
@ 2025-10-20 9:39 Donglin Peng
2025-10-20 9:39 ` [RFC PATCH v2 1/5] btf: search local BTF before base BTF Donglin Peng
` (4 more replies)
0 siblings, 5 replies; 32+ messages in thread
From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw)
To: ast; +Cc: linux-kernel, bpf, Donglin Peng
The funcgraph-args feature currently invokes btf_find_by_name_kind()
frequently, which uses a linear search algorithm. With large BTF data
sets like vmlinux (containing over 80,000 named types), this results
in significant performance overhead.
This patch series optimizes btf_find_by_name_kind() by:
1. Sorting BTF types by kind and name to enable binary search
2. Changing the search order to check local BTF before base BTF
Performance testing shows dramatic improvement:
# echo 1 > options/funcgraph-args
# echo function_graph > current_tracer
Before:
# time cat trace | wc -l
124176
real 0m16.154s
user 0m0.000s
sys 0m15.962s
After:
# time cat trace | wc -l
124176
real 0m0.948s
user 0m0.000s
sys 0m0.973s
This represents over 20x performance improvement for BTF type lookups.
Donglin Peng (5):
btf: search local BTF before base BTF
btf: sort BTF types by kind and name to enable binary search
libbpf: check if BTF is sorted to enable binary search
selftests/bpf: add tests for BTF deduplication and sorting
btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME
include/linux/btf.h | 21 ++-
kernel/bpf/Kconfig | 8 +
kernel/bpf/Makefile | 1 +
kernel/bpf/btf.c | 36 ++--
kernel/bpf/btf_sort.c | 2 +
scripts/Makefile.btf | 5 +
tools/lib/bpf/Build | 2 +-
tools/lib/bpf/btf.c | 169 +++++++++++++++---
tools/lib/bpf/btf.h | 2 +
tools/lib/bpf/btf_sort.c | 159 +++++++++++++++++
tools/lib/bpf/libbpf_internal.h | 6 +
tools/testing/selftests/bpf/prog_tests/btf.c | 171 +++++++++++++++++++
12 files changed, 541 insertions(+), 41 deletions(-)
create mode 100644 kernel/bpf/btf_sort.c
create mode 100644 tools/lib/bpf/btf_sort.c
--
2.34.1
^ permalink raw reply [flat|nested] 32+ messages in thread* [RFC PATCH v2 1/5] btf: search local BTF before base BTF 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng @ 2025-10-20 9:39 ` Donglin Peng 2025-10-21 1:06 ` Eduard Zingerman 2025-10-20 9:39 ` [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search Donglin Peng ` (3 subsequent siblings) 4 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw) To: ast Cc: linux-kernel, bpf, Donglin Peng, Eduard Zingerman, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin Change btf_find_by_name_kind() to search the local BTF first, then fall back to the base BTF. This can skip traversing the large vmlinux BTF when the target type resides in a kernel module's BTF, thereby significantly improving lookup performance. In a test searching for the btf_type of function ext2_new_inode located in the ext2 kernel module: Before: 408631 ns After: 499 ns Performance improvement: ~819x faster Cc: Eduard Zingerman <eddyz87@gmail.com> Cc: Alexei Starovoitov <ast@kernel.org> Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> Cc: Alan Maguire <alan.maguire@oracle.com> Cc: Song Liu <song@kernel.org> Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> --- include/linux/btf.h | 1 + kernel/bpf/btf.c | 27 ++++++++++++++++++--------- 2 files changed, 19 insertions(+), 9 deletions(-) diff --git a/include/linux/btf.h b/include/linux/btf.h index f06976ffb63f..ddc53a7ac7cd 100644 --- a/include/linux/btf.h +++ b/include/linux/btf.h @@ -220,6 +220,7 @@ bool btf_is_module(const struct btf *btf); bool btf_is_vmlinux(const struct btf *btf); struct module *btf_try_get_module(const struct btf *btf); u32 btf_nr_types(const struct btf *btf); +u32 btf_type_cnt(const struct btf *btf); struct btf *btf_base_btf(const struct btf *btf); bool btf_type_is_i32(const struct btf_type *t); bool btf_type_is_i64(const struct btf_type *t); diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c index 0de8fc8a0e0b..c414cf37e1bd 100644 --- a/kernel/bpf/btf.c +++ b/kernel/bpf/btf.c @@ -544,22 +544,31 @@ u32 btf_nr_types(const struct btf *btf) return total; } +u32 btf_type_cnt(const struct btf *btf) +{ + return btf->start_id + btf->nr_types; +} + s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) { const struct btf_type *t; const char *tname; u32 i, total; - total = btf_nr_types(btf); - for (i = 1; i < total; i++) { - t = btf_type_by_id(btf, i); - if (BTF_INFO_KIND(t->info) != kind) - continue; + do { + total = btf_type_cnt(btf); + for (i = btf->start_id; i < total; i++) { + t = btf_type_by_id(btf, i); + if (BTF_INFO_KIND(t->info) != kind) + continue; - tname = btf_name_by_offset(btf, t->name_off); - if (!strcmp(tname, name)) - return i; - } + tname = btf_name_by_offset(btf, t->name_off); + if (!strcmp(tname, name)) + return i; + } + + btf = btf->base_btf; + } while (btf); return -ENOENT; } -- 2.34.1 ^ permalink raw reply related [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 1/5] btf: search local BTF before base BTF 2025-10-20 9:39 ` [RFC PATCH v2 1/5] btf: search local BTF before base BTF Donglin Peng @ 2025-10-21 1:06 ` Eduard Zingerman 2025-10-21 8:31 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-21 1:06 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > Change btf_find_by_name_kind() to search the local BTF first, > then fall back to the base BTF. This can skip traversing the large > vmlinux BTF when the target type resides in a kernel module's BTF, > thereby significantly improving lookup performance. > > In a test searching for the btf_type of function ext2_new_inode > located in the ext2 kernel module: > > Before: 408631 ns > After: 499 ns > > Performance improvement: ~819x faster [...] > --- The flip makes sense, but are we sure that there are no implicit expectations to return base type in case of a name conflict? E.g. kernel/bpf/btf.c:btf_parse_struct_metas() takes a pointer to `btf` instance and looks for types in alloc_obj_fields array by name (e.g. "bpf_spin_lock"). This will get confused if module declares a type with the same name. Probably not a problem in this particular case, but did you inspect other uses? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 1/5] btf: search local BTF before base BTF 2025-10-21 1:06 ` Eduard Zingerman @ 2025-10-21 8:31 ` Donglin Peng 2025-10-21 15:56 ` Eduard Zingerman 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-21 8:31 UTC (permalink / raw) To: Eduard Zingerman Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Tue, Oct 21, 2025 at 9:06 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > Change btf_find_by_name_kind() to search the local BTF first, > > then fall back to the base BTF. This can skip traversing the large > > vmlinux BTF when the target type resides in a kernel module's BTF, > > thereby significantly improving lookup performance. > > > > In a test searching for the btf_type of function ext2_new_inode > > located in the ext2 kernel module: > > > > Before: 408631 ns > > After: 499 ns > > > > Performance improvement: ~819x faster > > [...] > > > --- > > The flip makes sense, but are we sure that there are no implicit > expectations to return base type in case of a name conflict? > > E.g. kernel/bpf/btf.c:btf_parse_struct_metas() takes a pointer to > `btf` instance and looks for types in alloc_obj_fields array by name > (e.g. "bpf_spin_lock"). This will get confused if module declares a > type with the same name. Probably not a problem in this particular > case, but did you inspect other uses? Thank you for pointing this out. I haven't checked other use cases yet, and you're right that this could indeed become a real issue if there are name conflicts between local and base types. It seems difficult to prevent this behavior entirely. Do you have any suggestions on how we should handle such potential conflicts? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 1/5] btf: search local BTF before base BTF 2025-10-21 8:31 ` Donglin Peng @ 2025-10-21 15:56 ` Eduard Zingerman 2025-10-22 3:08 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-21 15:56 UTC (permalink / raw) To: Donglin Peng Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Tue, 2025-10-21 at 16:31 +0800, Donglin Peng wrote: > On Tue, Oct 21, 2025 at 9:06 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > Change btf_find_by_name_kind() to search the local BTF first, > > > then fall back to the base BTF. This can skip traversing the large > > > vmlinux BTF when the target type resides in a kernel module's BTF, > > > thereby significantly improving lookup performance. > > > > > > In a test searching for the btf_type of function ext2_new_inode > > > located in the ext2 kernel module: > > > > > > Before: 408631 ns > > > After: 499 ns > > > > > > Performance improvement: ~819x faster > > > > [...] > > > > > --- > > > > The flip makes sense, but are we sure that there are no implicit > > expectations to return base type in case of a name conflict? > > > > E.g. kernel/bpf/btf.c:btf_parse_struct_metas() takes a pointer to > > `btf` instance and looks for types in alloc_obj_fields array by name > > (e.g. "bpf_spin_lock"). This will get confused if module declares a > > type with the same name. Probably not a problem in this particular > > case, but did you inspect other uses? > > Thank you for pointing this out. I haven't checked other use cases yet, > and you're right that this could indeed become a real issue if there are > name conflicts between local and base types. It seems difficult to > prevent this behavior entirely. Do you have any suggestions on how we > should handle such potential conflicts? What are the results of the above benchmark after sorting? If things are fast enough we might not need to do this change. Otherwise, each call to btf_find_by_name_kind() should be inspected. If necessary new APIs can be added to search only in vmlinux, or only in program, or only in module BTF. ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 1/5] btf: search local BTF before base BTF 2025-10-21 15:56 ` Eduard Zingerman @ 2025-10-22 3:08 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-22 3:08 UTC (permalink / raw) To: Eduard Zingerman Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Tue, Oct 21, 2025 at 11:56 PM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Tue, 2025-10-21 at 16:31 +0800, Donglin Peng wrote: > > On Tue, Oct 21, 2025 at 9:06 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > > Change btf_find_by_name_kind() to search the local BTF first, > > > > then fall back to the base BTF. This can skip traversing the large > > > > vmlinux BTF when the target type resides in a kernel module's BTF, > > > > thereby significantly improving lookup performance. > > > > > > > > In a test searching for the btf_type of function ext2_new_inode > > > > located in the ext2 kernel module: > > > > > > > > Before: 408631 ns > > > > After: 499 ns > > > > > > > > Performance improvement: ~819x faster > > > > > > [...] > > > > > > > --- > > > > > > The flip makes sense, but are we sure that there are no implicit > > > expectations to return base type in case of a name conflict? > > > > > > E.g. kernel/bpf/btf.c:btf_parse_struct_metas() takes a pointer to > > > `btf` instance and looks for types in alloc_obj_fields array by name > > > (e.g. "bpf_spin_lock"). This will get confused if module declares a > > > type with the same name. Probably not a problem in this particular > > > case, but did you inspect other uses? > > > > Thank you for pointing this out. I haven't checked other use cases yet, > > and you're right that this could indeed become a real issue if there are > > name conflicts between local and base types. It seems difficult to > > prevent this behavior entirely. Do you have any suggestions on how we > > should handle such potential conflicts? > > What are the results of the above benchmark after sorting? > If things are fast enough we might not need to do this change. > Otherwise, each call to btf_find_by_name_kind() should be > inspected. If necessary new APIs can be added to search only in > vmlinux, or only in program, or only in module BTF. Thanks for the suggestion. I'll run some benchmarks and share my findings. ^ permalink raw reply [flat|nested] 32+ messages in thread
* [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 1/5] btf: search local BTF before base BTF Donglin Peng @ 2025-10-20 9:39 ` Donglin Peng 2025-10-21 17:24 ` Alan Maguire 2025-10-21 18:59 ` Eduard Zingerman 2025-10-20 9:39 ` [RFC PATCH v2 3/5] libbpf: check if BTF is sorted " Donglin Peng ` (2 subsequent siblings) 4 siblings, 2 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw) To: ast Cc: linux-kernel, bpf, Donglin Peng, Eduard Zingerman, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin This patch implements sorting of BTF types by their kind and name, enabling the use of binary search for type lookups. To share logic between kernel and libbpf, a new btf_sort.c file is introduced containing common sorting functionality. The sorting is performed during btf__dedup() when the new sort_by_kind_name option in btf_dedup_opts is enabled. For vmlinux and kernel module BTF, btf_check_sorted() verifies whether the types are sorted and binary search can be used. Cc: Eduard Zingerman <eddyz87@gmail.com> Cc: Alexei Starovoitov <ast@kernel.org> Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> Cc: Alan Maguire <alan.maguire@oracle.com> Cc: Song Liu <song@kernel.org> Co-developed-by: Eduard Zingerman <eddyz87@gmail.com> Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> --- include/linux/btf.h | 20 +++- kernel/bpf/Makefile | 1 + kernel/bpf/btf.c | 39 ++++---- kernel/bpf/btf_sort.c | 2 + tools/lib/bpf/Build | 2 +- tools/lib/bpf/btf.c | 163 +++++++++++++++++++++++++++----- tools/lib/bpf/btf.h | 2 + tools/lib/bpf/btf_sort.c | 159 +++++++++++++++++++++++++++++++ tools/lib/bpf/libbpf_internal.h | 6 ++ 9 files changed, 347 insertions(+), 47 deletions(-) create mode 100644 kernel/bpf/btf_sort.c create mode 100644 tools/lib/bpf/btf_sort.c diff --git a/include/linux/btf.h b/include/linux/btf.h index ddc53a7ac7cd..c6fe5e689ab9 100644 --- a/include/linux/btf.h +++ b/include/linux/btf.h @@ -221,7 +221,10 @@ bool btf_is_vmlinux(const struct btf *btf); struct module *btf_try_get_module(const struct btf *btf); u32 btf_nr_types(const struct btf *btf); u32 btf_type_cnt(const struct btf *btf); -struct btf *btf_base_btf(const struct btf *btf); +u32 btf_start_id(const struct btf *btf); +u32 btf_nr_sorted_types(const struct btf *btf); +void btf_set_nr_sorted_types(struct btf *btf, u32 nr); +struct btf* btf_base_btf(const struct btf *btf); bool btf_type_is_i32(const struct btf_type *t); bool btf_type_is_i64(const struct btf_type *t); bool btf_type_is_primitive(const struct btf_type *t); @@ -595,6 +598,10 @@ int get_kern_ctx_btf_id(struct bpf_verifier_log *log, enum bpf_prog_type prog_ty bool btf_types_are_same(const struct btf *btf1, u32 id1, const struct btf *btf2, u32 id2); int btf_check_iter_arg(struct btf *btf, const struct btf_type *func, int arg_idx); +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); +s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, + u32 kind); +void btf_check_sorted(struct btf *btf, int start_id); static inline bool btf_type_is_struct_ptr(struct btf *btf, const struct btf_type *t) { @@ -683,5 +690,16 @@ static inline int btf_check_iter_arg(struct btf *btf, const struct btf_type *fun { return -EOPNOTSUPP; } +static inline int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) +{ + return -EOPNOTSUPP; +} +static inline s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, + u32 kind) +{ + return -EOPNOTSUPP; +} +static inline void btf_check_sorted(struct btf *btf, int start_id); +{} #endif #endif diff --git a/kernel/bpf/Makefile b/kernel/bpf/Makefile index 7fd0badfacb1..c9d8f986c7e1 100644 --- a/kernel/bpf/Makefile +++ b/kernel/bpf/Makefile @@ -56,6 +56,7 @@ obj-$(CONFIG_BPF_SYSCALL) += kmem_cache_iter.o ifeq ($(CONFIG_DMA_SHARED_BUFFER),y) obj-$(CONFIG_BPF_SYSCALL) += dmabuf_iter.o endif +obj-$(CONFIG_BPF_SYSCALL) += btf_sort.o CFLAGS_REMOVE_percpu_freelist.o = $(CC_FLAGS_FTRACE) CFLAGS_REMOVE_bpf_lru_list.o = $(CC_FLAGS_FTRACE) diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c index c414cf37e1bd..11b05f4eb07d 100644 --- a/kernel/bpf/btf.c +++ b/kernel/bpf/btf.c @@ -259,6 +259,7 @@ struct btf { void *nohdr_data; struct btf_header hdr; u32 nr_types; /* includes VOID for base BTF */ + u32 nr_sorted_types; u32 types_size; u32 data_size; refcount_t refcnt; @@ -544,33 +545,29 @@ u32 btf_nr_types(const struct btf *btf) return total; } -u32 btf_type_cnt(const struct btf *btf) +u32 btf_start_id(const struct btf *btf) { - return btf->start_id + btf->nr_types; + return btf->start_id; } -s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) +u32 btf_nr_sorted_types(const struct btf *btf) { - const struct btf_type *t; - const char *tname; - u32 i, total; - - do { - total = btf_type_cnt(btf); - for (i = btf->start_id; i < total; i++) { - t = btf_type_by_id(btf, i); - if (BTF_INFO_KIND(t->info) != kind) - continue; + return btf->nr_sorted_types; +} - tname = btf_name_by_offset(btf, t->name_off); - if (!strcmp(tname, name)) - return i; - } +void btf_set_nr_sorted_types(struct btf *btf, u32 nr) +{ + btf->nr_sorted_types = nr; +} - btf = btf->base_btf; - } while (btf); +u32 btf_type_cnt(const struct btf *btf) +{ + return btf->start_id + btf->nr_types; +} - return -ENOENT; +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) +{ + return find_btf_by_name_kind(btf, 1, name, kind); } s32 bpf_find_btf_id(const char *name, u32 kind, struct btf **btf_p) @@ -6239,6 +6236,7 @@ static struct btf *btf_parse_base(struct btf_verifier_env *env, const char *name if (err) goto errout; + btf_check_sorted(btf, 1); refcount_set(&btf->refcnt, 1); return btf; @@ -6371,6 +6369,7 @@ static struct btf *btf_parse_module(const char *module_name, const void *data, base_btf = vmlinux_btf; } + btf_check_sorted(btf, btf_nr_types(base_btf)); btf_verifier_env_free(env); refcount_set(&btf->refcnt, 1); return btf; diff --git a/kernel/bpf/btf_sort.c b/kernel/bpf/btf_sort.c new file mode 100644 index 000000000000..898f9189952c --- /dev/null +++ b/kernel/bpf/btf_sort.c @@ -0,0 +1,2 @@ +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) +#include "../../tools/lib/bpf/btf_sort.c" diff --git a/tools/lib/bpf/Build b/tools/lib/bpf/Build index c80204bb72a2..ed7c2506e22d 100644 --- a/tools/lib/bpf/Build +++ b/tools/lib/bpf/Build @@ -1,4 +1,4 @@ libbpf-y := libbpf.o bpf.o nlattr.o btf.o libbpf_utils.o \ netlink.o bpf_prog_linfo.o libbpf_probes.o hashmap.o \ btf_dump.o ringbuf.o strset.o linker.o gen_loader.o relo_core.o \ - usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o + usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o btf_sort.o diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c index 18907f0fcf9f..87e47f0b78ba 100644 --- a/tools/lib/bpf/btf.c +++ b/tools/lib/bpf/btf.c @@ -1,6 +1,9 @@ // SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) /* Copyright (c) 2018 Facebook */ +#ifndef _GNU_SOURCE +#define _GNU_SOURCE +#endif #include <byteswap.h> #include <endian.h> #include <stdio.h> @@ -92,6 +95,9 @@ struct btf { * - for split BTF counts number of types added on top of base BTF. */ __u32 nr_types; + /* number of named types in this BTF instance for binary search + */ + __u32 nr_sorted_types; /* if not NULL, points to the base BTF on top of which the current * split BTF is based */ @@ -619,6 +625,21 @@ __u32 btf__type_cnt(const struct btf *btf) return btf->start_id + btf->nr_types; } +__u32 btf__start_id(const struct btf *btf) +{ + return btf->start_id; +} + +__u32 btf__nr_sorted_types(const struct btf *btf) +{ + return btf->nr_sorted_types; +} + +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr) +{ + btf->nr_sorted_types = nr; +} + const struct btf *btf__base_btf(const struct btf *btf) { return btf->base_btf; @@ -915,38 +936,16 @@ __s32 btf__find_by_name(const struct btf *btf, const char *type_name) return libbpf_err(-ENOENT); } -static __s32 btf_find_by_name_kind(const struct btf *btf, int start_id, - const char *type_name, __u32 kind) -{ - __u32 i, nr_types = btf__type_cnt(btf); - - if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) - return 0; - - for (i = start_id; i < nr_types; i++) { - const struct btf_type *t = btf__type_by_id(btf, i); - const char *name; - - if (btf_kind(t) != kind) - continue; - name = btf__name_by_offset(btf, t->name_off); - if (name && !strcmp(type_name, name)) - return i; - } - - return libbpf_err(-ENOENT); -} - __s32 btf__find_by_name_kind_own(const struct btf *btf, const char *type_name, __u32 kind) { - return btf_find_by_name_kind(btf, btf->start_id, type_name, kind); + return find_btf_by_name_kind(btf, btf->start_id, type_name, kind); } __s32 btf__find_by_name_kind(const struct btf *btf, const char *type_name, __u32 kind) { - return btf_find_by_name_kind(btf, 1, type_name, kind); + return find_btf_by_name_kind(btf, 1, type_name, kind); } static bool btf_is_modifiable(const struct btf *btf) @@ -3411,6 +3410,7 @@ static int btf_dedup_struct_types(struct btf_dedup *d); static int btf_dedup_ref_types(struct btf_dedup *d); static int btf_dedup_resolve_fwds(struct btf_dedup *d); static int btf_dedup_compact_types(struct btf_dedup *d); +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d); static int btf_dedup_remap_types(struct btf_dedup *d); /* @@ -3600,7 +3600,7 @@ int btf__dedup(struct btf *btf, const struct btf_dedup_opts *opts) pr_debug("btf_dedup_ref_types failed: %s\n", errstr(err)); goto done; } - err = btf_dedup_compact_types(d); + err = btf_dedup_compact_and_sort_types(d); if (err < 0) { pr_debug("btf_dedup_compact_types failed: %s\n", errstr(err)); goto done; @@ -3649,6 +3649,8 @@ struct btf_dedup { * BTF is considered to be immutable. */ bool hypot_adjust_canon; + /* Sort btf_types by kind and time */ + bool sort_by_kind_name; /* Various option modifying behavior of algorithm */ struct btf_dedup_opts opts; /* temporary strings deduplication state */ @@ -3741,6 +3743,7 @@ static struct btf_dedup *btf_dedup_new(struct btf *btf, const struct btf_dedup_o d->btf = btf; d->btf_ext = OPTS_GET(opts, btf_ext, NULL); + d->sort_by_kind_name = OPTS_GET(opts, sort_by_kind_name, false); d->dedup_table = hashmap__new(hash_fn, btf_dedup_equal_fn, NULL); if (IS_ERR(d->dedup_table)) { @@ -5288,6 +5291,116 @@ static int btf_dedup_compact_types(struct btf_dedup *d) return 0; } +static __u32 *get_sorted_canon_types(struct btf_dedup *d, __u32 *cnt) +{ + int i, j, id, types_cnt = 0; + __u32 *sorted_ids; + + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) + if (d->map[id] == id) + ++types_cnt; + + sorted_ids = calloc(types_cnt, sizeof(*sorted_ids)); + if (!sorted_ids) + return NULL; + + for (j = 0, i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) + if (d->map[id] == id) + sorted_ids[j++] = id; + + qsort_r(sorted_ids, types_cnt, sizeof(*sorted_ids), + btf_compare_type_kinds_names, d->btf); + + *cnt = types_cnt; + + return sorted_ids; +} + +/* + * Compact and sort BTF types. + * + * Similar to btf_dedup_compact_types, but additionally sorts the btf_types. + */ +static int btf__dedup_compact_and_sort_types(struct btf_dedup *d) +{ + __u32 canon_types_cnt = 0, canon_types_len = 0; + __u32 *new_offs = NULL, *canon_types = NULL; + const struct btf_type *t; + void *p, *new_types = NULL; + int i, id, len, err; + + /* we are going to reuse hypot_map to store compaction remapping */ + d->hypot_map[0] = 0; + /* base BTF types are not renumbered */ + for (id = 1; id < d->btf->start_id; id++) + d->hypot_map[id] = id; + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) + d->hypot_map[id] = BTF_UNPROCESSED_ID; + + canon_types = get_sorted_canon_types(d, &canon_types_cnt); + if (!canon_types) { + err = -ENOMEM; + goto out_err; + } + + for (i = 0; i < canon_types_cnt; i++) { + id = canon_types[i]; + t = btf__type_by_id(d->btf, id); + len = btf_type_size(t); + if (len < 0) { + err = len; + goto out_err; + } + canon_types_len += len; + } + + new_offs = calloc(canon_types_cnt, sizeof(*new_offs)); + new_types = calloc(canon_types_len, 1); + if (!new_types || !new_offs) { + err = -ENOMEM; + goto out_err; + } + + p = new_types; + + for (i = 0; i < canon_types_cnt; i++) { + id = canon_types[i]; + t = btf__type_by_id(d->btf, id); + len = btf_type_size(t); + memcpy(p, t, len); + d->hypot_map[id] = d->btf->start_id + i; + new_offs[i] = p - new_types; + p += len; + } + + /* shrink struct btf's internal types index and update btf_header */ + free(d->btf->types_data); + free(d->btf->type_offs); + d->btf->types_data = new_types; + d->btf->type_offs = new_offs; + d->btf->types_data_cap = canon_types_len; + d->btf->type_offs_cap = canon_types_cnt; + d->btf->nr_types = canon_types_cnt; + d->btf->hdr->type_len = canon_types_len; + d->btf->hdr->str_off = d->btf->hdr->type_len; + d->btf->raw_size = d->btf->hdr->hdr_len + d->btf->hdr->type_len + d->btf->hdr->str_len; + free(canon_types); + return 0; + +out_err: + free(canon_types); + free(new_types); + free(new_offs); + return err; +} + +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d) +{ + if (d->sort_by_kind_name) + return btf__dedup_compact_and_sort_types(d); + return btf_dedup_compact_types(d); +} + /* * Figure out final (deduplicated and compacted) type ID for provided original * `type_id` by first resolving it into corresponding canonical type ID and diff --git a/tools/lib/bpf/btf.h b/tools/lib/bpf/btf.h index ccfd905f03df..9a7cfe6b4bb3 100644 --- a/tools/lib/bpf/btf.h +++ b/tools/lib/bpf/btf.h @@ -251,6 +251,8 @@ struct btf_dedup_opts { size_t sz; /* optional .BTF.ext info to dedup along the main BTF info */ struct btf_ext *btf_ext; + /* Sort btf_types by kind and name */ + bool sort_by_kind_name; /* force hash collisions (used for testing) */ bool force_collisions; size_t :0; diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c new file mode 100644 index 000000000000..2ad4a56f1c08 --- /dev/null +++ b/tools/lib/bpf/btf_sort.c @@ -0,0 +1,159 @@ +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) + +#ifndef _GNU_SOURCE +#define _GNU_SOURCE +#endif + +#ifdef __KERNEL__ +#include <linux/bpf.h> +#include <linux/btf.h> +#include <linux/string.h> + +#define btf_type_by_id (struct btf_type *)btf_type_by_id +#define btf__str_by_offset btf_str_by_offset +#define btf__name_by_offset btf_name_by_offset +#define btf__type_cnt btf_nr_types +#define btf__start_id btf_start_id +#define btf__nr_sorted_types btf_nr_sorted_types +#define btf__set_nr_sorted_types btf_set_nr_sorted_types +#define btf__base_btf btf_base_btf +#define libbpf_err(x) x + +#else + +#include "btf.h" +#include "bpf.h" +#include "libbpf.h" +#include "libbpf_internal.h" + +#endif /* __KERNEL__ */ + +/* Skip the sorted check if number of btf_types is below threshold + */ +#define BTF_CHECK_SORT_THRESHOLD 8 + +struct btf; + +static int cmp_btf_kind_name(int ka, const char *na, int kb, const char *nb) +{ + return (ka - kb) ?: strcmp(na, nb); +} + +/* + * Sort BTF types by kind and name in ascending order, placing named types + * before anonymous ones. + */ +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) +{ + struct btf *btf = (struct btf *)priv; + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); + const char *na, *nb; + int ka, kb; + + /* ta w/o name is greater than tb */ + if (!ta->name_off && tb->name_off) + return 1; + /* tb w/o name is smaller than ta */ + if (ta->name_off && !tb->name_off) + return -1; + + ka = btf_kind(ta); + kb = btf_kind(tb); + na = btf__str_by_offset(btf, ta->name_off); + nb = btf__str_by_offset(btf, tb->name_off); + + return cmp_btf_kind_name(ka, na, kb, nb); +} + +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, + const char *type_name, __u32 kind) +{ + const struct btf_type *t; + const char *tname; + __u32 i, total; + + if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) + return 0; + + do { + if (btf__nr_sorted_types(btf)) { + /* binary search */ + __s32 start, end, mid, found = -1; + int ret; + + start = btf__start_id(btf); + end = start + btf__nr_sorted_types(btf) - 1; + /* found the leftmost btf_type that matches */ + while(start <= end) { + mid = start + (end - start) / 2; + t = btf_type_by_id(btf, mid); + tname = btf__name_by_offset(btf, t->name_off); + ret = cmp_btf_kind_name(BTF_INFO_KIND(t->info), tname, + kind, type_name); + if (ret == 0) + found = mid; + if (ret < 0) + start = mid + 1; + else if (ret >= 0) + end = mid - 1; + } + + if (found != -1) + return found; + } else { + /* linear search */ + total = btf__type_cnt(btf); + for (i = btf__start_id(btf); i < total; i++) { + t = btf_type_by_id(btf, i); + if (btf_kind(t) != kind) + continue; + + tname = btf__name_by_offset(btf, t->name_off); + if (tname && !strcmp(tname, type_name)) + return i; + } + } + + btf = btf__base_btf(btf); + } while (btf && btf__start_id(btf) >= start_id); + + return libbpf_err(-ENOENT); +} + +void btf_check_sorted(struct btf *btf, int start_id) +{ + const struct btf_type *t; + int i, n, nr_sorted_types; + + n = btf__type_cnt(btf); + if ((n - start_id) < BTF_CHECK_SORT_THRESHOLD) + return; + + n--; + nr_sorted_types = 0; + for (i = start_id; i < n; i++) { + int k = i + 1; + + t = btf_type_by_id(btf, i); + if (!btf__str_by_offset(btf, t->name_off)) + return; + + t = btf_type_by_id(btf, k); + if (!btf__str_by_offset(btf, t->name_off)) + return; + + if (btf_compare_type_kinds_names(&i, &k, btf) > 0) + return; + + if (t->name_off) + nr_sorted_types++; + } + + t = btf_type_by_id(btf, start_id); + if (t->name_off) + nr_sorted_types++; + if (nr_sorted_types >= BTF_CHECK_SORT_THRESHOLD) + btf__set_nr_sorted_types(btf, nr_sorted_types); +} + diff --git a/tools/lib/bpf/libbpf_internal.h b/tools/lib/bpf/libbpf_internal.h index 35b2527bedec..f71f3e70c51c 100644 --- a/tools/lib/bpf/libbpf_internal.h +++ b/tools/lib/bpf/libbpf_internal.h @@ -248,6 +248,12 @@ const struct btf_type *skip_mods_and_typedefs(const struct btf *btf, __u32 id, _ const struct btf_header *btf_header(const struct btf *btf); void btf_set_base_btf(struct btf *btf, const struct btf *base_btf); int btf_relocate(struct btf *btf, const struct btf *base_btf, __u32 **id_map); +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, __u32 kind); +void btf_check_sorted(struct btf *btf, int start_id); +__u32 btf__start_id(const struct btf *btf); +__u32 btf__nr_sorted_types(const struct btf *btf); +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr); static inline enum btf_func_linkage btf_func_linkage(const struct btf_type *t) { -- 2.34.1 ^ permalink raw reply related [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-20 9:39 ` [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search Donglin Peng @ 2025-10-21 17:24 ` Alan Maguire 2025-10-22 4:47 ` Donglin Peng 2025-10-21 18:59 ` Eduard Zingerman 1 sibling, 1 reply; 32+ messages in thread From: Alan Maguire @ 2025-10-21 17:24 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Eduard Zingerman, Andrii Nakryiko, Song Liu, pengdonglin On 20/10/2025 10:39, Donglin Peng wrote: > This patch implements sorting of BTF types by their kind and name, > enabling the use of binary search for type lookups. > > To share logic between kernel and libbpf, a new btf_sort.c file is > introduced containing common sorting functionality. > > The sorting is performed during btf__dedup() when the new > sort_by_kind_name option in btf_dedup_opts is enabled. > > For vmlinux and kernel module BTF, btf_check_sorted() verifies > whether the types are sorted and binary search can be used. > this looks great! one thing that might make libbpf integration easier though (and Andrii can probably best answer if it's actually a problem) - would it be possible to separate the libbpf and non-libbpf parts here; i.e have a patch that adds the libbpf stuff first, then re-use it in a separate patch for kernel sorting? > Cc: Eduard Zingerman <eddyz87@gmail.com> > Cc: Alexei Starovoitov <ast@kernel.org> > Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> > Cc: Alan Maguire <alan.maguire@oracle.com> > Cc: Song Liu <song@kernel.org> > Co-developed-by: Eduard Zingerman <eddyz87@gmail.com> > Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> > Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> > --- > include/linux/btf.h | 20 +++- > kernel/bpf/Makefile | 1 + > kernel/bpf/btf.c | 39 ++++---- > kernel/bpf/btf_sort.c | 2 + > tools/lib/bpf/Build | 2 +- > tools/lib/bpf/btf.c | 163 +++++++++++++++++++++++++++----- > tools/lib/bpf/btf.h | 2 + > tools/lib/bpf/btf_sort.c | 159 +++++++++++++++++++++++++++++++ > tools/lib/bpf/libbpf_internal.h | 6 ++ > 9 files changed, 347 insertions(+), 47 deletions(-) > create mode 100644 kernel/bpf/btf_sort.c > create mode 100644 tools/lib/bpf/btf_sort.c > > diff --git a/include/linux/btf.h b/include/linux/btf.h > index ddc53a7ac7cd..c6fe5e689ab9 100644 > --- a/include/linux/btf.h > +++ b/include/linux/btf.h > @@ -221,7 +221,10 @@ bool btf_is_vmlinux(const struct btf *btf); > struct module *btf_try_get_module(const struct btf *btf); > u32 btf_nr_types(const struct btf *btf); > u32 btf_type_cnt(const struct btf *btf); > -struct btf *btf_base_btf(const struct btf *btf); > +u32 btf_start_id(const struct btf *btf); > +u32 btf_nr_sorted_types(const struct btf *btf); > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr); > +struct btf* btf_base_btf(const struct btf *btf); > bool btf_type_is_i32(const struct btf_type *t); > bool btf_type_is_i64(const struct btf_type *t); > bool btf_type_is_primitive(const struct btf_type *t); > @@ -595,6 +598,10 @@ int get_kern_ctx_btf_id(struct bpf_verifier_log *log, enum bpf_prog_type prog_ty > bool btf_types_are_same(const struct btf *btf1, u32 id1, > const struct btf *btf2, u32 id2); > int btf_check_iter_arg(struct btf *btf, const struct btf_type *func, int arg_idx); > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); > +s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, > + u32 kind); > +void btf_check_sorted(struct btf *btf, int start_id); > > static inline bool btf_type_is_struct_ptr(struct btf *btf, const struct btf_type *t) > { > @@ -683,5 +690,16 @@ static inline int btf_check_iter_arg(struct btf *btf, const struct btf_type *fun > { > return -EOPNOTSUPP; > } > +static inline int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > +{ > + return -EOPNOTSUPP; > +} > +static inline s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, > + u32 kind) > +{ > + return -EOPNOTSUPP; > +} > +static inline void btf_check_sorted(struct btf *btf, int start_id); > +{} > #endif > #endif > diff --git a/kernel/bpf/Makefile b/kernel/bpf/Makefile > index 7fd0badfacb1..c9d8f986c7e1 100644 > --- a/kernel/bpf/Makefile > +++ b/kernel/bpf/Makefile > @@ -56,6 +56,7 @@ obj-$(CONFIG_BPF_SYSCALL) += kmem_cache_iter.o > ifeq ($(CONFIG_DMA_SHARED_BUFFER),y) > obj-$(CONFIG_BPF_SYSCALL) += dmabuf_iter.o > endif > +obj-$(CONFIG_BPF_SYSCALL) += btf_sort.o > > CFLAGS_REMOVE_percpu_freelist.o = $(CC_FLAGS_FTRACE) > CFLAGS_REMOVE_bpf_lru_list.o = $(CC_FLAGS_FTRACE) > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > index c414cf37e1bd..11b05f4eb07d 100644 > --- a/kernel/bpf/btf.c > +++ b/kernel/bpf/btf.c > @@ -259,6 +259,7 @@ struct btf { > void *nohdr_data; > struct btf_header hdr; > u32 nr_types; /* includes VOID for base BTF */ > + u32 nr_sorted_types; > u32 types_size; > u32 data_size; > refcount_t refcnt; > @@ -544,33 +545,29 @@ u32 btf_nr_types(const struct btf *btf) > return total; > } > > -u32 btf_type_cnt(const struct btf *btf) > +u32 btf_start_id(const struct btf *btf) > { > - return btf->start_id + btf->nr_types; > + return btf->start_id; > } > > -s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > +u32 btf_nr_sorted_types(const struct btf *btf) > { > - const struct btf_type *t; > - const char *tname; > - u32 i, total; > - > - do { > - total = btf_type_cnt(btf); > - for (i = btf->start_id; i < total; i++) { > - t = btf_type_by_id(btf, i); > - if (BTF_INFO_KIND(t->info) != kind) > - continue; > + return btf->nr_sorted_types; > +} > > - tname = btf_name_by_offset(btf, t->name_off); > - if (!strcmp(tname, name)) > - return i; > - } > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr) > +{ > + btf->nr_sorted_types = nr; > +} > > - btf = btf->base_btf; > - } while (btf); > +u32 btf_type_cnt(const struct btf *btf) > +{ > + return btf->start_id + btf->nr_types; > +} > > - return -ENOENT; > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > +{ > + return find_btf_by_name_kind(btf, 1, name, kind); > } > > s32 bpf_find_btf_id(const char *name, u32 kind, struct btf **btf_p) > @@ -6239,6 +6236,7 @@ static struct btf *btf_parse_base(struct btf_verifier_env *env, const char *name > if (err) > goto errout; > > + btf_check_sorted(btf, 1); > refcount_set(&btf->refcnt, 1); > > return btf; > @@ -6371,6 +6369,7 @@ static struct btf *btf_parse_module(const char *module_name, const void *data, > base_btf = vmlinux_btf; > } > > + btf_check_sorted(btf, btf_nr_types(base_btf)); > btf_verifier_env_free(env); > refcount_set(&btf->refcnt, 1); > return btf; > diff --git a/kernel/bpf/btf_sort.c b/kernel/bpf/btf_sort.c > new file mode 100644 > index 000000000000..898f9189952c > --- /dev/null > +++ b/kernel/bpf/btf_sort.c > @@ -0,0 +1,2 @@ > +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > +#include "../../tools/lib/bpf/btf_sort.c" > diff --git a/tools/lib/bpf/Build b/tools/lib/bpf/Build > index c80204bb72a2..ed7c2506e22d 100644 > --- a/tools/lib/bpf/Build > +++ b/tools/lib/bpf/Build > @@ -1,4 +1,4 @@ > libbpf-y := libbpf.o bpf.o nlattr.o btf.o libbpf_utils.o \ > netlink.o bpf_prog_linfo.o libbpf_probes.o hashmap.o \ > btf_dump.o ringbuf.o strset.o linker.o gen_loader.o relo_core.o \ > - usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o > + usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o btf_sort.o > diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c > index 18907f0fcf9f..87e47f0b78ba 100644 > --- a/tools/lib/bpf/btf.c > +++ b/tools/lib/bpf/btf.c > @@ -1,6 +1,9 @@ > // SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > /* Copyright (c) 2018 Facebook */ > > +#ifndef _GNU_SOURCE > +#define _GNU_SOURCE > +#endif > #include <byteswap.h> > #include <endian.h> > #include <stdio.h> > @@ -92,6 +95,9 @@ struct btf { > * - for split BTF counts number of types added on top of base BTF. > */ > __u32 nr_types; > + /* number of named types in this BTF instance for binary search > + */ > + __u32 nr_sorted_types; > /* if not NULL, points to the base BTF on top of which the current > * split BTF is based > */ > @@ -619,6 +625,21 @@ __u32 btf__type_cnt(const struct btf *btf) > return btf->start_id + btf->nr_types; > } > > +__u32 btf__start_id(const struct btf *btf) > +{ > + return btf->start_id; > +} > + > +__u32 btf__nr_sorted_types(const struct btf *btf) > +{ > + return btf->nr_sorted_types; > +} > + > +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr) > +{ > + btf->nr_sorted_types = nr; > +} > + > const struct btf *btf__base_btf(const struct btf *btf) > { > return btf->base_btf; > @@ -915,38 +936,16 @@ __s32 btf__find_by_name(const struct btf *btf, const char *type_name) > return libbpf_err(-ENOENT); > } > > -static __s32 btf_find_by_name_kind(const struct btf *btf, int start_id, > - const char *type_name, __u32 kind) > -{ > - __u32 i, nr_types = btf__type_cnt(btf); > - > - if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > - return 0; > - > - for (i = start_id; i < nr_types; i++) { > - const struct btf_type *t = btf__type_by_id(btf, i); > - const char *name; > - > - if (btf_kind(t) != kind) > - continue; > - name = btf__name_by_offset(btf, t->name_off); > - if (name && !strcmp(type_name, name)) > - return i; > - } > - > - return libbpf_err(-ENOENT); > -} > - > __s32 btf__find_by_name_kind_own(const struct btf *btf, const char *type_name, > __u32 kind) > { > - return btf_find_by_name_kind(btf, btf->start_id, type_name, kind); > + return find_btf_by_name_kind(btf, btf->start_id, type_name, kind); > } > > __s32 btf__find_by_name_kind(const struct btf *btf, const char *type_name, > __u32 kind) > { > - return btf_find_by_name_kind(btf, 1, type_name, kind); > + return find_btf_by_name_kind(btf, 1, type_name, kind); > } > > static bool btf_is_modifiable(const struct btf *btf) > @@ -3411,6 +3410,7 @@ static int btf_dedup_struct_types(struct btf_dedup *d); > static int btf_dedup_ref_types(struct btf_dedup *d); > static int btf_dedup_resolve_fwds(struct btf_dedup *d); > static int btf_dedup_compact_types(struct btf_dedup *d); > +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d); > static int btf_dedup_remap_types(struct btf_dedup *d); > > /* > @@ -3600,7 +3600,7 @@ int btf__dedup(struct btf *btf, const struct btf_dedup_opts *opts) > pr_debug("btf_dedup_ref_types failed: %s\n", errstr(err)); > goto done; > } > - err = btf_dedup_compact_types(d); > + err = btf_dedup_compact_and_sort_types(d); > if (err < 0) { > pr_debug("btf_dedup_compact_types failed: %s\n", errstr(err)); > goto done; > @@ -3649,6 +3649,8 @@ struct btf_dedup { > * BTF is considered to be immutable. > */ > bool hypot_adjust_canon; > + /* Sort btf_types by kind and time */ > + bool sort_by_kind_name; > /* Various option modifying behavior of algorithm */ > struct btf_dedup_opts opts; > /* temporary strings deduplication state */ > @@ -3741,6 +3743,7 @@ static struct btf_dedup *btf_dedup_new(struct btf *btf, const struct btf_dedup_o > > d->btf = btf; > d->btf_ext = OPTS_GET(opts, btf_ext, NULL); > + d->sort_by_kind_name = OPTS_GET(opts, sort_by_kind_name, false); > > d->dedup_table = hashmap__new(hash_fn, btf_dedup_equal_fn, NULL); > if (IS_ERR(d->dedup_table)) { > @@ -5288,6 +5291,116 @@ static int btf_dedup_compact_types(struct btf_dedup *d) > return 0; > } > > +static __u32 *get_sorted_canon_types(struct btf_dedup *d, __u32 *cnt) > +{ > + int i, j, id, types_cnt = 0; > + __u32 *sorted_ids; > + > + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > + if (d->map[id] == id) > + ++types_cnt; > + > + sorted_ids = calloc(types_cnt, sizeof(*sorted_ids)); > + if (!sorted_ids) > + return NULL; > + > + for (j = 0, i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > + if (d->map[id] == id) > + sorted_ids[j++] = id; > + > + qsort_r(sorted_ids, types_cnt, sizeof(*sorted_ids), > + btf_compare_type_kinds_names, d->btf); > + > + *cnt = types_cnt; > + > + return sorted_ids; > +} > + > +/* > + * Compact and sort BTF types. > + * > + * Similar to btf_dedup_compact_types, but additionally sorts the btf_types. > + */ > +static int btf__dedup_compact_and_sort_types(struct btf_dedup *d) > +{ > + __u32 canon_types_cnt = 0, canon_types_len = 0; > + __u32 *new_offs = NULL, *canon_types = NULL; > + const struct btf_type *t; > + void *p, *new_types = NULL; > + int i, id, len, err; > + > + /* we are going to reuse hypot_map to store compaction remapping */ > + d->hypot_map[0] = 0; > + /* base BTF types are not renumbered */ > + for (id = 1; id < d->btf->start_id; id++) > + d->hypot_map[id] = id; > + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > + d->hypot_map[id] = BTF_UNPROCESSED_ID; > + > + canon_types = get_sorted_canon_types(d, &canon_types_cnt); > + if (!canon_types) { > + err = -ENOMEM; > + goto out_err; > + } > + > + for (i = 0; i < canon_types_cnt; i++) { > + id = canon_types[i]; > + t = btf__type_by_id(d->btf, id); > + len = btf_type_size(t); > + if (len < 0) { > + err = len; > + goto out_err; > + } > + canon_types_len += len; > + } > + > + new_offs = calloc(canon_types_cnt, sizeof(*new_offs)); > + new_types = calloc(canon_types_len, 1); > + if (!new_types || !new_offs) { > + err = -ENOMEM; > + goto out_err; > + } > + > + p = new_types; > + > + for (i = 0; i < canon_types_cnt; i++) { > + id = canon_types[i]; > + t = btf__type_by_id(d->btf, id); > + len = btf_type_size(t); > + memcpy(p, t, len); > + d->hypot_map[id] = d->btf->start_id + i; > + new_offs[i] = p - new_types; > + p += len; > + } > + > + /* shrink struct btf's internal types index and update btf_header */ > + free(d->btf->types_data); > + free(d->btf->type_offs); > + d->btf->types_data = new_types; > + d->btf->type_offs = new_offs; > + d->btf->types_data_cap = canon_types_len; > + d->btf->type_offs_cap = canon_types_cnt; > + d->btf->nr_types = canon_types_cnt; > + d->btf->hdr->type_len = canon_types_len; > + d->btf->hdr->str_off = d->btf->hdr->type_len; > + d->btf->raw_size = d->btf->hdr->hdr_len + d->btf->hdr->type_len + d->btf->hdr->str_len; > + free(canon_types); > + return 0; > + > +out_err: > + free(canon_types); > + free(new_types); > + free(new_offs); > + return err; > +} > + > +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d) > +{ > + if (d->sort_by_kind_name) > + return btf__dedup_compact_and_sort_types(d); > + return btf_dedup_compact_types(d); > +} > + > /* > * Figure out final (deduplicated and compacted) type ID for provided original > * `type_id` by first resolving it into corresponding canonical type ID and > diff --git a/tools/lib/bpf/btf.h b/tools/lib/bpf/btf.h > index ccfd905f03df..9a7cfe6b4bb3 100644 > --- a/tools/lib/bpf/btf.h > +++ b/tools/lib/bpf/btf.h > @@ -251,6 +251,8 @@ struct btf_dedup_opts { > size_t sz; > /* optional .BTF.ext info to dedup along the main BTF info */ > struct btf_ext *btf_ext; > + /* Sort btf_types by kind and name */ > + bool sort_by_kind_name; > /* force hash collisions (used for testing) */ > bool force_collisions; > size_t :0; > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > new file mode 100644 > index 000000000000..2ad4a56f1c08 > --- /dev/null > +++ b/tools/lib/bpf/btf_sort.c > @@ -0,0 +1,159 @@ > +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > + > +#ifndef _GNU_SOURCE > +#define _GNU_SOURCE > +#endif > + > +#ifdef __KERNEL__ > +#include <linux/bpf.h> > +#include <linux/btf.h> > +#include <linux/string.h> > + > +#define btf_type_by_id (struct btf_type *)btf_type_by_id > +#define btf__str_by_offset btf_str_by_offset > +#define btf__name_by_offset btf_name_by_offset > +#define btf__type_cnt btf_nr_types > +#define btf__start_id btf_start_id > +#define btf__nr_sorted_types btf_nr_sorted_types > +#define btf__set_nr_sorted_types btf_set_nr_sorted_types > +#define btf__base_btf btf_base_btf > +#define libbpf_err(x) x > + > +#else > + > +#include "btf.h" > +#include "bpf.h" > +#include "libbpf.h" > +#include "libbpf_internal.h" > + > +#endif /* __KERNEL__ */ > + > +/* Skip the sorted check if number of btf_types is below threshold > + */ > +#define BTF_CHECK_SORT_THRESHOLD 8 > + > +struct btf; > + > +static int cmp_btf_kind_name(int ka, const char *na, int kb, const char *nb) > +{ > + return (ka - kb) ?: strcmp(na, nb); > +} > + > +/* > + * Sort BTF types by kind and name in ascending order, placing named types > + * before anonymous ones. > + */ > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > +{ > + struct btf *btf = (struct btf *)priv; > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > + const char *na, *nb; > + int ka, kb; > + > + /* ta w/o name is greater than tb */ > + if (!ta->name_off && tb->name_off) > + return 1; > + /* tb w/o name is smaller than ta */ > + if (ta->name_off && !tb->name_off) > + return -1; > + > + ka = btf_kind(ta); > + kb = btf_kind(tb); > + na = btf__str_by_offset(btf, ta->name_off); > + nb = btf__str_by_offset(btf, tb->name_off); > + > + return cmp_btf_kind_name(ka, na, kb, nb); > +} > + > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, > + const char *type_name, __u32 kind) > +{ > + const struct btf_type *t; > + const char *tname; > + __u32 i, total; > + > + if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > + return 0; > + > + do { > + if (btf__nr_sorted_types(btf)) { > + /* binary search */ > + __s32 start, end, mid, found = -1; > + int ret; > + > + start = btf__start_id(btf); > + end = start + btf__nr_sorted_types(btf) - 1; > + /* found the leftmost btf_type that matches */ > + while(start <= end) { > + mid = start + (end - start) / 2; > + t = btf_type_by_id(btf, mid); > + tname = btf__name_by_offset(btf, t->name_off); > + ret = cmp_btf_kind_name(BTF_INFO_KIND(t->info), tname, > + kind, type_name); > + if (ret == 0) > + found = mid; > + if (ret < 0) > + start = mid + 1; > + else if (ret >= 0) > + end = mid - 1; > + } > + > + if (found != -1) > + return found; > + } else { > + /* linear search */ > + total = btf__type_cnt(btf); > + for (i = btf__start_id(btf); i < total; i++) { > + t = btf_type_by_id(btf, i); > + if (btf_kind(t) != kind) > + continue; > + > + tname = btf__name_by_offset(btf, t->name_off); > + if (tname && !strcmp(tname, type_name)) > + return i; > + } > + } > + > + btf = btf__base_btf(btf); > + } while (btf && btf__start_id(btf) >= start_id); > + > + return libbpf_err(-ENOENT); > +} > + > +void btf_check_sorted(struct btf *btf, int start_id) > +{ > + const struct btf_type *t; > + int i, n, nr_sorted_types; > + > + n = btf__type_cnt(btf); > + if ((n - start_id) < BTF_CHECK_SORT_THRESHOLD) > + return; > + > + n--; > + nr_sorted_types = 0; > + for (i = start_id; i < n; i++) { > + int k = i + 1; > + > + t = btf_type_by_id(btf, i); > + if (!btf__str_by_offset(btf, t->name_off)) > + return; > + > + t = btf_type_by_id(btf, k); > + if (!btf__str_by_offset(btf, t->name_off)) > + return; > + > + if (btf_compare_type_kinds_names(&i, &k, btf) > 0) > + return; > + > + if (t->name_off) > + nr_sorted_types++; > + } > + > + t = btf_type_by_id(btf, start_id); > + if (t->name_off) > + nr_sorted_types++; > + if (nr_sorted_types >= BTF_CHECK_SORT_THRESHOLD) > + btf__set_nr_sorted_types(btf, nr_sorted_types); > +} > + > diff --git a/tools/lib/bpf/libbpf_internal.h b/tools/lib/bpf/libbpf_internal.h > index 35b2527bedec..f71f3e70c51c 100644 > --- a/tools/lib/bpf/libbpf_internal.h > +++ b/tools/lib/bpf/libbpf_internal.h > @@ -248,6 +248,12 @@ const struct btf_type *skip_mods_and_typedefs(const struct btf *btf, __u32 id, _ > const struct btf_header *btf_header(const struct btf *btf); > void btf_set_base_btf(struct btf *btf, const struct btf *base_btf); > int btf_relocate(struct btf *btf, const struct btf *base_btf, __u32 **id_map); > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, __u32 kind); > +void btf_check_sorted(struct btf *btf, int start_id); > +__u32 btf__start_id(const struct btf *btf); > +__u32 btf__nr_sorted_types(const struct btf *btf); > +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr); > > static inline enum btf_func_linkage btf_func_linkage(const struct btf_type *t) > { ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-21 17:24 ` Alan Maguire @ 2025-10-22 4:47 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-22 4:47 UTC (permalink / raw) To: Alan Maguire Cc: ast, linux-kernel, bpf, Eduard Zingerman, Andrii Nakryiko, Song Liu, pengdonglin On Wed, Oct 22, 2025 at 1:25 AM Alan Maguire <alan.maguire@oracle.com> wrote: > > On 20/10/2025 10:39, Donglin Peng wrote: > > This patch implements sorting of BTF types by their kind and name, > > enabling the use of binary search for type lookups. > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > introduced containing common sorting functionality. > > > > The sorting is performed during btf__dedup() when the new > > sort_by_kind_name option in btf_dedup_opts is enabled. > > > > For vmlinux and kernel module BTF, btf_check_sorted() verifies > > whether the types are sorted and binary search can be used. > > > > this looks great! one thing that might make libbpf integration easier > though (and Andrii can probably best answer if it's actually a problem) > - would it be possible to separate the libbpf and non-libbpf parts here; > i.e have a patch that adds the libbpf stuff first, then re-use it in a > separate patch for kernel sorting? Thanks for the suggestion, I will split it in the next version. > > > Cc: Eduard Zingerman <eddyz87@gmail.com> > > Cc: Alexei Starovoitov <ast@kernel.org> > > Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> > > Cc: Alan Maguire <alan.maguire@oracle.com> > > Cc: Song Liu <song@kernel.org> > > Co-developed-by: Eduard Zingerman <eddyz87@gmail.com> > > Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> > > Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> > > --- > > include/linux/btf.h | 20 +++- > > kernel/bpf/Makefile | 1 + > > kernel/bpf/btf.c | 39 ++++---- > > kernel/bpf/btf_sort.c | 2 + > > tools/lib/bpf/Build | 2 +- > > tools/lib/bpf/btf.c | 163 +++++++++++++++++++++++++++----- > > tools/lib/bpf/btf.h | 2 + > > tools/lib/bpf/btf_sort.c | 159 +++++++++++++++++++++++++++++++ > > tools/lib/bpf/libbpf_internal.h | 6 ++ > > 9 files changed, 347 insertions(+), 47 deletions(-) > > create mode 100644 kernel/bpf/btf_sort.c > > create mode 100644 tools/lib/bpf/btf_sort.c > > > > diff --git a/include/linux/btf.h b/include/linux/btf.h > > index ddc53a7ac7cd..c6fe5e689ab9 100644 > > --- a/include/linux/btf.h > > +++ b/include/linux/btf.h > > @@ -221,7 +221,10 @@ bool btf_is_vmlinux(const struct btf *btf); > > struct module *btf_try_get_module(const struct btf *btf); > > u32 btf_nr_types(const struct btf *btf); > > u32 btf_type_cnt(const struct btf *btf); > > -struct btf *btf_base_btf(const struct btf *btf); > > +u32 btf_start_id(const struct btf *btf); > > +u32 btf_nr_sorted_types(const struct btf *btf); > > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr); > > +struct btf* btf_base_btf(const struct btf *btf); > > bool btf_type_is_i32(const struct btf_type *t); > > bool btf_type_is_i64(const struct btf_type *t); > > bool btf_type_is_primitive(const struct btf_type *t); > > @@ -595,6 +598,10 @@ int get_kern_ctx_btf_id(struct bpf_verifier_log *log, enum bpf_prog_type prog_ty > > bool btf_types_are_same(const struct btf *btf1, u32 id1, > > const struct btf *btf2, u32 id2); > > int btf_check_iter_arg(struct btf *btf, const struct btf_type *func, int arg_idx); > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); > > +s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, > > + u32 kind); > > +void btf_check_sorted(struct btf *btf, int start_id); > > > > static inline bool btf_type_is_struct_ptr(struct btf *btf, const struct btf_type *t) > > { > > @@ -683,5 +690,16 @@ static inline int btf_check_iter_arg(struct btf *btf, const struct btf_type *fun > > { > > return -EOPNOTSUPP; > > } > > +static inline int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > > +{ > > + return -EOPNOTSUPP; > > +} > > +static inline s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, > > + u32 kind) > > +{ > > + return -EOPNOTSUPP; > > +} > > +static inline void btf_check_sorted(struct btf *btf, int start_id); > > +{} > > #endif > > #endif > > diff --git a/kernel/bpf/Makefile b/kernel/bpf/Makefile > > index 7fd0badfacb1..c9d8f986c7e1 100644 > > --- a/kernel/bpf/Makefile > > +++ b/kernel/bpf/Makefile > > @@ -56,6 +56,7 @@ obj-$(CONFIG_BPF_SYSCALL) += kmem_cache_iter.o > > ifeq ($(CONFIG_DMA_SHARED_BUFFER),y) > > obj-$(CONFIG_BPF_SYSCALL) += dmabuf_iter.o > > endif > > +obj-$(CONFIG_BPF_SYSCALL) += btf_sort.o > > > > CFLAGS_REMOVE_percpu_freelist.o = $(CC_FLAGS_FTRACE) > > CFLAGS_REMOVE_bpf_lru_list.o = $(CC_FLAGS_FTRACE) > > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > > index c414cf37e1bd..11b05f4eb07d 100644 > > --- a/kernel/bpf/btf.c > > +++ b/kernel/bpf/btf.c > > @@ -259,6 +259,7 @@ struct btf { > > void *nohdr_data; > > struct btf_header hdr; > > u32 nr_types; /* includes VOID for base BTF */ > > + u32 nr_sorted_types; > > u32 types_size; > > u32 data_size; > > refcount_t refcnt; > > @@ -544,33 +545,29 @@ u32 btf_nr_types(const struct btf *btf) > > return total; > > } > > > > -u32 btf_type_cnt(const struct btf *btf) > > +u32 btf_start_id(const struct btf *btf) > > { > > - return btf->start_id + btf->nr_types; > > + return btf->start_id; > > } > > > > -s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > +u32 btf_nr_sorted_types(const struct btf *btf) > > { > > - const struct btf_type *t; > > - const char *tname; > > - u32 i, total; > > - > > - do { > > - total = btf_type_cnt(btf); > > - for (i = btf->start_id; i < total; i++) { > > - t = btf_type_by_id(btf, i); > > - if (BTF_INFO_KIND(t->info) != kind) > > - continue; > > + return btf->nr_sorted_types; > > +} > > > > - tname = btf_name_by_offset(btf, t->name_off); > > - if (!strcmp(tname, name)) > > - return i; > > - } > > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr) > > +{ > > + btf->nr_sorted_types = nr; > > +} > > > > - btf = btf->base_btf; > > - } while (btf); > > +u32 btf_type_cnt(const struct btf *btf) > > +{ > > + return btf->start_id + btf->nr_types; > > +} > > > > - return -ENOENT; > > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > +{ > > + return find_btf_by_name_kind(btf, 1, name, kind); > > } > > > > s32 bpf_find_btf_id(const char *name, u32 kind, struct btf **btf_p) > > @@ -6239,6 +6236,7 @@ static struct btf *btf_parse_base(struct btf_verifier_env *env, const char *name > > if (err) > > goto errout; > > > > + btf_check_sorted(btf, 1); > > refcount_set(&btf->refcnt, 1); > > > > return btf; > > @@ -6371,6 +6369,7 @@ static struct btf *btf_parse_module(const char *module_name, const void *data, > > base_btf = vmlinux_btf; > > } > > > > + btf_check_sorted(btf, btf_nr_types(base_btf)); > > btf_verifier_env_free(env); > > refcount_set(&btf->refcnt, 1); > > return btf; > > diff --git a/kernel/bpf/btf_sort.c b/kernel/bpf/btf_sort.c > > new file mode 100644 > > index 000000000000..898f9189952c > > --- /dev/null > > +++ b/kernel/bpf/btf_sort.c > > @@ -0,0 +1,2 @@ > > +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > > +#include "../../tools/lib/bpf/btf_sort.c" > > diff --git a/tools/lib/bpf/Build b/tools/lib/bpf/Build > > index c80204bb72a2..ed7c2506e22d 100644 > > --- a/tools/lib/bpf/Build > > +++ b/tools/lib/bpf/Build > > @@ -1,4 +1,4 @@ > > libbpf-y := libbpf.o bpf.o nlattr.o btf.o libbpf_utils.o \ > > netlink.o bpf_prog_linfo.o libbpf_probes.o hashmap.o \ > > btf_dump.o ringbuf.o strset.o linker.o gen_loader.o relo_core.o \ > > - usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o > > + usdt.o zip.o elf.o features.o btf_iter.o btf_relocate.o btf_sort.o > > diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c > > index 18907f0fcf9f..87e47f0b78ba 100644 > > --- a/tools/lib/bpf/btf.c > > +++ b/tools/lib/bpf/btf.c > > @@ -1,6 +1,9 @@ > > // SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > > /* Copyright (c) 2018 Facebook */ > > > > +#ifndef _GNU_SOURCE > > +#define _GNU_SOURCE > > +#endif > > #include <byteswap.h> > > #include <endian.h> > > #include <stdio.h> > > @@ -92,6 +95,9 @@ struct btf { > > * - for split BTF counts number of types added on top of base BTF. > > */ > > __u32 nr_types; > > + /* number of named types in this BTF instance for binary search > > + */ > > + __u32 nr_sorted_types; > > /* if not NULL, points to the base BTF on top of which the current > > * split BTF is based > > */ > > @@ -619,6 +625,21 @@ __u32 btf__type_cnt(const struct btf *btf) > > return btf->start_id + btf->nr_types; > > } > > > > +__u32 btf__start_id(const struct btf *btf) > > +{ > > + return btf->start_id; > > +} > > + > > +__u32 btf__nr_sorted_types(const struct btf *btf) > > +{ > > + return btf->nr_sorted_types; > > +} > > + > > +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr) > > +{ > > + btf->nr_sorted_types = nr; > > +} > > + > > const struct btf *btf__base_btf(const struct btf *btf) > > { > > return btf->base_btf; > > @@ -915,38 +936,16 @@ __s32 btf__find_by_name(const struct btf *btf, const char *type_name) > > return libbpf_err(-ENOENT); > > } > > > > -static __s32 btf_find_by_name_kind(const struct btf *btf, int start_id, > > - const char *type_name, __u32 kind) > > -{ > > - __u32 i, nr_types = btf__type_cnt(btf); > > - > > - if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > > - return 0; > > - > > - for (i = start_id; i < nr_types; i++) { > > - const struct btf_type *t = btf__type_by_id(btf, i); > > - const char *name; > > - > > - if (btf_kind(t) != kind) > > - continue; > > - name = btf__name_by_offset(btf, t->name_off); > > - if (name && !strcmp(type_name, name)) > > - return i; > > - } > > - > > - return libbpf_err(-ENOENT); > > -} > > - > > __s32 btf__find_by_name_kind_own(const struct btf *btf, const char *type_name, > > __u32 kind) > > { > > - return btf_find_by_name_kind(btf, btf->start_id, type_name, kind); > > + return find_btf_by_name_kind(btf, btf->start_id, type_name, kind); > > } > > > > __s32 btf__find_by_name_kind(const struct btf *btf, const char *type_name, > > __u32 kind) > > { > > - return btf_find_by_name_kind(btf, 1, type_name, kind); > > + return find_btf_by_name_kind(btf, 1, type_name, kind); > > } > > > > static bool btf_is_modifiable(const struct btf *btf) > > @@ -3411,6 +3410,7 @@ static int btf_dedup_struct_types(struct btf_dedup *d); > > static int btf_dedup_ref_types(struct btf_dedup *d); > > static int btf_dedup_resolve_fwds(struct btf_dedup *d); > > static int btf_dedup_compact_types(struct btf_dedup *d); > > +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d); > > static int btf_dedup_remap_types(struct btf_dedup *d); > > > > /* > > @@ -3600,7 +3600,7 @@ int btf__dedup(struct btf *btf, const struct btf_dedup_opts *opts) > > pr_debug("btf_dedup_ref_types failed: %s\n", errstr(err)); > > goto done; > > } > > - err = btf_dedup_compact_types(d); > > + err = btf_dedup_compact_and_sort_types(d); > > if (err < 0) { > > pr_debug("btf_dedup_compact_types failed: %s\n", errstr(err)); > > goto done; > > @@ -3649,6 +3649,8 @@ struct btf_dedup { > > * BTF is considered to be immutable. > > */ > > bool hypot_adjust_canon; > > + /* Sort btf_types by kind and time */ > > + bool sort_by_kind_name; > > /* Various option modifying behavior of algorithm */ > > struct btf_dedup_opts opts; > > /* temporary strings deduplication state */ > > @@ -3741,6 +3743,7 @@ static struct btf_dedup *btf_dedup_new(struct btf *btf, const struct btf_dedup_o > > > > d->btf = btf; > > d->btf_ext = OPTS_GET(opts, btf_ext, NULL); > > + d->sort_by_kind_name = OPTS_GET(opts, sort_by_kind_name, false); > > > > d->dedup_table = hashmap__new(hash_fn, btf_dedup_equal_fn, NULL); > > if (IS_ERR(d->dedup_table)) { > > @@ -5288,6 +5291,116 @@ static int btf_dedup_compact_types(struct btf_dedup *d) > > return 0; > > } > > > > +static __u32 *get_sorted_canon_types(struct btf_dedup *d, __u32 *cnt) > > +{ > > + int i, j, id, types_cnt = 0; > > + __u32 *sorted_ids; > > + > > + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > > + if (d->map[id] == id) > > + ++types_cnt; > > + > > + sorted_ids = calloc(types_cnt, sizeof(*sorted_ids)); > > + if (!sorted_ids) > > + return NULL; > > + > > + for (j = 0, i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > > + if (d->map[id] == id) > > + sorted_ids[j++] = id; > > + > > + qsort_r(sorted_ids, types_cnt, sizeof(*sorted_ids), > > + btf_compare_type_kinds_names, d->btf); > > + > > + *cnt = types_cnt; > > + > > + return sorted_ids; > > +} > > + > > +/* > > + * Compact and sort BTF types. > > + * > > + * Similar to btf_dedup_compact_types, but additionally sorts the btf_types. > > + */ > > +static int btf__dedup_compact_and_sort_types(struct btf_dedup *d) > > +{ > > + __u32 canon_types_cnt = 0, canon_types_len = 0; > > + __u32 *new_offs = NULL, *canon_types = NULL; > > + const struct btf_type *t; > > + void *p, *new_types = NULL; > > + int i, id, len, err; > > + > > + /* we are going to reuse hypot_map to store compaction remapping */ > > + d->hypot_map[0] = 0; > > + /* base BTF types are not renumbered */ > > + for (id = 1; id < d->btf->start_id; id++) > > + d->hypot_map[id] = id; > > + for (i = 0, id = d->btf->start_id; i < d->btf->nr_types; i++, id++) > > + d->hypot_map[id] = BTF_UNPROCESSED_ID; > > + > > + canon_types = get_sorted_canon_types(d, &canon_types_cnt); > > + if (!canon_types) { > > + err = -ENOMEM; > > + goto out_err; > > + } > > + > > + for (i = 0; i < canon_types_cnt; i++) { > > + id = canon_types[i]; > > + t = btf__type_by_id(d->btf, id); > > + len = btf_type_size(t); > > + if (len < 0) { > > + err = len; > > + goto out_err; > > + } > > + canon_types_len += len; > > + } > > + > > + new_offs = calloc(canon_types_cnt, sizeof(*new_offs)); > > + new_types = calloc(canon_types_len, 1); > > + if (!new_types || !new_offs) { > > + err = -ENOMEM; > > + goto out_err; > > + } > > + > > + p = new_types; > > + > > + for (i = 0; i < canon_types_cnt; i++) { > > + id = canon_types[i]; > > + t = btf__type_by_id(d->btf, id); > > + len = btf_type_size(t); > > + memcpy(p, t, len); > > + d->hypot_map[id] = d->btf->start_id + i; > > + new_offs[i] = p - new_types; > > + p += len; > > + } > > + > > + /* shrink struct btf's internal types index and update btf_header */ > > + free(d->btf->types_data); > > + free(d->btf->type_offs); > > + d->btf->types_data = new_types; > > + d->btf->type_offs = new_offs; > > + d->btf->types_data_cap = canon_types_len; > > + d->btf->type_offs_cap = canon_types_cnt; > > + d->btf->nr_types = canon_types_cnt; > > + d->btf->hdr->type_len = canon_types_len; > > + d->btf->hdr->str_off = d->btf->hdr->type_len; > > + d->btf->raw_size = d->btf->hdr->hdr_len + d->btf->hdr->type_len + d->btf->hdr->str_len; > > + free(canon_types); > > + return 0; > > + > > +out_err: > > + free(canon_types); > > + free(new_types); > > + free(new_offs); > > + return err; > > +} > > + > > +static int btf_dedup_compact_and_sort_types(struct btf_dedup *d) > > +{ > > + if (d->sort_by_kind_name) > > + return btf__dedup_compact_and_sort_types(d); > > + return btf_dedup_compact_types(d); > > +} > > + > > /* > > * Figure out final (deduplicated and compacted) type ID for provided original > > * `type_id` by first resolving it into corresponding canonical type ID and > > diff --git a/tools/lib/bpf/btf.h b/tools/lib/bpf/btf.h > > index ccfd905f03df..9a7cfe6b4bb3 100644 > > --- a/tools/lib/bpf/btf.h > > +++ b/tools/lib/bpf/btf.h > > @@ -251,6 +251,8 @@ struct btf_dedup_opts { > > size_t sz; > > /* optional .BTF.ext info to dedup along the main BTF info */ > > struct btf_ext *btf_ext; > > + /* Sort btf_types by kind and name */ > > + bool sort_by_kind_name; > > /* force hash collisions (used for testing) */ > > bool force_collisions; > > size_t :0; > > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > > new file mode 100644 > > index 000000000000..2ad4a56f1c08 > > --- /dev/null > > +++ b/tools/lib/bpf/btf_sort.c > > @@ -0,0 +1,159 @@ > > +// SPDX-License-Identifier: (LGPL-2.1 OR BSD-2-Clause) > > + > > +#ifndef _GNU_SOURCE > > +#define _GNU_SOURCE > > +#endif > > + > > +#ifdef __KERNEL__ > > +#include <linux/bpf.h> > > +#include <linux/btf.h> > > +#include <linux/string.h> > > + > > +#define btf_type_by_id (struct btf_type *)btf_type_by_id > > +#define btf__str_by_offset btf_str_by_offset > > +#define btf__name_by_offset btf_name_by_offset > > +#define btf__type_cnt btf_nr_types > > +#define btf__start_id btf_start_id > > +#define btf__nr_sorted_types btf_nr_sorted_types > > +#define btf__set_nr_sorted_types btf_set_nr_sorted_types > > +#define btf__base_btf btf_base_btf > > +#define libbpf_err(x) x > > + > > +#else > > + > > +#include "btf.h" > > +#include "bpf.h" > > +#include "libbpf.h" > > +#include "libbpf_internal.h" > > + > > +#endif /* __KERNEL__ */ > > + > > +/* Skip the sorted check if number of btf_types is below threshold > > + */ > > +#define BTF_CHECK_SORT_THRESHOLD 8 > > + > > +struct btf; > > + > > +static int cmp_btf_kind_name(int ka, const char *na, int kb, const char *nb) > > +{ > > + return (ka - kb) ?: strcmp(na, nb); > > +} > > + > > +/* > > + * Sort BTF types by kind and name in ascending order, placing named types > > + * before anonymous ones. > > + */ > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > > +{ > > + struct btf *btf = (struct btf *)priv; > > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > > + const char *na, *nb; > > + int ka, kb; > > + > > + /* ta w/o name is greater than tb */ > > + if (!ta->name_off && tb->name_off) > > + return 1; > > + /* tb w/o name is smaller than ta */ > > + if (ta->name_off && !tb->name_off) > > + return -1; > > + > > + ka = btf_kind(ta); > > + kb = btf_kind(tb); > > + na = btf__str_by_offset(btf, ta->name_off); > > + nb = btf__str_by_offset(btf, tb->name_off); > > + > > + return cmp_btf_kind_name(ka, na, kb, nb); > > +} > > + > > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, > > + const char *type_name, __u32 kind) > > +{ > > + const struct btf_type *t; > > + const char *tname; > > + __u32 i, total; > > + > > + if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > > + return 0; > > + > > + do { > > + if (btf__nr_sorted_types(btf)) { > > + /* binary search */ > > + __s32 start, end, mid, found = -1; > > + int ret; > > + > > + start = btf__start_id(btf); > > + end = start + btf__nr_sorted_types(btf) - 1; > > + /* found the leftmost btf_type that matches */ > > + while(start <= end) { > > + mid = start + (end - start) / 2; > > + t = btf_type_by_id(btf, mid); > > + tname = btf__name_by_offset(btf, t->name_off); > > + ret = cmp_btf_kind_name(BTF_INFO_KIND(t->info), tname, > > + kind, type_name); > > + if (ret == 0) > > + found = mid; > > + if (ret < 0) > > + start = mid + 1; > > + else if (ret >= 0) > > + end = mid - 1; > > + } > > + > > + if (found != -1) > > + return found; > > + } else { > > + /* linear search */ > > + total = btf__type_cnt(btf); > > + for (i = btf__start_id(btf); i < total; i++) { > > + t = btf_type_by_id(btf, i); > > + if (btf_kind(t) != kind) > > + continue; > > + > > + tname = btf__name_by_offset(btf, t->name_off); > > + if (tname && !strcmp(tname, type_name)) > > + return i; > > + } > > + } > > + > > + btf = btf__base_btf(btf); > > + } while (btf && btf__start_id(btf) >= start_id); > > + > > + return libbpf_err(-ENOENT); > > +} > > + > > +void btf_check_sorted(struct btf *btf, int start_id) > > +{ > > + const struct btf_type *t; > > + int i, n, nr_sorted_types; > > + > > + n = btf__type_cnt(btf); > > + if ((n - start_id) < BTF_CHECK_SORT_THRESHOLD) > > + return; > > + > > + n--; > > + nr_sorted_types = 0; > > + for (i = start_id; i < n; i++) { > > + int k = i + 1; > > + > > + t = btf_type_by_id(btf, i); > > + if (!btf__str_by_offset(btf, t->name_off)) > > + return; > > + > > + t = btf_type_by_id(btf, k); > > + if (!btf__str_by_offset(btf, t->name_off)) > > + return; > > + > > + if (btf_compare_type_kinds_names(&i, &k, btf) > 0) > > + return; > > + > > + if (t->name_off) > > + nr_sorted_types++; > > + } > > + > > + t = btf_type_by_id(btf, start_id); > > + if (t->name_off) > > + nr_sorted_types++; > > + if (nr_sorted_types >= BTF_CHECK_SORT_THRESHOLD) > > + btf__set_nr_sorted_types(btf, nr_sorted_types); > > +} > > + > > diff --git a/tools/lib/bpf/libbpf_internal.h b/tools/lib/bpf/libbpf_internal.h > > index 35b2527bedec..f71f3e70c51c 100644 > > --- a/tools/lib/bpf/libbpf_internal.h > > +++ b/tools/lib/bpf/libbpf_internal.h > > @@ -248,6 +248,12 @@ const struct btf_type *skip_mods_and_typedefs(const struct btf *btf, __u32 id, _ > > const struct btf_header *btf_header(const struct btf *btf); > > void btf_set_base_btf(struct btf *btf, const struct btf *base_btf); > > int btf_relocate(struct btf *btf, const struct btf *base_btf, __u32 **id_map); > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv); > > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, const char *type_name, __u32 kind); > > +void btf_check_sorted(struct btf *btf, int start_id); > > +__u32 btf__start_id(const struct btf *btf); > > +__u32 btf__nr_sorted_types(const struct btf *btf); > > +void btf__set_nr_sorted_types(struct btf *btf, __u32 nr); > > > > static inline enum btf_func_linkage btf_func_linkage(const struct btf_type *t) > > { > ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-20 9:39 ` [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search Donglin Peng 2025-10-21 17:24 ` Alan Maguire @ 2025-10-21 18:59 ` Eduard Zingerman 2025-10-22 3:02 ` Donglin Peng 1 sibling, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-21 18:59 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > This patch implements sorting of BTF types by their kind and name, > enabling the use of binary search for type lookups. > > To share logic between kernel and libbpf, a new btf_sort.c file is > introduced containing common sorting functionality. > > The sorting is performed during btf__dedup() when the new > sort_by_kind_name option in btf_dedup_opts is enabled. Do we really need this option? Dedup is free to rearrange btf types anyway, so why not sort always? Is execution time a concern? > For vmlinux and kernel module BTF, btf_check_sorted() verifies > whether the types are sorted and binary search can be used. [...] > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > index c414cf37e1bd..11b05f4eb07d 100644 > --- a/kernel/bpf/btf.c > +++ b/kernel/bpf/btf.c > @@ -259,6 +259,7 @@ struct btf { > void *nohdr_data; > struct btf_header hdr; > u32 nr_types; /* includes VOID for base BTF */ > + u32 nr_sorted_types; > u32 types_size; > u32 data_size; > refcount_t refcnt; > @@ -544,33 +545,29 @@ u32 btf_nr_types(const struct btf *btf) > return total; > } > > -u32 btf_type_cnt(const struct btf *btf) > +u32 btf_start_id(const struct btf *btf) > { > - return btf->start_id + btf->nr_types; > + return btf->start_id; > } > > -s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > +u32 btf_nr_sorted_types(const struct btf *btf) > { > - const struct btf_type *t; > - const char *tname; > - u32 i, total; > - > - do { > - total = btf_type_cnt(btf); > - for (i = btf->start_id; i < total; i++) { > - t = btf_type_by_id(btf, i); > - if (BTF_INFO_KIND(t->info) != kind) > - continue; > + return btf->nr_sorted_types; > +} > > - tname = btf_name_by_offset(btf, t->name_off); > - if (!strcmp(tname, name)) > - return i; > - } > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr) > +{ > + btf->nr_sorted_types = nr; > +} > > - btf = btf->base_btf; > - } while (btf); > +u32 btf_type_cnt(const struct btf *btf) > +{ > + return btf->start_id + btf->nr_types; > +} > > - return -ENOENT; > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > +{ > + return find_btf_by_name_kind(btf, 1, name, kind); ^^^ nit: this will make it impossible to find "void" w/o a special case in the find_btf_by_name_kind(), why not start from 0? > } > > s32 bpf_find_btf_id(const char *name, u32 kind, struct btf **btf_p) [...] > diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c > index 18907f0fcf9f..87e47f0b78ba 100644 > --- a/tools/lib/bpf/btf.c > +++ b/tools/lib/bpf/btf.c [...] > +/* > + * Compact and sort BTF types. > + * > + * Similar to btf_dedup_compact_types, but additionally sorts the btf_types. > + */ > +static int btf__dedup_compact_and_sort_types(struct btf_dedup *d) > +{ And this function will become btf__dedup_compact_types(), if BTF will be always sorted. Thus removing some code duplication. [...] > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > new file mode 100644 > index 000000000000..2ad4a56f1c08 > --- /dev/null > +++ b/tools/lib/bpf/btf_sort.c [...] > +/* > + * Sort BTF types by kind and name in ascending order, placing named types > + * before anonymous ones. > + */ > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > +{ > + struct btf *btf = (struct btf *)priv; > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > + const char *na, *nb; > + int ka, kb; > + > + /* ta w/o name is greater than tb */ > + if (!ta->name_off && tb->name_off) > + return 1; > + /* tb w/o name is smaller than ta */ > + if (ta->name_off && !tb->name_off) > + return -1; > + > + ka = btf_kind(ta); > + kb = btf_kind(tb); > + na = btf__str_by_offset(btf, ta->name_off); > + nb = btf__str_by_offset(btf, tb->name_off); > + > + return cmp_btf_kind_name(ka, na, kb, nb); If both types are anonymous and have the same kind, this will lead to strcmp(NULL, NULL). On kernel side that would lead to null pointer dereference. > +} > + > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, > + const char *type_name, __u32 kind) Nit: having functions with names btf_find_by_name_kind and find_btf_by_name_kind is very confusing. Usually we use names like __<func> for auxiliary functions like this. > +{ > + const struct btf_type *t; > + const char *tname; > + __u32 i, total; > + > + if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > + return 0; > + > + do { > + if (btf__nr_sorted_types(btf)) { > + /* binary search */ > + __s32 start, end, mid, found = -1; > + int ret; > + > + start = btf__start_id(btf); > + end = start + btf__nr_sorted_types(btf) - 1; > + /* found the leftmost btf_type that matches */ > + while(start <= end) { > + mid = start + (end - start) / 2; > + t = btf_type_by_id(btf, mid); > + tname = btf__name_by_offset(btf, t->name_off); > + ret = cmp_btf_kind_name(BTF_INFO_KIND(t->info), tname, > + kind, type_name); > + if (ret == 0) > + found = mid; > + if (ret < 0) > + start = mid + 1; > + else if (ret >= 0) > + end = mid - 1; > + } > + > + if (found != -1) > + return found; > + } else { > + /* linear search */ > + total = btf__type_cnt(btf); > + for (i = btf__start_id(btf); i < total; i++) { > + t = btf_type_by_id(btf, i); > + if (btf_kind(t) != kind) > + continue; > + > + tname = btf__name_by_offset(btf, t->name_off); > + if (tname && !strcmp(tname, type_name)) > + return i; > + } > + } > + > + btf = btf__base_btf(btf); > + } while (btf && btf__start_id(btf) >= start_id); > + > + return libbpf_err(-ENOENT); > +} > + > +void btf_check_sorted(struct btf *btf, int start_id) > +{ > + const struct btf_type *t; > + int i, n, nr_sorted_types; > + > + n = btf__type_cnt(btf); > + if ((n - start_id) < BTF_CHECK_SORT_THRESHOLD) > + return; Are there any measurable performance benefits from having this special case? > + > + n--; > + nr_sorted_types = 0; > + for (i = start_id; i < n; i++) { > + int k = i + 1; > + > + t = btf_type_by_id(btf, i); > + if (!btf__str_by_offset(btf, t->name_off)) > + return; I am confused. This effectively bans BTFs with anonymous types, as btf__set_nr_sorted_types() wont be called if such types are found. Anonymous types are very common, e.g. all FUNC_PROTO are anonymous. > + > + t = btf_type_by_id(btf, k); > + if (!btf__str_by_offset(btf, t->name_off)) > + return; > + > + if (btf_compare_type_kinds_names(&i, &k, btf) > 0) > + return; > + > + if (t->name_off) > + nr_sorted_types++; > + } > + > + t = btf_type_by_id(btf, start_id); > + if (t->name_off) > + nr_sorted_types++; > + if (nr_sorted_types >= BTF_CHECK_SORT_THRESHOLD) > + btf__set_nr_sorted_types(btf, nr_sorted_types); > +} > + [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-21 18:59 ` Eduard Zingerman @ 2025-10-22 3:02 ` Donglin Peng 2025-10-22 20:50 ` Eduard Zingerman 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-22 3:02 UTC (permalink / raw) To: Eduard Zingerman Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Wed, Oct 22, 2025 at 2:59 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > This patch implements sorting of BTF types by their kind and name, > > enabling the use of binary search for type lookups. > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > introduced containing common sorting functionality. > > > > The sorting is performed during btf__dedup() when the new > > sort_by_kind_name option in btf_dedup_opts is enabled. > > Do we really need this option? Dedup is free to rearrange btf types > anyway, so why not sort always? Is execution time a concern? The issue is that sorting changes the layout of BTF. Many existing selftests rely on the current, non-sorted order for their validation checks. Introducing this as an optional feature first allows us to run it without immediately breaking the tests, giving us time to fix them incrementally. > > > For vmlinux and kernel module BTF, btf_check_sorted() verifies > > whether the types are sorted and binary search can be used. > > [...] > > > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > > index c414cf37e1bd..11b05f4eb07d 100644 > > --- a/kernel/bpf/btf.c > > +++ b/kernel/bpf/btf.c > > @@ -259,6 +259,7 @@ struct btf { > > void *nohdr_data; > > struct btf_header hdr; > > u32 nr_types; /* includes VOID for base BTF */ > > + u32 nr_sorted_types; > > u32 types_size; > > u32 data_size; > > refcount_t refcnt; > > @@ -544,33 +545,29 @@ u32 btf_nr_types(const struct btf *btf) > > return total; > > } > > > > -u32 btf_type_cnt(const struct btf *btf) > > +u32 btf_start_id(const struct btf *btf) > > { > > - return btf->start_id + btf->nr_types; > > + return btf->start_id; > > } > > > > -s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > +u32 btf_nr_sorted_types(const struct btf *btf) > > { > > - const struct btf_type *t; > > - const char *tname; > > - u32 i, total; > > - > > - do { > > - total = btf_type_cnt(btf); > > - for (i = btf->start_id; i < total; i++) { > > - t = btf_type_by_id(btf, i); > > - if (BTF_INFO_KIND(t->info) != kind) > > - continue; > > + return btf->nr_sorted_types; > > +} > > > > - tname = btf_name_by_offset(btf, t->name_off); > > - if (!strcmp(tname, name)) > > - return i; > > - } > > +void btf_set_nr_sorted_types(struct btf *btf, u32 nr) > > +{ > > + btf->nr_sorted_types = nr; > > +} > > > > - btf = btf->base_btf; > > - } while (btf); > > +u32 btf_type_cnt(const struct btf *btf) > > +{ > > + return btf->start_id + btf->nr_types; > > +} > > > > - return -ENOENT; > > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > +{ > > + return find_btf_by_name_kind(btf, 1, name, kind); > ^^^ > nit: this will make it impossible to find "void" w/o a special case > in the find_btf_by_name_kind(), why not start from 0? Thanks. I referred to btf__find_by_name_kind in libbpf. In btf_find_by_name_kind, there is a special check for "void". Consequently, I've added a similar special check for "void" in find_btf_by_name_kind as well. > > } > > > > s32 bpf_find_btf_id(const char *name, u32 kind, struct btf **btf_p) > > [...] > > > diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c > > index 18907f0fcf9f..87e47f0b78ba 100644 > > --- a/tools/lib/bpf/btf.c > > +++ b/tools/lib/bpf/btf.c > > [...] > > > +/* > > + * Compact and sort BTF types. > > + * > > + * Similar to btf_dedup_compact_types, but additionally sorts the btf_types. > > + */ > > +static int btf__dedup_compact_and_sort_types(struct btf_dedup *d) > > +{ > > And this function will become btf__dedup_compact_types(), > if BTF will be always sorted. Thus removing some code duplication. Thanks for the suggestion. I think we can keep just btf__dedup_compact_and_sort_types and add a feature check before sorting, like this: if (d->sort_by_kind_name) qsort_r(sorted_ids, types_cnt, sizeof(*sorted_ids), btf_compare_type_kinds_names, d->btf); > > [...] > > > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > > new file mode 100644 > > index 000000000000..2ad4a56f1c08 > > --- /dev/null > > +++ b/tools/lib/bpf/btf_sort.c > > [...] > > > +/* > > + * Sort BTF types by kind and name in ascending order, placing named types > > + * before anonymous ones. > > + */ > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > > +{ > > + struct btf *btf = (struct btf *)priv; > > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > > + const char *na, *nb; > > + int ka, kb; > > + > > + /* ta w/o name is greater than tb */ > > + if (!ta->name_off && tb->name_off) > > + return 1; > > + /* tb w/o name is smaller than ta */ > > + if (ta->name_off && !tb->name_off) > > + return -1; > > + > > + ka = btf_kind(ta); > > + kb = btf_kind(tb); > > + na = btf__str_by_offset(btf, ta->name_off); > > + nb = btf__str_by_offset(btf, tb->name_off); > > + > > + return cmp_btf_kind_name(ka, na, kb, nb); > > If both types are anonymous and have the same kind, this will lead to > strcmp(NULL, NULL). On kernel side that would lead to null pointer > dereference. Thanks, I've confirmed that for anonymous types, name_off is 0, so btf__str_by_offset returns a pointer to btf->strs_data (which contains a '\0' at index 0) rather than NULL. However, when name_off is invalid, btf__str_by_offset does return NULL. Using str_is_empty will correctly handle both scenarios. Unnamed types of the same kind shall be considered equal. I will fix it in the next version. > > > +} > > + > > +__s32 find_btf_by_name_kind(const struct btf *btf, int start_id, > > + const char *type_name, __u32 kind) > > Nit: having functions with names btf_find_by_name_kind and > find_btf_by_name_kind > is very confusing. > Usually we use names like __<func> for auxiliary functions > like this. Agreed. The function will be updated to __btf_find_by_name_kind in the next version. > > > +{ > > + const struct btf_type *t; > > + const char *tname; > > + __u32 i, total; > > + > > + if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) > > + return 0; > > + > > + do { > > + if (btf__nr_sorted_types(btf)) { > > + /* binary search */ > > + __s32 start, end, mid, found = -1; > > + int ret; > > + > > + start = btf__start_id(btf); > > + end = start + btf__nr_sorted_types(btf) - 1; > > + /* found the leftmost btf_type that matches */ > > + while(start <= end) { > > + mid = start + (end - start) / 2; > > + t = btf_type_by_id(btf, mid); > > + tname = btf__name_by_offset(btf, t->name_off); > > + ret = cmp_btf_kind_name(BTF_INFO_KIND(t->info), tname, > > + kind, type_name); > > + if (ret == 0) > > + found = mid; > > + if (ret < 0) > > + start = mid + 1; > > + else if (ret >= 0) > > + end = mid - 1; > > + } > > + > > + if (found != -1) > > + return found; > > + } else { > > + /* linear search */ > > + total = btf__type_cnt(btf); > > + for (i = btf__start_id(btf); i < total; i++) { > > + t = btf_type_by_id(btf, i); > > + if (btf_kind(t) != kind) > > + continue; > > + > > + tname = btf__name_by_offset(btf, t->name_off); > > + if (tname && !strcmp(tname, type_name)) > > + return i; > > + } > > + } > > + > > + btf = btf__base_btf(btf); > > + } while (btf && btf__start_id(btf) >= start_id); > > + > > + return libbpf_err(-ENOENT); > > +} > > + > > +void btf_check_sorted(struct btf *btf, int start_id) > > +{ > > + const struct btf_type *t; > > + int i, n, nr_sorted_types; > > + > > + n = btf__type_cnt(btf); > > + if ((n - start_id) < BTF_CHECK_SORT_THRESHOLD) > > + return; > > Are there any measurable performance benefits from having this special case? Sorry, I haven't run performance tests. The number 8 comes from the theoretical equivalence point where n/2 ≈ log2(n). > > > + > > + n--; > > + nr_sorted_types = 0; > > + for (i = start_id; i < n; i++) { > > + int k = i + 1; > > + > > + t = btf_type_by_id(btf, i); > > + if (!btf__str_by_offset(btf, t->name_off)) > > + return; > > I am confused. > This effectively bans BTFs with anonymous types, > as btf__set_nr_sorted_types() wont be called if such types are found. > Anonymous types are very common, e.g. all FUNC_PROTO are anonymous. Thanks, I thought that for anonymous types, name_off would be 0, and btf__str_by_offset would not return NULL. However if the name_off is invalid, it will return NULL. So I plan to modify btf_compare_type_kinds_names to cover both scenarios. > > > + > > + t = btf_type_by_id(btf, k); > > + if (!btf__str_by_offset(btf, t->name_off)) > > + return; > > + > > + if (btf_compare_type_kinds_names(&i, &k, btf) > 0) > > + return; > > + > > + if (t->name_off) > > + nr_sorted_types++; > > + } > > + > > + t = btf_type_by_id(btf, start_id); > > + if (t->name_off) > > + nr_sorted_types++; > > + if (nr_sorted_types >= BTF_CHECK_SORT_THRESHOLD) > > + btf__set_nr_sorted_types(btf, nr_sorted_types); > > +} > > + > > [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-22 3:02 ` Donglin Peng @ 2025-10-22 20:50 ` Eduard Zingerman 2025-10-23 10:35 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-22 20:50 UTC (permalink / raw) To: Donglin Peng Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Wed, 2025-10-22 at 11:02 +0800, Donglin Peng wrote: > On Wed, Oct 22, 2025 at 2:59 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > This patch implements sorting of BTF types by their kind and name, > > > enabling the use of binary search for type lookups. > > > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > > introduced containing common sorting functionality. > > > > > > The sorting is performed during btf__dedup() when the new > > > sort_by_kind_name option in btf_dedup_opts is enabled. > > > > Do we really need this option? Dedup is free to rearrange btf types > > anyway, so why not sort always? Is execution time a concern? > > The issue is that sorting changes the layout of BTF. Many existing selftests > rely on the current, non-sorted order for their validation checks. Introducing > this as an optional feature first allows us to run it without immediately > breaking the tests, giving us time to fix them incrementally. How many tests are we talking about? The option is an API and it stays with us forever. If the only justification for its existence is to avoid tests modification, I don't think that's enough. > > > > > For vmlinux and kernel module BTF, btf_check_sorted() verifies > > > whether the types are sorted and binary search can be used. > > > > [...] > > > > > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > > > index c414cf37e1bd..11b05f4eb07d 100644 > > > --- a/kernel/bpf/btf.c > > > +++ b/kernel/bpf/btf.c [...] > > > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > > +{ > > > + return find_btf_by_name_kind(btf, 1, name, kind); > > ^^^ > > nit: this will make it impossible to find "void" w/o a special case > > in the find_btf_by_name_kind(), why not start from 0? > > Thanks. I referred to btf__find_by_name_kind in libbpf. In > btf_find_by_name_kind, > there is a special check for "void". Consequently, I've added a > similar special check > for "void" in find_btf_by_name_kind as well. Yes, I see the special case in the find_btf_by_name_kind. But wouldn't starting from 0 here avoid the need for special case? [...] > > > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > > > new file mode 100644 > > > index 000000000000..2ad4a56f1c08 > > > --- /dev/null > > > +++ b/tools/lib/bpf/btf_sort.c > > > > [...] > > > > > +/* > > > + * Sort BTF types by kind and name in ascending order, placing named types > > > + * before anonymous ones. > > > + */ > > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > > > +{ > > > + struct btf *btf = (struct btf *)priv; > > > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > > > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > > > + const char *na, *nb; > > > + int ka, kb; > > > + > > > + /* ta w/o name is greater than tb */ > > > + if (!ta->name_off && tb->name_off) > > > + return 1; > > > + /* tb w/o name is smaller than ta */ > > > + if (ta->name_off && !tb->name_off) > > > + return -1; > > > + > > > + ka = btf_kind(ta); > > > + kb = btf_kind(tb); > > > + na = btf__str_by_offset(btf, ta->name_off); > > > + nb = btf__str_by_offset(btf, tb->name_off); > > > + > > > + return cmp_btf_kind_name(ka, na, kb, nb); > > > > If both types are anonymous and have the same kind, this will lead to > > strcmp(NULL, NULL). On kernel side that would lead to null pointer > > dereference. > > Thanks, I've confirmed that for anonymous types, name_off is 0, > so btf__str_by_offset returns a pointer to btf->strs_data (which > contains a '\0' at index 0) rather than NULL. However, when name_off > is invalid, btf__str_by_offset does return NULL. Using str_is_empty > will correctly handle both scenarios. Unnamed types of the same kind > shall be considered equal. I will fix it in the next version. I see, thank you for explaining. Checking the usage of kernel/bpf/btf.c:btf_name_valid_identifier(), it looks like kernel validates name_off for all types. So, your implementation should be fine. [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-22 20:50 ` Eduard Zingerman @ 2025-10-23 10:35 ` Donglin Peng 2025-10-23 15:52 ` Alexei Starovoitov 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-23 10:35 UTC (permalink / raw) To: Eduard Zingerman, ast, Andrii Nakryiko, Alan Maguire Cc: linux-kernel, bpf, Song Liu, pengdonglin On Thu, Oct 23, 2025 at 4:50 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Wed, 2025-10-22 at 11:02 +0800, Donglin Peng wrote: > > On Wed, Oct 22, 2025 at 2:59 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > > This patch implements sorting of BTF types by their kind and name, > > > > enabling the use of binary search for type lookups. > > > > > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > > > introduced containing common sorting functionality. > > > > > > > > The sorting is performed during btf__dedup() when the new > > > > sort_by_kind_name option in btf_dedup_opts is enabled. > > > > > > Do we really need this option? Dedup is free to rearrange btf types > > > anyway, so why not sort always? Is execution time a concern? > > > > The issue is that sorting changes the layout of BTF. Many existing selftests > > rely on the current, non-sorted order for their validation checks. Introducing > > this as an optional feature first allows us to run it without immediately > > breaking the tests, giving us time to fix them incrementally. > > How many tests are we talking about? > The option is an API and it stays with us forever. > If the only justification for its existence is to avoid tests > modification, I don't think that's enough. I get your point, thanks. I wonder what others think? > > > > > > > > For vmlinux and kernel module BTF, btf_check_sorted() verifies > > > > whether the types are sorted and binary search can be used. > > > > > > [...] > > > > > > > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > > > > index c414cf37e1bd..11b05f4eb07d 100644 > > > > --- a/kernel/bpf/btf.c > > > > +++ b/kernel/bpf/btf.c > > [...] > > > > > +s32 btf_find_by_name_kind(const struct btf *btf, const char *name, u8 kind) > > > > +{ > > > > + return find_btf_by_name_kind(btf, 1, name, kind); > > > ^^^ > > > nit: this will make it impossible to find "void" w/o a special case > > > in the find_btf_by_name_kind(), why not start from 0? > > > > Thanks. I referred to btf__find_by_name_kind in libbpf. In > > btf_find_by_name_kind, > > there is a special check for "void". Consequently, I've added a > > similar special check > > for "void" in find_btf_by_name_kind as well. > > Yes, I see the special case in the find_btf_by_name_kind. > But wouldn't starting from 0 here avoid the need for special case? The start_id parameter here serves the same purpose as the one in libbpf's btf_find_by_name_kind. However, its implementation in find_btf_by_name_kind was incorrect. I will fix this in the next version. __s32 btf__find_by_name_kind_own(const struct btf *btf, const char *type_name, __u32 kind) { return btf_find_by_name_kind(btf, btf->start_id, type_name, kind); } __s32 btf__find_by_name_kind(const struct btf *btf, const char *type_name, __u32 kind) { return btf_find_by_name_kind(btf, 1, type_name, kind); } static __s32 btf_find_by_name_kind(const struct btf *btf, int start_id, const char *type_name, __u32 kind) { __u32 i, nr_types = btf__type_cnt(btf); if (kind == BTF_KIND_UNKN || !strcmp(type_name, "void")) return 0; for (i = start_id; i < nr_types; i++) { const struct btf_type *t = btf__type_by_id(btf, i); const char *name; if (btf_kind(t) != kind) continue; name = btf__name_by_offset(btf, t->name_off); if (name && !strcmp(type_name, name)) return i; } return libbpf_err(-ENOENT); } > > [...] > > > > > diff --git a/tools/lib/bpf/btf_sort.c b/tools/lib/bpf/btf_sort.c > > > > new file mode 100644 > > > > index 000000000000..2ad4a56f1c08 > > > > --- /dev/null > > > > +++ b/tools/lib/bpf/btf_sort.c > > > > > > [...] > > > > > > > +/* > > > > + * Sort BTF types by kind and name in ascending order, placing named types > > > > + * before anonymous ones. > > > > + */ > > > > +int btf_compare_type_kinds_names(const void *a, const void *b, void *priv) > > > > +{ > > > > + struct btf *btf = (struct btf *)priv; > > > > + struct btf_type *ta = btf_type_by_id(btf, *(__u32 *)a); > > > > + struct btf_type *tb = btf_type_by_id(btf, *(__u32 *)b); > > > > + const char *na, *nb; > > > > + int ka, kb; > > > > + > > > > + /* ta w/o name is greater than tb */ > > > > + if (!ta->name_off && tb->name_off) > > > > + return 1; > > > > + /* tb w/o name is smaller than ta */ > > > > + if (ta->name_off && !tb->name_off) > > > > + return -1; > > > > + > > > > + ka = btf_kind(ta); > > > > + kb = btf_kind(tb); > > > > + na = btf__str_by_offset(btf, ta->name_off); > > > > + nb = btf__str_by_offset(btf, tb->name_off); > > > > + > > > > + return cmp_btf_kind_name(ka, na, kb, nb); > > > > > > If both types are anonymous and have the same kind, this will lead to > > > strcmp(NULL, NULL). On kernel side that would lead to null pointer > > > dereference. > > > > Thanks, I've confirmed that for anonymous types, name_off is 0, > > so btf__str_by_offset returns a pointer to btf->strs_data (which > > contains a '\0' at index 0) rather than NULL. However, when name_off > > is invalid, btf__str_by_offset does return NULL. Using str_is_empty > > will correctly handle both scenarios. Unnamed types of the same kind > > shall be considered equal. I will fix it in the next version. > > I see, thank you for explaining. > Checking the usage of kernel/bpf/btf.c:btf_name_valid_identifier(), > it looks like kernel validates name_off for all types. > So, your implementation should be fine. > > [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-23 10:35 ` Donglin Peng @ 2025-10-23 15:52 ` Alexei Starovoitov 2025-10-23 16:28 ` Andrii Nakryiko 0 siblings, 1 reply; 32+ messages in thread From: Alexei Starovoitov @ 2025-10-23 15:52 UTC (permalink / raw) To: Donglin Peng Cc: Eduard Zingerman, Alexei Starovoitov, Andrii Nakryiko, Alan Maguire, LKML, bpf, Song Liu, pengdonglin On Thu, Oct 23, 2025 at 3:35 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > On Thu, Oct 23, 2025 at 4:50 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > On Wed, 2025-10-22 at 11:02 +0800, Donglin Peng wrote: > > > On Wed, Oct 22, 2025 at 2:59 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > > > This patch implements sorting of BTF types by their kind and name, > > > > > enabling the use of binary search for type lookups. > > > > > > > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > > > > introduced containing common sorting functionality. > > > > > > > > > > The sorting is performed during btf__dedup() when the new > > > > > sort_by_kind_name option in btf_dedup_opts is enabled. > > > > > > > > Do we really need this option? Dedup is free to rearrange btf types > > > > anyway, so why not sort always? Is execution time a concern? > > > > > > The issue is that sorting changes the layout of BTF. Many existing selftests > > > rely on the current, non-sorted order for their validation checks. Introducing > > > this as an optional feature first allows us to run it without immediately > > > breaking the tests, giving us time to fix them incrementally. > > > > How many tests are we talking about? > > The option is an API and it stays with us forever. > > If the only justification for its existence is to avoid tests > > modification, I don't think that's enough. > > I get your point, thanks. I wonder what others think? +1 to Eduard's point. No new flags please. Always sort. Also I don't feel that sorting logic belongs in libbpf. pahole needs to dedup first, apply extra filters, add extra BTFs. At that point going back to libbpf and asking to sort seems odd. ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-23 15:52 ` Alexei Starovoitov @ 2025-10-23 16:28 ` Andrii Nakryiko 2025-10-23 18:37 ` Alexei Starovoitov 0 siblings, 1 reply; 32+ messages in thread From: Andrii Nakryiko @ 2025-10-23 16:28 UTC (permalink / raw) To: Alexei Starovoitov Cc: Donglin Peng, Eduard Zingerman, Alexei Starovoitov, Alan Maguire, LKML, bpf, Song Liu, pengdonglin On Thu, Oct 23, 2025 at 8:53 AM Alexei Starovoitov <alexei.starovoitov@gmail.com> wrote: > > On Thu, Oct 23, 2025 at 3:35 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > > > On Thu, Oct 23, 2025 at 4:50 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > On Wed, 2025-10-22 at 11:02 +0800, Donglin Peng wrote: > > > > On Wed, Oct 22, 2025 at 2:59 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > > > > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > > > > > This patch implements sorting of BTF types by their kind and name, > > > > > > enabling the use of binary search for type lookups. > > > > > > > > > > > > To share logic between kernel and libbpf, a new btf_sort.c file is > > > > > > introduced containing common sorting functionality. > > > > > > > > > > > > The sorting is performed during btf__dedup() when the new > > > > > > sort_by_kind_name option in btf_dedup_opts is enabled. > > > > > > > > > > Do we really need this option? Dedup is free to rearrange btf types > > > > > anyway, so why not sort always? Is execution time a concern? > > > > > > > > The issue is that sorting changes the layout of BTF. Many existing selftests > > > > rely on the current, non-sorted order for their validation checks. Introducing > > > > this as an optional feature first allows us to run it without immediately > > > > breaking the tests, giving us time to fix them incrementally. > > > > > > How many tests are we talking about? > > > The option is an API and it stays with us forever. > > > If the only justification for its existence is to avoid tests > > > modification, I don't think that's enough. > > > > I get your point, thanks. I wonder what others think? > > +1 to Eduard's point. > No new flags please. Always sort. > > Also I don't feel that sorting logic belongs in libbpf. > pahole needs to dedup first, apply extra filters, add extra BTFs. > At that point going back to libbpf and asking to sort seems odd. Correct, I'd also not bake sorting into libbpf. Sorting order should be determined by pahole (or whatever other producer of BTF) after all the information is collected. So btf_dedup shouldn't sort. But I think we should add a new API to libbpf to help with sorting. I'd add this: int btf__permute(struct btf *btf, int permutation, int permutation_sz); With the idea that libbpf will renumber and reshuffle BTF data according to permutation provided by the caller. Caller should make sure that permutation is a valid one, of course. (we can also extend this to allow specifying that some types should be dropped by mapping them to zero, I think I wanted something like that for BTF linker at some point, don't remember details anymore; but that's definitely not in v1) This is useful for sorting use case (pahole build an index from old btf type ID to new btf type ID -> libbpf shuffles bytes and renumbers everything), but can be applied more generally (I don't remember now, but I did have this idea earlier in response to some need for BTF reshuffling). Speaking of flags, though. I think adding BTF_F_SORTED flag to btf_header->flags seems useful, as that would allow libbpf (and user space apps working with BTF in general) to use more optimal find_by_name implementation. The only gotcha is that old kernels enforce this btf_header->flags to be zero, so pahole would need to know not to emit this when building BTF for old kernels (or, rather, we'll just teach pahole_flags in kernel build scripts to add this going forward). This is not very important for kernel, because kernel has to validate all this anyways, but would allow saving time for user space. ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-23 16:28 ` Andrii Nakryiko @ 2025-10-23 18:37 ` Alexei Starovoitov 2025-10-23 19:39 ` Andrii Nakryiko 0 siblings, 1 reply; 32+ messages in thread From: Alexei Starovoitov @ 2025-10-23 18:37 UTC (permalink / raw) To: Andrii Nakryiko Cc: Donglin Peng, Eduard Zingerman, Alexei Starovoitov, Alan Maguire, LKML, bpf, Song Liu, pengdonglin On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko <andrii.nakryiko@gmail.com> wrote: > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > btf_header->flags seems useful, as that would allow libbpf (and user > space apps working with BTF in general) to use more optimal > find_by_name implementation. The only gotcha is that old kernels > enforce this btf_header->flags to be zero, so pahole would need to > know not to emit this when building BTF for old kernels (or, rather, > we'll just teach pahole_flags in kernel build scripts to add this > going forward). This is not very important for kernel, because kernel > has to validate all this anyways, but would allow saving time for user > space. Thinking more about it... I don't think it's worth it. It's an operational headache. I'd rather have newer pahole sort it without on/off flags and detection, so that people can upgrade pahole and build older kernels. Also BTF_F_SORTED doesn't spell out the way it's sorted. Things may change and we will need a new flag and so on. I think it's easier to check in the kernel and libbpf whether BTF is sorted the way they want it. The check is simple, fast and done once. Then both (kernel and libbpf) can set an internal flag and use different functions to search within a given BTF. ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-23 18:37 ` Alexei Starovoitov @ 2025-10-23 19:39 ` Andrii Nakryiko 2025-10-24 1:59 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Andrii Nakryiko @ 2025-10-23 19:39 UTC (permalink / raw) To: Alexei Starovoitov Cc: Donglin Peng, Eduard Zingerman, Alexei Starovoitov, Alan Maguire, LKML, bpf, Song Liu, pengdonglin On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov <alexei.starovoitov@gmail.com> wrote: > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > <andrii.nakryiko@gmail.com> wrote: > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > btf_header->flags seems useful, as that would allow libbpf (and user > > space apps working with BTF in general) to use more optimal > > find_by_name implementation. The only gotcha is that old kernels > > enforce this btf_header->flags to be zero, so pahole would need to > > know not to emit this when building BTF for old kernels (or, rather, > > we'll just teach pahole_flags in kernel build scripts to add this > > going forward). This is not very important for kernel, because kernel > > has to validate all this anyways, but would allow saving time for user > > space. > > Thinking more about it... I don't think it's worth it. > It's an operational headache. I'd rather have newer pahole sort it > without on/off flags and detection, so that people can upgrade > pahole and build older kernels. > Also BTF_F_SORTED doesn't spell out the way it's sorted. > Things may change and we will need a new flag and so on. > I think it's easier to check in the kernel and libbpf whether > BTF is sorted the way they want it. > The check is simple, fast and done once. Then both (kernel and libbpf) can > set an internal flag and use different functions to search > within a given BTF. I guess that's fine. libbpf can do this check lazily on the first btf__find_by_name() to avoid unnecessary overhead. Agreed. ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-23 19:39 ` Andrii Nakryiko @ 2025-10-24 1:59 ` Donglin Peng 2025-10-24 2:23 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-24 1:59 UTC (permalink / raw) To: Andrii Nakryiko, Alexei Starovoitov, Eduard Zingerman, Alan Maguire Cc: Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko <andrii.nakryiko@gmail.com> wrote: > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > <alexei.starovoitov@gmail.com> wrote: > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > space apps working with BTF in general) to use more optimal > > > find_by_name implementation. The only gotcha is that old kernels > > > enforce this btf_header->flags to be zero, so pahole would need to > > > know not to emit this when building BTF for old kernels (or, rather, > > > we'll just teach pahole_flags in kernel build scripts to add this > > > going forward). This is not very important for kernel, because kernel > > > has to validate all this anyways, but would allow saving time for user > > > space. > > > > Thinking more about it... I don't think it's worth it. > > It's an operational headache. I'd rather have newer pahole sort it > > without on/off flags and detection, so that people can upgrade > > pahole and build older kernels. > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > Things may change and we will need a new flag and so on. > > I think it's easier to check in the kernel and libbpf whether > > BTF is sorted the way they want it. > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > set an internal flag and use different functions to search > > within a given BTF. > > I guess that's fine. libbpf can do this check lazily on the first > btf__find_by_name() to avoid unnecessary overhead. Agreed. Thank you for all the feedback. Based on the suggestions above, the sorting implementation will be redesigned in the next version as follows: 1. The sorting operation will be fully handled by pahole, with no dependency on libbpf. This means users can benefit from sorting simply by upgrading their pahole version. 2. The kernel and libbpf will only be responsible for: 2.1. Checking whether the BTF data is sorted 2.2. Implementing binary search for sorted BTF Regarding the sorting check overhead: if the runtime cost is sufficiently small, it can be performed during BTF parsing. Based on my local testing with vmlinux BTF (containing 143,484 btf_types), this check takes at most 1.5 milliseconds during boot. Is this 1.5ms overhead acceptable? Are there any other suggestions? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-24 1:59 ` Donglin Peng @ 2025-10-24 2:23 ` Donglin Peng 2025-10-24 2:32 ` Eduard Zingerman 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-24 2:23 UTC (permalink / raw) To: Andrii Nakryiko, Alexei Starovoitov, Eduard Zingerman, Alan Maguire Cc: Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, Oct 24, 2025 at 9:59 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko > <andrii.nakryiko@gmail.com> wrote: > > > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > > <alexei.starovoitov@gmail.com> wrote: > > > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > > space apps working with BTF in general) to use more optimal > > > > find_by_name implementation. The only gotcha is that old kernels > > > > enforce this btf_header->flags to be zero, so pahole would need to > > > > know not to emit this when building BTF for old kernels (or, rather, > > > > we'll just teach pahole_flags in kernel build scripts to add this > > > > going forward). This is not very important for kernel, because kernel > > > > has to validate all this anyways, but would allow saving time for user > > > > space. > > > > > > Thinking more about it... I don't think it's worth it. > > > It's an operational headache. I'd rather have newer pahole sort it > > > without on/off flags and detection, so that people can upgrade > > > pahole and build older kernels. > > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > > Things may change and we will need a new flag and so on. > > > I think it's easier to check in the kernel and libbpf whether > > > BTF is sorted the way they want it. > > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > > set an internal flag and use different functions to search > > > within a given BTF. > > > > I guess that's fine. libbpf can do this check lazily on the first > > btf__find_by_name() to avoid unnecessary overhead. Agreed. > > Thank you for all the feedback. Based on the suggestions above, the sorting > implementation will be redesigned in the next version as follows: > > 1. The sorting operation will be fully handled by pahole, with no dependency on > libbpf. This means users can benefit from sorting simply by upgrading their > pahole version. I suggest that libbpf provides a sorting function, such as the btf__permute suggested by Andrii, for pahole to call. This approach allows pahole to leverage libbpf's existing helper functions and avoids code duplication. > > 2. The kernel and libbpf will only be responsible for: > 2.1. Checking whether the BTF data is sorted > 2.2. Implementing binary search for sorted BTF > > Regarding the sorting check overhead: if the runtime cost is sufficiently small, > it can be performed during BTF parsing. Based on my local testing with vmlinux > BTF (containing 143,484 btf_types), this check takes at most 1.5 milliseconds > during boot. Is this 1.5ms overhead acceptable? > > Are there any other suggestions? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-24 2:23 ` Donglin Peng @ 2025-10-24 2:32 ` Eduard Zingerman 2025-10-24 3:04 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-24 2:32 UTC (permalink / raw) To: Donglin Peng, Andrii Nakryiko, Alexei Starovoitov, Alan Maguire Cc: Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, 2025-10-24 at 10:23 +0800, Donglin Peng wrote: > On Fri, Oct 24, 2025 at 9:59 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > > > On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko > > <andrii.nakryiko@gmail.com> wrote: > > > > > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > > > <alexei.starovoitov@gmail.com> wrote: > > > > > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > > > space apps working with BTF in general) to use more optimal > > > > > find_by_name implementation. The only gotcha is that old kernels > > > > > enforce this btf_header->flags to be zero, so pahole would need to > > > > > know not to emit this when building BTF for old kernels (or, rather, > > > > > we'll just teach pahole_flags in kernel build scripts to add this > > > > > going forward). This is not very important for kernel, because kernel > > > > > has to validate all this anyways, but would allow saving time for user > > > > > space. > > > > > > > > Thinking more about it... I don't think it's worth it. > > > > It's an operational headache. I'd rather have newer pahole sort it > > > > without on/off flags and detection, so that people can upgrade > > > > pahole and build older kernels. > > > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > > > Things may change and we will need a new flag and so on. > > > > I think it's easier to check in the kernel and libbpf whether > > > > BTF is sorted the way they want it. > > > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > > > set an internal flag and use different functions to search > > > > within a given BTF. > > > > > > I guess that's fine. libbpf can do this check lazily on the first > > > btf__find_by_name() to avoid unnecessary overhead. Agreed. > > > > Thank you for all the feedback. Based on the suggestions above, the sorting > > implementation will be redesigned in the next version as follows: > > > > 1. The sorting operation will be fully handled by pahole, with no dependency on > > libbpf. This means users can benefit from sorting simply by upgrading their > > pahole version. > > I suggest that libbpf provides a sorting function, such as the > btf__permute suggested > by Andrii, for pahole to call. This approach allows pahole to leverage > libbpf's existing > helper functions and avoids code duplication. Could you please enumerate the functions you'd have to reimplement in pahole? > > > > 2. The kernel and libbpf will only be responsible for: > > 2.1. Checking whether the BTF data is sorted > > 2.2. Implementing binary search for sorted BTF > > > > Regarding the sorting check overhead: if the runtime cost is sufficiently small, > > it can be performed during BTF parsing. Based on my local testing with vmlinux > > BTF (containing 143,484 btf_types), this check takes at most 1.5 milliseconds > > during boot. Is this 1.5ms overhead acceptable? > > > > Are there any other suggestions? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-24 2:32 ` Eduard Zingerman @ 2025-10-24 3:04 ` Donglin Peng 2025-10-24 3:15 ` Eduard Zingerman 0 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-24 3:04 UTC (permalink / raw) To: Eduard Zingerman Cc: Andrii Nakryiko, Alexei Starovoitov, Alan Maguire, Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, Oct 24, 2025 at 10:32 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Fri, 2025-10-24 at 10:23 +0800, Donglin Peng wrote: > > On Fri, Oct 24, 2025 at 9:59 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > > > > > On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > > > > <alexei.starovoitov@gmail.com> wrote: > > > > > > > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > > > > space apps working with BTF in general) to use more optimal > > > > > > find_by_name implementation. The only gotcha is that old kernels > > > > > > enforce this btf_header->flags to be zero, so pahole would need to > > > > > > know not to emit this when building BTF for old kernels (or, rather, > > > > > > we'll just teach pahole_flags in kernel build scripts to add this > > > > > > going forward). This is not very important for kernel, because kernel > > > > > > has to validate all this anyways, but would allow saving time for user > > > > > > space. > > > > > > > > > > Thinking more about it... I don't think it's worth it. > > > > > It's an operational headache. I'd rather have newer pahole sort it > > > > > without on/off flags and detection, so that people can upgrade > > > > > pahole and build older kernels. > > > > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > > > > Things may change and we will need a new flag and so on. > > > > > I think it's easier to check in the kernel and libbpf whether > > > > > BTF is sorted the way they want it. > > > > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > > > > set an internal flag and use different functions to search > > > > > within a given BTF. > > > > > > > > I guess that's fine. libbpf can do this check lazily on the first > > > > btf__find_by_name() to avoid unnecessary overhead. Agreed. > > > > > > Thank you for all the feedback. Based on the suggestions above, the sorting > > > implementation will be redesigned in the next version as follows: > > > > > > 1. The sorting operation will be fully handled by pahole, with no dependency on > > > libbpf. This means users can benefit from sorting simply by upgrading their > > > pahole version. > > > > I suggest that libbpf provides a sorting function, such as the > > btf__permute suggested > > by Andrii, for pahole to call. This approach allows pahole to leverage > > libbpf's existing > > helper functions and avoids code duplication. > > Could you please enumerate the functions you'd have to reimplement in > pahole? Yes. Once the BTF types are sorted, the type IDs in both the BTF and BTF ext sections must be remapped. Libbpf provides helper functions like btf_field_iter_init, btf_field_iter_next, btf_ext_visit_type_ids to iterate through the btf_field and btf_ext_info_sec entries that require updating. We will likely need to reimplement these three functions for this purpose. > > > > > > > 2. The kernel and libbpf will only be responsible for: > > > 2.1. Checking whether the BTF data is sorted > > > 2.2. Implementing binary search for sorted BTF > > > > > > Regarding the sorting check overhead: if the runtime cost is sufficiently small, > > > it can be performed during BTF parsing. Based on my local testing with vmlinux > > > BTF (containing 143,484 btf_types), this check takes at most 1.5 milliseconds > > > during boot. Is this 1.5ms overhead acceptable? > > > > > > Are there any other suggestions? ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-24 3:04 ` Donglin Peng @ 2025-10-24 3:15 ` Eduard Zingerman 2025-10-24 3:19 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-24 3:15 UTC (permalink / raw) To: Donglin Peng Cc: Andrii Nakryiko, Alexei Starovoitov, Alan Maguire, Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, 2025-10-24 at 11:04 +0800, Donglin Peng wrote: > On Fri, Oct 24, 2025 at 10:32 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > On Fri, 2025-10-24 at 10:23 +0800, Donglin Peng wrote: > > > On Fri, Oct 24, 2025 at 9:59 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > > > > > > > On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > > > > > <alexei.starovoitov@gmail.com> wrote: > > > > > > > > > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > > > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > > > > > space apps working with BTF in general) to use more optimal > > > > > > > find_by_name implementation. The only gotcha is that old kernels > > > > > > > enforce this btf_header->flags to be zero, so pahole would need to > > > > > > > know not to emit this when building BTF for old kernels (or, rather, > > > > > > > we'll just teach pahole_flags in kernel build scripts to add this > > > > > > > going forward). This is not very important for kernel, because kernel > > > > > > > has to validate all this anyways, but would allow saving time for user > > > > > > > space. > > > > > > > > > > > > Thinking more about it... I don't think it's worth it. > > > > > > It's an operational headache. I'd rather have newer pahole sort it > > > > > > without on/off flags and detection, so that people can upgrade > > > > > > pahole and build older kernels. > > > > > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > > > > > Things may change and we will need a new flag and so on. > > > > > > I think it's easier to check in the kernel and libbpf whether > > > > > > BTF is sorted the way they want it. > > > > > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > > > > > set an internal flag and use different functions to search > > > > > > within a given BTF. > > > > > > > > > > I guess that's fine. libbpf can do this check lazily on the first > > > > > btf__find_by_name() to avoid unnecessary overhead. Agreed. > > > > > > > > Thank you for all the feedback. Based on the suggestions above, the sorting > > > > implementation will be redesigned in the next version as follows: > > > > > > > > 1. The sorting operation will be fully handled by pahole, with no dependency on > > > > libbpf. This means users can benefit from sorting simply by upgrading their > > > > pahole version. > > > > > > I suggest that libbpf provides a sorting function, such as the > > > btf__permute suggested > > > by Andrii, for pahole to call. This approach allows pahole to leverage > > > libbpf's existing > > > helper functions and avoids code duplication. > > > > Could you please enumerate the functions you'd have to reimplement in > > pahole? > > Yes. Once the BTF types are sorted, the type IDs in both the BTF and BTF ext > sections must be remapped. Libbpf provides helper functions like > btf_field_iter_init, > btf_field_iter_next, > btf_ext_visit_type_ids > to iterate through the btf_field and btf_ext_info_sec entries that > require updating. > We will likely need to reimplement these three functions for this purpose. I think Andrii's suggestion is to have btf__permute in libbpf, as it needs all the functions you mention. But actual sorting can happen in pahole, then: - allocate array of length num-types, initialize it 0..num-types; - reorder it as one sees fit; - call btf__permute() from libbpf and get all the renamings handled by it. [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search 2025-10-24 3:15 ` Eduard Zingerman @ 2025-10-24 3:19 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-24 3:19 UTC (permalink / raw) To: Eduard Zingerman Cc: Andrii Nakryiko, Alexei Starovoitov, Alan Maguire, Alexei Starovoitov, LKML, bpf, Song Liu, pengdonglin On Fri, Oct 24, 2025 at 11:15 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Fri, 2025-10-24 at 11:04 +0800, Donglin Peng wrote: > > On Fri, Oct 24, 2025 at 10:32 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > > > > > On Fri, 2025-10-24 at 10:23 +0800, Donglin Peng wrote: > > > > On Fri, Oct 24, 2025 at 9:59 AM Donglin Peng <dolinux.peng@gmail.com> wrote: > > > > > > > > > > On Fri, Oct 24, 2025 at 3:40 AM Andrii Nakryiko > > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > On Thu, Oct 23, 2025 at 11:37 AM Alexei Starovoitov > > > > > > <alexei.starovoitov@gmail.com> wrote: > > > > > > > > > > > > > > On Thu, Oct 23, 2025 at 9:28 AM Andrii Nakryiko > > > > > > > <andrii.nakryiko@gmail.com> wrote: > > > > > > > > > > > > > > > > > > > > > > > > Speaking of flags, though. I think adding BTF_F_SORTED flag to > > > > > > > > btf_header->flags seems useful, as that would allow libbpf (and user > > > > > > > > space apps working with BTF in general) to use more optimal > > > > > > > > find_by_name implementation. The only gotcha is that old kernels > > > > > > > > enforce this btf_header->flags to be zero, so pahole would need to > > > > > > > > know not to emit this when building BTF for old kernels (or, rather, > > > > > > > > we'll just teach pahole_flags in kernel build scripts to add this > > > > > > > > going forward). This is not very important for kernel, because kernel > > > > > > > > has to validate all this anyways, but would allow saving time for user > > > > > > > > space. > > > > > > > > > > > > > > Thinking more about it... I don't think it's worth it. > > > > > > > It's an operational headache. I'd rather have newer pahole sort it > > > > > > > without on/off flags and detection, so that people can upgrade > > > > > > > pahole and build older kernels. > > > > > > > Also BTF_F_SORTED doesn't spell out the way it's sorted. > > > > > > > Things may change and we will need a new flag and so on. > > > > > > > I think it's easier to check in the kernel and libbpf whether > > > > > > > BTF is sorted the way they want it. > > > > > > > The check is simple, fast and done once. Then both (kernel and libbpf) can > > > > > > > set an internal flag and use different functions to search > > > > > > > within a given BTF. > > > > > > > > > > > > I guess that's fine. libbpf can do this check lazily on the first > > > > > > btf__find_by_name() to avoid unnecessary overhead. Agreed. > > > > > > > > > > Thank you for all the feedback. Based on the suggestions above, the sorting > > > > > implementation will be redesigned in the next version as follows: > > > > > > > > > > 1. The sorting operation will be fully handled by pahole, with no dependency on > > > > > libbpf. This means users can benefit from sorting simply by upgrading their > > > > > pahole version. > > > > > > > > I suggest that libbpf provides a sorting function, such as the > > > > btf__permute suggested > > > > by Andrii, for pahole to call. This approach allows pahole to leverage > > > > libbpf's existing > > > > helper functions and avoids code duplication. > > > > > > Could you please enumerate the functions you'd have to reimplement in > > > pahole? > > > > Yes. Once the BTF types are sorted, the type IDs in both the BTF and BTF ext > > sections must be remapped. Libbpf provides helper functions like > > btf_field_iter_init, > > btf_field_iter_next, > > btf_ext_visit_type_ids > > to iterate through the btf_field and btf_ext_info_sec entries that > > require updating. > > We will likely need to reimplement these three functions for this purpose. > > I think Andrii's suggestion is to have btf__permute in libbpf, > as it needs all the functions you mention. > But actual sorting can happen in pahole, then: > - allocate array of length num-types, initialize it 0..num-types; > - reorder it as one sees fit; > - call btf__permute() from libbpf and get all the renamings handled by it. Yes, the first two can be implemented in pahole, while the last one belongs in libbpf. > > [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* [RFC PATCH v2 3/5] libbpf: check if BTF is sorted to enable binary search 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 1/5] btf: search local BTF before base BTF Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search Donglin Peng @ 2025-10-20 9:39 ` Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME Donglin Peng 4 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw) To: ast Cc: linux-kernel, bpf, Donglin Peng, Eduard Zingerman, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin Now that libbpf supports sorted BTF types, add a check to determine whether binary search can be used for type lookups. Cc: Eduard Zingerman <eddyz87@gmail.com> Cc: Alexei Starovoitov <ast@kernel.org> Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> Cc: Alan Maguire <alan.maguire@oracle.com> Cc: Song Liu <song@kernel.org> Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> --- tools/lib/bpf/btf.c | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/tools/lib/bpf/btf.c b/tools/lib/bpf/btf.c index 87e47f0b78ba..92c370fe9b79 100644 --- a/tools/lib/bpf/btf.c +++ b/tools/lib/bpf/btf.c @@ -1091,6 +1091,8 @@ static struct btf *btf_new(const void *data, __u32 size, struct btf *base_btf, b if (err) goto done; + btf_check_sorted(btf, btf->start_id); + done: if (err) { btf__free(btf); @@ -1714,6 +1716,7 @@ static void btf_invalidate_raw_data(struct btf *btf) free(btf->raw_data_swapped); btf->raw_data_swapped = NULL; } + btf->nr_sorted_types = 0; } /* Ensure BTF is ready to be modified (by splitting into a three memory @@ -5456,6 +5459,9 @@ static int btf_dedup_remap_types(struct btf_dedup *d) } } + if (d->sort_by_kind_name) + btf_check_sorted(d->btf, d->btf->start_id); + if (!d->btf_ext) return 0; -- 2.34.1 ^ permalink raw reply related [flat|nested] 32+ messages in thread
* [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng ` (2 preceding siblings ...) 2025-10-20 9:39 ` [RFC PATCH v2 3/5] libbpf: check if BTF is sorted " Donglin Peng @ 2025-10-20 9:39 ` Donglin Peng 2025-10-21 19:07 ` Eduard Zingerman 2025-10-20 9:39 ` [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME Donglin Peng 4 siblings, 1 reply; 32+ messages in thread From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw) To: ast Cc: linux-kernel, bpf, Donglin Peng, Eduard Zingerman, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin Verify that BTF deduplication and sorting functionality works correctly Cc: Eduard Zingerman <eddyz87@gmail.com> Cc: Alexei Starovoitov <ast@kernel.org> Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> Cc: Alan Maguire <alan.maguire@oracle.com> Cc: Song Liu <song@kernel.org> Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> --- tools/testing/selftests/bpf/prog_tests/btf.c | 171 +++++++++++++++++++ 1 file changed, 171 insertions(+) diff --git a/tools/testing/selftests/bpf/prog_tests/btf.c b/tools/testing/selftests/bpf/prog_tests/btf.c index 8a9ba4292109..244f9d535bc2 100644 --- a/tools/testing/selftests/bpf/prog_tests/btf.c +++ b/tools/testing/selftests/bpf/prog_tests/btf.c @@ -8022,6 +8022,177 @@ static struct btf_dedup_test dedup_tests[] = { BTF_STR_SEC("\0foo\0x\0y\0foo_ptr"), }, }, +{ + .descr = "dedup_sort: strings deduplication", + .input = { + .raw_types = { + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), + BTF_TYPE_INT_ENC(NAME_NTH(3), BTF_INT_SIGNED, 0, 32, 4), + BTF_TYPE_INT_ENC(NAME_NTH(4), BTF_INT_SIGNED, 0, 64, 8), + BTF_TYPE_INT_ENC(NAME_NTH(5), BTF_INT_SIGNED, 0, 32, 4), + BTF_END_RAW, + }, + BTF_STR_SEC("\0int\0long int\0int\0long int\0int"), + }, + .expect = { + .raw_types = { + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), + BTF_END_RAW, + }, + BTF_STR_SEC("\0int\0long int"), + }, + .opts = { + .sort_by_kind_name = true, + }, +}, +{ + .descr = "dedup_sort: func/func_arg/var tags", + .input = { + .raw_types = { + /* int */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [1] */ + /* static int t */ + BTF_VAR_ENC(NAME_NTH(1), 1, 0), /* [2] */ + /* void f(int a1, int a2) */ + BTF_FUNC_PROTO_ENC(0, 2), /* [3] */ + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(2), 1), + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(3), 1), + BTF_FUNC_ENC(NAME_NTH(4), 3), /* [4] */ + /* tag -> t */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 2, -1), /* [5] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 2, -1), /* [6] */ + /* tag -> func */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 4, -1), /* [7] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 4, -1), /* [8] */ + /* tag -> func arg a1 */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 4, 1), /* [9] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 4, 1), /* [10] */ + BTF_END_RAW, + }, + BTF_STR_SEC("\0t\0a1\0a2\0f\0tag"), + }, + .expect = { + .raw_types = { + BTF_FUNC_ENC(NAME_NTH(4), 7), /* [1] */ + BTF_VAR_ENC(NAME_NTH(1), 6, 0), /* [2] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 2, -1), /* [3] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, -1), /* [4] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, 1), /* [5] */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [6] */ + BTF_FUNC_PROTO_ENC(0, 2), /* [7] */ + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(2), 6), + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(3), 6), + BTF_END_RAW, + }, + BTF_STR_SEC("\0t\0a1\0a2\0f\0tag"), + }, + .opts = { + .sort_by_kind_name = true, + }, +}, +{ + .descr = "dedup_sort: func/func_param tags", + .input = { + .raw_types = { + /* int */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [1] */ + /* void f(int a1, int a2) */ + BTF_FUNC_PROTO_ENC(0, 2), /* [2] */ + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(1), 1), + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(2), 1), + BTF_FUNC_ENC(NAME_NTH(3), 2), /* [3] */ + /* void f(int a1, int a2) */ + BTF_FUNC_PROTO_ENC(0, 2), /* [4] */ + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(1), 1), + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(2), 1), + BTF_FUNC_ENC(NAME_NTH(3), 4), /* [5] */ + /* tag -> f: tag1, tag2 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 3, -1), /* [6] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 3, -1), /* [7] */ + /* tag -> f/a2: tag1, tag2 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 3, 1), /* [8] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 3, 1), /* [9] */ + /* tag -> f: tag1, tag3 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 5, -1), /* [10] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 5, -1), /* [11] */ + /* tag -> f/a2: tag1, tag3 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 5, 1), /* [12] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 5, 1), /* [13] */ + BTF_END_RAW, + }, + BTF_STR_SEC("\0a1\0a2\0f\0tag1\0tag2\0tag3"), + }, + .expect = { + .raw_types = { + BTF_FUNC_ENC(NAME_NTH(3), 9), /* [1] */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 1, -1), /* [2] */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 1, 1), /* [3] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, -1), /* [4] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, 1), /* [5] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 1, -1), /* [6] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 1, 1), /* [7] */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [8] */ + BTF_FUNC_PROTO_ENC(0, 2), /* [9] */ + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(1), 8), + BTF_FUNC_PROTO_ARG_ENC(NAME_NTH(2), 8), + BTF_END_RAW, + }, + BTF_STR_SEC("\0a1\0a2\0f\0tag1\0tag2\0tag3"), + }, + .opts = { + .sort_by_kind_name = true, + }, +}, +{ + .descr = "dedup_sort: struct/struct_member tags", + .input = { + .raw_types = { + /* int */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [1] */ + BTF_STRUCT_ENC(NAME_NTH(1), 2, 8), /* [2] */ + BTF_MEMBER_ENC(NAME_NTH(2), 1, 0), + BTF_MEMBER_ENC(NAME_NTH(3), 1, 32), + BTF_STRUCT_ENC(NAME_NTH(1), 2, 8), /* [3] */ + BTF_MEMBER_ENC(NAME_NTH(2), 1, 0), + BTF_MEMBER_ENC(NAME_NTH(3), 1, 32), + /* tag -> t: tag1, tag2 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 2, -1), /* [4] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 2, -1), /* [5] */ + /* tag -> t/m2: tag1, tag2 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 2, 1), /* [6] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 2, 1), /* [7] */ + /* tag -> t: tag1, tag3 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 3, -1), /* [8] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 3, -1), /* [9] */ + /* tag -> t/m2: tag1, tag3 */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 3, 1), /* [10] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 3, 1), /* [11] */ + BTF_END_RAW, + }, + BTF_STR_SEC("\0t\0m1\0m2\0tag1\0tag2\0tag3"), + }, + .expect = { + .raw_types = { + BTF_STRUCT_ENC(NAME_NTH(1), 2, 8), /* [1] */ + BTF_MEMBER_ENC(NAME_NTH(2), 8, 0), + BTF_MEMBER_ENC(NAME_NTH(3), 8, 32), + BTF_DECL_TAG_ENC(NAME_NTH(4), 1, -1), /* [2] */ + BTF_DECL_TAG_ENC(NAME_NTH(4), 1, 1), /* [3] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, -1), /* [4] */ + BTF_DECL_TAG_ENC(NAME_NTH(5), 1, 1), /* [5] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 1, -1), /* [6] */ + BTF_DECL_TAG_ENC(NAME_NTH(6), 1, 1), /* [7] */ + BTF_TYPE_INT_ENC(0, BTF_INT_SIGNED, 0, 32, 4), /* [8] */ + BTF_END_RAW, + }, + BTF_STR_SEC("\0t\0m1\0m2\0tag1\0tag2\0tag3"), + }, + .opts = { + .sort_by_kind_name = true, + }, +}, }; static int btf_type_size(const struct btf_type *t) -- 2.34.1 ^ permalink raw reply related [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting 2025-10-20 9:39 ` [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting Donglin Peng @ 2025-10-21 19:07 ` Eduard Zingerman 2025-10-23 11:20 ` Donglin Peng 0 siblings, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-21 19:07 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: [...] > +{ > + .descr = "dedup_sort: strings deduplication", > + .input = { > + .raw_types = { > + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), > + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), > + BTF_TYPE_INT_ENC(NAME_NTH(3), BTF_INT_SIGNED, 0, 32, 4), > + BTF_TYPE_INT_ENC(NAME_NTH(4), BTF_INT_SIGNED, 0, 64, 8), > + BTF_TYPE_INT_ENC(NAME_NTH(5), BTF_INT_SIGNED, 0, 32, 4), > + BTF_END_RAW, > + }, > + BTF_STR_SEC("\0int\0long int\0int\0long int\0int"), > + }, > + .expect = { > + .raw_types = { > + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), > + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), > + BTF_END_RAW, > + }, > + BTF_STR_SEC("\0int\0long int"), > + }, > + .opts = { > + .sort_by_kind_name = true, > + }, > +}, I think that having so many tests for this feature is redundant. E.g. above strings handling test does not seem necessary, as btf__dedup_compact_and_sort_types() does not really change anything with regards to strings handling. I'd say that a single test including elements with and without names, and elements of different kind should suffice. [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting 2025-10-21 19:07 ` Eduard Zingerman @ 2025-10-23 11:20 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-23 11:20 UTC (permalink / raw) To: Eduard Zingerman Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Wed, Oct 22, 2025 at 3:07 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > [...] > > > +{ > > + .descr = "dedup_sort: strings deduplication", > > + .input = { > > + .raw_types = { > > + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), > > + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), > > + BTF_TYPE_INT_ENC(NAME_NTH(3), BTF_INT_SIGNED, 0, 32, 4), > > + BTF_TYPE_INT_ENC(NAME_NTH(4), BTF_INT_SIGNED, 0, 64, 8), > > + BTF_TYPE_INT_ENC(NAME_NTH(5), BTF_INT_SIGNED, 0, 32, 4), > > + BTF_END_RAW, > > + }, > > + BTF_STR_SEC("\0int\0long int\0int\0long int\0int"), > > + }, > > + .expect = { > > + .raw_types = { > > + BTF_TYPE_INT_ENC(NAME_NTH(1), BTF_INT_SIGNED, 0, 32, 4), > > + BTF_TYPE_INT_ENC(NAME_NTH(2), BTF_INT_SIGNED, 0, 64, 8), > > + BTF_END_RAW, > > + }, > > + BTF_STR_SEC("\0int\0long int"), > > + }, > > + .opts = { > > + .sort_by_kind_name = true, > > + }, > > +}, > > I think that having so many tests for this feature is redundant. > E.g. above strings handling test does not seem necessary, > as btf__dedup_compact_and_sort_types() does not really change anything > with regards to strings handling. > I'd say that a single test including elements with and without names, > and elements of different kind should suffice. Thanks for the suggestion. > > [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng ` (3 preceding siblings ...) 2025-10-20 9:39 ` [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting Donglin Peng @ 2025-10-20 9:39 ` Donglin Peng 2025-10-21 0:50 ` Eduard Zingerman 2025-10-21 17:27 ` Alan Maguire 4 siblings, 2 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-20 9:39 UTC (permalink / raw) To: ast Cc: linux-kernel, bpf, Donglin Peng, Eduard Zingerman, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin Pahole v1.32 and later supports BTF sorting. Add a new configuration option to control whether to enable this feature for vmlinux and kernel modules. Cc: Eduard Zingerman <eddyz87@gmail.com> Cc: Alexei Starovoitov <ast@kernel.org> Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> Cc: Alan Maguire <alan.maguire@oracle.com> Cc: Song Liu <song@kernel.org> Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> --- kernel/bpf/Kconfig | 8 ++++++++ scripts/Makefile.btf | 5 +++++ 2 files changed, 13 insertions(+) diff --git a/kernel/bpf/Kconfig b/kernel/bpf/Kconfig index eb3de35734f0..08251a250f06 100644 --- a/kernel/bpf/Kconfig +++ b/kernel/bpf/Kconfig @@ -101,4 +101,12 @@ config BPF_LSM If you are unsure how to answer this question, answer N. +config BPF_SORT_BTF_BY_KIND_NAME + bool "Sort BTF types by kind and name" + depends on BPF_SYSCALL + help + This option sorts BTF types in vmlinux and kernel modules by their + kind and name, enabling binary search for btf_find_by_name_kind() + and significantly improving its lookup performance. + endmenu # "BPF subsystem" diff --git a/scripts/Makefile.btf b/scripts/Makefile.btf index db76335dd917..3f1a0b3c3f3f 100644 --- a/scripts/Makefile.btf +++ b/scripts/Makefile.btf @@ -29,6 +29,11 @@ ifneq ($(KBUILD_EXTMOD),) module-pahole-flags-$(call test-ge, $(pahole-ver), 128) += --btf_features=distilled_base endif +ifeq ($(call test-ge, $(pahole-ver), 132),y) +pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort +module-pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort +endif + endif pahole-flags-$(CONFIG_PAHOLE_HAS_LANG_EXCLUDE) += --lang_exclude=rust -- 2.34.1 ^ permalink raw reply related [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME 2025-10-20 9:39 ` [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME Donglin Peng @ 2025-10-21 0:50 ` Eduard Zingerman 2025-10-21 8:33 ` Donglin Peng 2025-10-21 17:27 ` Alan Maguire 1 sibling, 1 reply; 32+ messages in thread From: Eduard Zingerman @ 2025-10-21 0:50 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: [...] > diff --git a/kernel/bpf/Kconfig b/kernel/bpf/Kconfig > index eb3de35734f0..08251a250f06 100644 > --- a/kernel/bpf/Kconfig > +++ b/kernel/bpf/Kconfig > @@ -101,4 +101,12 @@ config BPF_LSM > > If you are unsure how to answer this question, answer N. > > +config BPF_SORT_BTF_BY_KIND_NAME > + bool "Sort BTF types by kind and name" > + depends on BPF_SYSCALL > + help > + This option sorts BTF types in vmlinux and kernel modules by their > + kind and name, enabling binary search for btf_find_by_name_kind() > + and significantly improving its lookup performance. > + Why having this as an option? There are no downsides to always enabling, right? The cost of sorting btf at build time should be negligible. [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME 2025-10-21 0:50 ` Eduard Zingerman @ 2025-10-21 8:33 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-21 8:33 UTC (permalink / raw) To: Eduard Zingerman Cc: ast, linux-kernel, bpf, Andrii Nakryiko, Alan Maguire, Song Liu, pengdonglin On Tue, Oct 21, 2025 at 8:50 AM Eduard Zingerman <eddyz87@gmail.com> wrote: > > On Mon, 2025-10-20 at 17:39 +0800, Donglin Peng wrote: > > [...] > > > diff --git a/kernel/bpf/Kconfig b/kernel/bpf/Kconfig > > index eb3de35734f0..08251a250f06 100644 > > --- a/kernel/bpf/Kconfig > > +++ b/kernel/bpf/Kconfig > > @@ -101,4 +101,12 @@ config BPF_LSM > > > > If you are unsure how to answer this question, answer N. > > > > +config BPF_SORT_BTF_BY_KIND_NAME > > + bool "Sort BTF types by kind and name" > > + depends on BPF_SYSCALL > > + help > > + This option sorts BTF types in vmlinux and kernel modules by their > > + kind and name, enabling binary search for btf_find_by_name_kind() > > + and significantly improving its lookup performance. > > + > > Why having this as an option? > There are no downsides to always enabling, right? > The cost of sorting btf at build time should be negligible. Thanks, I'll remove this config option in the next version as suggested. > > [...] ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME 2025-10-20 9:39 ` [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME Donglin Peng 2025-10-21 0:50 ` Eduard Zingerman @ 2025-10-21 17:27 ` Alan Maguire 2025-10-22 1:15 ` Donglin Peng 1 sibling, 1 reply; 32+ messages in thread From: Alan Maguire @ 2025-10-21 17:27 UTC (permalink / raw) To: Donglin Peng, ast Cc: linux-kernel, bpf, Eduard Zingerman, Andrii Nakryiko, Song Liu, pengdonglin On 20/10/2025 10:39, Donglin Peng wrote: > Pahole v1.32 and later supports BTF sorting. Add a new configuration > option to control whether to enable this feature for vmlinux and > kernel modules. > > Cc: Eduard Zingerman <eddyz87@gmail.com> > Cc: Alexei Starovoitov <ast@kernel.org> > Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> > Cc: Alan Maguire <alan.maguire@oracle.com> > Cc: Song Liu <song@kernel.org> > Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> > Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> > --- > kernel/bpf/Kconfig | 8 ++++++++ > scripts/Makefile.btf | 5 +++++ > 2 files changed, 13 insertions(+) > > diff --git a/kernel/bpf/Kconfig b/kernel/bpf/Kconfig > index eb3de35734f0..08251a250f06 100644 > --- a/kernel/bpf/Kconfig > +++ b/kernel/bpf/Kconfig > @@ -101,4 +101,12 @@ config BPF_LSM > > If you are unsure how to answer this question, answer N. > > +config BPF_SORT_BTF_BY_KIND_NAME > + bool "Sort BTF types by kind and name" > + depends on BPF_SYSCALL > + help > + This option sorts BTF types in vmlinux and kernel modules by their > + kind and name, enabling binary search for btf_find_by_name_kind() > + and significantly improving its lookup performance. > + > endmenu # "BPF subsystem" > diff --git a/scripts/Makefile.btf b/scripts/Makefile.btf > index db76335dd917..3f1a0b3c3f3f 100644 > --- a/scripts/Makefile.btf > +++ b/scripts/Makefile.btf > @@ -29,6 +29,11 @@ ifneq ($(KBUILD_EXTMOD),) > module-pahole-flags-$(call test-ge, $(pahole-ver), 128) += --btf_features=distilled_base > endif > > +ifeq ($(call test-ge, $(pahole-ver), 132),y) > +pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort > +module-pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort > +endif > + perhaps it's useful informationally, but you don't need to wrap the addition of the sort flag in a pahole version check; unsupported btf_features are just ignored. Also we're at v1.30 in pahole now (we'll be releasing 1.31 shortly hopefully), so any version check should be v1.30/v1.31. I'd say just leave out the version check though. Alan ^ permalink raw reply [flat|nested] 32+ messages in thread
* Re: [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME 2025-10-21 17:27 ` Alan Maguire @ 2025-10-22 1:15 ` Donglin Peng 0 siblings, 0 replies; 32+ messages in thread From: Donglin Peng @ 2025-10-22 1:15 UTC (permalink / raw) To: Alan Maguire Cc: ast, linux-kernel, bpf, Eduard Zingerman, Andrii Nakryiko, Song Liu, pengdonglin On Wed, Oct 22, 2025 at 1:28 AM Alan Maguire <alan.maguire@oracle.com> wrote: > > On 20/10/2025 10:39, Donglin Peng wrote: > > Pahole v1.32 and later supports BTF sorting. Add a new configuration > > option to control whether to enable this feature for vmlinux and > > kernel modules. > > > > Cc: Eduard Zingerman <eddyz87@gmail.com> > > Cc: Alexei Starovoitov <ast@kernel.org> > > Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com> > > Cc: Alan Maguire <alan.maguire@oracle.com> > > Cc: Song Liu <song@kernel.org> > > Signed-off-by: pengdonglin <pengdonglin@xiaomi.com> > > Signed-off-by: Donglin Peng <dolinux.peng@gmail.com> > > --- > > kernel/bpf/Kconfig | 8 ++++++++ > > scripts/Makefile.btf | 5 +++++ > > 2 files changed, 13 insertions(+) > > > > diff --git a/kernel/bpf/Kconfig b/kernel/bpf/Kconfig > > index eb3de35734f0..08251a250f06 100644 > > --- a/kernel/bpf/Kconfig > > +++ b/kernel/bpf/Kconfig > > @@ -101,4 +101,12 @@ config BPF_LSM > > > > If you are unsure how to answer this question, answer N. > > > > +config BPF_SORT_BTF_BY_KIND_NAME > > + bool "Sort BTF types by kind and name" > > + depends on BPF_SYSCALL > > + help > > + This option sorts BTF types in vmlinux and kernel modules by their > > + kind and name, enabling binary search for btf_find_by_name_kind() > > + and significantly improving its lookup performance. > > + > > endmenu # "BPF subsystem" > > diff --git a/scripts/Makefile.btf b/scripts/Makefile.btf > > index db76335dd917..3f1a0b3c3f3f 100644 > > --- a/scripts/Makefile.btf > > +++ b/scripts/Makefile.btf > > @@ -29,6 +29,11 @@ ifneq ($(KBUILD_EXTMOD),) > > module-pahole-flags-$(call test-ge, $(pahole-ver), 128) += --btf_features=distilled_base > > endif > > > > +ifeq ($(call test-ge, $(pahole-ver), 132),y) > > +pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort > > +module-pahole-flags-$(CONFIG_BPF_SORT_BTF_BY_KIND_NAME) += --btf_features=sort > > +endif > > + > > perhaps it's useful informationally, but you don't need to wrap the > addition of the sort flag in a pahole version check; unsupported > btf_features are just ignored. Also we're at v1.30 in pahole now (we'll > be releasing 1.31 shortly hopefully), so any version check should be > v1.30/v1.31. I'd say just leave out the version check though. Understood, thanks. Will do. > > Alan ^ permalink raw reply [flat|nested] 32+ messages in thread
end of thread, other threads:[~2025-10-24 3:20 UTC | newest] Thread overview: 32+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2025-10-20 9:39 [RFC PATCH v2 0/5] Significantly Improve BTF Type Lookup Performance Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 1/5] btf: search local BTF before base BTF Donglin Peng 2025-10-21 1:06 ` Eduard Zingerman 2025-10-21 8:31 ` Donglin Peng 2025-10-21 15:56 ` Eduard Zingerman 2025-10-22 3:08 ` Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 2/5] btf: sort BTF types by kind and name to enable binary search Donglin Peng 2025-10-21 17:24 ` Alan Maguire 2025-10-22 4:47 ` Donglin Peng 2025-10-21 18:59 ` Eduard Zingerman 2025-10-22 3:02 ` Donglin Peng 2025-10-22 20:50 ` Eduard Zingerman 2025-10-23 10:35 ` Donglin Peng 2025-10-23 15:52 ` Alexei Starovoitov 2025-10-23 16:28 ` Andrii Nakryiko 2025-10-23 18:37 ` Alexei Starovoitov 2025-10-23 19:39 ` Andrii Nakryiko 2025-10-24 1:59 ` Donglin Peng 2025-10-24 2:23 ` Donglin Peng 2025-10-24 2:32 ` Eduard Zingerman 2025-10-24 3:04 ` Donglin Peng 2025-10-24 3:15 ` Eduard Zingerman 2025-10-24 3:19 ` Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 3/5] libbpf: check if BTF is sorted " Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 4/5] selftests/bpf: add tests for BTF deduplication and sorting Donglin Peng 2025-10-21 19:07 ` Eduard Zingerman 2025-10-23 11:20 ` Donglin Peng 2025-10-20 9:39 ` [RFC PATCH v2 5/5] btf: add CONFIG_BPF_SORT_BTF_BY_KIND_NAME Donglin Peng 2025-10-21 0:50 ` Eduard Zingerman 2025-10-21 8:33 ` Donglin Peng 2025-10-21 17:27 ` Alan Maguire 2025-10-22 1:15 ` Donglin Peng
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).