From: Ihor Solodrai <ihor.solodrai@linux.dev>
To: Andrii Nakryiko <andrii.nakryiko@gmail.com>
Cc: Alexei Starovoitov <ast@kernel.org>,
Andrii Nakryiko <andrii@kernel.org>,
Daniel Borkmann <daniel@iogearbox.net>,
Eduard Zingerman <eddyz87@gmail.com>,
Kumar Kartikeya Dwivedi <memxor@gmail.com>,
Puranjay Mohan <puranjay@kernel.org>,
Shakeel Butt <shakeel.butt@linux.dev>,
Mykyta Yatsenko <mykyta.yatsenko5@gmail.com>,
bpf@vger.kernel.org, linux-kernel@vger.kernel.org,
kernel-team@meta.com
Subject: Re: [PATCH bpf-next v5 3/3] bpf: Cache build IDs in sleepable stackmap path
Date: Fri, 15 May 2026 16:59:21 -0700 [thread overview]
Message-ID: <c99ca87d-2835-4e49-815f-663452c9c156@linux.dev> (raw)
In-Reply-To: <CAEf4BzYYmA7=pSe--McfQNP=SEkPnZyhBLb-_-eB8XS4Gtw2QQ@mail.gmail.com>
On 5/15/26 3:40 PM, Andrii Nakryiko wrote:
> On Thu, May 14, 2026 at 5:53 PM Ihor Solodrai <ihor.solodrai@linux.dev> wrote:
>>
>> Stack traces often contain adjacent IPs from the same VMA or from
>> different VMAs backed by the same ELF file. Cache the last successfully
>> parsed build ID together with the resolved VMA range and backing file
>> so the sleepable build-ID path can avoid repeated VMA locking and file
>> parsing in common cases.
>>
>> Suggested-by: Mykyta Yatsenko <yatsenko@meta.com>
>> Acked-by: Mykyta Yatsenko <yatsenko@meta.com>
>> Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
>> ---
>> kernel/bpf/stackmap.c | 56 ++++++++++++++++++++++++++++++++++++++++---
>> 1 file changed, 53 insertions(+), 3 deletions(-)
>>
>> diff --git a/kernel/bpf/stackmap.c b/kernel/bpf/stackmap.c
>> index 08f7659505d1..7336fd55c856 100644
>> --- a/kernel/bpf/stackmap.c
>> +++ b/kernel/bpf/stackmap.c
>> @@ -230,13 +230,33 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>> .vma = NULL,
>> .mm = mm,
>> };
>> - unsigned long vm_pgoff, vm_start;
>> + struct {
>> + struct file *file;
>> + const unsigned char *build_id;
>> + unsigned long vm_start;
>> + unsigned long vm_end;
>> + unsigned long vm_pgoff;
>> + } cache = {};
>> + unsigned long vm_pgoff, vm_start, vm_end;
>> struct vm_area_struct *vma;
>> struct file *file;
>> u64 ip;
>>
>> for (u32 i = 0; i < trace_nr; i++) {
>> ip = READ_ONCE(id_offs[i].ip);
>> +
>> + /*
>> + * Range cache fast path: if ip falls within the previously
>> + * resolved VMA range, reuse the cache build_id without
>> + * re-acquiring the VMA lock.
>> + */
>> + if (cache.build_id && ip >= cache.vm_start && ip < cache.vm_end) {
>> + vm_start = cache.vm_start;
>> + vm_end = cache.vm_end;
>> + vm_pgoff = cache.vm_pgoff;
>
> isn't this unnecessarily convoluted way of doing things? If we are
> here, we know for sure a) how to calculate offset and b) we need to
> memcpy. So just do that here and just then maybe jump all the way to
> id_offs[i].offset setting. Or just fill offset, status and memcpy
> build id here and continue.
>
>> + goto build_id_valid;
>> + }
>> +
>> vma = stack_map_lock_vma(&lock, ip);
>> if (!vma || !vma->vm_file) {
>> stack_map_build_id_set_ip(&id_offs[i]);
>> @@ -244,9 +264,21 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>> continue;
>> }
>>
>> - file = get_file(vma->vm_file);
>> + file = vma->vm_file;
>> vm_pgoff = vma->vm_pgoff;
>> vm_start = vma->vm_start;
>> + vm_end = vma->vm_end;
>> +
>> + if (file == cache.file) {
>> + /*
>> + * Same backing file as previous (e.g. different VMAs
>> + * of the same ELF binary). Reuse the cache build_id.
>> + */
>> + stack_map_unlock_vma(&lock);
>> + goto build_id_valid;
>> + }
>
> same thing here, file matched, so memcpy(build_id), calculate correct
> offset, set offset and status, unlock and continue. This logic is
> pretty straightforward, IMO it's cleaner to copy-paste these 4 lines
> than dealing with the goto maze...
Eeh... it's a style preference I guess. I had two memcpy() points
originally, and then Mykyta suggested to consolidate them, so I did.
Having a single "build_id_valid" point with a common "ok" sequence
makes sense to me, but at the same time I am not allergic to 4-line
copy pastes.
AFAIU with your suggestion we'll still have a label at
id_offs[i].offset = ... so it's not like we'll get rid of goto right?
>
>
>> +
>> + file = get_file(file);
>> stack_map_unlock_vma(&lock);
>>
>> /* build_id_parse_file() may block on filesystem reads */
>> @@ -255,11 +287,29 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>> fput(file);
>> continue;
>> }
>> - fput(file);
>>
>> + if (cache.file)
>> + fput(cache.file);
>> + cache.file = file;
>> + cache.build_id = id_offs[i].build_id;
>> +
>> +build_id_valid:
>> + /*
>> + * In the slow path cache.build_id points to id_offs[i].build_id.
>> + * Cache hits leave cache.build_id pointing at a prior slot,
>> + * triggering the memcpy here.
>> + */
>> + if (cache.build_id != id_offs[i].build_id)
>> + memcpy(id_offs[i].build_id, cache.build_id, BUILD_ID_SIZE_MAX);
>> + cache.vm_start = vm_start;
>> + cache.vm_end = vm_end;
>> + cache.vm_pgoff = vm_pgoff;
>> id_offs[i].offset = (vm_pgoff << PAGE_SHIFT) + ip - vm_start;
>> id_offs[i].status = BPF_STACK_BUILD_ID_VALID;
>> }
>> +
>> + if (cache.file)
>> + fput(cache.file);
>> }
>>
>> /*
>> --
>> 2.54.0
>>
next prev parent reply other threads:[~2026-05-15 23:59 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-15 0:52 [PATCH bpf-next v5 0/3] bpf: Implement stack_map_get_build_id_offset_sleepable() Ihor Solodrai
2026-05-15 0:52 ` [PATCH bpf-next v5 1/3] bpf: Factor out stack_map_build_id_set_ip() in stackmap.c Ihor Solodrai
2026-05-15 0:52 ` [PATCH bpf-next v5 2/3] bpf: Avoid faultable build ID reads under mm locks Ihor Solodrai
2026-05-15 1:25 ` sashiko-bot
2026-05-15 1:33 ` bot+bpf-ci
2026-05-15 22:40 ` Andrii Nakryiko
2026-05-15 23:44 ` Ihor Solodrai
2026-05-16 2:15 ` Andrii Nakryiko
2026-05-15 0:52 ` [PATCH bpf-next v5 3/3] bpf: Cache build IDs in sleepable stackmap path Ihor Solodrai
2026-05-15 1:21 ` bot+bpf-ci
2026-05-15 22:40 ` Andrii Nakryiko
2026-05-15 23:59 ` Ihor Solodrai [this message]
2026-05-16 2:20 ` Andrii Nakryiko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c99ca87d-2835-4e49-815f-663452c9c156@linux.dev \
--to=ihor.solodrai@linux.dev \
--cc=andrii.nakryiko@gmail.com \
--cc=andrii@kernel.org \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=daniel@iogearbox.net \
--cc=eddyz87@gmail.com \
--cc=kernel-team@meta.com \
--cc=linux-kernel@vger.kernel.org \
--cc=memxor@gmail.com \
--cc=mykyta.yatsenko5@gmail.com \
--cc=puranjay@kernel.org \
--cc=shakeel.butt@linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox