Re: [PATCH bpf-next v5 3/3] bpf: Cache build IDs in sleepable stackmap path

BPF List
 help / color / mirror / Atom feed

From: Ihor Solodrai <ihor.solodrai@linux.dev>
To: Andrii Nakryiko <andrii.nakryiko@gmail.com>
Cc: Alexei Starovoitov <ast@kernel.org>,
	Andrii Nakryiko <andrii@kernel.org>,
	Daniel Borkmann <daniel@iogearbox.net>,
	Eduard Zingerman <eddyz87@gmail.com>,
	Kumar Kartikeya Dwivedi <memxor@gmail.com>,
	Puranjay Mohan <puranjay@kernel.org>,
	Shakeel Butt <shakeel.butt@linux.dev>,
	Mykyta Yatsenko <mykyta.yatsenko5@gmail.com>,
	bpf@vger.kernel.org, linux-kernel@vger.kernel.org,
	kernel-team@meta.com
Subject: Re: [PATCH bpf-next v5 3/3] bpf: Cache build IDs in sleepable stackmap path
Date: Fri, 15 May 2026 16:59:21 -0700	[thread overview]
Message-ID: <c99ca87d-2835-4e49-815f-663452c9c156@linux.dev> (raw)
In-Reply-To: <CAEf4BzYYmA7=pSe--McfQNP=SEkPnZyhBLb-_-eB8XS4Gtw2QQ@mail.gmail.com>

On 5/15/26 3:40 PM, Andrii Nakryiko wrote:
> On Thu, May 14, 2026 at 5:53 PM Ihor Solodrai <ihor.solodrai@linux.dev> wrote:
>>
>> Stack traces often contain adjacent IPs from the same VMA or from
>> different VMAs backed by the same ELF file. Cache the last successfully
>> parsed build ID together with the resolved VMA range and backing file
>> so the sleepable build-ID path can avoid repeated VMA locking and file
>> parsing in common cases.
>>
>> Suggested-by: Mykyta Yatsenko <yatsenko@meta.com>
>> Acked-by: Mykyta Yatsenko <yatsenko@meta.com>
>> Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
>> ---
>>  kernel/bpf/stackmap.c | 56 ++++++++++++++++++++++++++++++++++++++++---
>>  1 file changed, 53 insertions(+), 3 deletions(-)
>>
>> diff --git a/kernel/bpf/stackmap.c b/kernel/bpf/stackmap.c
>> index 08f7659505d1..7336fd55c856 100644
>> --- a/kernel/bpf/stackmap.c
>> +++ b/kernel/bpf/stackmap.c
>> @@ -230,13 +230,33 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>>                 .vma = NULL,
>>                 .mm = mm,
>>         };
>> -       unsigned long vm_pgoff, vm_start;
>> +       struct {
>> +               struct file *file;
>> +               const unsigned char *build_id;
>> +               unsigned long vm_start;
>> +               unsigned long vm_end;
>> +               unsigned long vm_pgoff;
>> +       } cache = {};
>> +       unsigned long vm_pgoff, vm_start, vm_end;
>>         struct vm_area_struct *vma;
>>         struct file *file;
>>         u64 ip;
>>
>>         for (u32 i = 0; i < trace_nr; i++) {
>>                 ip = READ_ONCE(id_offs[i].ip);
>> +
>> +               /*
>> +                * Range cache fast path: if ip falls within the previously
>> +                * resolved VMA range, reuse the cache build_id without
>> +                * re-acquiring the VMA lock.
>> +                */
>> +               if (cache.build_id && ip >= cache.vm_start && ip < cache.vm_end) {
>> +                       vm_start = cache.vm_start;
>> +                       vm_end = cache.vm_end;
>> +                       vm_pgoff = cache.vm_pgoff;
> 
> isn't this unnecessarily convoluted way of doing things? If we are
> here, we know for sure a) how to calculate offset and b) we need to
> memcpy. So just do that here and just then maybe jump all the way to
> id_offs[i].offset setting. Or just fill offset, status and memcpy
> build id here and continue.
> 
>> +                       goto build_id_valid;
>> +               }
>> +
>>                 vma = stack_map_lock_vma(&lock, ip);
>>                 if (!vma || !vma->vm_file) {
>>                         stack_map_build_id_set_ip(&id_offs[i]);
>> @@ -244,9 +264,21 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>>                         continue;
>>                 }
>>
>> -               file = get_file(vma->vm_file);
>> +               file = vma->vm_file;
>>                 vm_pgoff = vma->vm_pgoff;
>>                 vm_start = vma->vm_start;
>> +               vm_end = vma->vm_end;
>> +
>> +               if (file == cache.file) {
>> +                       /*
>> +                        * Same backing file as previous (e.g. different VMAs
>> +                        * of the same ELF binary). Reuse the cache build_id.
>> +                        */
>> +                       stack_map_unlock_vma(&lock);
>> +                       goto build_id_valid;
>> +               }
> 
> same thing here, file matched, so memcpy(build_id), calculate correct
> offset, set offset and status, unlock and continue. This logic is
> pretty straightforward, IMO it's cleaner to copy-paste these 4 lines
> than dealing with the goto maze...


Eeh... it's a style preference I guess. I had two memcpy() points 
originally, and then Mykyta suggested to consolidate them, so I did.

Having a single "build_id_valid" point with a common "ok" sequence
makes sense to me, but at the same time I am not allergic to 4-line 
copy pastes.

AFAIU with your suggestion we'll still have a label at
id_offs[i].offset = ... so it's not like we'll get rid of goto right?

> 
> 
>> +
>> +               file = get_file(file);
>>                 stack_map_unlock_vma(&lock);
>>
>>                 /* build_id_parse_file() may block on filesystem reads */
>> @@ -255,11 +287,29 @@ static void stack_map_get_build_id_offset_sleepable(struct bpf_stack_build_id *i
>>                         fput(file);
>>                         continue;
>>                 }
>> -               fput(file);
>>
>> +               if (cache.file)
>> +                       fput(cache.file);
>> +               cache.file = file;
>> +               cache.build_id = id_offs[i].build_id;
>> +
>> +build_id_valid:
>> +               /*
>> +                * In the slow path cache.build_id points to id_offs[i].build_id.
>> +                * Cache hits leave cache.build_id pointing at a prior slot,
>> +                * triggering the memcpy here.
>> +                */
>> +               if (cache.build_id != id_offs[i].build_id)
>> +                       memcpy(id_offs[i].build_id, cache.build_id, BUILD_ID_SIZE_MAX);
>> +               cache.vm_start = vm_start;
>> +               cache.vm_end = vm_end;
>> +               cache.vm_pgoff = vm_pgoff;
>>                 id_offs[i].offset = (vm_pgoff << PAGE_SHIFT) + ip - vm_start;
>>                 id_offs[i].status = BPF_STACK_BUILD_ID_VALID;
>>         }
>> +
>> +       if (cache.file)
>> +               fput(cache.file);
>>  }
>>
>>  /*
>> --
>> 2.54.0
>>

next prev parent reply	other threads:[~2026-05-15 23:59 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-15  0:52 [PATCH bpf-next v5 0/3] bpf: Implement stack_map_get_build_id_offset_sleepable() Ihor Solodrai
2026-05-15  0:52 ` [PATCH bpf-next v5 1/3] bpf: Factor out stack_map_build_id_set_ip() in stackmap.c Ihor Solodrai
2026-05-15  0:52 ` [PATCH bpf-next v5 2/3] bpf: Avoid faultable build ID reads under mm locks Ihor Solodrai
2026-05-15  1:25   ` sashiko-bot
2026-05-15  1:33   ` bot+bpf-ci
2026-05-15 22:40   ` Andrii Nakryiko
2026-05-15 23:44     ` Ihor Solodrai
2026-05-16  2:15       ` Andrii Nakryiko
2026-05-15  0:52 ` [PATCH bpf-next v5 3/3] bpf: Cache build IDs in sleepable stackmap path Ihor Solodrai
2026-05-15  1:21   ` bot+bpf-ci
2026-05-15 22:40   ` Andrii Nakryiko
2026-05-15 23:59     ` Ihor Solodrai [this message]
2026-05-16  2:20       ` Andrii Nakryiko

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=c99ca87d-2835-4e49-815f-663452c9c156@linux.dev \
    --to=ihor.solodrai@linux.dev \
    --cc=andrii.nakryiko@gmail.com \
    --cc=andrii@kernel.org \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=daniel@iogearbox.net \
    --cc=eddyz87@gmail.com \
    --cc=kernel-team@meta.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=memxor@gmail.com \
    --cc=mykyta.yatsenko5@gmail.com \
    --cc=puranjay@kernel.org \
    --cc=shakeel.butt@linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox