* [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
@ 2026-02-24 21:12 Ihor Solodrai
2026-02-24 21:14 ` Emil Tsalapatis
` (2 more replies)
0 siblings, 3 replies; 5+ messages in thread
From: Ihor Solodrai @ 2026-02-24 21:12 UTC (permalink / raw)
To: Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann,
Eduard Zingerman, Amery Hung
Cc: Mykyta Yatsenko, bpf, kernel-team
The test_sys_enter_exit test was setting target_pid before attaching
the BPF programs, which causes syscalls made during the attach phase
to be counted. This is flaky because, apparently, there is no
guarantee that both on_enter and on_exit will trigger during the
attachment.
Move the target_pid assignment to after task_local_storage__attach()
so that only explicit sys_gettid() calls are counted.
Reported-by: BPF CI Bot (Claude Opus 4.6) <bot+bpf-ci@kernel.org>
Closes: https://github.com/kernel-patches/vmtest/issues/448
Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
---
v1->v2: reset skel->bss->target_pid to 0 before asserts (Amery)
v1: https://lore.kernel.org/bpf/20260224015855.1481707-1-ihor.solodrai@linux.dev/
---
.../bpf/prog_tests/task_local_storage.c | 16 +++++++++++-----
1 file changed, 11 insertions(+), 5 deletions(-)
diff --git a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
index 7bee33797c71..1b26c12f255a 100644
--- a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
+++ b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
@@ -25,24 +25,30 @@
static void test_sys_enter_exit(void)
{
struct task_local_storage *skel;
+ pid_t pid = sys_gettid();
int err;
skel = task_local_storage__open_and_load();
if (!ASSERT_OK_PTR(skel, "skel_open_and_load"))
return;
- skel->bss->target_pid = sys_gettid();
-
err = task_local_storage__attach(skel);
if (!ASSERT_OK(err, "skel_attach"))
goto out;
+ /* Set target_pid after attach so that syscalls made during
+ * attach are not counted.
+ */
+ skel->bss->target_pid = pid;
+
sys_gettid();
sys_gettid();
- /* 3x syscalls: 1x attach and 2x gettid */
- ASSERT_EQ(skel->bss->enter_cnt, 3, "enter_cnt");
- ASSERT_EQ(skel->bss->exit_cnt, 3, "exit_cnt");
+ skel->bss->target_pid = 0;
+
+ /* 2x gettid syscalls */
+ ASSERT_EQ(skel->bss->enter_cnt, 2, "enter_cnt");
+ ASSERT_EQ(skel->bss->exit_cnt, 2, "exit_cnt");
ASSERT_EQ(skel->bss->mismatch_cnt, 0, "mismatch_cnt");
out:
task_local_storage__destroy(skel);
--
2.53.0
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
2026-02-24 21:12 [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit Ihor Solodrai
@ 2026-02-24 21:14 ` Emil Tsalapatis
2026-02-24 22:04 ` Amery Hung
2026-02-24 23:00 ` patchwork-bot+netdevbpf
2 siblings, 0 replies; 5+ messages in thread
From: Emil Tsalapatis @ 2026-02-24 21:14 UTC (permalink / raw)
To: Ihor Solodrai, Alexei Starovoitov, Andrii Nakryiko,
Daniel Borkmann, Eduard Zingerman, Amery Hung
Cc: Mykyta Yatsenko, bpf, kernel-team
On Tue Feb 24, 2026 at 4:12 PM EST, Ihor Solodrai wrote:
> The test_sys_enter_exit test was setting target_pid before attaching
> the BPF programs, which causes syscalls made during the attach phase
> to be counted. This is flaky because, apparently, there is no
> guarantee that both on_enter and on_exit will trigger during the
> attachment.
>
> Move the target_pid assignment to after task_local_storage__attach()
> so that only explicit sys_gettid() calls are counted.
>
> Reported-by: BPF CI Bot (Claude Opus 4.6) <bot+bpf-ci@kernel.org>
> Closes: https://github.com/kernel-patches/vmtest/issues/448
> Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
>
Reviewed-by: Emil Tsalapatis <emil@etsalapatis.com>
> ---
>
> v1->v2: reset skel->bss->target_pid to 0 before asserts (Amery)
> v1: https://lore.kernel.org/bpf/20260224015855.1481707-1-ihor.solodrai@linux.dev/
>
> ---
> .../bpf/prog_tests/task_local_storage.c | 16 +++++++++++-----
> 1 file changed, 11 insertions(+), 5 deletions(-)
>
> diff --git a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> index 7bee33797c71..1b26c12f255a 100644
> --- a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> +++ b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> @@ -25,24 +25,30 @@
> static void test_sys_enter_exit(void)
> {
> struct task_local_storage *skel;
> + pid_t pid = sys_gettid();
> int err;
>
> skel = task_local_storage__open_and_load();
> if (!ASSERT_OK_PTR(skel, "skel_open_and_load"))
> return;
>
> - skel->bss->target_pid = sys_gettid();
> -
> err = task_local_storage__attach(skel);
> if (!ASSERT_OK(err, "skel_attach"))
> goto out;
>
> + /* Set target_pid after attach so that syscalls made during
> + * attach are not counted.
> + */
> + skel->bss->target_pid = pid;
> +
> sys_gettid();
> sys_gettid();
>
> - /* 3x syscalls: 1x attach and 2x gettid */
> - ASSERT_EQ(skel->bss->enter_cnt, 3, "enter_cnt");
> - ASSERT_EQ(skel->bss->exit_cnt, 3, "exit_cnt");
> + skel->bss->target_pid = 0;
> +
> + /* 2x gettid syscalls */
> + ASSERT_EQ(skel->bss->enter_cnt, 2, "enter_cnt");
> + ASSERT_EQ(skel->bss->exit_cnt, 2, "exit_cnt");
> ASSERT_EQ(skel->bss->mismatch_cnt, 0, "mismatch_cnt");
> out:
> task_local_storage__destroy(skel);
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
2026-02-24 21:12 [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit Ihor Solodrai
2026-02-24 21:14 ` Emil Tsalapatis
@ 2026-02-24 22:04 ` Amery Hung
2026-02-24 22:49 ` Ihor Solodrai
2026-02-24 23:00 ` patchwork-bot+netdevbpf
2 siblings, 1 reply; 5+ messages in thread
From: Amery Hung @ 2026-02-24 22:04 UTC (permalink / raw)
To: Ihor Solodrai
Cc: Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann,
Eduard Zingerman, Mykyta Yatsenko, bpf, kernel-team
On Tue, Feb 24, 2026 at 1:12 PM Ihor Solodrai <ihor.solodrai@linux.dev> wrote:
>
> The test_sys_enter_exit test was setting target_pid before attaching
> the BPF programs, which causes syscalls made during the attach phase
> to be counted. This is flaky because, apparently, there is no
> guarantee that both on_enter and on_exit will trigger during the
> attachment.
>
> Move the target_pid assignment to after task_local_storage__attach()
> so that only explicit sys_gettid() calls are counted.
>
> Reported-by: BPF CI Bot (Claude Opus 4.6) <bot+bpf-ci@kernel.org>
> Closes: https://github.com/kernel-patches/vmtest/issues/448
> Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
>
> ---
>
> v1->v2: reset skel->bss->target_pid to 0 before asserts (Amery)
> v1: https://lore.kernel.org/bpf/20260224015855.1481707-1-ihor.solodrai@linux.dev/
>
> ---
> .../bpf/prog_tests/task_local_storage.c | 16 +++++++++++-----
> 1 file changed, 11 insertions(+), 5 deletions(-)
>
> diff --git a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> index 7bee33797c71..1b26c12f255a 100644
> --- a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> +++ b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
> @@ -25,24 +25,30 @@
> static void test_sys_enter_exit(void)
> {
> struct task_local_storage *skel;
> + pid_t pid = sys_gettid();
> int err;
>
> skel = task_local_storage__open_and_load();
> if (!ASSERT_OK_PTR(skel, "skel_open_and_load"))
> return;
>
> - skel->bss->target_pid = sys_gettid();
> -
> err = task_local_storage__attach(skel);
> if (!ASSERT_OK(err, "skel_attach"))
> goto out;
>
> + /* Set target_pid after attach so that syscalls made during
> + * attach are not counted.
> + */
> + skel->bss->target_pid = pid;
> +
> sys_gettid();
> sys_gettid();
>
> - /* 3x syscalls: 1x attach and 2x gettid */
> - ASSERT_EQ(skel->bss->enter_cnt, 3, "enter_cnt");
> - ASSERT_EQ(skel->bss->exit_cnt, 3, "exit_cnt");
> + skel->bss->target_pid = 0;
> +
> + /* 2x gettid syscalls */
> + ASSERT_EQ(skel->bss->enter_cnt, 2, "enter_cnt");
> + ASSERT_EQ(skel->bss->exit_cnt, 2, "exit_cnt");
Does tools/testing/selftests/bpf/prog_tests/cgrp_local_storage.c also
have the same flakiness issue and should also be updated in the same
way?
> ASSERT_EQ(skel->bss->mismatch_cnt, 0, "mismatch_cnt");
> out:
> task_local_storage__destroy(skel);
> --
> 2.53.0
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
2026-02-24 22:04 ` Amery Hung
@ 2026-02-24 22:49 ` Ihor Solodrai
0 siblings, 0 replies; 5+ messages in thread
From: Ihor Solodrai @ 2026-02-24 22:49 UTC (permalink / raw)
To: Amery Hung
Cc: Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann,
Eduard Zingerman, Mykyta Yatsenko, bpf, kernel-team
On 2/24/26 2:04 PM, Amery Hung wrote:
> On Tue, Feb 24, 2026 at 1:12 PM Ihor Solodrai <ihor.solodrai@linux.dev> wrote:
>>
>> The test_sys_enter_exit test was setting target_pid before attaching
>> the BPF programs, which causes syscalls made during the attach phase
>> to be counted. This is flaky because, apparently, there is no
>> guarantee that both on_enter and on_exit will trigger during the
>> attachment.
>>
>> Move the target_pid assignment to after task_local_storage__attach()
>> so that only explicit sys_gettid() calls are counted.
>>
>> Reported-by: BPF CI Bot (Claude Opus 4.6) <bot+bpf-ci@kernel.org>
>> Closes: https://github.com/kernel-patches/vmtest/issues/448
>> Signed-off-by: Ihor Solodrai <ihor.solodrai@linux.dev>
>>
>> ---
>>
>> v1->v2: reset skel->bss->target_pid to 0 before asserts (Amery)
>> v1: https://lore.kernel.org/bpf/20260224015855.1481707-1-ihor.solodrai@linux.dev/
>>
>> ---
>> .../bpf/prog_tests/task_local_storage.c | 16 +++++++++++-----
>> 1 file changed, 11 insertions(+), 5 deletions(-)
>>
>> diff --git a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
>> index 7bee33797c71..1b26c12f255a 100644
>> --- a/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
>> +++ b/tools/testing/selftests/bpf/prog_tests/task_local_storage.c
>> @@ -25,24 +25,30 @@
>> static void test_sys_enter_exit(void)
>> {
>> struct task_local_storage *skel;
>> + pid_t pid = sys_gettid();
>> int err;
>>
>> skel = task_local_storage__open_and_load();
>> if (!ASSERT_OK_PTR(skel, "skel_open_and_load"))
>> return;
>>
>> - skel->bss->target_pid = sys_gettid();
>> -
>> err = task_local_storage__attach(skel);
>> if (!ASSERT_OK(err, "skel_attach"))
>> goto out;
>>
>> + /* Set target_pid after attach so that syscalls made during
>> + * attach are not counted.
>> + */
>> + skel->bss->target_pid = pid;
>> +
>> sys_gettid();
>> sys_gettid();
>>
>> - /* 3x syscalls: 1x attach and 2x gettid */
>> - ASSERT_EQ(skel->bss->enter_cnt, 3, "enter_cnt");
>> - ASSERT_EQ(skel->bss->exit_cnt, 3, "exit_cnt");
>> + skel->bss->target_pid = 0;
>> +
>> + /* 2x gettid syscalls */
>> + ASSERT_EQ(skel->bss->enter_cnt, 2, "enter_cnt");
>> + ASSERT_EQ(skel->bss->exit_cnt, 2, "exit_cnt");
>
> Does tools/testing/selftests/bpf/prog_tests/cgrp_local_storage.c also
> have the same flakiness issue and should also be updated in the same
> way?
I haven't seen similar failures on CI, but that may just mean we were
lucky.
Do you know of other tests that may have the same issue?
I checked these, but only task_local_storage and cgrp_local_storage
assert counts:
$ grep -r 'skel->bss->target_pid = .*gettid' tools/testing/selftests/bpf/
tools/testing/selftests/bpf/prog_tests/cgroup_xattr.c: skel->bss->target_pid = sys_gettid();
tools/testing/selftests/bpf/prog_tests/rcu_read_lock.c: skel->bss->target_pid = sys_gettid();
tools/testing/selftests/bpf/prog_tests/rcu_read_lock.c: skel->bss->target_pid = sys_gettid();
tools/testing/selftests/bpf/prog_tests/task_local_storage.c: skel->bss->target_pid = sys_gettid();
tools/testing/selftests/bpf/prog_tests/cgrp_local_storage.c: skel->bss->target_pid = sys_gettid();
tools/testing/selftests/bpf/prog_tests/cgrp_local_storage.c: skel->bss->target_pid = sys_gettid();
>
>> ASSERT_EQ(skel->bss->mismatch_cnt, 0, "mismatch_cnt");
>> out:
>> task_local_storage__destroy(skel);
>> --
>> 2.53.0
>>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
2026-02-24 21:12 [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit Ihor Solodrai
2026-02-24 21:14 ` Emil Tsalapatis
2026-02-24 22:04 ` Amery Hung
@ 2026-02-24 23:00 ` patchwork-bot+netdevbpf
2 siblings, 0 replies; 5+ messages in thread
From: patchwork-bot+netdevbpf @ 2026-02-24 23:00 UTC (permalink / raw)
To: Ihor Solodrai
Cc: ast, andrii, daniel, eddyz87, ameryhung, yatsenko, bpf,
kernel-team
Hello:
This patch was applied to bpf/bpf-next.git (master)
by Alexei Starovoitov <ast@kernel.org>:
On Tue, 24 Feb 2026 13:12:02 -0800 you wrote:
> The test_sys_enter_exit test was setting target_pid before attaching
> the BPF programs, which causes syscalls made during the attach phase
> to be counted. This is flaky because, apparently, there is no
> guarantee that both on_enter and on_exit will trigger during the
> attachment.
>
> Move the target_pid assignment to after task_local_storage__attach()
> so that only explicit sys_gettid() calls are counted.
>
> [...]
Here is the summary with links:
- [bpf-next,v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit
https://git.kernel.org/bpf/bpf-next/c/c89b50cc6b9f
You are awesome, thank you!
--
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/patchwork/pwbot.html
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2026-02-24 23:00 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-02-24 21:12 [PATCH bpf-next v2] selftests/bpf: Fix flakiness of task_local_storage/sys_enter_exit Ihor Solodrai
2026-02-24 21:14 ` Emil Tsalapatis
2026-02-24 22:04 ` Amery Hung
2026-02-24 22:49 ` Ihor Solodrai
2026-02-24 23:00 ` patchwork-bot+netdevbpf
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.