From: Leon Hwang <leon.hwang@linux.dev>
To: Xu Kuohai <xukuohai@huaweicloud.com>,
bpf@vger.kernel.org, linux-kernel@vger.kernel.org
Cc: Alexei Starovoitov <ast@kernel.org>,
Daniel Borkmann <daniel@iogearbox.net>,
Andrii Nakryiko <andrii@kernel.org>,
Eduard Zingerman <eddyz87@gmail.com>,
Kumar Kartikeya Dwivedi <memxor@gmail.com>,
Yonghong Song <yonghong.song@linux.dev>,
Jiri Olsa <jolsa@kernel.org>, KP Singh <kpsingh@kernel.org>,
Anton Protopopov <a.s.protopopov@gmail.com>,
Amery Hung <ameryhung@gmail.com>,
Eyal Birger <eyal.birger@gmail.com>, Rong Tao <rongtao@cestc.cn>
Subject: Re: [RFC PATCH bpf-next 00/12] bpf: Introduce static-defined tracing probe for BPF
Date: Mon, 29 Jun 2026 10:14:28 +0800 [thread overview]
Message-ID: <855e44f3-d22c-47e5-8bd1-c5d9f7014a75@linux.dev> (raw)
In-Reply-To: <cover.1782571533.git.xukuohai@huawei.com>
Hi Kuohai,
On 28/6/26 06:51, Xu Kuohai wrote:
> From: Xu Kuohai <xukuohai@huawei.com>
>
> This series introduces static-defined tracing probes for BPF programs.
> BPF SDT (static-defined tracing) works similarly to USDT. User defines
At first glance, the SDT idea looks cool to me.
However, what's your purpose of introducing SDT?
If to provide points in bpf progs to be traced, like tracepoints in
kernel functions, I think subprog+fentry is an alternative approach.
Comparing with SDT, subprog+fentry requires a function call at run time,
instead of a NOP like SDT.
For example,
#define __sink(expr) asm volatile("" : "+g"(expr))
static __noinline void my_trace(int len, int ret)
{
__sink(len);
__sink(ret);
}
SEC("xdp")
int xdp_prog(struct xdp_md *ctx)
{
int len = ctx->data_end - ctx->data;
int ret = XDP_DROP;
...
my_trace(len, ret);
...
}
'my_trace' can be traced using fentry to inspect the details of 'xdp_prog'.
Furthermore, if users don't want a function call at run time, e.g. they
don't want to call 'my_trace' at run time in production, they can patch
the callsite of 'my_trace' with NOP before loading 'xdp_prog', and drop
the subprog 'my_trace' in their user space application. This elimination
is approachable, since it is used heavily in bpfsnoop [1]. However, this
elimination is not easy to understand. Want me to show more details
about this elimination?
Link:
[1] https://github.com/bpfsnoop/bpfsnoop
Thanks,
Leon
> probes in the BPF source code. The probes are built into NOP instructions
> in the ELF. At runtime, when an observer is attached, the NOP instruction
> is patched to a CALL instruction to the observer prog trampoline.
>
> Unlike USDT, BPF SDT requires explicit macros to generate the function
> prototype BTF for each probe. This allows the verifier to validate the
> probe sites against the declared types, and observer programs can be
> attached similarly to normal tracing programs using the function prototype
> information.
>
> A probe with two arguments in the target program can be declared and defined
> like:
>
> BPF_SDT_DECLARE2(my_trace, int, int);
>
> SEC("xdp")
> int xdp_prog(struct xdp_md *ctx)
> {
> int len = ctx->data_end - ctx->data;
> int ret = XDP_DROP;
> ...
> BPF_SDT_PROBE2(my_trace, len, ret);
> ...
> }
>
> An observer would be like:
>
> SEC("bpf_sdt")
> int BPF_PROG(observer_prog, int len, int ret)
> {
> bpf_printk("len=%d ret=%d\n", len, ret);
> return 0;
> }
>
> The target program and probe site for the observer program should be set
> at load time via bpf_program__set_attach_target() manually, since program
> names are not unique and program IDs are allocated at runtime - there is
> no static way to identify the target program.
>
> For BPF_SDT_DECLARE2(my_trace, int, int) and BPF_SDT_PROBE2(my_trace, len, ret)
> macros, the compiler produces:
>
> [code section, e.g. xdp]
> goto +0 // NOP, patched to CALL at attach time
>
> [.bpf_sdt_notes section]
> ___sdt_jt_my_trace: // symbol marking this entry's boundary
> .quad 0b // 8 bytes: offset of the NOP in the code
> // section (resolved by the linker via
> // R_BPF_64_ABS64 relocation)
> r1 = %[arg1_reg] // 8 bytes per argument: BPF move insn
> // whose src_reg field encodes the BPF
> // register holding each probe argument
> r2 = %[arg2_reg]
>
> ....
>
> [.BTF section]
> FUNC_PROTO (int, int) -> void // from BPF_SDT_DECLARE2 stub
> DECL_TAG "bpf_sdt:my_trace:2" // keyed by name + nargs
>
> The .BTF and .bpf_sdt_notes sections are parsed by libbpf and ultimately
> converted to an insn_array map per bpf program. Each map slot records the
> instruction offset, jited address, FUNC_PROTO BTF ID, and argument register
> for each probe site.
>
> When an observer prog is loaded, the probe is resolved against the target
> program's SDT map using the target fd and probe name passed by the user.
>
> For attach, the existing bpf trampoline is reused with per-argument register
> mappings declared by the SDT macro.
>
> Xu Kuohai (12):
> libbpf: Prepare bpf SDT probe section for the linker
> libbpf: Introduce bpf SDT probe macros
> libbpf: Add bpf_sdt_notes section parser
> bpf: Create insn_array map for bpf SDT probe
> bpf: Collect SDT probe BTF IDs from BTF decl tags
> bpf: Add type check for SDT probe site
> bpf: Record probe name in SDT map
> libbpf: Add libbpf support to load SDT observer program
> bpf: Add kernel support to load SDT observer program
> bpf: Support attach and detach for SDT observer program
> bpf, x86: Add JIT support SDT for probe
> selftests/bpf: Add tests for bpf SDT probe
>
> arch/x86/net/bpf_jit_comp.c | 38 +-
> include/linux/bpf.h | 26 +-
> include/linux/bpf_types.h | 1 +
> include/linux/bpf_verifier.h | 3 +
> include/linux/filter.h | 1 +
> include/uapi/linux/bpf.h | 21 +-
> kernel/bpf/bpf_insn_array.c | 64 ++-
> kernel/bpf/cfg.c | 3 +
> kernel/bpf/core.c | 5 +
> kernel/bpf/fixups.c | 4 +
> kernel/bpf/liveness.c | 24 +-
> kernel/bpf/syscall.c | 163 +++++-
> kernel/bpf/trampoline.c | 4 +
> kernel/bpf/verifier.c | 174 +++++-
> tools/include/uapi/linux/bpf.h | 21 +-
> tools/lib/bpf/Makefile | 2 +-
> tools/lib/bpf/bpf.c | 10 +-
> tools/lib/bpf/bpf.h | 9 +-
> tools/lib/bpf/bpf_sdt.h | 346 ++++++++++++
> tools/lib/bpf/libbpf.c | 513 +++++++++++++++++-
> tools/lib/bpf/libbpf_internal.h | 1 +
> tools/lib/bpf/linker.c | 9 +
> .../selftests/bpf/prog_tests/test_bpf_sdt.c | 151 ++++++
> .../selftests/bpf/progs/bpf_sdt_observer.c | 30 +
> .../selftests/bpf/progs/bpf_sdt_target.c | 34 ++
> 25 files changed, 1622 insertions(+), 35 deletions(-)
> create mode 100644 tools/lib/bpf/bpf_sdt.h
> create mode 100644 tools/testing/selftests/bpf/prog_tests/test_bpf_sdt.c
> create mode 100644 tools/testing/selftests/bpf/progs/bpf_sdt_observer.c
> create mode 100644 tools/testing/selftests/bpf/progs/bpf_sdt_target.c
>
next prev parent reply other threads:[~2026-06-29 2:14 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-27 22:51 [RFC PATCH bpf-next 00/12] bpf: Introduce static-defined tracing probe for BPF Xu Kuohai
2026-06-27 20:51 ` [syzbot ci] " syzbot ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 01/12] libbpf: Prepare bpf SDT probe section for the linker Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 02/12] libbpf: Introduce bpf SDT probe macros Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 03/12] libbpf: Add bpf_sdt_notes section parser Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 04/12] bpf: Create insn_array map for bpf SDT probe Xu Kuohai
2026-06-27 15:34 ` bot+bpf-ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 05/12] bpf: Collect SDT probe BTF IDs from BTF decl tags Xu Kuohai
2026-06-27 15:34 ` bot+bpf-ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 06/12] bpf: Add type check for SDT probe site Xu Kuohai
2026-06-27 15:22 ` bot+bpf-ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 07/12] bpf: Record probe name in SDT map Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 08/12] libbpf: Add libbpf support to load SDT observer program Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 09/12] bpf: Add kernel " Xu Kuohai
2026-06-27 15:22 ` bot+bpf-ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 10/12] bpf: Support attach and detach for " Xu Kuohai
2026-06-27 22:51 ` [RFC PATCH bpf-next 11/12] bpf, x86: Add JIT support SDT for probe Xu Kuohai
2026-06-27 15:22 ` bot+bpf-ci
2026-06-27 22:51 ` [RFC PATCH bpf-next 12/12] selftests/bpf: Add tests for bpf SDT probe Xu Kuohai
2026-06-29 2:14 ` Leon Hwang [this message]
2026-06-29 3:27 ` [RFC PATCH bpf-next 00/12] bpf: Introduce static-defined tracing probe for BPF Xu Kuohai
2026-06-29 5:26 ` Leon Hwang
2026-06-29 7:51 ` Xu Kuohai
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=855e44f3-d22c-47e5-8bd1-c5d9f7014a75@linux.dev \
--to=leon.hwang@linux.dev \
--cc=a.s.protopopov@gmail.com \
--cc=ameryhung@gmail.com \
--cc=andrii@kernel.org \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=daniel@iogearbox.net \
--cc=eddyz87@gmail.com \
--cc=eyal.birger@gmail.com \
--cc=jolsa@kernel.org \
--cc=kpsingh@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=memxor@gmail.com \
--cc=rongtao@cestc.cn \
--cc=xukuohai@huaweicloud.com \
--cc=yonghong.song@linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox