From: Namhyung Kim <namhyung@kernel.org>
To: Tengda Wu <wutengda@huaweicloud.com>
Cc: Peter Zijlstra <peterz@infradead.org>,
leo.yan@linux.dev, Li Huafei <lihuafei1@huawei.com>,
Ian Rogers <irogers@google.com>,
Kim Phillips <kim.phillips@arm.com>,
Mark Rutland <mark.rutland@arm.com>,
Arnaldo Carvalho de Melo <acme@kernel.org>,
Ingo Molnar <mingo@redhat.com>, Bill Wendling <morbo@google.com>,
Nick Desaulniers <nick.desaulniers+lkml@gmail.com>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Adrian Hunter <adrian.hunter@intel.com>,
Zecheng Li <zli94@ncsu.edu>,
linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org,
llvm@lists.linux.dev
Subject: Re: [PATCH v2 09/16] perf annotate-arm64: Support load instruction tracking
Date: Thu, 9 Apr 2026 23:23:28 -0700 [thread overview]
Message-ID: <adiXYBhkEpbBR0ZA@google.com> (raw)
In-Reply-To: <20260403094800.1418825-10-wutengda@huaweicloud.com>
On Fri, Apr 03, 2026 at 09:47:53AM +0000, Tengda Wu wrote:
> Implement update_insn_state() for arm64 to track register state changes
> during load (LDR) instructions. This is essential for maintaining accurate
> type information when data is moved from memory to registers.
>
> The implementation handles the three primary arm64 addressing modes:
> 1. Signed offset: [base, #imm]
> 2. Pre-index: [base, #imm]!
> 3. Post-index: [base], #imm
>
> Introduce adjust_reg_index_state() to handle the side effects of pre-index
> and post-index addressing, where the base register is updated with the
> offset after or before the memory access. This ensures that the register's
> offset within a structure is correctly tracked across sequential
> instructions.
>
> A real-world example is shown below:
>
> ffff80008011f5b0 <pick_task_stop>:
> ffff80008011f5b8: ldr x0, [x0, #2712] // x0: struct rq* -> task_struct*
> ffff80008011f5c0: ldr w1, [x0, #104] // PMU sample at offset 0x68
>
> Before this commit, the type of x0 was incorrectly inferred as 'struct rq':
>
> find data type for 0x68(reg0) at pick_task_stop+0x10
> var [8] reg0 offset 0 type='struct rq*'
> chk [10] reg0 offset=0x68 ok=1 kind=1 (struct rq*) : Good!
> final result: type='struct rq'
>
> After this commit, the type of x0 is correctly inferred as 'struct task_struct':
>
> find data type for 0x68(reg0) at pick_task_stop+0x10
> var [8] reg0 offset 0 type='struct rq*'
> ldr [8] 0xa98(reg0) -> reg0 type='struct task_struct*'
> chk [10] reg0 offset=0x68 ok=1 kind=1 (struct task_struct*) : Good!
> final result: type='struct task_struct'
>
> Signed-off-by: Li Huafei <lihuafei1@huawei.com>
> Signed-off-by: Tengda Wu <wutengda@huaweicloud.com>
> ---
> .../perf/util/annotate-arch/annotate-arm64.c | 87 +++++++++++++++++++
> 1 file changed, 87 insertions(+)
>
> diff --git a/tools/perf/util/annotate-arch/annotate-arm64.c b/tools/perf/util/annotate-arch/annotate-arm64.c
> index 1fe4c503431b..cac2bf0021c9 100644
> --- a/tools/perf/util/annotate-arch/annotate-arm64.c
> +++ b/tools/perf/util/annotate-arch/annotate-arm64.c
> @@ -9,6 +9,8 @@
> #include <regex.h>
> #include "../annotate.h"
> #include "../disasm.h"
> +#include "../annotate-data.h"
> +#include "../debug.h"
>
> struct arch_arm64 {
> struct arch arch;
> @@ -254,6 +256,88 @@ static int extract_op_location_arm64(const struct arch *arch,
> return 0;
> }
>
> +#ifdef HAVE_LIBDW_SUPPORT
> +static int get_mem_offset(struct annotated_op_loc *op_loc, int type_offset)
> +{
> + if (op_loc->addr_mode == INSN_ADDR_POST_INDEX)
> + return type_offset;
> +
> + return op_loc->offset + type_offset;
> +}
> +
> +static void adjust_reg_index_state(struct type_state *state, int reg,
> + struct annotated_op_loc *op_loc,
> + const char *insn_name, u32 insn_offset)
> +{
> + struct type_state_reg *tsr;
> +
> + if (!has_reg_type(state, reg) ||
> + (op_loc->addr_mode != INSN_ADDR_PRE_INDEX &&
> + op_loc->addr_mode != INSN_ADDR_POST_INDEX))
> + return;
> +
> + tsr = &state->regs[reg];
> + tsr->offset = op_loc->offset + tsr->offset;
> + tsr->ok = true;
Maybe we can skip setting 'ok' here?
> +
> + pr_debug_dtp("%s [%x] %s-index %#x(reg%d) -> reg%d", insn_name,
> + insn_offset, op_loc->addr_mode == INSN_ADDR_PRE_INDEX ?
> + "pre" : "post", op_loc->offset, reg, reg);
> + pr_debug_type_name(&tsr->type, tsr->kind);
> +}
> +
> +static void update_insn_state_arm64(struct type_state *state,
> + struct data_loc_info *dloc, Dwarf_Die * cu_die __maybe_unused,
> + struct disasm_line *dl)
> +{
> + struct annotated_insn_loc loc;
> + struct annotated_op_loc *src = &loc.ops[INSN_OP_SOURCE];
> + struct annotated_op_loc *dst = &loc.ops[INSN_OP_TARGET];
> + struct type_state_reg *tsr;
> + Dwarf_Die type_die;
> + u32 insn_offset = dl->al.offset;
> + int sreg, dreg;
> +
> + if (annotate_get_insn_location(dloc->arch, dl, &loc) < 0)
> + return;
> +
> + sreg = src->reg1;
> + dreg = dst->reg1;
> +
> + /* Memory to register transfers */
> + if (!strncmp(dl->ins.name, "ld", 2)) {
> + struct type_state_reg dst_tsr;
> +
> + if (!has_reg_type(state, sreg) ||
> + !has_reg_type(state, dreg) ||
> + !state->regs[dreg].ok)
> + return;
I think we should set state->regs[sreg].ok to false if sreg is valid.
> +
> + tsr = &state->regs[sreg];
> + tsr->copied_from = -1;
> + dst_tsr = state->regs[dreg];
> +
> + /* Dereference the pointer if it has one */
> + if (dst_tsr.kind == TSR_KIND_TYPE &&
> + die_deref_ptr_type(&dst_tsr.type,
> + get_mem_offset(dst, dst_tsr.offset),
> + &type_die)) {
> + tsr->type = type_die;
> + tsr->kind = TSR_KIND_TYPE;
> + tsr->offset = 0;
> + tsr->ok = true;
> +
> + pr_debug_dtp("ldr [%x] %#x(reg%d) -> reg%d",
> + insn_offset, dst->offset, dreg, sreg);
It could confuse people since src and dst are opposite. Maybe you can
change the parse function to set the source and target (destination)
properly for each instruction.
> + pr_debug_type_name(&tsr->type, tsr->kind);
> +
> + adjust_reg_index_state(state, dreg, dst, "ldr", insn_offset);
> + }
Also you will need to update index offset and set the 'ok' state
properly.
Thanks,
Namhyung
> + return;
> + }
> +}
> +#endif
> +
> const struct arch *arch__new_arm64(const struct e_machine_and_e_flags *id,
> const char *cpuid __maybe_unused)
> {
> @@ -273,6 +357,9 @@ const struct arch *arch__new_arm64(const struct e_machine_and_e_flags *id,
> arch->objdump.imm_char = '#';
> arch->associate_instruction_ops = arm64__associate_instruction_ops;
> arch->extract_op_location = extract_op_location_arm64;
> +#ifdef HAVE_LIBDW_SUPPORT
> + arch->update_insn_state = update_insn_state_arm64;
> +#endif
>
> /* bl, blr */
> err = regcomp(&arm->call_insn, "^blr?$", REG_EXTENDED);
> --
> 2.34.1
>
next prev parent reply other threads:[~2026-04-10 6:23 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-03 9:47 [PATCH v2 00/16] perf arm64: Support data type profiling Tengda Wu
2026-04-03 9:47 ` [PATCH v2 01/16] perf llvm: Fix arm64 adrp instruction disassembly mismatch with objdump Tengda Wu
2026-04-03 9:47 ` [PATCH v2 02/16] perf capstone: Fix arm64 jump/adrp " Tengda Wu
2026-04-07 6:43 ` Namhyung Kim
2026-04-10 9:08 ` Tengda Wu
2026-04-14 13:51 ` James Clark
2026-04-03 9:47 ` [PATCH v2 03/16] perf annotate-arm64: Generalize arm64_mov__parse to support standard operands Tengda Wu
2026-04-07 6:58 ` Namhyung Kim
2026-04-10 10:06 ` Tengda Wu
2026-04-14 14:13 ` James Clark
2026-04-03 9:47 ` [PATCH v2 04/16] perf annotate-arm64: Handle load and store instructions Tengda Wu
2026-04-07 7:09 ` Namhyung Kim
2026-04-10 10:16 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 05/16] perf annotate: Introduce extract_op_location callback for arch-specific parsing Tengda Wu
2026-04-03 9:47 ` [PATCH v2 06/16] perf dwarf-regs: Adapt get_dwarf_regnum() for arm64 Tengda Wu
2026-04-03 9:47 ` [PATCH v2 07/16] perf annotate-arm64: Implement extract_op_location() callback Tengda Wu
2026-04-07 7:26 ` Namhyung Kim
2026-04-10 10:27 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 08/16] perf annotate-arm64: Enable instruction tracking support Tengda Wu
2026-04-10 6:09 ` Namhyung Kim
2026-04-10 10:29 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 09/16] perf annotate-arm64: Support load instruction tracking Tengda Wu
2026-04-10 6:23 ` Namhyung Kim [this message]
2026-04-10 10:37 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 10/16] perf annotate-arm64: Support store " Tengda Wu
2026-04-03 9:47 ` [PATCH v2 11/16] perf annotate-arm64: Support stack variable tracking Tengda Wu
2026-04-10 6:29 ` Namhyung Kim
2026-04-10 10:41 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 12/16] perf annotate-arm64: Support 'mov' instruction tracking Tengda Wu
2026-04-10 6:39 ` Namhyung Kim
2026-04-10 10:53 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 13/16] perf annotate-arm64: Support 'add' " Tengda Wu
2026-04-10 6:42 ` Namhyung Kim
2026-04-10 10:49 ` Tengda Wu
2026-04-03 9:47 ` [PATCH v2 14/16] perf annotate-arm64: Support 'adrp' instruction to track global variables Tengda Wu
2026-04-03 9:47 ` [PATCH v2 15/16] perf annotate-arm64: Support per-cpu variable access tracking Tengda Wu
2026-04-03 9:48 ` [PATCH v2 16/16] perf annotate-arm64: Support 'mrs' instruction to track 'current' pointer Tengda Wu
2026-04-10 6:52 ` Namhyung Kim
2026-04-10 10:44 ` Tengda Wu
2026-04-07 6:31 ` [PATCH v2 00/16] perf arm64: Support data type profiling Namhyung Kim
2026-04-08 11:35 ` Tengda Wu
2026-04-10 7:00 ` Namhyung Kim
2026-04-10 8:17 ` Tengda Wu
2026-04-14 15:10 ` James Clark
2026-04-15 1:34 ` Tengda Wu
2026-04-16 15:31 ` James Clark
2026-04-17 1:53 ` Tengda Wu
2026-04-20 9:31 ` James Clark
2026-04-22 9:50 ` James Clark
2026-04-27 8:43 ` Tengda Wu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=adiXYBhkEpbBR0ZA@google.com \
--to=namhyung@kernel.org \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=irogers@google.com \
--cc=kim.phillips@arm.com \
--cc=leo.yan@linux.dev \
--cc=lihuafei1@huawei.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=llvm@lists.linux.dev \
--cc=mark.rutland@arm.com \
--cc=mingo@redhat.com \
--cc=morbo@google.com \
--cc=nick.desaulniers+lkml@gmail.com \
--cc=peterz@infradead.org \
--cc=wutengda@huaweicloud.com \
--cc=zli94@ncsu.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.