From: James Clark <james.clark@linaro.org>
To: Ian Rogers <irogers@google.com>
Cc: John Garry <john.g.garry@oracle.com>,
Will Deacon <will@kernel.org>, Mike Leach <mike.leach@linaro.org>,
Leo Yan <leo.yan@linux.dev>,
Peter Zijlstra <peterz@infradead.org>,
Ingo Molnar <mingo@redhat.com>,
Arnaldo Carvalho de Melo <acme@kernel.org>,
Namhyung Kim <namhyung@kernel.org>,
Mark Rutland <mark.rutland@arm.com>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Jiri Olsa <jolsa@kernel.org>,
Adrian Hunter <adrian.hunter@intel.com>,
Al Grant <al.grant@arm.com>,
linux-arm-kernel@lists.infradead.org,
linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 3/4] perf arm_spe: Decode Arm N1 IMPDEF events
Date: Tue, 7 Apr 2026 13:35:16 +0100 [thread overview]
Message-ID: <ee012f48-5d08-4004-b9d0-bcac825cb50c@linaro.org> (raw)
In-Reply-To: <CAP-5=fVcYOJ_3TWd6won5FVaH2MH6QwLhCoCRnNzzeE-9PgO1Q@mail.gmail.com>
On 02/04/2026 4:26 pm, Ian Rogers wrote:
> On Wed, Apr 1, 2026 at 7:26 AM James Clark <james.clark@linaro.org> wrote:
>>
>> From the TRM [1], N1 has one IMPDEF event which isn't covered by the
>> common list. Add a framework so that more cores can be added in the
>> future and that the N1 IMPDEF event can be decoded. Also increase the
>> size of the buffer because we're adding more strings and if it gets
>> truncated it falls back to a hex dump only.
>>
>> [1]: https://developer.arm.com/documentation/100616/0401/Statistical-Profiling-Extension/implementation-defined-features-of-SPE
>> Suggested-by: Al Grant <al.grant@arm.com>
>> Signed-off-by: James Clark <james.clark@linaro.org>
>> ---
>> tools/perf/util/arm-spe-decoder/Build | 2 +
>> .../util/arm-spe-decoder/arm-spe-pkt-decoder.c | 45 ++++++++++++++++++++--
>> .../util/arm-spe-decoder/arm-spe-pkt-decoder.h | 5 ++-
>> tools/perf/util/arm-spe.c | 13 ++++---
>> 4 files changed, 54 insertions(+), 11 deletions(-)
>>
>> diff --git a/tools/perf/util/arm-spe-decoder/Build b/tools/perf/util/arm-spe-decoder/Build
>> index ab500e0efe24..97a298d1e279 100644
>> --- a/tools/perf/util/arm-spe-decoder/Build
>> +++ b/tools/perf/util/arm-spe-decoder/Build
>> @@ -1 +1,3 @@
>> perf-util-y += arm-spe-pkt-decoder.o arm-spe-decoder.o
>> +
>> +CFLAGS_arm-spe-pkt-decoder.o += -I$(srctree)/tools/arch/arm64/include/ -I$(OUTPUT)arch/arm64/include/generated/
>> diff --git a/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.c b/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.c
>> index c880b0dec3a1..42a7501d4dfe 100644
>> --- a/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.c
>> +++ b/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.c
>> @@ -15,6 +15,8 @@
>>
>> #include "arm-spe-pkt-decoder.h"
>>
>> +#include "../../arm64/include/asm/cputype.h"
>
> Sashiko spotted:
> https://sashiko.dev/#/patchset/20260401-james-spe-impdef-decode-v1-0-ad0d372c220c%40linaro.org
> """
> This isn't a bug, but does this include directive rely on accidental
> path normalization?
>
> The relative path ../../arm64/include/asm/cputype.h does not exist relative
> to arm-spe-pkt-decoder.c. It only compiles because the Build file adds
> -I$(srctree)/tools/arch/arm64/include/ to CFLAGS.
>
> Would it be cleaner to use #include <asm/cputype.h> to explicitly rely on
> the include path?
> [ ... ]
> """
> I wouldn't use <asm/cputype.h> due to cross-compilation and the like,
> instead just add the extra "../" into the include path.
>
Do you mean change the #include to this?
#include "../../../arm64/include/asm/cputype.h"
I still need to add:
CFLAGS_arm-spe-pkt-decoder.o += -I$(srctree)/tools/arch/arm64/include/
To make the this include in cputype.h work:
#include <asm/sysreg.h>
Which probably only works because there isn't a sysreg.h on other
architectures. But I'm not sure what the significance of ../../ vs
../../../ is if either compile? arm-spe.c already does it with ../../
which is what I copied.
>> +
>> static const char * const arm_spe_packet_name[] = {
>> [ARM_SPE_PAD] = "PAD",
>> [ARM_SPE_END] = "END",
>> @@ -307,6 +309,11 @@ static const struct ev_string common_ev_strings[] = {
>> { .event = 0, .desc = NULL },
>> };
>>
>> +static const struct ev_string n1_event_strings[] = {
>> + { .event = 12, .desc = "LATE-PREFETCH" },
>> + { .event = 0, .desc = NULL },
>> +};
>> +
>> static u64 print_event_list(int *err, char **buf, size_t *buf_len,
>> const struct ev_string *ev_strings, u64 payload)
>> {
>> @@ -318,14 +325,44 @@ static u64 print_event_list(int *err, char **buf, size_t *buf_len,
>> return payload;
>> }
>>
>> +struct event_print_handle {
>> + const struct midr_range *midr_ranges;
>> + const struct ev_string *ev_strings;
>> +};
>> +
>> +#define EV_PRINT(range, strings) \
>> + { \
>> + .midr_ranges = range, \
>> + .ev_strings = strings, \
>> + }
>> +
>> +static const struct midr_range n1_event_encoding_cpus[] = {
>> + MIDR_ALL_VERSIONS(MIDR_NEOVERSE_N1),
>> + {},
>> +};
>> +
>> +static const struct event_print_handle event_print_handles[] = {
>> + EV_PRINT(n1_event_encoding_cpus, n1_event_strings),
>> +};
>> +
>> static int arm_spe_pkt_desc_event(const struct arm_spe_pkt *packet,
>> - char *buf, size_t buf_len)
>> + char *buf, size_t buf_len, u64 midr)
>> {
>> u64 payload = packet->payload;
>> int err = 0;
>>
>> arm_spe_pkt_out_string(&err, &buf, &buf_len, "EV");
>> - print_event_list(&err, &buf, &buf_len, common_ev_strings, payload);
>> + payload = print_event_list(&err, &buf, &buf_len, common_ev_strings,
>> + payload);
>> +
>> + /* Try to decode IMPDEF bits for known CPUs */
>> + for (unsigned int i = 0; i < ARRAY_SIZE(event_print_handles); i++) {
>> + if (is_midr_in_range_list(midr,
>> + event_print_handles[i].midr_ranges))
>> + payload = print_event_list(&err, &buf, &buf_len,
>> + event_print_handles[i].ev_strings,
>> + payload);
>> + }
>>
>> return err;
>> }
>> @@ -506,7 +543,7 @@ static int arm_spe_pkt_desc_counter(const struct arm_spe_pkt *packet,
>> }
>>
>> int arm_spe_pkt_desc(const struct arm_spe_pkt *packet, char *buf,
>> - size_t buf_len)
>> + size_t buf_len, u64 midr)
>> {
>> int idx = packet->index;
>> unsigned long long payload = packet->payload;
>> @@ -522,7 +559,7 @@ int arm_spe_pkt_desc(const struct arm_spe_pkt *packet, char *buf,
>> arm_spe_pkt_out_string(&err, &buf, &blen, "%s", name);
>> break;
>> case ARM_SPE_EVENTS:
>> - err = arm_spe_pkt_desc_event(packet, buf, buf_len);
>> + err = arm_spe_pkt_desc_event(packet, buf, buf_len, midr);
>> break;
>> case ARM_SPE_OP_TYPE:
>> err = arm_spe_pkt_desc_op_type(packet, buf, buf_len);
>> diff --git a/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.h b/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.h
>> index adf4cde320aa..17b067fe3c87 100644
>> --- a/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.h
>> +++ b/tools/perf/util/arm-spe-decoder/arm-spe-pkt-decoder.h
>> @@ -11,7 +11,7 @@
>> #include <stddef.h>
>> #include <stdint.h>
>>
>> -#define ARM_SPE_PKT_DESC_MAX 256
>> +#define ARM_SPE_PKT_DESC_MAX 512
>>
>> #define ARM_SPE_NEED_MORE_BYTES -1
>> #define ARM_SPE_BAD_PACKET -2
>> @@ -186,5 +186,6 @@ const char *arm_spe_pkt_name(enum arm_spe_pkt_type);
>> int arm_spe_get_packet(const unsigned char *buf, size_t len,
>> struct arm_spe_pkt *packet);
>>
>> -int arm_spe_pkt_desc(const struct arm_spe_pkt *packet, char *buf, size_t len);
>> +int arm_spe_pkt_desc(const struct arm_spe_pkt *packet, char *buf, size_t len,
>> + u64 midr);
>> #endif
>> diff --git a/tools/perf/util/arm-spe.c b/tools/perf/util/arm-spe.c
>> index 7447b000f9cd..46f0309c092b 100644
>> --- a/tools/perf/util/arm-spe.c
>> +++ b/tools/perf/util/arm-spe.c
>> @@ -135,7 +135,7 @@ struct data_source_handle {
>> }
>>
>> static void arm_spe_dump(struct arm_spe *spe __maybe_unused,
>> - unsigned char *buf, size_t len)
>> + unsigned char *buf, size_t len, u64 midr)
>> {
>> struct arm_spe_pkt packet;
>> size_t pos = 0;
>> @@ -161,7 +161,7 @@ static void arm_spe_dump(struct arm_spe *spe __maybe_unused,
>> color_fprintf(stdout, color, " ");
>> if (ret > 0) {
>> ret = arm_spe_pkt_desc(&packet, desc,
>> - ARM_SPE_PKT_DESC_MAX);
>> + ARM_SPE_PKT_DESC_MAX, midr);
>> if (!ret)
>> color_fprintf(stdout, color, " %s\n", desc);
>> } else {
>> @@ -174,10 +174,10 @@ static void arm_spe_dump(struct arm_spe *spe __maybe_unused,
>> }
>>
>> static void arm_spe_dump_event(struct arm_spe *spe, unsigned char *buf,
>> - size_t len)
>> + size_t len, u64 midr)
>> {
>> printf(".\n");
>> - arm_spe_dump(spe, buf, len);
>> + arm_spe_dump(spe, buf, len, midr);
>> }
>>
>> static int arm_spe_get_trace(struct arm_spe_buffer *b, void *data)
>> @@ -1469,8 +1469,11 @@ static int arm_spe_process_auxtrace_event(struct perf_session *session,
>> /* Dump here now we have copied a piped trace out of the pipe */
>> if (dump_trace) {
>> if (auxtrace_buffer__get_data(buffer, fd)) {
>> + u64 midr = 0;
>> +
>> + arm_spe__get_midr(spe, buffer->cpu.cpu, &midr);
>
> Sashiko claims to have spotted an issue here:
> """
> Is it possible for arm_spe__get_midr() to cause a segmentation fault here?
>
> If the trace is from an older recording (metadata version 1) and the
> environment lacks a CPUID string (such as during cross-architecture
> analysis), perf_env__cpuid() returns NULL.
>
> It appears arm_spe__get_midr() then passes this NULL pointer to
> strtol(cpuid, NULL, 16), which leads to undefined behavior.
> """
>
> But this feels like, if this happens you're already having a bad time
> and these changes aren't necessarily making things worse.
>
> Thanks,
> Ian
>
Yeah I think it might be possible so I can add an error instead of a
segfault. I'll check the rest of the Sashiko comments too.
>> arm_spe_dump_event(spe, buffer->data,
>> - buffer->size);
>> + buffer->size, midr);
>> auxtrace_buffer__put_data(buffer);
>> }
>> }
>>
>> --
>> 2.34.1
>>
next prev parent reply other threads:[~2026-04-07 12:35 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-01 14:25 [PATCH 0/4] perf arm_spe: Dump IMPDEF events James Clark
2026-04-01 14:25 ` [PATCH 1/4] perf arm_spe: Make a function to get the MIDR James Clark
2026-04-02 8:55 ` Leo Yan
2026-04-01 14:25 ` [PATCH 2/4] perf arm_spe: Turn event name mappings into an array James Clark
2026-04-02 9:13 ` Leo Yan
2026-04-02 9:48 ` James Clark
2026-04-02 15:30 ` Ian Rogers
2026-04-01 14:25 ` [PATCH 3/4] perf arm_spe: Decode Arm N1 IMPDEF events James Clark
2026-04-02 9:32 ` Leo Yan
2026-04-02 9:49 ` James Clark
2026-04-02 15:26 ` Ian Rogers
2026-04-07 12:35 ` James Clark [this message]
2026-04-07 18:26 ` Ian Rogers
2026-04-08 8:47 ` James Clark
2026-04-10 4:51 ` Namhyung Kim
2026-04-01 14:25 ` [PATCH 4/4] perf arm_spe: Print remaining IMPDEF event numbers James Clark
2026-04-02 9:46 ` Leo Yan
2026-04-06 18:18 ` [PATCH 0/4] perf arm_spe: Dump IMPDEF events Namhyung Kim
2026-04-07 8:18 ` James Clark
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ee012f48-5d08-4004-b9d0-bcac825cb50c@linaro.org \
--to=james.clark@linaro.org \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=al.grant@arm.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=irogers@google.com \
--cc=john.g.garry@oracle.com \
--cc=jolsa@kernel.org \
--cc=leo.yan@linux.dev \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=mark.rutland@arm.com \
--cc=mike.leach@linaro.org \
--cc=mingo@redhat.com \
--cc=namhyung@kernel.org \
--cc=peterz@infradead.org \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox