From: Jiri Olsa <jolsa@redhat.com>
To: Arnaldo Carvalho de Melo <acme@kernel.org>
Cc: German Gomez <german.gomez@arm.com>,
linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org,
Alexandre Truong <alexandre.truong@arm.com>,
John Garry <john.garry@huawei.com>, Will Deacon <will@kernel.org>,
Mathieu Poirier <mathieu.poirier@linaro.org>,
Leo Yan <leo.yan@linaro.org>, Mark Rutland <mark.rutland@arm.com>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Namhyung Kim <namhyung@kernel.org>,
linux-arm-kernel@lists.infradead.org
Subject: Re: [PATCH v5 6/6] perf arm64: inject missing frames if perf-record used "--call-graph=fp"
Date: Tue, 21 Dec 2021 16:06:13 +0100 [thread overview]
Message-ID: <YcHtZbTQ7B64Py+7@krava> (raw)
In-Reply-To: <YcHh37Iw2GXBkXm9@kernel.org>
On Tue, Dec 21, 2021 at 11:17:03AM -0300, Arnaldo Carvalho de Melo wrote:
> Em Tue, Dec 21, 2021 at 10:32:50AM +0100, Jiri Olsa escreveu:
> > On Fri, Dec 17, 2021 at 03:45:20PM +0000, German Gomez wrote:
> >
> > SNIP
> >
> > > +}
> > > +
> > > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int usr_idx)
> > > +{
> > > + int ret;
> > > + struct entries entries = {};
> > > + struct regs_dump old_regs = sample->user_regs;
> > > +
> > > + if (!get_leaf_frame_caller_enabled(sample))
> > > + return 0;
> > > +
> > > + /*
> > > + * If PC and SP are not recorded, get the value of PC from the stack
> > > + * and set its mask. SP is not used when doing the unwinding but it
> > > + * still needs to be set to prevent failures.
> > > + */
> > > +
> > > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_PC))) {
> > > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_PC);
> > > + sample->user_regs.cache_regs[PERF_REG_ARM64_PC] = sample->callchain->ips[usr_idx+1];
> > > + }
> > > +
> > > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_SP))) {
> > > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_SP);
> > > + sample->user_regs.cache_regs[PERF_REG_ARM64_SP] = 0;
> > > + }
> > > +
> > > + ret = unwind__get_entries(add_entry, &entries, thread, sample, 2);
> >
> > just curious, did you try this with both unwinders libunwind/libdw?
> >
> > any chance you could add arm specific test for this?
> >
> > otherwise it looks good to me
>
> Whole patchkit?
yes, it's for the patchset
jirka
>
> > Acked-by: Jiri Olsa <jolsa@kernel.org>
> >
> > thanks,
> > jirka
> >
> >
> > > + sample->user_regs = old_regs;
> > > +
> > > + if (ret || entries.length != 2)
> > > + return ret;
> > > +
> > > + return callchain_param.order == ORDER_CALLER ? entries.stack[0] : entries.stack[1];
> > > +}
> > > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.h b/tools/perf/util/arm64-frame-pointer-unwind-support.h
> > > new file mode 100644
> > > index 000000000000..32af9ce94398
> > > --- /dev/null
> > > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.h
> > > @@ -0,0 +1,10 @@
> > > +/* SPDX-License-Identifier: GPL-2.0 */
> > > +#ifndef __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H
> > > +#define __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H
> > > +
> > > +#include "event.h"
> > > +#include "thread.h"
> > > +
> > > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int user_idx);
> > > +
> > > +#endif /* __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H */
> > > diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c
> > > index 3eddad009f78..a00fd6796b35 100644
> > > --- a/tools/perf/util/machine.c
> > > +++ b/tools/perf/util/machine.c
> > > @@ -34,6 +34,7 @@
> > > #include "bpf-event.h"
> > > #include <internal/lib.h> // page_size
> > > #include "cgroup.h"
> > > +#include "arm64-frame-pointer-unwind-support.h"
> > >
> > > #include <linux/ctype.h>
> > > #include <symbol/kallsyms.h>
> > > @@ -2710,10 +2711,13 @@ static int find_prev_cpumode(struct ip_callchain *chain, struct thread *thread,
> > > return err;
> > > }
> > >
> > > -static u64 get_leaf_frame_caller(struct perf_sample *sample __maybe_unused,
> > > - struct thread *thread __maybe_unused, int usr_idx __maybe_unused)
> > > +static u64 get_leaf_frame_caller(struct perf_sample *sample,
> > > + struct thread *thread, int usr_idx)
> > > {
> > > - return 0;
> > > + if (machine__normalize_is(thread->maps->machine, "arm64"))
> > > + return get_leaf_frame_caller_aarch64(sample, thread, usr_idx);
> > > + else
> > > + return 0;
> > > }
> > >
> > > static int thread__resolve_callchain_sample(struct thread *thread,
> > > @@ -3114,14 +3118,19 @@ int machine__set_current_tid(struct machine *machine, int cpu, pid_t pid,
> > > }
> > >
> > > /*
> > > - * Compares the raw arch string. N.B. see instead perf_env__arch() if a
> > > - * normalized arch is needed.
> > > + * Compares the raw arch string. N.B. see instead perf_env__arch() or
> > > + * machine__normalize_is() if a normalized arch is needed.
> > > */
> > > bool machine__is(struct machine *machine, const char *arch)
> > > {
> > > return machine && !strcmp(perf_env__raw_arch(machine->env), arch);
> > > }
> > >
> > > +bool machine__normalize_is(struct machine *machine, const char *arch)
> > > +{
> > > + return machine && !strcmp(perf_env__arch(machine->env), arch);
> > > +}
> > > +
> > > int machine__nr_cpus_avail(struct machine *machine)
> > > {
> > > return machine ? perf_env__nr_cpus_avail(machine->env) : 0;
> > > diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h
> > > index a143087eeb47..665535153411 100644
> > > --- a/tools/perf/util/machine.h
> > > +++ b/tools/perf/util/machine.h
> > > @@ -208,6 +208,7 @@ static inline bool machine__is_host(struct machine *machine)
> > > }
> > >
> > > bool machine__is(struct machine *machine, const char *arch);
> > > +bool machine__normalize_is(struct machine *machine, const char *arch);
> > > int machine__nr_cpus_avail(struct machine *machine);
> > >
> > > struct thread *__machine__findnew_thread(struct machine *machine, pid_t pid, pid_t tid);
> > > --
> > > 2.25.1
> > >
>
> --
>
> - Arnaldo
>
WARNING: multiple messages have this Message-ID (diff)
From: Jiri Olsa <jolsa@redhat.com>
To: Arnaldo Carvalho de Melo <acme@kernel.org>
Cc: German Gomez <german.gomez@arm.com>,
linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org,
Alexandre Truong <alexandre.truong@arm.com>,
John Garry <john.garry@huawei.com>, Will Deacon <will@kernel.org>,
Mathieu Poirier <mathieu.poirier@linaro.org>,
Leo Yan <leo.yan@linaro.org>, Mark Rutland <mark.rutland@arm.com>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Namhyung Kim <namhyung@kernel.org>,
linux-arm-kernel@lists.infradead.org
Subject: Re: [PATCH v5 6/6] perf arm64: inject missing frames if perf-record used "--call-graph=fp"
Date: Tue, 21 Dec 2021 16:06:13 +0100 [thread overview]
Message-ID: <YcHtZbTQ7B64Py+7@krava> (raw)
In-Reply-To: <YcHh37Iw2GXBkXm9@kernel.org>
On Tue, Dec 21, 2021 at 11:17:03AM -0300, Arnaldo Carvalho de Melo wrote:
> Em Tue, Dec 21, 2021 at 10:32:50AM +0100, Jiri Olsa escreveu:
> > On Fri, Dec 17, 2021 at 03:45:20PM +0000, German Gomez wrote:
> >
> > SNIP
> >
> > > +}
> > > +
> > > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int usr_idx)
> > > +{
> > > + int ret;
> > > + struct entries entries = {};
> > > + struct regs_dump old_regs = sample->user_regs;
> > > +
> > > + if (!get_leaf_frame_caller_enabled(sample))
> > > + return 0;
> > > +
> > > + /*
> > > + * If PC and SP are not recorded, get the value of PC from the stack
> > > + * and set its mask. SP is not used when doing the unwinding but it
> > > + * still needs to be set to prevent failures.
> > > + */
> > > +
> > > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_PC))) {
> > > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_PC);
> > > + sample->user_regs.cache_regs[PERF_REG_ARM64_PC] = sample->callchain->ips[usr_idx+1];
> > > + }
> > > +
> > > + if (!(sample->user_regs.mask & SMPL_REG_MASK(PERF_REG_ARM64_SP))) {
> > > + sample->user_regs.cache_mask |= SMPL_REG_MASK(PERF_REG_ARM64_SP);
> > > + sample->user_regs.cache_regs[PERF_REG_ARM64_SP] = 0;
> > > + }
> > > +
> > > + ret = unwind__get_entries(add_entry, &entries, thread, sample, 2);
> >
> > just curious, did you try this with both unwinders libunwind/libdw?
> >
> > any chance you could add arm specific test for this?
> >
> > otherwise it looks good to me
>
> Whole patchkit?
yes, it's for the patchset
jirka
>
> > Acked-by: Jiri Olsa <jolsa@kernel.org>
> >
> > thanks,
> > jirka
> >
> >
> > > + sample->user_regs = old_regs;
> > > +
> > > + if (ret || entries.length != 2)
> > > + return ret;
> > > +
> > > + return callchain_param.order == ORDER_CALLER ? entries.stack[0] : entries.stack[1];
> > > +}
> > > diff --git a/tools/perf/util/arm64-frame-pointer-unwind-support.h b/tools/perf/util/arm64-frame-pointer-unwind-support.h
> > > new file mode 100644
> > > index 000000000000..32af9ce94398
> > > --- /dev/null
> > > +++ b/tools/perf/util/arm64-frame-pointer-unwind-support.h
> > > @@ -0,0 +1,10 @@
> > > +/* SPDX-License-Identifier: GPL-2.0 */
> > > +#ifndef __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H
> > > +#define __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H
> > > +
> > > +#include "event.h"
> > > +#include "thread.h"
> > > +
> > > +u64 get_leaf_frame_caller_aarch64(struct perf_sample *sample, struct thread *thread, int user_idx);
> > > +
> > > +#endif /* __PERF_ARM_FRAME_POINTER_UNWIND_SUPPORT_H */
> > > diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c
> > > index 3eddad009f78..a00fd6796b35 100644
> > > --- a/tools/perf/util/machine.c
> > > +++ b/tools/perf/util/machine.c
> > > @@ -34,6 +34,7 @@
> > > #include "bpf-event.h"
> > > #include <internal/lib.h> // page_size
> > > #include "cgroup.h"
> > > +#include "arm64-frame-pointer-unwind-support.h"
> > >
> > > #include <linux/ctype.h>
> > > #include <symbol/kallsyms.h>
> > > @@ -2710,10 +2711,13 @@ static int find_prev_cpumode(struct ip_callchain *chain, struct thread *thread,
> > > return err;
> > > }
> > >
> > > -static u64 get_leaf_frame_caller(struct perf_sample *sample __maybe_unused,
> > > - struct thread *thread __maybe_unused, int usr_idx __maybe_unused)
> > > +static u64 get_leaf_frame_caller(struct perf_sample *sample,
> > > + struct thread *thread, int usr_idx)
> > > {
> > > - return 0;
> > > + if (machine__normalize_is(thread->maps->machine, "arm64"))
> > > + return get_leaf_frame_caller_aarch64(sample, thread, usr_idx);
> > > + else
> > > + return 0;
> > > }
> > >
> > > static int thread__resolve_callchain_sample(struct thread *thread,
> > > @@ -3114,14 +3118,19 @@ int machine__set_current_tid(struct machine *machine, int cpu, pid_t pid,
> > > }
> > >
> > > /*
> > > - * Compares the raw arch string. N.B. see instead perf_env__arch() if a
> > > - * normalized arch is needed.
> > > + * Compares the raw arch string. N.B. see instead perf_env__arch() or
> > > + * machine__normalize_is() if a normalized arch is needed.
> > > */
> > > bool machine__is(struct machine *machine, const char *arch)
> > > {
> > > return machine && !strcmp(perf_env__raw_arch(machine->env), arch);
> > > }
> > >
> > > +bool machine__normalize_is(struct machine *machine, const char *arch)
> > > +{
> > > + return machine && !strcmp(perf_env__arch(machine->env), arch);
> > > +}
> > > +
> > > int machine__nr_cpus_avail(struct machine *machine)
> > > {
> > > return machine ? perf_env__nr_cpus_avail(machine->env) : 0;
> > > diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h
> > > index a143087eeb47..665535153411 100644
> > > --- a/tools/perf/util/machine.h
> > > +++ b/tools/perf/util/machine.h
> > > @@ -208,6 +208,7 @@ static inline bool machine__is_host(struct machine *machine)
> > > }
> > >
> > > bool machine__is(struct machine *machine, const char *arch);
> > > +bool machine__normalize_is(struct machine *machine, const char *arch);
> > > int machine__nr_cpus_avail(struct machine *machine);
> > >
> > > struct thread *__machine__findnew_thread(struct machine *machine, pid_t pid, pid_t tid);
> > > --
> > > 2.25.1
> > >
>
> --
>
> - Arnaldo
>
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2021-12-21 15:06 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-12-17 15:45 [PATCH v5 0/6] perf tools/arm64: Fix missing leaf-function callers in ARM64 when using "--call-graph=fp" German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 1/6] perf tools: record ARM64 LR register automatically German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 2/6] perf tools: add a mechanism to inject stack frames German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 3/6] perf tools: Refactor script__setup_sample_type() German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 4/6] perf tools: enable dwarf_callchain_users on arm64 German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 5/6] perf tools: Refactor SMPL_REG macro in perf_regs.h German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 15:45 ` [PATCH v5 6/6] perf arm64: inject missing frames if perf-record used "--call-graph=fp" German Gomez
2021-12-17 15:45 ` German Gomez
2021-12-17 16:01 ` James Clark
2021-12-17 16:01 ` James Clark
2021-12-18 11:35 ` Arnaldo Carvalho de Melo
2021-12-18 11:35 ` Arnaldo Carvalho de Melo
2021-12-21 9:32 ` Jiri Olsa
2021-12-21 9:32 ` Jiri Olsa
2021-12-21 14:17 ` Arnaldo Carvalho de Melo
2021-12-21 14:17 ` Arnaldo Carvalho de Melo
2021-12-21 15:06 ` Jiri Olsa [this message]
2021-12-21 15:06 ` Jiri Olsa
2022-01-10 10:48 ` German Gomez
2022-01-10 10:48 ` German Gomez
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YcHtZbTQ7B64Py+7@krava \
--to=jolsa@redhat.com \
--cc=acme@kernel.org \
--cc=alexander.shishkin@linux.intel.com \
--cc=alexandre.truong@arm.com \
--cc=german.gomez@arm.com \
--cc=john.garry@huawei.com \
--cc=leo.yan@linaro.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=mark.rutland@arm.com \
--cc=mathieu.poirier@linaro.org \
--cc=namhyung@kernel.org \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.