From: Namhyung Kim <namhyung@kernel.org>
To: "Liang, Kan" <kan.liang@linux.intel.com>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>,
Ian Rogers <irogers@google.com>, Jiri Olsa <jolsa@kernel.org>,
Adrian Hunter <adrian.hunter@intel.com>,
Peter Zijlstra <peterz@infradead.org>,
Ingo Molnar <mingo@kernel.org>,
LKML <linux-kernel@vger.kernel.org>,
linux-perf-users@vger.kernel.org,
Josh Poimboeuf <jpoimboe@kernel.org>,
Steven Rostedt <rostedt@goodmis.org>,
Mathieu Desnoyers <mathieu.desnoyers@efficios.com>,
Indu Bhagat <indu.bhagat@oracle.com>,
linux-toolchains@vger.kernel.org
Subject: Re: [RFC/PATCHSET 0/5] perf tools: Support deferred user callchains (v2)
Date: Mon, 23 Sep 2024 16:08:42 -0700 [thread overview]
Message-ID: <ZvH0-ny9gUzh_Jc7@google.com> (raw)
In-Reply-To: <f554bf58-a1df-4ef8-9045-2d1396f30ef3@linux.intel.com>
Hi Kan,
On Wed, Sep 18, 2024 at 04:26:56PM -0400, Liang, Kan wrote:
>
>
> On 2024-09-17 6:28 p.m., Namhyung Kim wrote:
> > Hello,
> >
> > This is a counterpart for Josh's kernel change v2 [1] to support deferred
> > user callchains. The change is transparent and users should not notice
> > anything with the deferred callchains.
> >
> > $ perf record -g sleep 1
> >
> > I added --[no-]merge-callchains option to control output of perf script.
> > You can verify it has the deferred callchains like this:
> >
> > $ perf script --no-merge-callchains
> > perf 801 [000] 18.031793: 1 cycles:P:
> > ffffffff91a14c36 __intel_pmu_enable_all.isra.0+0x56 ([kernel.kallsyms])
> > ffffffff91d373e9 perf_ctx_enable+0x39 ([kernel.kallsyms])
> > ffffffff91d36af7 event_function+0xd7 ([kernel.kallsyms])
> > ffffffff91d34222 remote_function+0x42 ([kernel.kallsyms])
> > ffffffff91c1ebe1 generic_exec_single+0x61 ([kernel.kallsyms])
> > ffffffff91c1edac smp_call_function_single+0xec ([kernel.kallsyms])
> > ffffffff91d37a9d event_function_call+0x10d ([kernel.kallsyms])
> > ffffffff91d33557 perf_event_for_each_child+0x37 ([kernel.kallsyms])
> > ffffffff91d47324 _perf_ioctl+0x204 ([kernel.kallsyms])
> > ffffffff91d47c43 perf_ioctl+0x33 ([kernel.kallsyms])
> > ffffffff91e2f216 __x64_sys_ioctl+0x96 ([kernel.kallsyms])
> > ffffffff9265f1ae do_syscall_64+0x9e ([kernel.kallsyms])
> > ffffffff92800130 entry_SYSCALL_64+0xb0 ([kernel.kallsyms])
> >
> > perf 801 [000] 18.031814: DEFERRED CALLCHAIN
> > 7fb5fc22034b __GI___ioctl+0x3b (/usr/lib/x86_64-linux-gnu/libc.so.6)
> >
> > ...
> >
> > When the callchain is merged (it's the default) it'd look like below:
> >
> > $ perf script
> > perf 801 [000] 18.031793: 1 cycles:P:
> > ffffffff91a14c36 __intel_pmu_enable_all.isra.0+0x56 ([kernel.kallsyms])
> > ffffffff91d373e9 perf_ctx_enable+0x39 ([kernel.kallsyms])
> > ffffffff91d36af7 event_function+0xd7 ([kernel.kallsyms])
> > ffffffff91d34222 remote_function+0x42 ([kernel.kallsyms])
> > ffffffff91c1ebe1 generic_exec_single+0x61 ([kernel.kallsyms])
> > ffffffff91c1edac smp_call_function_single+0xec ([kernel.kallsyms])
> > ffffffff91d37a9d event_function_call+0x10d ([kernel.kallsyms])
> > ffffffff91d33557 perf_event_for_each_child+0x37 ([kernel.kallsyms])
> > ffffffff91d47324 _perf_ioctl+0x204 ([kernel.kallsyms])
> > ffffffff91d47c43 perf_ioctl+0x33 ([kernel.kallsyms])
> > ffffffff91e2f216 __x64_sys_ioctl+0x96 ([kernel.kallsyms])
> > ffffffff9265f1ae do_syscall_64+0x9e ([kernel.kallsyms])
> > ffffffff92800130 entry_SYSCALL_64+0xb0 ([kernel.kallsyms])
> > 7fb5fc22034b __GI___ioctl+0x3b (/usr/lib/x86_64-linux-gnu/libc.so.6)
> >
> > ...
> >
> > Notice that the last line and it has the __GI___ioctl in the same
> > callchain. It should work with other tools like perf report.
>
>
> It seems it only works with perf report -D, when I test it on a
> non-hybrid machine.
> $perf record -e branches -g -c 3000000 ~/tchain_edit
> [ perf record: Woken up 1 times to write data ]
> [ perf record: Captured and wrote 0.397 MB perf.data ]
> $ perf report -D | tail -n 17
>
> Aggregated stats:
> TOTAL events: 8235
> MMAP events: 78 ( 0.9%)
> COMM events: 2 ( 0.0%)
> EXIT events: 1 ( 0.0%)
> SAMPLE events: 4060 (49.3%)
> MMAP2 events: 2 ( 0.0%)
> KSYMBOL events: 12 ( 0.1%)
> BPF_EVENT events: 12 ( 0.1%)
> CALLCHAIN_DEFERRED events: 4060 (49.3%)
> FINISHED_ROUND events: 3 ( 0.0%)
> ID_INDEX events: 1 ( 0.0%)
> THREAD_MAP events: 1 ( 0.0%)
> CPU_MAP events: 1 ( 0.0%)
> TIME_CONV events: 1 ( 0.0%)
> FINISHED_INIT events: 1 ( 0.0%)
> $ perf report
> Error:
> The perf.data data has no samples!
> # To display the perf.data header info, please use
> --header/--header-only options.
> #
>
>
> On a hybrid machine, perf record errors out.
>
> $perf record -g true
> [ perf record: Woken up 1 times to write data ]
> 0x58a8 [0x38]: failed to process type: 22 [Bad address]
> [ perf record: Captured and wrote 0.022 MB perf.data ]
Thanks for the test, I'll take a look what I missed.
Thanks,
Namhyung
prev parent reply other threads:[~2024-09-23 23:08 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-09-17 22:28 [RFC/PATCHSET 0/5] perf tools: Support deferred user callchains (v2) Namhyung Kim
2024-09-17 22:28 ` [PATCH 1/5] perf tools: Sync UAPI perf_event.h header Namhyung Kim
2024-09-17 22:28 ` [PATCH 2/5] perf tools: Minimal DEFERRED_CALLCHAIN support Namhyung Kim
2024-09-17 22:28 ` [PATCH 3/5] perf record: Enable defer_callchain for user callchains Namhyung Kim
2024-09-17 22:28 ` [PATCH 4/5] perf script: Display PERF_RECORD_CALLCHAIN_DEFERRED Namhyung Kim
2024-09-17 22:28 ` [PATCH 5/5] perf tools: Merge deferred user callchains Namhyung Kim
2024-09-18 6:38 ` [RFC/PATCHSET 0/5] perf tools: Support deferred user callchains (v2) Ian Rogers
2024-09-18 9:38 ` Namhyung Kim
2024-09-18 13:39 ` Ian Rogers
2024-09-23 23:07 ` Namhyung Kim
2024-09-18 20:26 ` Liang, Kan
2024-09-23 23:08 ` Namhyung Kim [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZvH0-ny9gUzh_Jc7@google.com \
--to=namhyung@kernel.org \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=indu.bhagat@oracle.com \
--cc=irogers@google.com \
--cc=jolsa@kernel.org \
--cc=jpoimboe@kernel.org \
--cc=kan.liang@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=linux-toolchains@vger.kernel.org \
--cc=mathieu.desnoyers@efficios.com \
--cc=mingo@kernel.org \
--cc=peterz@infradead.org \
--cc=rostedt@goodmis.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.