From: Jonathan Cameron <Jonathan.Cameron@Huawei.com>
To: Jie Zhan <zhanjie9@hisilicon.com>
Cc: Yicong Yang <yangyicong@huawei.com>, <acme@kernel.org>,
<mark.rutland@arm.com>, <peterz@infradead.org>,
<mingo@redhat.com>, <james.clark@arm.com>,
<alexander.shishkin@linux.intel.com>,
<linux-perf-users@vger.kernel.org>,
<linux-kernel@vger.kernel.org>, <21cnbao@gmail.com>,
<tim.c.chen@intel.com>, <prime.zeng@hisilicon.com>,
<shenyang39@huawei.com>, <linuxarm@huawei.com>,
<yangyicong@hisilicon.com>
Subject: Re: [PATCH] perf stat: Support per-cluster aggregation
Date: Fri, 24 Mar 2023 12:30:31 +0000 [thread overview]
Message-ID: <20230324123031.0000013c@Huawei.com> (raw)
In-Reply-To: <20230324122422.00006a2b@Huawei.com>
On Fri, 24 Mar 2023 12:24:22 +0000
Jonathan Cameron <Jonathan.Cameron@Huawei.com> wrote:
> On Fri, 24 Mar 2023 10:34:33 +0800
> Jie Zhan <zhanjie9@hisilicon.com> wrote:
>
> > On 13/03/2023 16:59, Yicong Yang wrote:
> > > From: Yicong Yang <yangyicong@hisilicon.com>
> > >
> > > Some platforms have 'cluster' topology and CPUs in the cluster will
> > > share resources like L3 Cache Tag (for HiSilicon Kunpeng SoC) or L2
> > > cache (for Intel Jacobsville). Currently parsing and building cluster
> > > topology have been supported since [1].
> > >
> > > perf stat has already supported aggregation for other topologies like
> > > die or socket, etc. It'll be useful to aggregate per-cluster to find
> > > problems like L3T bandwidth contention or imbalance.
> > >
> > > This patch adds support for "--per-cluster" option for per-cluster
> > > aggregation. Also update the docs and related test. The output will
> > > be like:
> > >
> > > [root@localhost tmp]# perf stat -a -e LLC-load --per-cluster -- sleep 5
> > >
> > > Performance counter stats for 'system wide':
> > >
> > > S56-D0-CLS158 4 1,321,521,570 LLC-load
> > > S56-D0-CLS594 4 794,211,453 LLC-load
> > > S56-D0-CLS1030 4 41,623 LLC-load
> > > S56-D0-CLS1466 4 41,646 LLC-load
> > > S56-D0-CLS1902 4 16,863 LLC-load
> > > S56-D0-CLS2338 4 15,721 LLC-load
> > > S56-D0-CLS2774 4 22,671 LLC-load
> > > [...]
> > >
> > > [1] commit c5e22feffdd7 ("topology: Represent clusters of CPUs within a die")
> > >
> > > Signed-off-by: Yicong Yang <yangyicong@hisilicon.com>
> >
> > An end user may have to check sysfs to figure out what CPUs those
> > cluster IDs account for.
> >
> > Any better method to show the mapping between CPUs and cluster IDs?
>
> The cluster code is capable of using the ACPI_PPTT_ACPI_PROCESSOR_ID field
> if valid for the cluster level of PPTT.
>
> The numbers in the example above look like offsets into the PPTT table
> so I think the PPTT table is missing that information.
>
> Whilst not a great description anyway (it's just an index), the UUID
> that would be in there can convey more info on which cluster this is.
>
>
> >
> > Perhaps adding a conditional cluster id (when there are clusters) in the
> > "--per-core" output may help.
>
> That's an interesting idea. You'd want to include the other levels
> if doing that. So whenever you do a --per-xxx it also provides the
> cluster / die / node / socket etc as relevant 'above' the level of xxx
> Fun is that node and die can flip which would make this tricky to do.
Ignore me on this. I'd not looked at the patch closely when I wrote
this. Clearly a lot of this information is already provided - the
suggestion was to consider adding cluster to that mix which makes
sense to me.
Jonathan
>
> Jonathan
>
> >
> > Apart form that, this works well on my aarch64.
> >
> > Tested-by: Jie Zhan <zhanjie9@hisilicon.com>
>
> >
> >
>
next prev parent reply other threads:[~2023-03-24 12:31 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-03-13 8:59 [PATCH] perf stat: Support per-cluster aggregation Yicong Yang
2023-03-23 13:03 ` Yicong Yang
2023-03-24 2:34 ` Jie Zhan
2023-03-24 12:24 ` Jonathan Cameron
2023-03-24 12:30 ` Jonathan Cameron [this message]
2023-03-27 6:20 ` Yicong Yang
2023-03-24 18:05 ` Chen, Tim C
2023-03-27 4:03 ` Yicong Yang
2023-03-29 6:47 ` Namhyung Kim
2023-03-29 12:46 ` Yicong Yang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230324123031.0000013c@Huawei.com \
--to=jonathan.cameron@huawei.com \
--cc=21cnbao@gmail.com \
--cc=acme@kernel.org \
--cc=alexander.shishkin@linux.intel.com \
--cc=james.clark@arm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=linuxarm@huawei.com \
--cc=mark.rutland@arm.com \
--cc=mingo@redhat.com \
--cc=peterz@infradead.org \
--cc=prime.zeng@hisilicon.com \
--cc=shenyang39@huawei.com \
--cc=tim.c.chen@intel.com \
--cc=yangyicong@hisilicon.com \
--cc=yangyicong@huawei.com \
--cc=zhanjie9@hisilicon.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.