public inbox for linux-perf-users@vger.kernel.org
 help / color / mirror / Atom feed
From: Namhyung Kim <namhyung@kernel.org>
To: Qinxin Xia <xiaqinxin@huawei.com>
Cc: irogers@google.com, peterz@infradead.org, mingo@redhat.com,
	acme@kernel.org, wangyushan12@huawei.com,
	hejunhao3@h-partners.com, jonathan.cameron@huawei.com,
	linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org,
	mark.rutland@arm.com, alexander.shishkin@linux.intel.com,
	jolsa@kernel.org, adrian.hunter@intel.com
Subject: Re: [PATCH 1/1] perf: Add --pmu-filter option for filtering PMUs
Date: Tue, 3 Mar 2026 17:19:48 -0800	[thread overview]
Message-ID: <aaeItD2RYk0DZU0x@google.com> (raw)
In-Reply-To: <20260225034125.2157622-2-xiaqinxin@huawei.com>

On Wed, Feb 25, 2026 at 11:41:25AM +0800, Qinxin Xia wrote:
> This patch adds a new --pmu-filter option to perf-stat command to allow
> filtering events on specific PMUs. This is useful when there are
> multiple PMUs with same type (e.g. hisi_sicl2_cpa0 and hisi_sicl0_cpa0).
> 
> perf event:
> [root@localhost tmp]# perf stat -e cpa_cycles
>  Performance counter stats for 'system wide':
> 
>      3,005,509,659      hisi_sicl0_cpa0/cpa_cycles/
>      3,005,502,319      hisi_sicl2_cpa0/cpa_cycles/
> 
>        3.005460970 seconds time elapsed
> 
> [root@localhost tmp]# perf stat --pmu-filter hisi_sicl0_cpa0 -e cpa_cycles
>  Performance counter stats for 'system wide':
> 
>      3,003,899,571      cpa_cycles
> 
>        3.003864470 seconds time elapsed

In this case, you can just use -e /hisi_sicl0_cpa0/cpa_cycles/.

> 
> perf metric:
> [root@localhost tmp]# perf stat -M cpa_p0_avg_bw
>  Performance counter stats for 'system wide':
> 
>     19,417,779,115      hisi_sicl0_cpa0/cpa_cycles/      #     0.00 cpa_p0_avg_bw
>                  0      hisi_sicl0_cpa0/cpa_p0_wr_dat/
>                  0      hisi_sicl0_cpa0/cpa_p0_rd_dat_64b/
>                  0      hisi_sicl0_cpa0/cpa_p0_rd_dat_32b/
>     19,417,751,103      hisi_sicl10_cpa0/cpa_cycles/     #     0.00 cpa_p0_avg_bw
>                  0      hisi_sicl10_cpa0/cpa_p0_wr_dat/
>                  0      hisi_sicl10_cpa0/cpa_p0_rd_dat_64b/
>                  0      hisi_sicl10_cpa0/cpa_p0_rd_dat_32b/
>     19,417,730,679      hisi_sicl2_cpa0/cpa_cycles/      #     0.31 cpa_p0_avg_bw
>         75,635,749      hisi_sicl2_cpa0/cpa_p0_wr_dat/
>         18,520,640      hisi_sicl2_cpa0/cpa_p0_rd_dat_64b/
>                  0      hisi_sicl2_cpa0/cpa_p0_rd_dat_32b/
>     19,417,674,227      hisi_sicl8_cpa0/cpa_cycles/      #     0.00 cpa_p0_avg_bw
>                  0      hisi_sicl8_cpa0/cpa_p0_wr_dat/
>                  0      hisi_sicl8_cpa0/cpa_p0_rd_dat_64b/
>                  0      hisi_sicl8_cpa0/cpa_p0_rd_dat_32b/
> 
>       19.417734480 seconds time elapsed
> 
> [root@localhost tmp]# perf stat --pmu-filter hisi_sicl2_cpa0 -M cpa_p0_avg_bw
>  Performance counter stats for 'system wide':
> 
>      6,234,093,559      cpa_cycles                       #     0.60 cpa_p0_avg_bw
>         50,548,465      cpa_p0_wr_dat
>          7,552,182      cpa_p0_rd_dat_64b
>                  0      cpa_p0_rd_dat_32b
> 
>        6.234139320 seconds time elapsed

But right, metrics seem to need this.  Does this support wildcard match
for multiple PMUs?

Thanks,
Namhyung

> 
> Signed-off-by: Qinxin Xia <xiaqinxin@huawei.com>
> ---
>  tools/perf/Documentation/perf-stat.txt |  4 ++++
>  tools/perf/builtin-stat.c              | 26 ++++++++++++++++++++++++++
>  tools/perf/util/metricgroup.c          | 18 +++++++++++++-----
>  3 files changed, 43 insertions(+), 5 deletions(-)
> 
> diff --git a/tools/perf/Documentation/perf-stat.txt b/tools/perf/Documentation/perf-stat.txt
> index 7cccc3a847d1..b72a29c9223c 100644
> --- a/tools/perf/Documentation/perf-stat.txt
> +++ b/tools/perf/Documentation/perf-stat.txt
> @@ -578,6 +578,10 @@ $ perf config stat.no-csv-summary=true
>  Only enable events on applying cpu with this type for hybrid platform
>  (e.g. core or atom)"
>  
> +--pmu-filter::
> +Only enable events on applying pmu with specified for multiple
> +pmus with same type (e.g. hisi_sicl2_cpa0 or hisi_sicl0_cpa0)
> +
>  EXAMPLES
>  --------
>  
> diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c
> index 73c2ba7e3076..67fb91816c0b 100644
> --- a/tools/perf/builtin-stat.c
> +++ b/tools/perf/builtin-stat.c
> @@ -1214,6 +1214,28 @@ static int parse_cputype(const struct option *opt,
>  	return 0;
>  }
>  
> +static int parse_pmu_filter(const struct option *opt,
> +			   const char *str,
> +			   int unset __maybe_unused)
> +{
> +	const struct perf_pmu *pmu;
> +	struct evlist *evlist = *(struct evlist **)opt->value;
> +
> +	if (!list_empty(&evlist->core.entries)) {
> +		fprintf(stderr, "Must define pmu-filter before events/metrics\n");
> +		return -1;
> +	}
> +
> +	pmu = perf_pmus__pmu_for_pmu_filter(str);
> +	if (!pmu) {
> +		fprintf(stderr, "--pmu-filter %s is not supported!\n", str);
> +		return -1;
> +	}
> +	parse_events_option_args.pmu_filter = pmu->name;
> +
> +	return 0;
> +}
> +
>  static int parse_cache_level(const struct option *opt,
>  			     const char *str,
>  			     int unset __maybe_unused)
> @@ -2561,6 +2583,10 @@ int cmd_stat(int argc, const char **argv)
>  			"Only enable events on applying cpu with this type "
>  			"for hybrid platform (e.g. core or atom)",
>  			parse_cputype),
> +		OPT_CALLBACK(0, "pmu-filter", &evsel_list, "pmu",
> +			"Only enable events on applying pmu with specified "
> +			"for multiple pmus with same type(e.g. hisi_sicl2_cpa0 or hisi_sicl0_cpa0)",
> +			parse_pmu_filter),
>  #ifdef HAVE_LIBPFM
>  		OPT_CALLBACK(0, "pfm-events", &evsel_list, "event",
>  			"libpfm4 event selector. use 'perf list' to list available events",
> diff --git a/tools/perf/util/metricgroup.c b/tools/perf/util/metricgroup.c
> index 46bf4dfeebc8..89b83ca38483 100644
> --- a/tools/perf/util/metricgroup.c
> +++ b/tools/perf/util/metricgroup.c
> @@ -387,8 +387,13 @@ static bool match_pm_metric_or_groups(const struct pmu_metric *pm, const char *p
>  				      const char *metric_or_groups)
>  {
>  	const char *pm_pmu = pm->pmu ?: "cpu";
> +	struct perf_pmu *perf_pmu = NULL;
>  
> -	if (strcmp(pmu, "all") && strcmp(pm_pmu, pmu))
> +	if (pm->pmu)
> +		perf_pmu = perf_pmus__find(pm->pmu);
> +
> +	if (strcmp(pmu, "all") && strcmp(pm_pmu, pmu) &&
> +	   (perf_pmu && !perf_pmu__name_wildcard_match(perf_pmu, pmu)))
>  		return false;
>  
>  	return match_metric_or_groups(pm->metric_group, metric_or_groups) ||
> @@ -1259,7 +1264,8 @@ static int build_combined_expr_ctx(const struct list_head *metric_list,
>  static int parse_ids(bool metric_no_merge, bool fake_pmu,
>  		     struct expr_parse_ctx *ids, const char *modifier,
>  		     bool group_events, const bool tool_events[TOOL_PMU__EVENT_MAX],
> -		     struct evlist **out_evlist)
> +		     struct evlist **out_evlist,
> +		     const char *filter_pmu)
>  {
>  	struct parse_events_error parse_error;
>  	struct evlist *parsed_evlist;
> @@ -1313,7 +1319,7 @@ static int parse_ids(bool metric_no_merge, bool fake_pmu,
>  	}
>  	pr_debug("Parsing metric events '%s'\n", events.buf);
>  	parse_events_error__init(&parse_error);
> -	ret = __parse_events(parsed_evlist, events.buf, /*pmu_filter=*/NULL,
> +	ret = __parse_events(parsed_evlist, events.buf, filter_pmu,
>  			     &parse_error, fake_pmu, /*warn_if_reordered=*/false,
>  			     /*fake_tp=*/false);
>  	if (ret) {
> @@ -1416,7 +1422,8 @@ static int parse_groups(struct evlist *perf_evlist,
>  					/*modifier=*/NULL,
>  					/*group_events=*/false,
>  					tool_events,
> -					&combined_evlist);
> +					&combined_evlist,
> +					(pmu && strcmp(pmu, "all") == 0) ? NULL : pmu);
>  		}
>  		if (combined)
>  			expr__ctx_free(combined);
> @@ -1471,7 +1478,8 @@ static int parse_groups(struct evlist *perf_evlist,
>  		}
>  		if (!metric_evlist) {
>  			ret = parse_ids(metric_no_merge, fake_pmu, m->pctx, m->modifier,
> -					m->group_events, tool_events, &m->evlist);
> +					m->group_events, tool_events, &m->evlist,
> +					(pmu && strcmp(pmu, "all") == 0) ? NULL : pmu);
>  			if (ret)
>  				goto out;
>  
> -- 
> 2.33.0
> 

  reply	other threads:[~2026-03-04  1:19 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-25  3:41 [PATCH 0/1] perf: Add --pmu-filter option for filtering PMUs Qinxin Xia
2026-02-25  3:41 ` [PATCH 1/1] " Qinxin Xia
2026-03-04  1:19   ` Namhyung Kim [this message]
2026-03-05  6:14     ` Qinxin Xia

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aaeItD2RYk0DZU0x@google.com \
    --to=namhyung@kernel.org \
    --cc=acme@kernel.org \
    --cc=adrian.hunter@intel.com \
    --cc=alexander.shishkin@linux.intel.com \
    --cc=hejunhao3@h-partners.com \
    --cc=irogers@google.com \
    --cc=jolsa@kernel.org \
    --cc=jonathan.cameron@huawei.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-perf-users@vger.kernel.org \
    --cc=mark.rutland@arm.com \
    --cc=mingo@redhat.com \
    --cc=peterz@infradead.org \
    --cc=wangyushan12@huawei.com \
    --cc=xiaqinxin@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox