From: Namhyung Kim <namhyung@kernel.org>
To: Ian Rogers <irogers@google.com>
Cc: Peter Zijlstra <peterz@infradead.org>,
Ingo Molnar <mingo@redhat.com>,
Arnaldo Carvalho de Melo <acme@kernel.org>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Jiri Olsa <jolsa@kernel.org>,
Adrian Hunter <adrian.hunter@intel.com>,
"Dr. David Alan Gilbert" <linux@treblig.org>,
Yang Li <yang.lee@linux.alibaba.com>,
James Clark <james.clark@linaro.org>,
Thomas Falcon <thomas.falcon@intel.com>,
Thomas Richter <tmricht@linux.ibm.com>,
linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org,
Andi Kleen <ak@linux.intel.com>,
Dapeng Mi <dapeng1.mi@linux.intel.com>
Subject: Re: [PATCH v4 04/10] perf stat-shadow: Read tool events directly
Date: Mon, 17 Nov 2025 22:45:44 -0800 [thread overview]
Message-ID: <aRwWGPs0IQoQesoo@google.com> (raw)
In-Reply-To: <CAP-5=fUJFFOQzbPyef0F0UixFFD4Mtfh2iYc6aMXsT0mr_DXqA@mail.gmail.com>
On Mon, Nov 17, 2025 at 08:36:34PM -0800, Ian Rogers wrote:
> On Mon, Nov 17, 2025 at 6:30 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > On Thu, Nov 13, 2025 at 10:05:10AM -0800, Ian Rogers wrote:
> > > When reading time values for metrics don't use the globals updated in
> > > builtin-stat, just read the events as regular events. The only
> > > exception is for time events where nanoseconds need converting to
> > > seconds as metrics assume time metrics are in seconds.
> > >
> > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > ---
[SNIP]
> > > - case TOOL_PMU__EVENT_SYSTEM_TIME:
> > > - stats = &ru_stats.ru_stime_usec_stat;
> > > - scale = 1e-6;
> > > - break;
> >
> > I think this was broken as it seems to be converted to nsec in the
> > update_rusage_stats().
>
> The global utime/stime is in usecs as in the name. In the tool_pmu it
> is converted to nanoseconds, for consistency with duration_time. In
> the new code the counters are all in nanoseconds and so the same
> scaling factor can be applied.
What do you mean by global utime/stime?
>
> > > - case TOOL_PMU__EVENT_NONE:
> > > - pr_err("Invalid tool event 'none'");
> > > - abort();
> > > - case TOOL_PMU__EVENT_MAX:
> > > - pr_err("Invalid tool event 'max'");
> > > - abort();
> > > - case TOOL_PMU__EVENT_HAS_PMEM:
> > > - case TOOL_PMU__EVENT_NUM_CORES:
> > > - case TOOL_PMU__EVENT_NUM_CPUS:
> > > - case TOOL_PMU__EVENT_NUM_CPUS_ONLINE:
> > > - case TOOL_PMU__EVENT_NUM_DIES:
> > > - case TOOL_PMU__EVENT_NUM_PACKAGES:
> > > - case TOOL_PMU__EVENT_SLOTS:
> > > - case TOOL_PMU__EVENT_SMT_ON:
> > > - case TOOL_PMU__EVENT_SYSTEM_TSC_FREQ:
> > > - case TOOL_PMU__EVENT_CORE_WIDE:
> > > - case TOOL_PMU__EVENT_TARGET_CPU:
> > > - default:
> > > - pr_err("Unexpected tool event '%s'", evsel__name(metric_events[i]));
> > > - abort();
> > > }
> > > - val = avg_stats(stats) * scale;
> > > - source_count = 1;
> > > - } else {
> > > - struct perf_stat_evsel *ps = metric_events[i]->stats;
> > > - struct perf_stat_aggr *aggr;
> > > -
> > > + }
> > > + /* Time events are always on CPU0, the first aggregation index. */
> > > + aggr = &ps->aggr[is_tool_time ? tool_aggr_idx : aggr_idx];
> > > + if (!aggr || !metric_events[i]->supported) {
> > > /*
> > > - * If there are multiple uncore PMUs and we're not
> > > - * reading the leader's stats, determine the stats for
> > > - * the appropriate uncore PMU.
> > > + * Not supported events will have a count of 0, which
> > > + * can be confusing in a metric. Explicitly set the
> > > + * value to NAN. Not counted events (enable time of 0)
> > > + * are read as 0.
> > > */
> > > - if (evsel && evsel->metric_leader &&
> > > - evsel->pmu != evsel->metric_leader->pmu &&
> > > - mexp->metric_events[i]->pmu == evsel->metric_leader->pmu) {
> > > - struct evsel *pos;
> > > -
> > > - evlist__for_each_entry(evsel->evlist, pos) {
> > > - if (pos->pmu != evsel->pmu)
> > > - continue;
> > > - if (pos->metric_leader != mexp->metric_events[i])
> > > - continue;
> > > - ps = pos->stats;
> > > - source_count = 1;
> > > - break;
> > > - }
> > > - }
> > > - aggr = &ps->aggr[aggr_idx];
> > > - if (!aggr)
> > > - break;
> > > -
> > > - if (!metric_events[i]->supported) {
> > > - /*
> > > - * Not supported events will have a count of 0,
> > > - * which can be confusing in a
> > > - * metric. Explicitly set the value to NAN. Not
> > > - * counted events (enable time of 0) are read as
> > > - * 0.
> > > - */
> > > - val = NAN;
> > > - source_count = 0;
> > > - } else {
> > > - val = aggr->counts.val;
> > > - if (!source_count)
> > > - source_count = evsel__source_count(metric_events[i]);
> > > - }
> > > + val = NAN;
> > > + source_count = 0;
> > > + } else {
> > > + val = aggr->counts.val;
> > > + if (is_tool_time)
> > > + val *= 1e-9; /* Convert time event nanoseconds to seconds. */
> >
> > And this code treats all time events are in nsec now.
>
> That's correct: user_time, system_time and duration_time are all
> counters that give a count in nanoseconds. However, in the metrics, I
> guess as the counts are doubles, the time has always been in whole
> seconds.
Yep, I think it's ok.
Thanks,
Namhyung
next prev parent reply other threads:[~2025-11-18 6:45 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-11-13 18:05 [PATCH v4 00/10] perf stat fixes and improvements Ian Rogers
2025-11-13 18:05 ` [PATCH v4 01/10] libperf cpumap: Reduce allocations and sorting in intersect Ian Rogers
2025-11-13 18:05 ` [PATCH v4 02/10] perf pmu: perf_cpu_map__new_int to avoid parsing a string Ian Rogers
2025-11-13 18:05 ` [PATCH v4 03/10] perf tool_pmu: Use old_count when computing count values for time events Ian Rogers
2025-11-13 18:05 ` [PATCH v4 04/10] perf stat-shadow: Read tool events directly Ian Rogers
2025-11-18 2:30 ` Namhyung Kim
2025-11-18 4:36 ` Ian Rogers
2025-11-18 6:45 ` Namhyung Kim [this message]
2025-11-13 18:05 ` [PATCH v4 05/10] perf stat: Reduce scope of ru_stats Ian Rogers
2025-11-18 2:31 ` Namhyung Kim
2025-11-13 18:05 ` [PATCH v4 06/10] perf stat: Reduce scope of walltime_nsecs_stats Ian Rogers
2025-11-13 18:05 ` [PATCH v4 07/10] perf tool_pmu: More accurately set the cpus for tool events Ian Rogers
2026-02-03 17:37 ` Andres Freund
2026-02-03 23:05 ` Ian Rogers
2026-02-03 23:27 ` Andres Freund
2026-02-03 23:31 ` Ian Rogers
2025-11-13 18:05 ` [PATCH v4 08/10] perf evlist: Reduce affinity use and move into iterator, fix no affinity Ian Rogers
2025-11-13 18:05 ` [PATCH v4 09/10] perf stat: Read tool events last Ian Rogers
2025-11-18 2:35 ` Namhyung Kim
2025-11-18 4:38 ` Ian Rogers
2025-11-18 6:46 ` Namhyung Kim
2025-11-13 18:05 ` [PATCH v4 10/10] perf stat: Add no-affinity flag Ian Rogers
2025-11-18 2:40 ` Namhyung Kim
2025-11-18 4:32 ` Ian Rogers
2025-11-18 6:50 ` Namhyung Kim
2025-11-18 18:00 ` [PATCH v4 00/10] perf stat fixes and improvements Namhyung Kim
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aRwWGPs0IQoQesoo@google.com \
--to=namhyung@kernel.org \
--cc=acme@kernel.org \
--cc=adrian.hunter@intel.com \
--cc=ak@linux.intel.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=dapeng1.mi@linux.intel.com \
--cc=irogers@google.com \
--cc=james.clark@linaro.org \
--cc=jolsa@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=linux@treblig.org \
--cc=mingo@redhat.com \
--cc=peterz@infradead.org \
--cc=thomas.falcon@intel.com \
--cc=tmricht@linux.ibm.com \
--cc=yang.lee@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.