* [PATCH 1/2] perf/x86/cstate: Allow reading the package statistics from local CPU
2023-08-23 5:56 [PATCH 0/2] perf/x86: Package residency counter improvements Tero Kristo
@ 2023-08-23 5:56 ` Tero Kristo
2023-08-23 5:56 ` [PATCH 2/2] perf/core: Allow reading package events from perf_event_read_local Tero Kristo
1 sibling, 0 replies; 3+ messages in thread
From: Tero Kristo @ 2023-08-23 5:56 UTC (permalink / raw)
To: dave.hansen, tglx, x86, bp
Cc: artem.bityutskiy, acme, bpf, namhyung, mingo, linux-kernel,
linux-perf-users, irogers, hpa, mark.rutland, jolsa,
adrian.hunter, alexander.shishkin, peterz, Kan Liang
The MSR registers for reading the package residency counters are
available on every CPU of the package. To avoid doing unnecessary SMP
calls to read the values for these from the various CPUs inside a
package, allow reading them from any CPU of the package.
Signed-off-by: Tero Kristo <tero.kristo@linux.intel.com>
Suggested-by: Kan Liang <kan.liang@intel.com>
---
arch/x86/events/intel/cstate.c | 3 +++
1 file changed, 3 insertions(+)
diff --git a/arch/x86/events/intel/cstate.c b/arch/x86/events/intel/cstate.c
index 835862c548cc..5cfe021edc65 100644
--- a/arch/x86/events/intel/cstate.c
+++ b/arch/x86/events/intel/cstate.c
@@ -336,6 +336,9 @@ static int cstate_pmu_event_init(struct perf_event *event)
cfg = array_index_nospec((unsigned long)cfg, PERF_CSTATE_PKG_EVENT_MAX);
if (!(pkg_msr_mask & (1 << cfg)))
return -EINVAL;
+
+ event->event_caps |= PERF_EV_CAP_READ_ACTIVE_PKG;
+
event->hw.event_base = pkg_msr[cfg].msr;
cpu = cpumask_any_and(&cstate_pkg_cpu_mask,
topology_die_cpumask(event->cpu));
--
2.25.1
^ permalink raw reply related [flat|nested] 3+ messages in thread
* [PATCH 2/2] perf/core: Allow reading package events from perf_event_read_local
2023-08-23 5:56 [PATCH 0/2] perf/x86: Package residency counter improvements Tero Kristo
2023-08-23 5:56 ` [PATCH 1/2] perf/x86/cstate: Allow reading the package statistics from local CPU Tero Kristo
@ 2023-08-23 5:56 ` Tero Kristo
1 sibling, 0 replies; 3+ messages in thread
From: Tero Kristo @ 2023-08-23 5:56 UTC (permalink / raw)
To: dave.hansen, tglx, x86, bp
Cc: artem.bityutskiy, acme, bpf, namhyung, mingo, linux-kernel,
linux-perf-users, irogers, hpa, mark.rutland, jolsa,
adrian.hunter, alexander.shishkin, peterz
Per-package perf events are typically registered with a single CPU only,
however they can be read across all the CPUs within the package.
Currently perf_event_read maps the event CPU according to the topology
information to avoid an unnecessary SMP call, however
perf_event_read_local deals with hard values and rejects a read with a
failure if the CPU is not the one exactly registered. Allow similar
mapping within the perf_event_read_local if the perf event in question
can support this.
This allows users like BPF code to read the package perf events properly
across different CPUs within a package.
Signed-off-by: Tero Kristo <tero.kristo@linux.intel.com>
---
kernel/events/core.c | 11 ++++++++---
1 file changed, 8 insertions(+), 3 deletions(-)
diff --git a/kernel/events/core.c b/kernel/events/core.c
index 78ae7b6f90fd..37db7c003b79 100644
--- a/kernel/events/core.c
+++ b/kernel/events/core.c
@@ -4528,6 +4528,7 @@ int perf_event_read_local(struct perf_event *event, u64 *value,
{
unsigned long flags;
int ret = 0;
+ int event_cpu;
/*
* Disabling interrupts avoids all counter scheduling (context
@@ -4551,15 +4552,19 @@ int perf_event_read_local(struct perf_event *event, u64 *value,
goto out;
}
+ /* Allow reading a per-package perf-event from local CPU also */
+ event_cpu = READ_ONCE(event->oncpu);
+ event_cpu = __perf_event_read_cpu(event, event_cpu);
+
/* If this is a per-CPU event, it must be for this CPU */
if (!(event->attach_state & PERF_ATTACH_TASK) &&
- event->cpu != smp_processor_id()) {
+ event_cpu != smp_processor_id()) {
ret = -EINVAL;
goto out;
}
/* If this is a pinned event it must be running on this CPU */
- if (event->attr.pinned && event->oncpu != smp_processor_id()) {
+ if (event->attr.pinned && event_cpu != smp_processor_id()) {
ret = -EBUSY;
goto out;
}
@@ -4569,7 +4574,7 @@ int perf_event_read_local(struct perf_event *event, u64 *value,
* or local to this CPU. Furthermore it means its ACTIVE (otherwise
* oncpu == -1).
*/
- if (event->oncpu == smp_processor_id())
+ if (event_cpu == smp_processor_id())
event->pmu->read(event);
*value = local64_read(&event->count);
--
2.25.1
^ permalink raw reply related [flat|nested] 3+ messages in thread