* [PATCH] arc: perf: Enable generic "cache-references" and "cache-misses" events
@ 2016-08-25 11:47 Alexey Brodkin
2016-08-26 17:30 ` Vineet Gupta
0 siblings, 1 reply; 3+ messages in thread
From: Alexey Brodkin @ 2016-08-25 11:47 UTC (permalink / raw)
To: linux-kernel
Cc: linux-snps-arc, Alexey Brodkin, Vineet Gupta, Thomas Gleixner,
Arnaldo Carvalho de Melo, Peter Zijlstra, stable
We used to live with PERF_COUNT_HW_CACHE_REFERENCES and
PERF_COUNT_HW_CACHE_REFERENCES not specified on ARC.
Those events are actually aliases to 2 cache events that we do support
and so this change sets "cache-reference" and "cache-misses" events
in the same way as "L1-dcache-loads" and L1-dcache-load-misses.
And while at it adding debug info for cache events as well as doing a
subtle fix in HW events debug info - config value is much better
represented by hex so we may see not only event index but as well other
control bits set (if they exist).
Signed-off-by: Alexey Brodkin <abrodkin@synopsys.com>
Cc: Vineet Gupta <vgupta@synopsys.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: linux-snps-arc@lists.infradead.org
Cc: linux-kernel@vger.kernel.org
Cc: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: stable@vger.kernel.org
---
arch/arc/include/asm/perf_event.h | 3 +++
arch/arc/kernel/perf_event.c | 6 ++++--
2 files changed, 7 insertions(+), 2 deletions(-)
diff --git a/arch/arc/include/asm/perf_event.h b/arch/arc/include/asm/perf_event.h
index 5f07176..9185541 100644
--- a/arch/arc/include/asm/perf_event.h
+++ b/arch/arc/include/asm/perf_event.h
@@ -118,6 +118,9 @@ static const char * const arc_pmu_ev_hw_map[] = {
[PERF_COUNT_ARC_ICM] = "icm", /* I-cache Miss */
[PERF_COUNT_ARC_EDTLB] = "edtlb", /* D-TLB Miss */
[PERF_COUNT_ARC_EITLB] = "eitlb", /* I-TLB Miss */
+
+ [PERF_COUNT_HW_CACHE_REFERENCES] = "imemrdc", /* Instr: mem read cached */
+ [PERF_COUNT_HW_CACHE_MISSES] = "dclm", /* D-cache Load Miss */
};
#define C(_x) PERF_COUNT_HW_CACHE_##_x
diff --git a/arch/arc/kernel/perf_event.c b/arch/arc/kernel/perf_event.c
index 08f03d9..2ce24e7 100644
--- a/arch/arc/kernel/perf_event.c
+++ b/arch/arc/kernel/perf_event.c
@@ -179,8 +179,8 @@ static int arc_pmu_event_init(struct perf_event *event)
if (arc_pmu->ev_hw_idx[event->attr.config] < 0)
return -ENOENT;
hwc->config |= arc_pmu->ev_hw_idx[event->attr.config];
- pr_debug("init event %d with h/w %d \'%s\'\n",
- (int) event->attr.config, (int) hwc->config,
+ pr_debug("init event %d with h/w %08x \'%s\'\n",
+ (int)event->attr.config, (int)hwc->config,
arc_pmu_ev_hw_map[event->attr.config]);
return 0;
@@ -189,6 +189,8 @@ static int arc_pmu_event_init(struct perf_event *event)
if (ret < 0)
return ret;
hwc->config |= arc_pmu->ev_hw_idx[ret];
+ pr_debug("init cache event with h/w %08x \'%s\'\n",
+ (int)hwc->config, arc_pmu_ev_hw_map[ret]);
return 0;
default:
return -ENOENT;
--
2.7.4
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] arc: perf: Enable generic "cache-references" and "cache-misses" events
2016-08-25 11:47 [PATCH] arc: perf: Enable generic "cache-references" and "cache-misses" events Alexey Brodkin
@ 2016-08-26 17:30 ` Vineet Gupta
2016-09-14 17:53 ` Vineet Gupta
0 siblings, 1 reply; 3+ messages in thread
From: Vineet Gupta @ 2016-08-26 17:30 UTC (permalink / raw)
To: Alexey Brodkin, linux-kernel@vger.kernel.org, Peter Zijlstra
Cc: linux-snps-arc@lists.infradead.org, Thomas Gleixner,
Arnaldo Carvalho de Melo, stable@vger.kernel.org
On 08/25/2016 04:49 AM, Alexey Brodkin wrote:
> ...
> [PERF_COUNT_ARC_EDTLB] = "edtlb", /* D-TLB Miss */
> [PERF_COUNT_ARC_EITLB] = "eitlb", /* I-TLB Miss */
> +
> + [PERF_COUNT_HW_CACHE_REFERENCES] = "imemrdc", /* Instr: mem read cached */
> + [PERF_COUNT_HW_CACHE_MISSES] = "dclm", /* D-cache Load Miss */
I think this is duplicating a mistake we already have. I vaguely remember when
doing some hackbench profiling last year with range based profiling confined to
memset routine and saw that L1-dcache-misses was counting zero. This is because it
only counts LD misses while memset only does ST.
Performance counter stats for '/sbin/hackbench':
0 L1-dcache-misses
0 L1-dcache-load-misses
1846082 L1-dcache-store-misses
@PeterZ do you concur that is wrong and we ought to setup 2 counters to do this
correctly ?
-Vineet
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] arc: perf: Enable generic "cache-references" and "cache-misses" events
2016-08-26 17:30 ` Vineet Gupta
@ 2016-09-14 17:53 ` Vineet Gupta
0 siblings, 0 replies; 3+ messages in thread
From: Vineet Gupta @ 2016-09-14 17:53 UTC (permalink / raw)
To: Alexey Brodkin, linux-kernel@vger.kernel.org, Peter Zijlstra
Cc: Arnaldo Carvalho de Melo, Thomas Gleixner,
linux-snps-arc@lists.infradead.org, stable@vger.kernel.org
On 08/26/2016 10:30 AM, Vineet Gupta wrote:
> On 08/25/2016 04:49 AM, Alexey Brodkin wrote:
>> ...
>> [PERF_COUNT_ARC_EDTLB] = "edtlb", /* D-TLB Miss */
>> [PERF_COUNT_ARC_EITLB] = "eitlb", /* I-TLB Miss */
>> +
>> + [PERF_COUNT_HW_CACHE_REFERENCES] = "imemrdc", /* Instr: mem read cached */
>> + [PERF_COUNT_HW_CACHE_MISSES] = "dclm", /* D-cache Load Miss */
>
> I think this is duplicating a mistake we already have. I vaguely remember when
> doing some hackbench profiling last year with range based profiling confined to
> memset routine and saw that L1-dcache-misses was counting zero. This is because it
> only counts LD misses while memset only does ST.
So given that this is the best we got, I'm going to merge this anyways.
-Vineet
>
> Performance counter stats for '/sbin/hackbench':
>
> 0 L1-dcache-misses
> 0 L1-dcache-load-misses
> 1846082 L1-dcache-store-misses
>
>
> @PeterZ do you concur that is wrong and we ought to setup 2 counters to do this
> correctly ?
>
> -Vineet
>
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2016-09-14 17:53 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-08-25 11:47 [PATCH] arc: perf: Enable generic "cache-references" and "cache-misses" events Alexey Brodkin
2016-08-26 17:30 ` Vineet Gupta
2016-09-14 17:53 ` Vineet Gupta
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).