* [PATCH] arm64: perf: Allow more than one cycle counter to be used
@ 2017-07-01 6:33 Pratyush Anand
2017-07-17 3:26 ` Pratyush Anand
0 siblings, 1 reply; 4+ messages in thread
From: Pratyush Anand @ 2017-07-01 6:33 UTC (permalink / raw)
To: linux-arm-kernel
Currently:
$ perf stat -e cycles:u -e cycles:k true
Performance counter stats for 'true':
2,24,699 cycles:u
<not counted> cycles:k (0.00%)
0.000788087 seconds time elapsed
We can not count more than one cycle counter in one instance,because we
allow to map cycle counter into PMCCNTR_EL0 only. However, if I did not
miss anything then specification do not prohibit to use PMEVCNTR<n>_EL0
for cycle count as well.
Modify the code so that it still prefers to use PMCCNTR_EL0 for cycle
counter, however allow to use PMEVCNTR<n>_EL0 if PMCCNTR_EL0 is already
in use.
After this patch:
$ perf stat -e cycles:u -e cycles:k true
Performance counter stats for 'true':
2,17,310 cycles:u
7,40,009 cycles:k
0.000764149 seconds time elapsed
Signed-off-by: Pratyush Anand <panand@redhat.com>
---
arch/arm64/kernel/perf_event.c | 11 ++++-------
1 file changed, 4 insertions(+), 7 deletions(-)
diff --git a/arch/arm64/kernel/perf_event.c b/arch/arm64/kernel/perf_event.c
index 83a1b1ad189f..43c77314f345 100644
--- a/arch/arm64/kernel/perf_event.c
+++ b/arch/arm64/kernel/perf_event.c
@@ -846,17 +846,14 @@ static int armv8pmu_get_event_idx(struct pmu_hw_events *cpuc,
struct hw_perf_event *hwc = &event->hw;
unsigned long evtype = hwc->config_base & ARMV8_PMU_EVTYPE_EVENT;
- /* Always place a cycle counter into the cycle counter. */
+ /* Always prefer to place a cycle counter into the cycle counter. */
if (evtype == ARMV8_PMUV3_PERFCTR_CPU_CYCLES) {
- if (test_and_set_bit(ARMV8_IDX_CYCLE_COUNTER, cpuc->used_mask))
- return -EAGAIN;
-
- return ARMV8_IDX_CYCLE_COUNTER;
+ if (!test_and_set_bit(ARMV8_IDX_CYCLE_COUNTER, cpuc->used_mask))
+ return ARMV8_IDX_CYCLE_COUNTER;
}
/*
- * For anything other than a cycle counter, try and use
- * the events counters
+ * Otherwise use events counters
*/
for (idx = ARMV8_IDX_COUNTER0; idx < cpu_pmu->num_events; ++idx) {
if (!test_and_set_bit(idx, cpuc->used_mask))
--
2.9.3
^ permalink raw reply related [flat|nested] 4+ messages in thread
* [PATCH] arm64: perf: Allow more than one cycle counter to be used
2017-07-01 6:33 [PATCH] arm64: perf: Allow more than one cycle counter to be used Pratyush Anand
@ 2017-07-17 3:26 ` Pratyush Anand
2017-07-17 13:16 ` Will Deacon
0 siblings, 1 reply; 4+ messages in thread
From: Pratyush Anand @ 2017-07-17 3:26 UTC (permalink / raw)
To: linux-arm-kernel
+Mark
On Saturday 01 July 2017 12:03 PM, Pratyush Anand wrote:
> Currently:
> $ perf stat -e cycles:u -e cycles:k true
>
> Performance counter stats for 'true':
>
> 2,24,699 cycles:u
> <not counted> cycles:k (0.00%)
>
> 0.000788087 seconds time elapsed
>
> We can not count more than one cycle counter in one instance,because we
> allow to map cycle counter into PMCCNTR_EL0 only. However, if I did not
> miss anything then specification do not prohibit to use PMEVCNTR<n>_EL0
> for cycle count as well.
>
> Modify the code so that it still prefers to use PMCCNTR_EL0 for cycle
> counter, however allow to use PMEVCNTR<n>_EL0 if PMCCNTR_EL0 is already
> in use.
>
> After this patch:
>
> $ perf stat -e cycles:u -e cycles:k true
>
> Performance counter stats for 'true':
>
> 2,17,310 cycles:u
> 7,40,009 cycles:k
>
> 0.000764149 seconds time elapsed
>
Any comment/feedback?
> Signed-off-by: Pratyush Anand <panand@redhat.com>
> ---
> arch/arm64/kernel/perf_event.c | 11 ++++-------
> 1 file changed, 4 insertions(+), 7 deletions(-)
>
> diff --git a/arch/arm64/kernel/perf_event.c b/arch/arm64/kernel/perf_event.c
> index 83a1b1ad189f..43c77314f345 100644
> --- a/arch/arm64/kernel/perf_event.c
> +++ b/arch/arm64/kernel/perf_event.c
> @@ -846,17 +846,14 @@ static int armv8pmu_get_event_idx(struct pmu_hw_events *cpuc,
> struct hw_perf_event *hwc = &event->hw;
> unsigned long evtype = hwc->config_base & ARMV8_PMU_EVTYPE_EVENT;
>
> - /* Always place a cycle counter into the cycle counter. */
> + /* Always prefer to place a cycle counter into the cycle counter. */
> if (evtype == ARMV8_PMUV3_PERFCTR_CPU_CYCLES) {
> - if (test_and_set_bit(ARMV8_IDX_CYCLE_COUNTER, cpuc->used_mask))
> - return -EAGAIN;
> -
> - return ARMV8_IDX_CYCLE_COUNTER;
> + if (!test_and_set_bit(ARMV8_IDX_CYCLE_COUNTER, cpuc->used_mask))
> + return ARMV8_IDX_CYCLE_COUNTER;
> }
>
> /*
> - * For anything other than a cycle counter, try and use
> - * the events counters
> + * Otherwise use events counters
> */
> for (idx = ARMV8_IDX_COUNTER0; idx < cpu_pmu->num_events; ++idx) {
> if (!test_and_set_bit(idx, cpuc->used_mask))
>
--
Pratyush
^ permalink raw reply [flat|nested] 4+ messages in thread
* [PATCH] arm64: perf: Allow more than one cycle counter to be used
2017-07-17 3:26 ` Pratyush Anand
@ 2017-07-17 13:16 ` Will Deacon
2017-07-17 14:00 ` Pratyush Anand
0 siblings, 1 reply; 4+ messages in thread
From: Will Deacon @ 2017-07-17 13:16 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, Jul 17, 2017 at 08:56:10AM +0530, Pratyush Anand wrote:
> On Saturday 01 July 2017 12:03 PM, Pratyush Anand wrote:
> >Currently:
> >$ perf stat -e cycles:u -e cycles:k true
> >
> > Performance counter stats for 'true':
> >
> > 2,24,699 cycles:u
> > <not counted> cycles:k (0.00%)
> >
> > 0.000788087 seconds time elapsed
> >
> >We can not count more than one cycle counter in one instance,because we
> >allow to map cycle counter into PMCCNTR_EL0 only. However, if I did not
> >miss anything then specification do not prohibit to use PMEVCNTR<n>_EL0
> >for cycle count as well.
> >
> >Modify the code so that it still prefers to use PMCCNTR_EL0 for cycle
> >counter, however allow to use PMEVCNTR<n>_EL0 if PMCCNTR_EL0 is already
> >in use.
> >
> >After this patch:
> >
> >$ perf stat -e cycles:u -e cycles:k true
> >
> > Performance counter stats for 'true':
> >
> > 2,17,310 cycles:u
> > 7,40,009 cycles:k
> >
> > 0.000764149 seconds time elapsed
> >
>
> Any comment/feedback?
I'll pick this one up.
Thanks,
Will
^ permalink raw reply [flat|nested] 4+ messages in thread
* [PATCH] arm64: perf: Allow more than one cycle counter to be used
2017-07-17 13:16 ` Will Deacon
@ 2017-07-17 14:00 ` Pratyush Anand
0 siblings, 0 replies; 4+ messages in thread
From: Pratyush Anand @ 2017-07-17 14:00 UTC (permalink / raw)
To: linux-arm-kernel
On Monday 17 July 2017 06:46 PM, Will Deacon wrote:
> On Mon, Jul 17, 2017 at 08:56:10AM +0530, Pratyush Anand wrote:
>> On Saturday 01 July 2017 12:03 PM, Pratyush Anand wrote:
>>> Currently:
>>> $ perf stat -e cycles:u -e cycles:k true
>>>
>>> Performance counter stats for 'true':
>>>
>>> 2,24,699 cycles:u
>>> <not counted> cycles:k (0.00%)
>>>
>>> 0.000788087 seconds time elapsed
>>>
>>> We can not count more than one cycle counter in one instance,because we
>>> allow to map cycle counter into PMCCNTR_EL0 only. However, if I did not
>>> miss anything then specification do not prohibit to use PMEVCNTR<n>_EL0
>>> for cycle count as well.
>>>
>>> Modify the code so that it still prefers to use PMCCNTR_EL0 for cycle
>>> counter, however allow to use PMEVCNTR<n>_EL0 if PMCCNTR_EL0 is already
>>> in use.
>>>
>>> After this patch:
>>>
>>> $ perf stat -e cycles:u -e cycles:k true
>>>
>>> Performance counter stats for 'true':
>>>
>>> 2,17,310 cycles:u
>>> 7,40,009 cycles:k
>>>
>>> 0.000764149 seconds time elapsed
>>>
>>
>> Any comment/feedback?
>
> I'll pick this one up.
>
Thanks Will.
--
Pratyush
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2017-07-17 14:00 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-07-01 6:33 [PATCH] arm64: perf: Allow more than one cycle counter to be used Pratyush Anand
2017-07-17 3:26 ` Pratyush Anand
2017-07-17 13:16 ` Will Deacon
2017-07-17 14:00 ` Pratyush Anand
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).