From: Robert Richter <robert.richter@amd.com>
To: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Vince Weaver <vweaver1@eecs.utk.edu>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
Paul Mackerras <paulus@samba.org>, Ingo Molnar <mingo@elte.hu>,
Arnaldo Carvalho de Melo <acme@ghostprotocols.net>,
Stephane Eranian <eranian@google.com>
Subject: Re: [patch] perf_events: more wrong events for AMD fam10h
Date: Mon, 27 Jun 2011 15:38:09 +0200 [thread overview]
Message-ID: <20110627133809.GC4590@erda.amd.com> (raw)
In-Reply-To: <1309173740.6701.103.camel@twins>
On 27.06.11 07:22:20, Peter Zijlstra wrote:
> On Tue, 2011-06-07 at 15:39 -0400, Vince Weaver wrote:
> > Hello
> >
> > I'm in the process of auditing perf_event's awesome "generalized events".
> >
> > On AMD fam10h for some we have the following definitions:
> > cache-references = INSTRUCTION_CACHE_FETCHES 0x530080
> > cache-misses = INSTRUCTION_CACHE_MISSES 0x530081
> >
> > on Intel at least I'm pretty sure these events match to Last Level Cache
> > accesses/misses, not icache. Is there a reason for this?
> >
> > Attached is a patch that removes these until better events can be found.
> > (LLC is tricky on AMD as it's a shared resource).
> >
> > Note, l1-dcache-stores is broken too, I'm looking into it.
> >
> > Thanks,
> >
> > Vince
> > vweaver1@eecs.utk.edu
> >
> > diff --git a/arch/x86/kernel/cpu/perf_event_amd.c b/arch/x86/kernel/cpu/perf_event_amd.c
> > index fe29c1d..a46b987 100644
> > --- a/arch/x86/kernel/cpu/perf_event_amd.c
> > +++ b/arch/x86/kernel/cpu/perf_event_amd.c
> > @@ -98,8 +98,6 @@ static const u64 amd_perfmon_event_map[] =
> > {
> > [PERF_COUNT_HW_CPU_CYCLES] = 0x0076,
> > [PERF_COUNT_HW_INSTRUCTIONS] = 0x00c0,
I am not sure if Intel's LLC definition includes uncore events which
are equivalent to AMD's northbridge counter (L3) events (it could be
meant the LLC of the core only?).
Following definition taken form the Intel spec (SDM 3B, 30.2.3 and
Appendix A):
> > - [PERF_COUNT_HW_CACHE_REFERENCES] = 0x0080,
"This event counts requests originating from the core that reference a
cache line in the last level cache." (Intel event 2EH/4FH)
> > - [PERF_COUNT_HW_CACHE_MISSES] = 0x0081,
"This event counts each cache miss condition for references to the
last level cache." (Intel event 2EH/41H)
>
> Would 0x40000F7E0 and 0x40000F7E1 be better?
Taking this is a bit tricky, we would measure all misses of the node
then. We actually would have to select the current core. But for this
we need to schedule one northbridge counter for each core. On a six
core (family 10h ref D) we don't have enough counters then. There is
also an erratum, we can not do per-core L3 measurements.
On family 15h northbridge counters are not yet implemented.
On k7/k8 we must select L2 events (0x0080/0x0081).
So even if LLC includes per-definition the L3, it is not easy to
implement. But using 0x0080/0x0081 on all models would give similar
results for every cpu family.
-Robert
>
> > [PERF_COUNT_HW_BRANCH_INSTRUCTIONS] = 0x00c2,
> > [PERF_COUNT_HW_BRANCH_MISSES] = 0x00c3,
> > [PERF_COUNT_HW_STALLED_CYCLES_FRONTEND] = 0x00d0, /* "Decoder empty" event */
>
>
--
Advanced Micro Devices, Inc.
Operating System Research Center
prev parent reply other threads:[~2011-06-27 13:53 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-06-07 19:39 [patch] perf_events: more wrong events for AMD fam10h Vince Weaver
2011-06-07 21:07 ` [patch] perf_events: even " Vince Weaver
2011-06-27 11:22 ` Peter Zijlstra
2011-06-27 15:51 ` Robert Richter
2011-06-28 16:32 ` Vince Weaver
2011-06-28 16:20 ` Vince Weaver
2011-06-27 11:22 ` [patch] perf_events: " Peter Zijlstra
2011-06-27 13:38 ` Robert Richter [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110627133809.GC4590@erda.amd.com \
--to=robert.richter@amd.com \
--cc=a.p.zijlstra@chello.nl \
--cc=acme@ghostprotocols.net \
--cc=eranian@google.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=paulus@samba.org \
--cc=vweaver1@eecs.utk.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox