From: Ingo Molnar <mingo@elte.hu>
To: Paul Mackerras <paulus@samba.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>,
Corey Ashford <cjashfor@linux.vnet.ibm.com>,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH 13/15] perf_counter: provide generic callchain bits
Date: Wed, 1 Apr 2009 12:00:15 +0200 [thread overview]
Message-ID: <20090401100015.GD27865@elte.hu> (raw)
In-Reply-To: <18899.10669.917693.67352@cargo.ozlabs.ibm.com>
* Paul Mackerras <paulus@samba.org> wrote:
> Peter Zijlstra writes:
>
> > Ah, yes, I see how that can confuse. PERF_EVENT_COUNTER_OVERFLOW then?
>
> Sounds reasonable.
>
> > > Also, let's add PERF_RECORD/PERF_EVENT bits for:
> > >
> > > * EVENT_INSTR_ADDR
> >
> > I'm failing to come up with what this could be..
>
> So, you have lots of instructions in flight in the processor, and
> one of them causes an event that increments a counter and causes
> it to overflow, so an interrupt request is generated. Even if the
> interrupt is taken "immediately", it can still happen that the set
> of instructions the processor decides to complete before taking
> the interrupt includes some instructions after the instruction
> that caused the counter to overflow, and of course if interrupts
> are (hard-) disabled at the time of the overflow, the interrupt
> will happen later. That means that the IP from the pt_regs is not
> generally a reliable indication of which instruction made the
> counter overflow.
>
> On POWER processors we have a register which gives us a much more
> reliable indication of which instruction caused the counter
> overflow, at least in those cases where the event can be
> attributed to a specific instruction. This EVENT_INSTR_ADDR bit
> would ask for that register to be sampled and recorded.
So it's a bit like PEBS and IBS on the x86, right?
In theory one could simply override the sampled ptregs->ip with this
more precise register value. The instruction where the IRQ hit is
probably meaningless, if more precise information is available. But
we can have both too i guess.
The data address extension definitely makes sense - it can be used
to for a profile view along the data symbol dimension, instead of
the usual function symbol dimension.
CPU flags makes sense too - irqs-off can help the annotation of
source code sections where the profiler sees that irqs were
disabled.
It seems here we gradually descend into arch-specific CPU state
technicalities and it's not immediately obvious where to draw the
line.
Call-chain and data address abstractions are clear. CPU flags is
less clear: we could perhaps split off the irq state and the
privilege level information - that is present on all CPUs.
The rest should probably be opaque and not generalized.
_Perhaps_, to stem the inevitable list of such small details, it
might make sense to have a record type with signal frame qualities -
which would include most of this info. That would mix well with the
planned feature of signal generation anyway, right?
I.e. we could extend the lowlevel sigcontext signal frame generation
code in arch/x86/kernel/signal.c (and its powerpc equivalent) to
generate a signal frame but output it into the mmap buffer, not into
the userspace stack - and we would not actually execute a signal in
that context.
[ of course, when the counter is configured to generate a signal
that is done too. The code would be dual purpose. ]
So user-space would get a fully signal frame compatible record - and
we'd not have to create a per arch ABI for this because we'd piggy
back to the signal frame format.
We could add SA_NOFPU support for fast-track integer-registers-only
frames, etc.
Hm?
Ingo
next prev parent reply other threads:[~2009-04-01 10:00 UTC|newest]
Thread overview: 55+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-03-30 17:07 [PATCH 00/15] pending perf_counter bits Peter Zijlstra
2009-03-30 17:07 ` [PATCH 01/15] perf_counter: unify and fix delayed counter wakeup Peter Zijlstra
2009-03-31 5:45 ` Paul Mackerras
2009-03-31 6:37 ` Peter Zijlstra
2009-04-01 9:04 ` Ingo Molnar
2009-04-01 10:12 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 02/15] perf_counter: fix update_userpage() Peter Zijlstra
2009-04-01 10:12 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 03/15] perf_counter: kerneltop: simplify data_head read Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 04/15] perf_counter: executable mmap() information Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 05/15] perf_counter: kerneltop: parse the mmap data stream Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 06/15] perf_counter: powerpc: only reserve PMU hardware when we need it Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Paul Mackerras
2009-03-30 17:07 ` [PATCH 07/15] perf_counter: make it possible for hw_perf_counter_init to return error codes Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Paul Mackerras
2009-03-30 17:07 ` [PATCH 08/15] perf_counter: x86: proper error propagation for the x86 hw_perf_counter_init() Peter Zijlstra
2009-04-01 10:13 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 09/15] perf_counter tools: optionally scale counter values in perfstat mode Peter Zijlstra
2009-04-01 9:39 ` Ingo Molnar
2009-04-01 10:50 ` Paul Mackerras
2009-04-01 10:14 ` [tip:perfcounters/core] " Paul Mackerras
2009-03-30 17:07 ` [PATCH 10/15] perf_counter: small cleanup of the output routines Peter Zijlstra
2009-04-01 10:14 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 11/15] perf_counter: re-arrange the perf_event_type Peter Zijlstra
2009-04-01 10:14 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 12/15] pref_counter: kerneltop: update event_types Peter Zijlstra
2009-04-01 10:14 ` [tip:perfcounters/core] perf_counter tools: " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 13/15] perf_counter: provide generic callchain bits Peter Zijlstra
2009-03-31 6:12 ` Paul Mackerras
2009-03-31 6:39 ` Peter Zijlstra
2009-03-31 7:24 ` Corey Ashford
2009-03-31 8:43 ` Peter Zijlstra
2009-03-31 9:09 ` Paul Mackerras
2009-03-31 9:12 ` Paul Mackerras
2009-03-31 14:00 ` Peter Zijlstra
2009-03-31 17:11 ` Corey Ashford
2009-04-01 3:48 ` Paul Mackerras
2009-04-01 7:59 ` Peter Zijlstra
2009-04-01 8:45 ` Paul Mackerras
2009-04-01 10:00 ` Ingo Molnar [this message]
2009-04-01 11:53 ` Paul Mackerras
2009-04-01 23:25 ` Corey Ashford
2009-04-02 6:43 ` Peter Zijlstra
2009-04-02 7:41 ` Peter Zijlstra
2009-04-02 9:10 ` Paul Mackerras
2009-04-02 9:14 ` Peter Zijlstra
2009-04-01 10:14 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 14/15] perf_counter: x86: callchain support Peter Zijlstra
2009-04-01 10:14 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-30 17:07 ` [PATCH 15/15] perf_counter: pmc arbitration Peter Zijlstra
2009-04-01 10:15 ` [tip:perfcounters/core] " Peter Zijlstra
2009-03-31 5:43 ` [PATCH 00/15] pending perf_counter bits Paul Mackerras
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090401100015.GD27865@elte.hu \
--to=mingo@elte.hu \
--cc=a.p.zijlstra@chello.nl \
--cc=cjashfor@linux.vnet.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=paulus@samba.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.