* [GIT PULL] perf x86 updates for v3.20
@ 2015-02-16 7:48 Ingo Molnar
2015-02-16 20:55 ` Andy Lutomirski
0 siblings, 1 reply; 2+ messages in thread
From: Ingo Molnar @ 2015-02-16 7:48 UTC (permalink / raw)
To: Linus Torvalds
Cc: linux-kernel, Andy Lutomirski, Peter Zijlstra, Thomas Gleixner,
Arnaldo Carvalho de Melo, Andrew Morton
Linus,
Please pull the latest perf-core-for-linus git tree from:
git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git perf-core-for-linus
# HEAD: a66734297f78707ce39d756b656bfae861d53f62 perf/x86: Add /sys/devices/cpu/rdpmc=2 to allow rdpmc for all tasks
( I'm sending these changes from Andy Lutomirski separately
because they were based on other bits that went upstream
in this cycle. )
This series tightens up RDPMC permissions: currently even
highly sandboxed x86 execution environments (such as
seccomp) have permission to execute RDPMC, which may leak
various perf events / PMU state such as timing information
and other CPU execution details.
This 'all is allowed' RDPMC mode is still preserved as the
(non-default) /sys/devices/cpu/rdpmc=2 setting. The new
default is that RDPMC access is only allowed if a perf
event is mmap-ed (which is needed to correctly interpret
RDPMC counter values in any case).
As a side effect of these changes CR4 handling is cleaned
up in the x86 code and a shadow copy of the CR4 value is
added.
The extra CR4 manipulation adds ~ <50ns to the context
switch cost between rdpmc-capable and rdpmc-non-capable
mms.
( Note: shortlog and diffstat created manually due to the
somewhat unusual merge base - hopefully the result is
still fine. )
Thanks,
Ingo
------------------>
Andy Lutomirski (7):
x86: Clean up cr4 manipulation
x86: Store a per-cpu shadow copy of CR4
x86: Add a comment clarifying LDT context switching
perf: Add pmu callbacks to track event mapping and unmapping
perf: Pass the event to arch_perf_update_userpage()
perf/x86: Only allow rdpmc if a perf_event is mapped
perf/x86: Add /sys/devices/cpu/rdpmc=2 to allow rdpmc for all tasks
Ingo Molnar (1):
Merge branch 'x86/asm' into perf/x86, to avoid conflicts with upcoming patches
arch/x86/include/asm/mmu.h | 2 ++
arch/x86/include/asm/mmu_context.h | 33 +++++++++++++++++++++-----
arch/x86/include/asm/paravirt.h | 6 ++---
arch/x86/include/asm/processor.h | 33 --------------------------
arch/x86/include/asm/special_insns.h | 6 ++---
arch/x86/include/asm/tlbflush.h | 77 ++++++++++++++++++++++++++++++++++++++++++++++++++++++------
arch/x86/include/asm/virtext.h | 5 ++--
arch/x86/kernel/acpi/sleep.c | 2 +-
arch/x86/kernel/cpu/common.c | 17 ++++++++++----
arch/x86/kernel/cpu/mcheck/mce.c | 3 ++-
arch/x86/kernel/cpu/mcheck/p5.c | 3 ++-
arch/x86/kernel/cpu/mcheck/winchip.c | 3 ++-
arch/x86/kernel/cpu/mtrr/cyrix.c | 6 ++---
arch/x86/kernel/cpu/mtrr/generic.c | 6 ++---
arch/x86/kernel/cpu/perf_event.c | 76 +++++++++++++++++++++++++++++++++++++++++++++--------------
arch/x86/kernel/cpu/perf_event.h | 2 ++
arch/x86/kernel/head32.c | 1 +
arch/x86/kernel/head64.c | 2 ++
arch/x86/kernel/i387.c | 3 ++-
arch/x86/kernel/process.c | 5 ++--
arch/x86/kernel/process_32.c | 2 +-
arch/x86/kernel/process_64.c | 2 +-
arch/x86/kernel/setup.c | 2 +-
arch/x86/kernel/xsave.c | 3 ++-
arch/x86/kvm/svm.c | 2 +-
arch/x86/kvm/vmx.c | 10 ++++----
arch/x86/mm/fault.c | 2 +-
arch/x86/mm/init.c | 13 ++++++++--
arch/x86/mm/tlb.c | 3 ---
arch/x86/power/cpu.c | 11 ++++-----
arch/x86/realmode/init.c | 2 +-
arch/x86/xen/enlighten.c | 4 ++--
drivers/lguest/x86/core.c | 5 ++--
include/linux/perf_event.h | 7 ++++++
kernel/events/core.c | 14 +++++++++--
35 files changed, 253 insertions(+), 120 deletions(-)
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [GIT PULL] perf x86 updates for v3.20
2015-02-16 7:48 [GIT PULL] perf x86 updates for v3.20 Ingo Molnar
@ 2015-02-16 20:55 ` Andy Lutomirski
0 siblings, 0 replies; 2+ messages in thread
From: Andy Lutomirski @ 2015-02-16 20:55 UTC (permalink / raw)
To: Ingo Molnar, Linus Torvalds
Cc: linux-kernel, Andy Lutomirski, Peter Zijlstra, Thomas Gleixner,
Arnaldo Carvalho de Melo, Andrew Morton
On 02/15/2015 11:48 PM, Ingo Molnar wrote:
> Linus,
>
> Please pull the latest perf-core-for-linus git tree from:
>
> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git perf-core-for-linus
>
> # HEAD: a66734297f78707ce39d756b656bfae861d53f62 perf/x86: Add /sys/devices/cpu/rdpmc=2 to allow rdpmc for all tasks
[...]
> The extra CR4 manipulation adds ~ <50ns to the context
> switch cost between rdpmc-capable and rdpmc-non-capable
> mms.
That's about the best I could benchmark, too -- if it was more than
about 50ns, I'm pretty sure I wouldn't seen a difference, but, as it
stands, it seems to have been lost in the noise. Maybe I should find a
better benchmark.
In any event, this series is probably a mixed bag performance-wise. In
the best base, there's a small extra cost in context switches, and, when
switching PCE, there's a CR4 write. On SVM guests, the CR4 write will suck.
To balance that out, I removed a CR4 read from VMX entry and from global
TLB flushes. The former mostly fixes a performance regression from a
security fix a few releases back, and the I expect that the latter will
more than offset the added context switch overhead (especially on SVM
guests, where even CR4 reads exit AFAIK).
Anyway, I tried and failed to detect any difference at all. Context
switch timing was very noisy for me.
--Andy
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2015-02-16 20:55 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-02-16 7:48 [GIT PULL] perf x86 updates for v3.20 Ingo Molnar
2015-02-16 20:55 ` Andy Lutomirski
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox