From: Ingo Molnar <mingo@elte.hu>
To: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
Cc: Steven Rostedt <rostedt@goodmis.org>,
Frederic Weisbecker <fweisbec@gmail.com>,
LKML <linux-kernel@vger.kernel.org>,
Thomas Gleixner <tglx@linutronix.de>,
Peter Zijlstra <peterz@infradead.org>
Subject: Re: [PATCH 0/5] [GIT PULL] updates for tip/tracing/ftrace
Date: Sun, 22 Mar 2009 18:02:15 +0100 [thread overview]
Message-ID: <20090322170215.GA4468@elte.hu> (raw)
In-Reply-To: <20090322150657.GA17415@elte.hu>
ok, with Frederic we figured out the problem.
What helped things most was this trace-dump output:
0) + 15.281 us | }
0) | handle_irq() {
1) + 35.871 us | }
1) | timespec_to_ktime() {
0) 4.608 us | }
0) | generic_handle_irq_desc() {
1) 4.097 us | }
1) + 14.171 us | }
0) 4.450 us | _spin_lock();
1) + 60.127 us | }
1) | ktime_get() {
0) | ack_apic_edge() {
1) | getnstimeofday() {
0) 6.486 us | }
0) 5.619 us | irq_complete_move();
1) 5.158 us | jiffies_read();
0) | move_native_irq() {
1) + 15.495 us | }
1) + 26.161 us | }
0) 5.631 us | }
1) 5.549 us | set_normalized_timespec();
0) + 16.304 us | }
0) | ack_APIC_irq() {
1) + 48.377 us | }
1) | timespec_to_ktime() {
0) 5.762 us | native_apic_mem_write();
1) 5.751 us | }
0) + 16.162 us | }
1) + 16.413 us | }
0) + 27.185 us | }
1) + 81.519 us | }
0) + 80.245 us | }
1) ! 154.606 us | }
0) | _spin_unlock() {
1) 5.743 us | tick_nohz_update_jiffies();
0) 5.781 us | }
1) ! 183.912 us | }
0) 5.327 us | preempt_schedule();
1) ! 202.575 us | }
0) + 25.827 us | }
[...]
1) ! 2623.297 us | }
i.e. all CPUs spend 2-3 milliseconds to handle a single tick. This
is on a Core2 Extreme Edition 2.93 GHz CPU, so this kind of cost was
unexpected.
Until i saw this:
CONFIG_TRACE_BRANCH_PROFILING=y
CONFIG_PROFILE_ALL_BRANCHES=y
that explains it all. The above sequence is two CPUs 'lock stepped'
in a very high overhead series of cacheline ping-pongs. The
ping-pongs happen due to every branch in the kernel doing:
______f.miss_hit[______r]++;
where the branch info metadata is defined as global variables:
static struct ftrace_branch_data \
__attribute__((__aligned__(4))) \
__attribute__((section("_ftrace_branch"))) \
not only is it global, it's also false cacheline-shared due to a 4
byte alignment.
The proper solution would be to use percpu data and percpu_add()
primitives for this.
Anyway ... i turned off the branch tracer for my tests.
Ingo
next prev parent reply other threads:[~2009-03-22 17:02 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-03-18 3:14 [PATCH 0/5] [GIT PULL] updates for tip/tracing/ftrace Steven Rostedt
2009-03-18 3:14 ` [PATCH 1/5] ring-buffer: add api to allow a tracer to change clock source Steven Rostedt
2009-03-18 3:14 ` [PATCH 2/5] tracing: add global-clock option to provide cross CPU clock to traces Steven Rostedt
2009-03-18 3:14 ` [PATCH 3/5] tracing: optimization of branch tracer Steven Rostedt
2009-03-18 3:14 ` [PATCH 4/5] tracing: make sched_switch stop/start light weight Steven Rostedt
2009-03-18 3:14 ` [PATCH 5/5] tracing: make power tracer start/stop methods lighter weight Steven Rostedt
2009-03-18 5:59 ` [PATCH 0/5] [GIT PULL] updates for tip/tracing/ftrace Ingo Molnar
2009-03-18 7:39 ` Ingo Molnar
2009-03-19 7:33 ` Ingo Molnar
2009-03-19 17:21 ` Steven Rostedt
2009-03-20 17:43 ` Paul E. McKenney
2009-03-20 18:36 ` Ingo Molnar
2009-03-20 18:38 ` Ingo Molnar
2009-03-20 19:19 ` Paul E. McKenney
2009-03-20 19:27 ` Ingo Molnar
2009-03-20 19:41 ` Paul E. McKenney
2009-03-20 19:46 ` Frederic Weisbecker
2009-03-20 19:54 ` Ingo Molnar
2009-03-20 20:48 ` Frederic Weisbecker
2009-03-20 21:05 ` Steven Rostedt
2009-03-21 10:01 ` Ingo Molnar
2009-03-21 16:58 ` Ingo Molnar
2009-03-21 17:25 ` Steven Rostedt
2009-03-21 19:07 ` Paul E. McKenney
2009-03-21 20:09 ` Ingo Molnar
2009-03-21 21:01 ` Paul E. McKenney
2009-03-22 14:24 ` Ingo Molnar
2009-03-22 15:06 ` Ingo Molnar
2009-03-22 17:02 ` Ingo Molnar [this message]
2009-03-22 18:33 ` Steven Rostedt
2009-03-22 19:52 ` Ingo Molnar
2009-03-23 18:44 ` Steven Rostedt
2009-03-21 17:32 ` Frederic Weisbecker
2009-03-21 17:44 ` Steven Rostedt
2009-03-21 17:53 ` Frederic Weisbecker
2009-03-21 18:17 ` Steven Rostedt
2009-03-21 20:03 ` Frederic Weisbecker
2009-03-21 18:18 ` Ingo Molnar
2009-03-21 20:09 ` Frederic Weisbecker
2009-03-21 20:46 ` Frederic Weisbecker
2009-03-22 19:41 ` Ingo Molnar
2009-03-22 20:41 ` Ingo Molnar
2009-03-20 21:39 ` Paul E. McKenney
2009-03-20 17:05 ` Frederic Weisbecker
2009-03-20 17:57 ` Frederic Weisbecker
2009-03-20 18:22 ` Steven Rostedt
2009-03-20 18:39 ` Frederic Weisbecker
2009-03-20 18:42 ` Ingo Molnar
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090322170215.GA4468@elte.hu \
--to=mingo@elte.hu \
--cc=fweisbec@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=paulmck@linux.vnet.ibm.com \
--cc=peterz@infradead.org \
--cc=rostedt@goodmis.org \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox