From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: "Srivatsa S. Bhat" <srivatsa.bhat@linux.vnet.ibm.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>,
Arjan van de Ven <arjan@infradead.org>,
Steven Rostedt <rostedt@goodmis.org>,
"rusty@rustcorp.com.au" <rusty@rustcorp.com.au>,
"Rafael J. Wysocki" <rjw@sisk.pl>,
Srivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>,
"akpm@linux-foundation.org" <akpm@linux-foundation.org>,
Paul Gortmaker <paul.gortmaker@windriver.com>,
Milton Miller <miltonm@bga.com>, "mingo@elte.hu" <mingo@elte.hu>,
Tejun Heo <tj@kernel.org>,
KOSAKI Motohiro <kosaki.motohiro@gmail.com>,
linux-kernel <linux-kernel@vger.kernel.org>,
Linux PM mailing list <linux-pm@vger.kernel.org>,
nikunj@linux.vnet.ibm.com
Subject: Re: CPU Hotplug rework
Date: Mon, 9 Apr 2012 09:46:28 -0700 [thread overview]
Message-ID: <20120409164628.GA2430@linux.vnet.ibm.com> (raw)
In-Reply-To: <4F7F4EF0.70305@linux.vnet.ibm.com>
On Sat, Apr 07, 2012 at 01:45:44AM +0530, Srivatsa S. Bhat wrote:
> On 04/06/2012 04:36 AM, Paul E. McKenney wrote:
>
> > Hello,
> >
> > Here is my attempt at a summary of the discussion.
> >
>
>
> Thanks for the summary, it is really helpful :-)
>
> > Srivatsa, I left out the preempt_disable() pieces, but would be happy
> > to add them in when you let me know what you are thinking to do for
> > de-stop_machine()ing CPU hotplug.
> >
>
>
> Ok..
>
>
> > Thanx, Paul
> >
> > ------------------------------------------------------------------------
> >
> > CPU-hotplug work breakout:
> >
> > 1. Read and understand the current generic code.
> > Srivatsa Bhat has done this, as have Paul E. McKenney and
> > Peter Zijlstra to a lesser extent.
>
>
> "lesser extent"?? Hell no! :-) ;-)
Certainly to a lesser extent on my part, but yes, I should not speak
for Peter.
> > 2. Read and understand the architecture-specific code, looking
> > for opportunities to consolidate additional function into
> > core code.
> >
> > a. Carry out any indicated consolidation.
> >
> > b. Convert all architectures to make use of the
> > consolidated implementation.
> >
> > Not started. Low priority from a big.LITTLE perspective.
>
>
> Recently this unexpectedly assumed high priority due to some scheduler
> changes and things got fixed up temporarily. And in that context,
> Peter Zijlstra gave some more technical pointers on what is wrong and needs
> to be done right. Link: https://lkml.org/lkml/2012/3/22/149
>
> Nikunj (in CC) has offered to work with me on this consolidation.
Very cool! I have added the following:
------------------------------------------------------------------------
CONSOLIDATE ARCHITECTURE-SPECIFIC CPU-HOTPLUG CODE
1. Ensure that all CPU_STARTING notifiers complete before the
incoming CPU is marked online (the blackfin architecture
fails to do this).
2. Ensure that interrupts are disabled throughout the CPU_STARTING
notifiers. Currently, blackfin, cris, m32r, mips, sh, sparc64,
um, and x86 fail to do this properly.
3. Ensure that all architectures that use CONFIG_USE_GENERIC_SMP_HELPERS
hold ipi_call_lock() over the entire CPU-online process. Currently,
alpha, arm, m32r, mips, sh, and sparc32 seem to fail to do this
properly.
4. Additional memory barriers are likely to be needed, for example,
an smp_wmb() after setting cpu_active and an smp_rmb() in
select_fallback_rq() before reading cpu_active.
Srivatsa Bhat (srivatsa.bhat@linux.vnet.ibm.com) and Nikunj A Dadhania
(nikunj@linux.vnet.ibm.com) are taking on this work.
------------------------------------------------------------------------
Please let me know if adjustments are needed.
> > 3. Address the current kthread creation/teardown/migration
> > performance issues. (More details below.)
> >
> > Highest priority from a big.LITTLE perspective.
> >
> > 4. Wean CPU-hotplug offlining from stop_machine().
> > (More details below.)
> >
> > Moderate priority from a big.LITTLE perspective.
> >
> >
> > ADDRESSING KTHREAD CREATION/TEARDOWN/MIGRATION PERFORMANCE ISSUES
> >
> > 1. Evaluate approaches. Approaches currently under
> > consideration include:
> >
> > a. Park the kthreads rather than tearing them down or
> > migrating them. RCU currently takes this sort of
> > approach. Note that RCU currently relies on both
> > preempt_disable() and local_bh_disable() blocking the
> > current CPU from going offline.
> >
> > b. Allow in-kernel kthreads to avoid the delay
> > required to work around a bug in old versions of
> > bash. (This bug is a failure to expect receiving
> > a SIGCHILD signal corresponding to a child
> > created by a fork() system call that has not yet
> > returned.)
> >
> > This might be implemented using an additional
> > CLONE_ flag. This should allow kthreads to
> > be created and torn down much more quickly.
> >
> > c. Have some other TBD way to "freeze" a kthread.
> > (As in "your clever idea here".)
> >
> > 2. Implement the chosen approach or approaches. (Different
> > kernel subsystems might have different constraints, possibly
> > requiring different kthread handling.)
> >
> >
> > WEAN CPU-HOTPLUG OFFLINING FROM stop_machine()
> >
> >
> > 1. CPU_DYING notifier fixes needed as of 3.2:
> >
> > o vfp_hotplug(): I believe that this works as-is.
> > o s390_nohz_notify(): I believe that this works as-is.
> > o x86_pmu_notifier(): I believe that this works as-is.
> > o perf_ibs_cpu_notifier(): I don't know enough about
> > APIC to say.
> > o tboot_cpu_callback(): I believe that this works as-is,
> > but this one returns NOTIFY_BAD to a CPU_DYING notifier,
> > which is badness. But it looks like that case is a
> > "cannot happen" case. Still needs to be fixed.
> > o clockevents_notify(): This one acquires a global lock,
> > so it should be safe as-is.
> > o console_cpu_notify(): This one takes the same action
> > for CPU_ONLINE, CPU_DEAD, CPU_DOWN_FAILED, and
> > CPU_UP_CANCELLED that it does for CPU_DYING, so it
> > should be OK.
> > * rcu_cpu_notify(): This one needs adjustment as noted
> > above, but nothing major. Patch has been posted,
> > probably needs a bit of debugging.
> > o migration_call(): I defer to Peter on this one.
> > It looks to me like it is written to handle other
> > CPUs, but...
> > * workqueue_cpu_callback(): Might need help, does a
> > non-atomic OR.
> > o kvm_cpu_hotplug(): Uses a global spinlock, so should
> > be OK as-is.
> >
> > 2. Evaluate designs for stop_machine()-free CPU hotplug.
> > Implement the chosen design. An outline for a particular
> > design is shown below, but the actual design might be
> > quite different.
> >
> > 3. Fix issues with CPU Hotplug callback registration. Currently
> > there is no totally-race-free way to register callbacks and do
> > setup for already online cpus.
> >
> > Srivatsa had posted an incomplete patchset some time ago
> > regarding this, which gives an idea of the direction he had
> > in mind.
> > http://thread.gmane.org/gmane.linux.kernel/1258880/focus=15826
>
> Gah, this has been "incomplete" for quite some time now.. I'll try to speed up
> things a bit :-)
Sounds good to me! ;-)
Thanx, Paul
next prev parent reply other threads:[~2012-04-09 16:46 UTC|newest]
Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-03-19 14:44 CPU Hotplug rework Srivatsa S. Bhat
2012-03-19 14:48 ` Srivatsa S. Bhat
2012-03-20 11:28 ` Peter Zijlstra
2012-04-05 17:39 ` Paul E. McKenney
2012-04-05 17:55 ` Paul E. McKenney
2012-04-05 23:06 ` Paul E. McKenney
2012-04-06 20:15 ` Srivatsa S. Bhat
2012-04-09 16:46 ` Paul E. McKenney [this message]
2012-04-10 7:56 ` Nikunj A Dadhania
2012-04-06 19:52 ` Srivatsa S. Bhat
2012-04-09 17:13 ` Paul E. McKenney
2012-04-10 13:41 ` Srivatsa S. Bhat
2012-04-10 15:46 ` Paul E. McKenney
2012-04-10 17:26 ` Srivatsa S. Bhat
2012-04-11 0:09 ` Steven Rostedt
2012-04-11 0:28 ` Paul E. McKenney
2012-04-11 0:37 ` Steven Rostedt
2012-04-11 1:00 ` Paul E. McKenney
2012-04-11 6:02 ` Srivatsa S. Bhat
2012-04-11 12:28 ` Paul E. McKenney
2012-03-19 23:42 ` Rusty Russell
2012-03-20 10:42 ` Peter Zijlstra
2012-03-20 23:00 ` Rusty Russell
2012-03-21 9:01 ` Peter Zijlstra
2012-03-22 4:25 ` Rusty Russell
2012-03-22 22:49 ` Paul E. McKenney
2012-03-23 23:27 ` Rusty Russell
2012-03-24 0:23 ` Paul E. McKenney
2012-03-26 0:41 ` Rusty Russell
2012-03-26 8:02 ` Peter Zijlstra
2012-03-26 13:09 ` Steven Rostedt
2012-03-26 13:38 ` Peter Zijlstra
2012-03-26 15:22 ` Steven Rostedt
2012-03-26 16:13 ` Peter Zijlstra
2012-03-26 17:05 ` Steven Rostedt
2012-03-26 17:59 ` Peter Zijlstra
2012-03-27 1:32 ` Rusty Russell
2012-03-27 3:05 ` Steven Rostedt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120409164628.GA2430@linux.vnet.ibm.com \
--to=paulmck@linux.vnet.ibm.com \
--cc=a.p.zijlstra@chello.nl \
--cc=akpm@linux-foundation.org \
--cc=arjan@infradead.org \
--cc=kosaki.motohiro@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=miltonm@bga.com \
--cc=mingo@elte.hu \
--cc=nikunj@linux.vnet.ibm.com \
--cc=paul.gortmaker@windriver.com \
--cc=rjw@sisk.pl \
--cc=rostedt@goodmis.org \
--cc=rusty@rustcorp.com.au \
--cc=srivatsa.bhat@linux.vnet.ibm.com \
--cc=tj@kernel.org \
--cc=vatsa@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).