From: Peter Zijlstra <peterz@infradead.org>
To: Valentin Schneider <valentin.schneider@arm.com>
Cc: mingo@kernel.org, tglx@linutronix.de,
linux-kernel@vger.kernel.org, jiangshanlai@gmail.com,
cai@redhat.com, vincent.donnefort@arm.com, decui@microsoft.com,
paulmck@kernel.org, vincent.guittot@linaro.org,
rostedt@goodmis.org, tj@kernel.org
Subject: Re: [PATCH 7/8] sched: Fix CPU hotplug / tighten is_per_cpu_kthread()
Date: Mon, 18 Jan 2021 10:30:21 +0100 [thread overview]
Message-ID: <YAVVLRWCLbnQoXz2@hirez.programming.kicks-ass.net> (raw)
In-Reply-To: <jhjsg6z4i2w.mognet@arm.com>
On Sun, Jan 17, 2021 at 04:57:27PM +0000, Valentin Schneider wrote:
> On 16/01/21 12:30, Peter Zijlstra wrote:
> > @@ -1796,13 +1796,28 @@ static inline bool rq_has_pinned_tasks(s
> > */
> > static inline bool is_cpu_allowed(struct task_struct *p, int cpu)
> > {
> > + /* When not in the task's cpumask, no point in looking further. */
> > if (!cpumask_test_cpu(cpu, p->cpus_ptr))
> > return false;
> >
> > + /* migrate_disabled() must be allowed to finish. */
> > + if (is_migration_disabled(p))
> > return cpu_online(cpu);
> >
> > + /* Non kernel threads are not allowed during either online or offline. */
> > + if (!(p->flags & PF_KTHREAD))
> > + return cpu_active(cpu);
> > +
> > + /* KTHREAD_IS_PER_CPU is always allowed. */
> > + if (kthread_is_per_cpu(p))
> > + return cpu_online(cpu);
> > +
> > + /* Regular kernel threads don't get to stay during offline. */
> > + if (cpu_rq(cpu)->balance_callback == &balance_push_callback)
> > + return cpu_active(cpu);
>
> is_cpu_allowed(, cpu) isn't guaranteed to have cpu_rq(cpu)'s rq_lock
> held, so this can race with balance_push_set(, true). This shouldn't
> matter under normal circumstances as we'll have sched_cpu_wait_empty()
> further down the line.
>
> This might get ugly with the rollback faff - this is jumping the gun a
> bit, but that's something we'll have to address, and I think what I'm
> concerned about is close to what you mentioned in
>
> http://lore.kernel.org/r/YAM1t2Qzr7Rib3bN@hirez.programming.kicks-ass.net
>
> Here's what I'm thinking of:
>
> _cpu_up() ttwu()
> select_task_rq()
> is_cpu_allowed()
> rq->balance_callback != balance_push_callback
> smpboot_unpark_threads() // FAIL
> (now going down, set push here)
> sched_cpu_wait_empty()
> ... ttwu_queue()
> sched_cpu_dying()
> *ARGH*
>
Let me try this then...
diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index 5057054b1cff..9b045296d646 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -7495,6 +7495,8 @@ int sched_cpu_activate(unsigned int cpu)
return 0;
}
+unsigned long sched_cpu_rcu_state;
+
int sched_cpu_deactivate(unsigned int cpu)
{
struct rq *rq = cpu_rq(cpu);
@@ -7519,6 +7521,11 @@ int sched_cpu_deactivate(unsigned int cpu)
*/
balance_push_set(cpu, true);
+ /*
+ * See sched_cpu_wait_empty().
+ */
+ sched_cpu_rcu_state = get_state_synchronize_rcu();
+
rq_lock_irqsave(rq, &rf);
if (rq->rd) {
update_rq_clock(rq);
@@ -7578,6 +7585,12 @@ int sched_cpu_starting(unsigned int cpu)
*/
int sched_cpu_wait_empty(unsigned int cpu)
{
+ /*
+ * Guarantee that TTWU will observe balance_push_set(true),
+ * such that all wakeups will refuse this CPU.
+ */
+ cond_synchronize_rcu(sched_cpu_rcu_state);
+
balance_hotplug_wait();
return 0;
}
next prev parent reply other threads:[~2021-01-18 9:47 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-01-16 11:30 [PATCH 0/8] sched: Fix hot-unplug regressions Peter Zijlstra
2021-01-16 11:30 ` [PATCH 1/8] sched/core: Print out straggler tasks in sched_cpu_dying() Peter Zijlstra
2021-01-16 11:30 ` [PATCH 2/8] workqueue: Use cpu_possible_mask instead of cpu_active_mask to break affinity Peter Zijlstra
2021-01-16 11:30 ` [PATCH 3/8] sched: Dont run cpu-online with balance_push() enabled Peter Zijlstra
2021-01-16 15:27 ` Peter Zijlstra
2021-01-16 11:30 ` [PATCH 4/8] kthread: Extract KTHREAD_IS_PER_CPU Peter Zijlstra
2021-01-16 11:30 ` [PATCH 5/8] workqueue: Tag bound workers with KTHREAD_IS_PER_CPU Peter Zijlstra
2021-01-16 11:30 ` [PATCH 6/8] workqueue: Restrict affinity change to rescuer Peter Zijlstra
2021-01-16 11:30 ` [PATCH 7/8] sched: Fix CPU hotplug / tighten is_per_cpu_kthread() Peter Zijlstra
2021-01-17 16:57 ` Valentin Schneider
2021-01-18 9:30 ` Peter Zijlstra [this message]
2021-01-16 11:30 ` [PATCH 8/8] sched: Relax the set_cpus_allowed_ptr() semantics Peter Zijlstra
2021-01-16 14:39 ` Lai Jiangshan
2021-01-16 15:19 ` Peter Zijlstra
2021-01-16 15:25 ` [PATCH 0/8] sched: Fix hot-unplug regressions Peter Zijlstra
2021-01-16 15:45 ` Paul E. McKenney
2021-01-16 18:51 ` Peter Zijlstra
2021-01-16 15:48 ` Paul E. McKenney
2021-01-18 5:28 ` Paul E. McKenney
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YAVVLRWCLbnQoXz2@hirez.programming.kicks-ass.net \
--to=peterz@infradead.org \
--cc=cai@redhat.com \
--cc=decui@microsoft.com \
--cc=jiangshanlai@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=paulmck@kernel.org \
--cc=rostedt@goodmis.org \
--cc=tglx@linutronix.de \
--cc=tj@kernel.org \
--cc=valentin.schneider@arm.com \
--cc=vincent.donnefort@arm.com \
--cc=vincent.guittot@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox