All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] kthread: kthread_bind fails to enforce CPU affinity (fixes kernel BUG at kernel/smpboot.c:134!)
@ 2014-12-08  3:27 ` Anton Blanchard
  0 siblings, 0 replies; 33+ messages in thread
From: Anton Blanchard @ 2014-12-08  3:27 UTC (permalink / raw)
  To: lkp

[-- Attachment #1: Type: text/plain, Size: 1850 bytes --]

I have a busy ppc64le KVM box where guests sometimes hit the infamous
"kernel BUG at kernel/smpboot.c:134!" issue during boot:

BUG_ON(td->cpu != smp_processor_id());

Basically a per CPU hotplug thread scheduled on the wrong CPU. The oops
output confirms it:

CPU: 0
Comm: watchdog/130

The issue is in kthread_bind where we set the cpus_allowed mask, but do
not touch task_thread_info(p)->cpu. The scheduler assumes the previously
scheduled CPU is in the cpus_allowed mask, but in this case we are
moving a thread to another CPU so it is not.

We used to call set_task_cpu which sets task_thread_info(p)->cpu (in fact
kthread_bind still has a comment suggesting this). That was removed in
e2912009fb7b ("sched: Ensure set_task_cpu() is never called on blocked
tasks").

Since we cannot call set_task_cpu (the task is in a sleeping state),
just do an explicit set of task_thread_info(p)->cpu.

Fixes: e2912009fb7b ("sched: Ensure set_task_cpu() is never called on blocked tasks")
Cc: stable(a)vger.kernel.org
Signed-off-by: Anton Blanchard <anton@samba.org>
---
 kernel/kthread.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/kernel/kthread.c b/kernel/kthread.c
index 10e489c..e40ab1d 100644
--- a/kernel/kthread.c
+++ b/kernel/kthread.c
@@ -327,13 +327,14 @@ EXPORT_SYMBOL(kthread_create_on_node);
 
 static void __kthread_bind(struct task_struct *p, unsigned int cpu, long state)
 {
-	/* Must have done schedule() in kthread() before we set_task_cpu */
+	/* Must have done schedule() in kthread() before we change affinity */
 	if (!wait_task_inactive(p, state)) {
 		WARN_ON(1);
 		return;
 	}
 	/* It's safe because the task is inactive. */
 	do_set_cpus_allowed(p, cpumask_of(cpu));
+	task_thread_info(p)->cpu = cpu;
 	p->flags |= PF_NO_SETAFFINITY;
 }
 
-- 
2.1.0


^ permalink raw reply related	[flat|nested] 33+ messages in thread

end of thread, other threads:[~2014-12-10 23:06 UTC | newest]

Thread overview: 33+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-12-08  3:27 [PATCH] kthread: kthread_bind fails to enforce CPU affinity (fixes kernel BUG at kernel/smpboot.c:134!) Anton Blanchard
2014-12-08  3:27 ` Anton Blanchard
2014-12-08  3:27 ` Anton Blanchard
2014-12-08  4:28 ` Linus Torvalds
2014-12-08  4:28   ` Linus Torvalds
2014-12-08  4:28   ` Linus Torvalds
2014-12-08  4:46   ` Anton Blanchard
2014-12-08  4:46     ` Anton Blanchard
2014-12-08  4:46     ` Anton Blanchard
2014-12-08  8:34 ` Ingo Molnar
2014-12-08  8:34   ` Ingo Molnar
2014-12-08  8:34   ` Ingo Molnar
2014-12-08 10:18   ` Anton Blanchard
2014-12-08 10:18     ` Anton Blanchard
2014-12-08 10:18     ` Anton Blanchard
2014-12-08 23:58     ` [PATCH] powerpc: secondary CPUs signal to master before setting active and online " Anton Blanchard
2014-12-08 23:58       ` Anton Blanchard
2014-12-08 23:58       ` Anton Blanchard
2014-12-09 20:54       ` Linus Torvalds
2014-12-09 20:54         ` Linus Torvalds
2014-12-09 20:54         ` Linus Torvalds
2014-12-10 14:08         ` Thomas Gleixner
2014-12-10 14:08           ` Thomas Gleixner
2014-12-10 14:08           ` Thomas Gleixner
2014-12-10 23:06         ` Michael Ellerman
2014-12-10 23:06           ` Michael Ellerman
2014-12-10 23:06           ` Michael Ellerman
2014-12-08 13:54 ` [PATCH] kthread: kthread_bind fails to enforce CPU affinity " Steven Rostedt
2014-12-08 13:54   ` Steven Rostedt
2014-12-08 13:54   ` Steven Rostedt
2014-12-09  2:24   ` Lai Jiangshan
2014-12-09  2:24     ` Lai Jiangshan
2014-12-09  2:24     ` Lai Jiangshan

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.