public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map
@ 2004-01-31 14:16 Rusty Russell
  2004-02-02 10:08 ` Ingo Molnar
  2004-02-03 21:22 ` Matthew Dobson
  0 siblings, 2 replies; 4+ messages in thread
From: Rusty Russell @ 2004-01-31 14:16 UTC (permalink / raw)
  To: akpm; +Cc: linux-kernel, Nick Piggin, dipankar, vatsa

Hi Andrew et al.

	Forward ported to -mm tree, so there might still be bugs.
Wanted to get these aired more widely.  Especially get Nick exposed to
the sched parts.

Name: Create cpu_active_map
Author: Rusty Russell
Status: Booted on 2.6.2-rc2-bk2

D: When CPUs are going down, there is a time when cpu_online(cpu) is
D: false, but they are still scheduling and responding to interrupts
D: (we are migrating things off the CPU, shutting down per-cpu
D: threads, etc).  It turns out that RCU cares about these CPUs, so
D: the decision was made to expose this mask (previously internal to x86,
D: and only used for IPIs).
D: 
D: The semantics of this mask are as follows:
D: 1) For platforms without hot unplug of CPUs: cpu_active_map ==
D:    cpu_online_map.
D: 2) For the others, they are equal except for a CPU which is going
D:    down: cpu_online_map gets cleared by __cpu_disable(), cpu_ipi_map
D:    gets cleared in __cpu_die() once the CPU is no longer responding
D:    to interrupts.

diff -urpN --exclude TAGS -X /home/rusty/devel/kernel/kernel-patches/current-dontdiff --minimal .25707-linux-2.6.0-test11-bk7/include/linux/cpumask.h .25707-linux-2.6.0-test11-bk7.updated/include/linux/cpumask.h
--- .25707-linux-2.6.0-test11-bk7/include/linux/cpumask.h	2003-12-10 13:58:18.000000000 +1100
+++ .25707-linux-2.6.0-test11-bk7.updated/include/linux/cpumask.h	2003-12-10 14:01:29.000000000 +1100
@@ -43,6 +43,13 @@ typedef unsigned long cpumask_t;
 extern cpumask_t cpu_online_map;
 extern cpumask_t cpu_possible_map;
 
+#ifdef CONFIG_HOTPLUG_CPU
+/* Online, or on its way down but still receiving interrupts. */
+extern cpumask_t cpu_active_map;
+#else
+#define cpu_active_map cpu_online_map
+#endif
+
 #define num_online_cpus()		cpus_weight(cpu_online_map)
 #define cpu_online(cpu)			cpu_isset(cpu, cpu_online_map)
 #define cpu_possible(cpu)		cpu_isset(cpu, cpu_possible_map)
diff -urpN --exclude TAGS -X /home/rusty/devel/kernel/kernel-patches/current-dontdiff --minimal .25707-linux-2.6.0-test11-bk7/kernel/rcupdate.c .25707-linux-2.6.0-test11-bk7.updated/kernel/rcupdate.c
--- .25707-linux-2.6.0-test11-bk7/kernel/rcupdate.c	2003-12-10 13:58:19.000000000 +1100
+++ .25707-linux-2.6.0-test11-bk7.updated/kernel/rcupdate.c	2003-12-10 14:01:03.000000000 +1100
@@ -110,7 +110,7 @@ static void rcu_start_batch(long newbatc
 	    !cpus_empty(rcu_ctrlblk.rcu_cpu_mask)) {
 		return;
 	}
-	rcu_ctrlblk.rcu_cpu_mask = cpu_online_map;
+	rcu_ctrlblk.rcu_cpu_mask = cpu_active_map;
 }
 
 /*
--
  Anyone who quotes me in their sig is an idiot. -- Rusty Russell.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map
  2004-01-31 14:16 [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map Rusty Russell
@ 2004-02-02 10:08 ` Ingo Molnar
  2004-02-02 11:08   ` Rusty Russell
  2004-02-03 21:22 ` Matthew Dobson
  1 sibling, 1 reply; 4+ messages in thread
From: Ingo Molnar @ 2004-02-02 10:08 UTC (permalink / raw)
  To: Rusty Russell; +Cc: akpm, linux-kernel, Nick Piggin, dipankar, vatsa


* Rusty Russell <rusty@rustcorp.com.au> wrote:

> D: When CPUs are going down, there is a time when cpu_online(cpu) is
> D: false, but they are still scheduling and responding to interrupts
> D: (we are migrating things off the CPU, shutting down per-cpu
> D: threads, etc).  It turns out that RCU cares about these CPUs, so
> D: the decision was made to expose this mask (previously internal to x86,
> D: and only used for IPIs).

these kinds of problems could be avoided by making the CPU-off as much
of an atomic operation as possible. The less atomic it is, the more
kernel code is exposed to the transitional state - and since this is a
rare situation it will always have quality problems. Is there any killer
argument that makes it impossible to down a CPU atomically? Kernel
threads can get their callbacks on other CPUs just fine. Other tasks
should not care. If the migrate-off operation is done 100% atomically
then zero knowledge is needed by unrelated scheduler code about the act
of disabling a CPU.

	Ingo

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map
  2004-02-02 10:08 ` Ingo Molnar
@ 2004-02-02 11:08   ` Rusty Russell
  0 siblings, 0 replies; 4+ messages in thread
From: Rusty Russell @ 2004-02-02 11:08 UTC (permalink / raw)
  To: Ingo Molnar; +Cc: akpm, linux-kernel, Nick Piggin, dipankar, vatsa

In message <20040202100827.GA28870@elte.hu> you write:
> 
> * Rusty Russell <rusty@rustcorp.com.au> wrote:
> 
> > D: When CPUs are going down, there is a time when cpu_online(cpu) is
> > D: false, but they are still scheduling and responding to interrupts
> > D: (we are migrating things off the CPU, shutting down per-cpu
> > D: threads, etc).  It turns out that RCU cares about these CPUs, so
> > D: the decision was made to expose this mask (previously internal to x86,
> > D: and only used for IPIs).
> 
> these kinds of problems could be avoided by making the CPU-off as much
> of an atomic operation as possible. The less atomic it is, the more
> kernel code is exposed to the transitional state - and since this is a
> rare situation it will always have quality problems. Is there any killer
> argument that makes it impossible to down a CPU atomically? Kernel
> threads can get their callbacks on other CPUs just fine.

Well, we could grab control of the machine, unconditionally move all
the threads and kill the cpu.

We'd need a lot of locks, but we could just use a bogolock (a-la
module.c's schedule a thread per cpu and tell them all to disable
interrupts) which has the same effect of grabbing every spinlock.

It means that kernel threads must not use smp_processor_id(), because
it might change under them (get_cpu() would protect from this).  You'd
still need explicit shutdown code.

> Other tasks should not care. If the migrate-off operation is done
> 100% atomically then zero knowledge is needed by unrelated scheduler
> code about the act of disabling a CPU.

No, migration code still needs to check the cpu is still up after
you've slept.

I'll sleep on it and see if it still sounds like a good idea in the
morning 8)
Rusty.
--
  Anyone who quotes me in their sig is an idiot. -- Rusty Russell.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map
  2004-01-31 14:16 [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map Rusty Russell
  2004-02-02 10:08 ` Ingo Molnar
@ 2004-02-03 21:22 ` Matthew Dobson
  1 sibling, 0 replies; 4+ messages in thread
From: Matthew Dobson @ 2004-02-03 21:22 UTC (permalink / raw)
  To: Rusty Russell; +Cc: akpm, linux-kernel, Nick Piggin, dipankar, vatsa

Rusty Russell wrote:
> D: The semantics of this mask are as follows:
> D: 1) For platforms without hot unplug of CPUs: cpu_active_map ==
> D:    cpu_online_map.
> D: 2) For the others, they are equal except for a CPU which is going
> D:    down: cpu_online_map gets cleared by __cpu_disable(), cpu_ipi_map
> D:    gets cleared in __cpu_die() once the CPU is no longer responding
> D:    to interrupts.

Small typo, but I believe you meant to type 'cpu_active_map' in part 2, 
not 'cpu_ipi_map'.

Cheers!

-Matt


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2004-02-03 21:28 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-01-31 14:16 [PATCH 1/4] 2.6.2-rc2-mm2 CPU Hotplug: cpu_active_map Rusty Russell
2004-02-02 10:08 ` Ingo Molnar
2004-02-02 11:08   ` Rusty Russell
2004-02-03 21:22 ` Matthew Dobson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox