* [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop
@ 2007-06-25 21:38 Loic Prylli
2007-06-25 22:05 ` Chuck Ebbert
0 siblings, 1 reply; 5+ messages in thread
From: Loic Prylli @ 2007-06-25 21:38 UTC (permalink / raw)
To: linux-kernel
Processors synchronization in set_mtrr requires the .gate field
to be set after .count field is properly initialized. Without an explicit
barrier, the compiler was reordering those memory stores. That was sometimes
causing a processor (in ipi_handler) to see the .gate change and
decrement .count before the latter is set by set_mtrr() (which
then hangs in a infinite loop with irqs disabled).
Signed-off-by: Loic Prylli <loic@myri.com>
---
arch/i386/kernel/cpu/mtrr/main.c | 4 ++++
1 files changed, 4 insertions(+), 0 deletions(-)
diff --git a/arch/i386/kernel/cpu/mtrr/main.c b/arch/i386/kernel/cpu/mtrr/main.c
index 55b0051..75dc6d5 100644
--- a/arch/i386/kernel/cpu/mtrr/main.c
+++ b/arch/i386/kernel/cpu/mtrr/main.c
@@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
data.smp_size = size;
data.smp_type = type;
atomic_set(&data.count, num_booting_cpus() - 1);
+ /* make sure data.count is visible before unleashing other CPUs */
+ smp_wmb();
atomic_set(&data.gate,0);
/* Start the ball rolling on other CPUs */
@@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
/* ok, reset count and toggle gate */
atomic_set(&data.count, num_booting_cpus() - 1);
+ smp_wmb();
atomic_set(&data.gate,1);
/* do our MTRR business */
@@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
cpu_relax();
atomic_set(&data.count, num_booting_cpus() - 1);
+ smp_wmb();
atomic_set(&data.gate,0);
/*
-- 1.5.2.2
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop
2007-06-25 21:38 [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop Loic Prylli
@ 2007-06-25 22:05 ` Chuck Ebbert
2007-06-25 22:34 ` Andi Kleen
0 siblings, 1 reply; 5+ messages in thread
From: Chuck Ebbert @ 2007-06-25 22:05 UTC (permalink / raw)
To: Loic Prylli; +Cc: linux-kernel, Andi Kleen
On 06/25/2007 05:38 PM, Loic Prylli wrote:
[cc: Andi]
> Processors synchronization in set_mtrr requires the .gate field
> to be set after .count field is properly initialized. Without an explicit
> barrier, the compiler was reordering those memory stores. That was sometimes
> causing a processor (in ipi_handler) to see the .gate change and
> decrement .count before the latter is set by set_mtrr() (which
> then hangs in a infinite loop with irqs disabled).
>
> Signed-off-by: Loic Prylli <loic@myri.com>
> ---
> arch/i386/kernel/cpu/mtrr/main.c | 4 ++++
> 1 files changed, 4 insertions(+), 0 deletions(-)
>
> diff --git a/arch/i386/kernel/cpu/mtrr/main.c b/arch/i386/kernel/cpu/mtrr/main.c
> index 55b0051..75dc6d5 100644
> --- a/arch/i386/kernel/cpu/mtrr/main.c
> +++ b/arch/i386/kernel/cpu/mtrr/main.c
> @@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
> data.smp_size = size;
> data.smp_type = type;
> atomic_set(&data.count, num_booting_cpus() - 1);
> + /* make sure data.count is visible before unleashing other CPUs */
> + smp_wmb();
> atomic_set(&data.gate,0);
>
> /* Start the ball rolling on other CPUs */
> @@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
>
> /* ok, reset count and toggle gate */
> atomic_set(&data.count, num_booting_cpus() - 1);
> + smp_wmb();
> atomic_set(&data.gate,1);
>
> /* do our MTRR business */
> @@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
> cpu_relax();
>
> atomic_set(&data.count, num_booting_cpus() - 1);
> + smp_wmb();
> atomic_set(&data.gate,0);
>
> /*
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop
2007-06-25 22:05 ` Chuck Ebbert
@ 2007-06-25 22:34 ` Andi Kleen
2007-06-26 1:40 ` Loic Prylli
2007-06-28 19:52 ` Chuck Ebbert
0 siblings, 2 replies; 5+ messages in thread
From: Andi Kleen @ 2007-06-25 22:34 UTC (permalink / raw)
To: Chuck Ebbert; +Cc: Loic Prylli, linux-kernel
On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
> On 06/25/2007 05:38 PM, Loic Prylli wrote:
>
> [cc: Andi]
>
> > Processors synchronization in set_mtrr requires the .gate field
> > to be set after .count field is properly initialized. Without an explicit
> > barrier, the compiler was reordering those memory stores. That was sometimes
> > causing a processor (in ipi_handler) to see the .gate change and
> > decrement .count before the latter is set by set_mtrr() (which
> > then hangs in a infinite loop with irqs disabled).
Hmm, perhaps we should just put the smp_wmb into atomic_set().
Near all other atomic operations have memory barriers too. I think
that would be the better fix.
-Andi
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop
2007-06-25 22:34 ` Andi Kleen
@ 2007-06-26 1:40 ` Loic Prylli
2007-06-28 19:52 ` Chuck Ebbert
1 sibling, 0 replies; 5+ messages in thread
From: Loic Prylli @ 2007-06-26 1:40 UTC (permalink / raw)
To: Andi Kleen; +Cc: Chuck Ebbert, linux-kernel
On 6/25/2007 6:34 PM, Andi Kleen wrote:
> On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
>
>> On 06/25/2007 05:38 PM, Loic Prylli wrote:
>>
>> [cc: Andi]
>>
>>
>>> Processors synchronization in set_mtrr requires the .gate field
>>> to be set after .count field is properly initialized. Without an explicit
>>> barrier, the compiler was reordering those memory stores. That was sometimes
>>> causing a processor (in ipi_handler) to see the .gate change and
>>> decrement .count before the latter is set by set_mtrr() (which
>>> then hangs in a infinite loop with irqs disabled).
>>>
>
> Hmm, perhaps we should just put the smp_wmb into atomic_set().
> Near all other atomic operations have memory barriers too. I think
> that would be the better fix.
>
> -Andi
>
In Documentation/atomic_ops.txt atomic_set/atomic_read are described as
nothing more than a type-safe assignement or reading, without any extra
semantics. For other atomic operations, the rule is that any atomic
operation that doesn't return a value doesn't come with a barrier (and
any operation that returns the atomic value must have memory barriers).
So I guess you are suggesting to change the doc and the implementation
for all arches.
I should admit I did not knew a number of atomic operations did not
imply memory-barriers before. But maybe the extra cost might not be
completely negligible, especially if, for consistency with other
"barrier-implied" atomic operations, a new memory barrier is put before
and after,
Are you suggested changing just atomic_set(), or also other barrier-free
atomic operations :"atomic_dec", "atomic_inc", "atomic_add", "atomic_sub" ?
Independently of what is done to atomic, what about not making the .gate
field an atomic_t, but a simple "int" in the mttr code, since the only
operations done on it are atomic_read and atomic_set?
Loic
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop
2007-06-25 22:34 ` Andi Kleen
2007-06-26 1:40 ` Loic Prylli
@ 2007-06-28 19:52 ` Chuck Ebbert
1 sibling, 0 replies; 5+ messages in thread
From: Chuck Ebbert @ 2007-06-28 19:52 UTC (permalink / raw)
To: Andi Kleen; +Cc: Loic Prylli, linux-kernel
On 06/25/2007 06:34 PM, Andi Kleen wrote:
> On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
>> On 06/25/2007 05:38 PM, Loic Prylli wrote:
>>
>> [cc: Andi]
>>
>>> Processors synchronization in set_mtrr requires the .gate field
>>> to be set after .count field is properly initialized. Without an explicit
>>> barrier, the compiler was reordering those memory stores. That was sometimes
>>> causing a processor (in ipi_handler) to see the .gate change and
>>> decrement .count before the latter is set by set_mtrr() (which
>>> then hangs in a infinite loop with irqs disabled).
>
> Hmm, perhaps we should just put the smp_wmb into atomic_set().
> Near all other atomic operations have memory barriers too. I think
> that would be the better fix.
Can we get something merged before 2.6.22-final?
The original patch seems okay...
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2007-06-28 19:52 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-06-25 21:38 [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop Loic Prylli
2007-06-25 22:05 ` Chuck Ebbert
2007-06-25 22:34 ` Andi Kleen
2007-06-26 1:40 ` Loic Prylli
2007-06-28 19:52 ` Chuck Ebbert
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox