Re: [patch 4/6] Fix SMP poweroff hangs

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

* Re: [patch 4/6] Fix SMP poweroff hangs
       [not found]                 ` <4702C902.7000504@rtr.ca>
@ 2007-10-05 13:39                   ` Mark Lord
       [not found]                   ` <4702CCFB.1010000@pobox.com>
  1 sibling, 0 replies; 3+ messages in thread
From: Mark Lord @ 2007-10-05 13:39 UTC (permalink / raw)
  To: Eric W. Biederman, Linux Kernel
  Cc: Rafael J. Wysocki, Brown, Len, Linus Torvalds, Andrew Morton,
	Thomas Gleixner, simon.derr, Alexey Starikovskiy

Reposting this.. somehow left LKML off the TO/CC list.. (thanks Andrew!)

Eric W. Biederman wrote:
> Mark Lord <lkml@rtr.ca> writes:
>
>> Eric W. Biederman wrote:
>>> Mark Lord <lkml@rtr.ca> writes:
>>>>> - It could be that we changed the initialization order.
>>>>> - It could be that we changed the set of sysdev devices.
>>>>> - It could be we that cpu_down is doing something extra that we are not
>>>>>   doing.
>>>>>
>>>>> My guess is that Mark Lord, and Thomas Gleixner are sharp enough to
>>>>> have checked their command line parameters before submitting a kernel
>>>>> bugfix patch.  Although it would not hurt to have confirmation of
>>>>> that.
>>>> Right.  No fancy command line overrides here.
>>> Ok.
>>>
>>> Simple questions.
>>> - 32bit or 64bit x86?
>>> - Is this new failure mode or a regression?
>>> - By hang on power off I assume you mean that the power does
>>>   not go off?
>>> - Is there anything else you can tell me about this failure mode?
>> You could search kernel archives from last week,
>> when lots more info was posted in the original trouble reports.
>
> Sure.  I will. I was starting off a bit lazy.
>
>> My system is Core2Duo DualCore 1.86GHz, pure 32bit kernel/user.
>> System prints "Power Down." message, but power stays on.
>> Broken in 2.6.17, and in 2.6.23-rc*.  No other versions attempted.
>
> Ok.  So apparently not a regression.
>
>>> Mark, Thomas given the difficulty in reproducing the failure if
>>> someone has time I would love to see what happens if for testing:
>>> set_cpus_allowed(current, cpumask_of_cpu(1)); (instead of
>> disable_nonboot_cpus)
>>
>> System behaved fine on six consecutive poweroffs with that.
>> This could be due to subtle timing changes induced by the
>> context switches this produces, though.  I also tried it
>> as the very first line in the same function, and saw no change.
>
> Thanks.  This confirms something.  This is not an issue of which
> cpu you are running on (or else by forcing you onto the second
> core it would always fail).   At most it may be something
> happening on one cpu and the getting switched to another cpu.
>
> So this looks like something extra that cpu_down is doing.
>
>> I then removed that one line (and left out the disable_nonboot_cpus as well),
>> and it failed to power off on the very next attempt (got lucky).
>
> Ok.
>
>> Changed it to set_cpus_allowed(current, cpumask_of_cpu(0)),
>> and it survived six consecutive poweroffs.
>>
>> Changed it back to disable_nonboot_cpus(..),
>> and it also survived another four consecutive poweroffs.
>>
>> I also verified that both CPUs were enabled on entry to machine_poweroff().
>
> That both CPUS are enabled on entry to machine_poweroff is expected,
> and normal.
>
> The code path on i386 should be:
> machine_power_off
>   native_machine_power_off
>      machine_shutdown(); (which disables the other cpus)
>     smp_call_function
>        stop_this_cpu (on each cpu to be stopped.
>      pm_power_off(); (which turns off the power)
..
> This does sound like a race of some sort.

Mmm... thanks for the tour.

The cpu hotplug code appears to take great precautions against internal races
(dunno if it succeeds or not, though), but the correspond code in native_smp_send_stop()
looks a bit iffy by comparison.  I wonder if that's where it gets stuck?

static void native_smp_send_stop(void)
{
       /* Don't deadlock on the call lock in panic */
       int nolock = !spin_trylock(&call_lock);
       unsigned long flags;

       local_irq_save(flags);
       __smp_call_function(stop_this_cpu, NULL, 0, 0);
       if (!nolock)
               spin_unlock(&call_lock);
       disable_local_APIC();
       local_irq_restore(flags);
}

So basically, it tries to avoid races by grabbing the call_lock,
but then ignores that lock and proceeds anyway.  Recipe for a race?

-ml

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [patch 4/6] Fix SMP poweroff hangs
       [not found]                   ` <4702CCFB.1010000@pobox.com>
@ 2007-10-05 13:40                     ` Mark Lord
       [not found]                     ` <47052292.3090400@rtr.ca>
  1 sibling, 0 replies; 3+ messages in thread
From: Mark Lord @ 2007-10-05 13:40 UTC (permalink / raw)
  To: Linux Kernel
  Cc: Eric W. Biederman, Rafael J. Wysocki, Brown, Len, Linus Torvalds,
	Andrew Morton, Thomas Gleixner, simon.derr, Alexey Starikovskiy

(reposting this.. somehow left LKML off the TO/CC list)

Mark Lord wrote:
> Eric W. Biederman wrote:
>> Mark Lord <lkml@rtr.ca> writes:
..
>> The code path on i386 should be:
>> machine_power_off
>>   native_machine_power_off
>>      machine_shutdown(); (which disables the other cpus)
>>     smp_call_function
>>        stop_this_cpu (on each cpu to be stopped.
>>      pm_power_off(); (which turns off the power)
> ..
>> This does sound like a race of some sort.
>
> Mmm... thanks for the tour.
>
> The cpu hotplug code appears to take great precautions against internal races
> (dunno if it succeeds or not, though), but the correspond code in native_smp_send_stop()
> looks a bit iffy by comparison.  I wonder if that's where it gets stuck?
>
> static void native_smp_send_stop(void)
> {
>        /* Don't deadlock on the call lock in panic */
>        int nolock = !spin_trylock(&call_lock);
>        unsigned long flags;
>
>        local_irq_save(flags);
>        __smp_call_function(stop_this_cpu, NULL, 0, 0);
>        if (!nolock)
>                spin_unlock(&call_lock);
>        disable_local_APIC();
>        local_irq_restore(flags);
> }
>
> So basically, it tries to avoid races by grabbing the call_lock,
> but then ignores that lock and proceeds anyway.  Recipe for a race?

Hmmmm.. and that code also fails to wait for the other CPU(s)
to actually halt (not sure how it could wait, but it doesn't even try),
so the other CPU(s) could easily still be active when we then stumble
into our ACPI call to turn the power off.

I suppose that might possibly hurt something.

Cheers
-- 
Mark Lord
Real-Time Remedies Inc.
mlord@pobox.com

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [patch 4/6] Fix SMP poweroff hangs
       [not found]                       ` <m1myuyv6qu.fsf@ebiederm.dsl.xmission.com>
@ 2007-10-05 13:46                         ` Mark Lord
  0 siblings, 0 replies; 3+ messages in thread
From: Mark Lord @ 2007-10-05 13:46 UTC (permalink / raw)
  To: Eric W. Biederman
  Cc: Rafael J. Wysocki, Brown, Len, Linus Torvalds, Andrew Morton,
	Thomas Gleixner, simon.derr, Alexey Starikovskiy, Linux Kernel

Eric W. Biederman wrote:
> Mark Lord <lkml@rtr.ca> writes:
>> Comments?  Eric?
> 
> Sorry -EBUSY.
> 
> The trylock on call_lock is a recent thing (.22), added to keep panic
> from hanging.   It probably makes more sense to move bust_spinlocks(0)
> down a few lines in kernel/panic.c and to test oops_in_progress in
> smp_send_stop and make all screwy locking behavior depend on that.

I've seen this issue (to a much lessor extent) on 2.6.17 as well.
So if earlier kernels had the regular spin_lock() there (rather than trylock)
then this is still *an* issue, but not *the* issue.  And in my specific
case (only 2 CPUs), the trylock cannot be the problem here.
But with more cores it definitely could be an issue.

> As for not waiting until cpus are in their stop loop that looks easy
> enough to remedy.  I don't know if any of those is our culprit but
> it certainly looks worth cleaning up.  If you are interested in
> testing I will cook up a patch.

I suspect this one more, as there really are not many other possibilities.
One thing to try might be to just have the power_off() code do msleep(1000)
before the ACPI poweroff call.  That would give enough time for the other
core to "halt", and is easy enough to try for diagnosis purposes.

I'm off rock climbing for a week or so, but if you cook something up
in the meanwhile then I'll test it on return.

> I haven't had a chance yet to walk through the other possible x86
> sysdev devices to see if there are any other canidates.

Cheers!

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2007-10-05 13:46 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <200710010820.l918KApW006834@imap1.linux-foundation.org>
     [not found] ` <CFF307C98FEABE47A452B27C06B85BB603DA9AB8@hdsmsx411.amr.corp.intel.com>
     [not found]   ` <m1hclascwo.fsf@ebiederm.dsl.xmission.com>
     [not found]     ` <200710020048.38856.rjw@sisk.pl>
     [not found]       ` <m1myv2o13c.fsf@ebiederm.dsl.xmission.com>
     [not found]         ` <470198B1.2030003@rtr.ca>
     [not found]           ` <m1ve9pn8d1.fsf@ebiederm.dsl.xmission.com>
     [not found]             ` <470271EA.4040509@rtr.ca>
     [not found]               ` <m1ejgdif0f.fsf@ebiederm.dsl.xmission.com>
     [not found]                 ` <4702C902.7000504@rtr.ca>
2007-10-05 13:39                   ` [patch 4/6] Fix SMP poweroff hangs Mark Lord
     [not found]                   ` <4702CCFB.1010000@pobox.com>
2007-10-05 13:40                     ` Mark Lord
     [not found]                     ` <47052292.3090400@rtr.ca>
     [not found]                       ` <m1myuyv6qu.fsf@ebiederm.dsl.xmission.com>
2007-10-05 13:46                         ` Mark Lord

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox