* crash on shutdown on rs/6000 powerpc
@ 2007-10-09 13:00 Paul Mackerras
2007-10-10 15:45 ` Joel Schopp
[not found] ` <470CF38F.3080809%40austin.ibm.com>
0 siblings, 2 replies; 3+ messages in thread
From: Paul Mackerras @ 2007-10-09 13:00 UTC (permalink / raw)
To: torvalds, lkml, linuxppc-dev
I have just seen a crash at shutdown with 2.6.23-rc9 on an RS/6000
powerpc box (POWER3, 64-bit). It crashed immediately after printing
"Disabling non-boot CPUs", so I tried reverting 4047727e and that
fixed it. It's unfortunate that that commit added the
disable_nonboot_cpus for all architectures, when only x86[-64] needs
it. (At least it sounds like only x86[-64] needs it; the commit
message seems quite x86-centric.)
In case anyone on the linuxppc-dev list wants to chase this further,
the crash looks like this:
Disabling non-boot CPUs ...
cpu 0x1: Vector: 300 (Data Access) at [c00000003f923950]
pc: c00000000003bf98: .xics_set_cpu_priority+0x58/0x78
lr: c00000000003bff4: .xics_migrate_irqs_away+0x3c/0x20c
sp: c00000003f923bd0
msr: a000000000001032
dar: 4
dsisr: 42000000
current = 0xc0000000bfb6b060
paca = 0xc00000000071c600
pid = 2610, comm = kstopmachine
enter ? for help
[c00000003f923c40] c00000000003bff4 .xics_migrate_irqs_away+0x3c/0x20c
[c00000003f923d00] c000000000040d54 .pseries_cpu_disable+0x98/0xb4
[c00000003f923d80] c000000000028e4c .__cpu_disable+0x44/0x58
[c00000003f923df0] c00000000007e204 .take_cpu_down+0x34/0x60
[c00000003f923e70] c00000000008ba3c .do_stop+0x144/0x1e4
[c00000003f923f00] c00000000006fd74 .kthread+0x78/0xc4
[c00000003f923f90] c0000000000272a8 .kernel_thread+0x4c/0x68
I don't have time tonight or tomorrow to track this down further. I'm
taking the kids to the coast tommorow. :)
Paul.
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: crash on shutdown on rs/6000 powerpc
2007-10-09 13:00 crash on shutdown on rs/6000 powerpc Paul Mackerras
@ 2007-10-10 15:45 ` Joel Schopp
[not found] ` <470CF38F.3080809%40austin.ibm.com>
1 sibling, 0 replies; 3+ messages in thread
From: Joel Schopp @ 2007-10-10 15:45 UTC (permalink / raw)
To: Paul Mackerras; +Cc: lkml, torvalds, linuxppc-dev
> [c00000003f923c40] c00000000003bff4 .xics_migrate_irqs_away+0x3c/0x20c
> [c00000003f923d00] c000000000040d54 .pseries_cpu_disable+0x98/0xb4
> [c00000003f923d80] c000000000028e4c .__cpu_disable+0x44/0x58
> [c00000003f923df0] c00000000007e204 .take_cpu_down+0x34/0x60
> [c00000003f923e70] c00000000008ba3c .do_stop+0x144/0x1e4
> [c00000003f923f00] c00000000006fd74 .kthread+0x78/0xc4
> [c00000003f923f90] c0000000000272a8 .kernel_thread+0x4c/0x68
>
> I don't have time tonight or tomorrow to track this down further. I'm
> taking the kids to the coast tommorow. :)
This looks like the problem Olof and Milton sent out patches to fix. The
Power3 shouldn't be doing cpu hotplug.
^ permalink raw reply [flat|nested] 3+ messages in thread[parent not found: <470CF38F.3080809%40austin.ibm.com>]
* Re: crash on shutdown on rs/6000 powerpc
[not found] ` <470CF38F.3080809%40austin.ibm.com>
@ 2007-10-10 16:18 ` Milton Miller
0 siblings, 0 replies; 3+ messages in thread
From: Milton Miller @ 2007-10-10 16:18 UTC (permalink / raw)
To: Joel Schopp; +Cc: linuxppc-dev, Paul Mackerras
Joel Schoopp wrote:
>Paulus wrote:
>> [c00000003f923c40] c00000000003bff4 .xics_migrate_irqs_away+0x3c/0x20c
>> [c00000003f923d00] c000000000040d54 .pseries_cpu_disable+0x98/0xb4
>> [c00000003f923d80] c000000000028e4c .__cpu_disable+0x44/0x58
>> [c00000003f923df0] c00000000007e204 .take_cpu_down+0x34/0x60
>> [c00000003f923e70] c00000000008ba3c .do_stop+0x144/0x1e4
>> [c00000003f923f00] c00000000006fd74 .kthread+0x78/0xc4
>> [c00000003f923f90] c0000000000272a8 .kernel_thread+0x4c
>>
>> I don't have time tonight or tomorrow to track this down further. I'm
>> taking the kids to the coast tommorow. :)
>>
> This looks like the problem Olof and Milton sent out patches to fix. The
> Power3 shouldn't be doing cpu hotplug.
Paul's comment prompted both of us.
Actually, cpu_hotplug (offline calling rtas stop-self, not dlpar
remove) should be supported on POWER3 and RS64, its just that the
pseries disable code calls over into xics directly. offline support
would help preserve system uptime under a "failing processor"
scenerio.
To fix that, we need a mpic version of migrate_irqs_away. powermac
is using the "bury head in sand" approach for that right now (ie
if you bind an interrupt and offline all the cpus in the mask, the
interrupt won't be delivered until you reset the affinity). Its also
a broken in that bringing a cpu back online adds the cpu to all
interrupts not just those who have the cpu mask bit set.
milton
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2007-10-10 16:25 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-10-09 13:00 crash on shutdown on rs/6000 powerpc Paul Mackerras
2007-10-10 15:45 ` Joel Schopp
[not found] ` <470CF38F.3080809%40austin.ibm.com>
2007-10-10 16:18 ` Milton Miller
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.