* Re: another Ultra2 SMP crash
2004-02-18 21:29 another Ultra2 SMP crash Chris Ricker
@ 2004-02-19 0:35 ` Ben Collins
2004-02-19 3:42 ` Chris Ricker
` (2 subsequent siblings)
3 siblings, 0 replies; 5+ messages in thread
From: Ben Collins @ 2004-02-19 0:35 UTC (permalink / raw)
To: sparclinux
On Wed, Feb 18, 2004 at 04:29:30PM -0500, Chris Ricker wrote:
> This is with 2.6.3-rc2, compiled SMP
>
> All I had on the serial console was
Do you have preempt enabled? I've already noticed from other people that
it is causing problems (one showed as a race conditition with autoload
of modules).
--
Debian - http://www.debian.org/
Linux 1394 - http://www.linux1394.org/
Subversion - http://subversion.tigris.org/
WatchGuard - http://www.watchguard.com/
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: another Ultra2 SMP crash
2004-02-18 21:29 another Ultra2 SMP crash Chris Ricker
2004-02-19 0:35 ` Ben Collins
@ 2004-02-19 3:42 ` Chris Ricker
2004-02-20 15:49 ` Chris Ricker
2004-02-20 15:51 ` Chris Ricker
3 siblings, 0 replies; 5+ messages in thread
From: Chris Ricker @ 2004-02-19 3:42 UTC (permalink / raw)
To: sparclinux
[-- Attachment #1: Type: TEXT/PLAIN, Size: 486 bytes --]
On Wed, 18 Feb 2004, Ben Collins wrote:
> On Wed, Feb 18, 2004 at 04:29:30PM -0500, Chris Ricker wrote:
> > This is with 2.6.3-rc2, compiled SMP
> >
> > All I had on the serial console was
>
> Do you have preempt enabled? I've already noticed from other people that
> it is causing problems (one showed as a race conditition with autoload
> of modules).
I don't have it enabled.
I've attached my complete config, in case there's anything else potentially
suspicious
thanks,
chris
[-- Attachment #2: Type: APPLICATION/octet-stream, Size: 5998 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: another Ultra2 SMP crash
2004-02-18 21:29 another Ultra2 SMP crash Chris Ricker
2004-02-19 0:35 ` Ben Collins
2004-02-19 3:42 ` Chris Ricker
@ 2004-02-20 15:49 ` Chris Ricker
2004-02-20 15:51 ` Chris Ricker
3 siblings, 0 replies; 5+ messages in thread
From: Chris Ricker @ 2004-02-20 15:49 UTC (permalink / raw)
To: sparclinux
On Wed, 18 Feb 2004, Chris Ricker wrote:
After I upgraded to 2.6.3-smp, I got another crash on a stuck spinlock:
spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
0046ae40 is near:
000000000046ad00 t m_next
000000000046ad40 t m_stop
000000000046ad60 t m_show
000000000046ae20 T search_module_extables
000000000046aec0 T module_text_address
000000000046af40 T struct_module
000000000046af60 T kallsyms_lookup
005c5390 is near:
00000000005c4a60 t tcp_v4_checksum_init
00000000005c4be0 T tcp_v4_do_rcv
00000000005c4d40 T tcp_v4_rcv
00000000005c5820 t __tcp_v4_rehash
00000000005c5860 t tcp_v4_reselect_saddr
00000000005c5ac0 T tcp_v4_rebuild_header
00000000005c5d00 t v4_addr2sockaddr
The 006481c8 spinlock is modlist_lock:
00000000006481a0 D cpucontrol
00000000006481c8 d modlist_lock
00000000006481d8 d module_mutex
0000000000648200 d modules
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: another Ultra2 SMP crash
2004-02-18 21:29 another Ultra2 SMP crash Chris Ricker
` (2 preceding siblings ...)
2004-02-20 15:49 ` Chris Ricker
@ 2004-02-20 15:51 ` Chris Ricker
3 siblings, 0 replies; 5+ messages in thread
From: Chris Ricker @ 2004-02-20 15:51 UTC (permalink / raw)
To: sparclinux
On Fri, 20 Feb 2004, Chris Ricker wrote:
> On Wed, 18 Feb 2004, Chris Ricker wrote:
>
> After I upgraded to 2.6.3-smp, I got another crash on a stuck spinlock:
>
> spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
> spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
> spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
> spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
> spin_lock(00000000006481c8) CPU#1 stuck at 0046ae40, owner PC(0046ae40):CPU(1)
> spin_lock(fffff8003e291370) CPU#0 stuck at 005c5390, owner PC(005848a8):CPU(1)
>
> 0046ae40 is near:
>
> 000000000046ad00 t m_next
> 000000000046ad40 t m_stop
> 000000000046ad60 t m_show
> 000000000046ae20 T search_module_extables
> 000000000046aec0 T module_text_address
> 000000000046af40 T struct_module
> 000000000046af60 T kallsyms_lookup
>
> 005c5390 is near:
>
> 00000000005c4a60 t tcp_v4_checksum_init
> 00000000005c4be0 T tcp_v4_do_rcv
> 00000000005c4d40 T tcp_v4_rcv
> 00000000005c5820 t __tcp_v4_rehash
> 00000000005c5860 t tcp_v4_reselect_saddr
> 00000000005c5ac0 T tcp_v4_rebuild_header
> 00000000005c5d00 t v4_addr2sockaddr
>
> The 006481c8 spinlock is modlist_lock:
>
> 00000000006481a0 D cpucontrol
> 00000000006481c8 d modlist_lock
> 00000000006481d8 d module_mutex
> 0000000000648200 d modules
Oops, hit send too soon.
I was doing an scp off of the system at the time it crashed, so the network
would've been active. I can't think of anything happening on it which
would've been modules-related, though.
thanks,
chris
^ permalink raw reply [flat|nested] 5+ messages in thread