public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Few questions about 2.5.6-pre3
@ 2002-03-11  8:01 Zwane Mwaikambo
  2002-03-12  6:46 ` Zwane Mwaikambo
  0 siblings, 1 reply; 5+ messages in thread
From: Zwane Mwaikambo @ 2002-03-11  8:01 UTC (permalink / raw)
  To: Linux Kernel

Has anyone come across the following bootime bugs?

2.5.6-pre3 UP:

  init_idle(current, smp_processor_id());
        /*
         *      We count on the initial thread going ok
         *      Like idlers init is an unlocked kernel thread, which will
         *      make syscalls (and thus be locked).
         */
        smp_init();
			<=== [1]
        /* Do the rest non-__init'ed, we're now alive */
        rest_init();

[1] If i don't put a busy loop or a printk there the machine pukes in 
do_page_fault when we try and down_read(&mm->mmap_sem); The dump shows 
that current is an invalid pointer and has crazy PID and other fields.

2.5.6-pre3 SMP:

This one is a funny one, it dies *right* after mtrr_init, and even if i 
put a BUG() after mtrr_init, it never gets executed. This one happens to 
die in the scheduler when we try and release_kernel_lock(prev, 
smp_processor_id()) It already is released so we trigger an Oops in 
spinlock.h:107

Thanks,
	Zwane Mwaikambo



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Few questions about 2.5.6-pre3
  2002-03-11  8:01 Few questions about 2.5.6-pre3 Zwane Mwaikambo
@ 2002-03-12  6:46 ` Zwane Mwaikambo
  2002-03-12  7:22   ` Robert Love
  0 siblings, 1 reply; 5+ messages in thread
From: Zwane Mwaikambo @ 2002-03-12  6:46 UTC (permalink / raw)
  To: Linux Kernel; +Cc: rml

On Mon, 11 Mar 2002, Zwane Mwaikambo wrote:

> 2.5.6-pre3 SMP:
> 
> This one is a funny one, it dies *right* after mtrr_init, and even if i 
> put a BUG() after mtrr_init, it never gets executed. This one happens to 
> die in the scheduler when we try and release_kernel_lock(prev, 
> smp_processor_id()) It already is released so we trigger an Oops in 
> spinlock.h:107

Not sure wether this is a known issue but disabling CONFIG_PREEMPT booted 
this box. Robert, here is the oops, if you want me to test a couple 
things just send them my way, but its a high latency test box ;) so you 
might have to wait a while between replies.

Cheers,
	Zwane

Right after mtrr_init in main.c, note that a BUG() after mtrr_init will 
not get triggered. Sorry the oops doesn't reveal much though, perhaps you 
need some specific information?

kernel BUG at /home/zwane/build/source/linux-2.5.6-pre/include/asm/spinlock.h:107!
invalid operand: 0000
CPU:    0
EIP:    0010:[<c011b09b>]    Not tainted
EFLAGS: 00010002
eax: 00000001   ebx: c0348000   ecx: 00000000   edx: c032c001
esi: c038b8a0   edi: 00000000   ebp: c0349f64   esp: c0349f48
ds: 0018   es: 0018   ss: 0018
Process swapper (pid: 0, threadinfo=c0348000 task=c032c060)
Stack: c0213532 c0213558 c03fe020 c032c060 c0348000 c038b8a0 00000000 c0349f88
       c011b0b9 c03c58c0 00000246 c0105000 c032c060 c03c591c 00000246 0000005c
       0008e000 c01206f0 c02e7c5b c0348000 c0105000 c0350783 c02e3800 c02e864c
Call Trace: [<c0213532>] [<c0213558>] [<c011b0b9>] [<c0105000>] [<c01206f0>]
   [<c0105000>] [<c0105000>]

Code: 0f 0b 6b 00 e0 d2 2d c0 86 15 00 a4 38 c0 ff 4b 10 8b 43 08
 <0>Kernel panic: Attempted to kill the idle task!
In idle task - not syncing

>>EIP; c011b09b <schedule+8b/470>   <=====
Trace; c0213532 <serial_console_write+142/1e0>
Trace; c0213558 <serial_console_write+168/1e0>
Trace; c011b0b9 <schedule+a9/470>
Trace; c0105000 <_stext+0/0>
Trace; c01206f0 <printk+1a0/210>
Trace; c0105000 <_stext+0/0>
Trace; c0105000 <_stext+0/0>
Code;  c011b09b <schedule+8b/470>
00000000 <_EIP>:
Code;  c011b09b <schedule+8b/470>   <=====
   0:   0f 0b                     ud2a      <=====
Code;  c011b09d <schedule+8d/470>
   2:   6b 00 e0                  imul   $0xffffffe0,(%eax),%eax
Code;  c011b0a0 <schedule+90/470>
   5:   d2 2d c0 86 15 00         shrb   %cl,0x1586c0
Code;  c011b0a6 <schedule+96/470>
   b:   a4                        movsb  %ds:(%esi),%es:(%edi)
Code;  c011b0a7 <schedule+97/470>
   c:   38 c0                     cmp    %al,%al
Code;  c011b0a9 <schedule+99/470>
   e:   ff 4b 10                  decl   0x10(%ebx)
Code;  c011b0ac <schedule+9c/470>
  11:   8b 43 08                  mov    0x8(%ebx),%eax



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Few questions about 2.5.6-pre3
  2002-03-12  6:46 ` Zwane Mwaikambo
@ 2002-03-12  7:22   ` Robert Love
  2002-03-12 14:11     ` Zwane Mwaikambo
  0 siblings, 1 reply; 5+ messages in thread
From: Robert Love @ 2002-03-12  7:22 UTC (permalink / raw)
  To: Zwane Mwaikambo; +Cc: Linux Kernel

On Tue, 2002-03-12 at 01:46, Zwane Mwaikambo wrote:

> Not sure wether this is a known issue but disabling CONFIG_PREEMPT booted 
> this box. Robert, here is the oops, if you want me to test a couple 
> things just send them my way, but its a high latency test box ;) so you 
> might have to wait a while between replies.

Thanks, Zwane.

I've never seen this.  I assume the box is SMP since you are hitting a
BUG in the spin_unlock code?  I almost want to think this is an SMP bug
(locking rules not being observed somewhere) and preemption is just
accelerating the race.

Ohh wait - this is 2.5.6-pre3 ?

Can you try 2.5.6 final (or anything later)?  There is a bug with SMP
and preempt and this could be it.  Let me know..

	Robert Love


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Few questions about 2.5.6-pre3
  2002-03-12  7:22   ` Robert Love
@ 2002-03-12 14:11     ` Zwane Mwaikambo
  2002-03-13  6:14       ` Zwane Mwaikambo
  0 siblings, 1 reply; 5+ messages in thread
From: Zwane Mwaikambo @ 2002-03-12 14:11 UTC (permalink / raw)
  To: Robert Love; +Cc: Linux Kernel

On 12 Mar 2002, Robert Love wrote:

> I've never seen this.  I assume the box is SMP since you are hitting a
> BUG in the spin_unlock code?  I almost want to think this is an SMP bug
> (locking rules not being observed somewhere) and preemption is just
> accelerating the race.
> 
> Ohh wait - this is 2.5.6-pre3 ?
> 
> Can you try 2.5.6 final (or anything later)?  There is a bug with SMP
> and preempt and this could be it.  Let me know..

Its SMP kernel on UP box, i'll be testing 2.5.6 this evening so i'll give 
you a heads up tommorrow.

Cheers,
	Zwane



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Few questions about 2.5.6-pre3
  2002-03-12 14:11     ` Zwane Mwaikambo
@ 2002-03-13  6:14       ` Zwane Mwaikambo
  0 siblings, 0 replies; 5+ messages in thread
From: Zwane Mwaikambo @ 2002-03-13  6:14 UTC (permalink / raw)
  To: Robert Love; +Cc: Linux Kernel

On Tue, 12 Mar 2002, Zwane Mwaikambo wrote:

> On 12 Mar 2002, Robert Love wrote:
> 
> > I've never seen this.  I assume the box is SMP since you are hitting a
> > BUG in the spin_unlock code?  I almost want to think this is an SMP bug
> > (locking rules not being observed somewhere) and preemption is just
> > accelerating the race.
> > 
> > Ohh wait - this is 2.5.6-pre3 ?
> > 
> > Can you try 2.5.6 final (or anything later)?  There is a bug with SMP
> > and preempt and this could be it.  Let me know..
> 
> Its SMP kernel on UP box, i'll be testing 2.5.6 this evening so i'll give 
> you a heads up tommorrow.

Ok couldn't reproduce it with -final.

Cheers,
	Zwane



^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2002-03-13  6:31 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-03-11  8:01 Few questions about 2.5.6-pre3 Zwane Mwaikambo
2002-03-12  6:46 ` Zwane Mwaikambo
2002-03-12  7:22   ` Robert Love
2002-03-12 14:11     ` Zwane Mwaikambo
2002-03-13  6:14       ` Zwane Mwaikambo

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox