public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* 2.5.69-mm8 improvements and one oops
@ 2003-05-22 18:14 jjs
  2003-05-22 18:35 ` Mike Fedyk
  2003-05-22 20:07 ` Andrew Morton
  0 siblings, 2 replies; 6+ messages in thread
From: jjs @ 2003-05-22 18:14 UTC (permalink / raw)
  To: linux kernel; +Cc: Andrew Morton

Hello -

This may be of interest...

kernel: 2.5.69-mm8

Linux distro: Red Hat 9 + updates

Hardware:
Celeron 1.2 Ghz on Intel Motherboard
51e MB RAM, 2x e100 ethernet

FWIW, The last few 2.5.69-mm releases would not boot for me,
but would oops while trying to load usb modules, then hang
immediately after the line announcing the activation of swap.

Happily, -mm8 compiled without module errors, and booted fine.

All seems healthy, and the services are all fully functional,
but I did spot this oops in syslog, after logging in and starting
a gnome session:

I wil gladly supply .config and all other info if desired

Best regards,

Joe

------------[ cut here ]------------
kernel BUG at kernel/sched.c:614!
invalid operand: 0000 [#1]
CPU:    0
EIP:    0060:[<c011d762>]    Not tainted VLI
EFLAGS: 00010246
EIP is at schedule_tail+0xe2/0x140
eax: 00000008   ebx: 00000000   ecx: d65d4e40   edx: d563c000
esi: d8fa0120   edi: d953a31c   ebp: d563dfb8   esp: d563df9c
ds: 007b   es: 007b   ss: 0068
Process bonobo-activati (pid: 1275, threadinfo=d563c000 task=d65d54c0)
Stack: 43297d43 7d43297d 297d4329 43297d43 7d43297d d8fa0120 d65d54c0 
d564ffbc
       c010a362 d8fa0120 01200011 00000000 00000000 00000000 40018da8 
bfffe148
       00000000 0000007b 0000007b 00000078 ffffe410 00000073 00000202 
bfffe0ec
Call Trace:
 [<c010a362>] ret_from_fork+0x6/0x14

Code: 85 d2 74 16 89 d6 83 c6 04 19 c9 39 70 18 83 d9 00 85 c9 75 05 8b 
43 7c 89 02 83 c4 14 5b 5e 5d c3 89 34 24 e8 a0 35 00 00 eb c6 <0f> 0b 
66 02 ad 7d 3e c0 eb b2 8d 74 26 00 89 d8 e8 f9 66 00 00



^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.5.69-mm8 improvements and one oops
  2003-05-22 18:14 2.5.69-mm8 improvements and one oops jjs
@ 2003-05-22 18:35 ` Mike Fedyk
  2003-05-22 19:05   ` jjs
  2003-05-22 20:07 ` Andrew Morton
  1 sibling, 1 reply; 6+ messages in thread
From: Mike Fedyk @ 2003-05-22 18:35 UTC (permalink / raw)
  To: jjs; +Cc: linux kernel, Andrew Morton

On Thu, May 22, 2003 at 11:14:57AM -0700, jjs wrote:
> ------------[ cut here ]------------
> kernel BUG at kernel/sched.c:614!
> invalid operand: 0000 [#1]
> CPU:    0
> EIP:    0060:[<c011d762>]    Not tainted VLI
> EFLAGS: 00010246
> EIP is at schedule_tail+0xe2/0x140
> eax: 00000008   ebx: 00000000   ecx: d65d4e40   edx: d563c000
> esi: d8fa0120   edi: d953a31c   ebp: d563dfb8   esp: d563df9c
> ds: 007b   es: 007b   ss: 0068
> Process bonobo-activati (pid: 1275, threadinfo=d563c000 task=d65d54c0)
> Stack: 43297d43 7d43297d 297d4329 43297d43 7d43297d d8fa0120 d65d54c0 
> d564ffbc
>       c010a362 d8fa0120 01200011 00000000 00000000 00000000 40018da8 
> bfffe148
>       00000000 0000007b 0000007b 00000078 ffffe410 00000073 00000202 
> bfffe0ec
> Call Trace:
> [<c010a362>] ret_from_fork+0x6/0x14
> 
> Code: 85 d2 74 16 89 d6 83 c6 04 19 c9 39 70 18 83 d9 00 85 c9 75 05 8b 
> 43 7c 89 02 83 c4 14 5b 5e 5d c3 89 34 24 e8 a0 35 00 00 eb c6 <0f> 0b 
> 66 02 ad 7d 3e c0 eb b2 8d 74 26 00 89 d8 e8 f9 66 00 00

You didn't run this through ksymoops.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.5.69-mm8 improvements and one oops
  2003-05-22 18:35 ` Mike Fedyk
@ 2003-05-22 19:05   ` jjs
  0 siblings, 0 replies; 6+ messages in thread
From: jjs @ 2003-05-22 19:05 UTC (permalink / raw)
  To: linux kernel; +Cc: Andrew Morton

>
>
>You didn't run this through ksymoops.
>
Sorry, how lame of me - no brain left after finals

Here is the decoded oops:

ksymoops 2.4.5 on i686 2.5.69-mm8.  Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.5.69-mm8/ (default)
     -m /boot/System.map-2.5.69-mm8 (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Error (regular_file): read_ksyms stat /proc/ksyms failed
No modules in ksyms, skipping objects
No ksyms, skipping lsmod


kernel BUG at kernel/sched.c:614!
invalid operand: 0000 [#1]
CPU:    0
EIP:    0060:[<c011d762>]    Not tainted VLI
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010246
eax: 00000008   ebx: 00000000   ecx: d65d4e40   edx: d563c000
esi: d8fa0120   edi: d953a31c   ebp: d563dfb8   esp: d563df9c
ds: 007b   es: 007b   ss: 0068
Stack: 43297d43 7d43297d 297d4329 43297d43 7d43297d d8fa0120 d65d54c0 
d564ffbc
       c010a362 d8fa0120 01200011 00000000 00000000 00000000 40018da8 
bfffe148
       00000000 0000007b 0000007b 00000078 ffffe410 00000073 00000202 
bfffe0ec
 [<c010a362>] ret_from_fork+0x6/0x14
Code: 85 d2 74 16 89 d6 83 c6 04 19 c9 39 70 18 83 d9 00 85 c9 75 05 8b 
43 7c 89
 02 83 c4 14 5b 5e 5d c3 89 34 24 e8 a0 35 00 00 eb c6 <0f> 0b 66 02 ad 
7d 3e c0
 eb b2 8d 74 26 00 89 d8 e8 f9 66 00 00

 >>EIP; c011d762 <schedule_tail+e2/140>   <=====

 >>ecx; d65d4e40 <__crc_unregister_filesystem+187c64/32bd9c>
 >>edx; d563c000 <__crc_proc_dointvec_minmax+15d80d/308f7b>
 >>esi; d8fa0120 <__crc_raise_softirq+150704/241947>
 >>edi; d953a31c <__crc_acpi_os_read_pci_configuration+78bac/110b35>
 >>ebp; d563dfb8 <__crc_proc_dointvec_minmax+15f7c5/308f7b>
 >>esp; d563df9c <__crc_proc_dointvec_minmax+15f7a9/308f7b>

Code;  c011d737 <schedule_tail+b7/140>
00000000 <_EIP>:
Code;  c011d737 <schedule_tail+b7/140>
   0:   85 d2                     test   %edx,%edx
Code;  c011d739 <schedule_tail+b9/140>
   2:   74 16                     je     1a <_EIP+0x1a>
Code;  c011d73b <schedule_tail+bb/140>
   4:   89 d6                     mov    %edx,%esi
Code;  c011d73d <schedule_tail+bd/140>
   6:   83 c6 04                  add    $0x4,%esi
Code;  c011d740 <schedule_tail+c0/140>
   9:   19 c9                     sbb    %ecx,%ecx
Code;  c011d742 <schedule_tail+c2/140>
   b:   39 70 18                  cmp    %esi,0x18(%eax)
Code;  c011d745 <schedule_tail+c5/140>
   e:   83 d9 00                  sbb    $0x0,%ecx
Code;  c011d748 <schedule_tail+c8/140>
  11:   85 c9                     test   %ecx,%ecx
Code;  c011d74a <schedule_tail+ca/140>
  13:   75 05                     jne    1a <_EIP+0x1a>
Code;  c011d74c <schedule_tail+cc/140>
  15:   8b 43 7c                  mov    0x7c(%ebx),%eax
Code;  c011d74f <schedule_tail+cf/140>
  18:   89 02                     mov    %eax,(%edx)
Code;  c011d751 <schedule_tail+d1/140>
  1a:   83 c4 14                  add    $0x14,%esp
Code;  c011d754 <schedule_tail+d4/140>
  1d:   5b                        pop    %ebx
Code;  c011d755 <schedule_tail+d5/140>
  1e:   5e                        pop    %esi
Code;  c011d756 <schedule_tail+d6/140>
  1f:   5d                        pop    %ebp
Code;  c011d757 <schedule_tail+d7/140>
  20:   c3                        ret   
Code;  c011d758 <schedule_tail+d8/140>
  21:   89 34 24                  mov    %esi,(%esp,1)
Code;  c011d75b <schedule_tail+db/140>
  24:   e8 a0 35 00 00            call   35c9 <_EIP+0x35c9>
Code;  c011d760 <schedule_tail+e0/140>
  29:   eb c6                     jmp    fffffff1 <_EIP+0xfffffff1>
Code;  c011d762 <schedule_tail+e2/140>   <=====
  2b:   0f 0b                     ud2a      <=====
Code;  c011d764 <schedule_tail+e4/140>
  2d:   66                        data16
Code;  c011d765 <schedule_tail+e5/140>
  2e:   02 ad 7d 3e c0 eb         add    0xebc03e7d(%ebp),%ch
Code;  c011d76b <schedule_tail+eb/140>
  34:   b2 8d                     mov    $0x8d,%dl
Code;  c011d76d <schedule_tail+ed/140>
  36:   74 26                     je     5e <_EIP+0x5e>
Code;  c011d76f <schedule_tail+ef/140>
  38:   00 89 d8 e8 f9 66         add    %cl,0x66f9e8d8(%ecx)


1 warning and 1 error issued.  Results may not be reliable.




^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.5.69-mm8 improvements and one oops
  2003-05-22 18:14 2.5.69-mm8 improvements and one oops jjs
  2003-05-22 18:35 ` Mike Fedyk
@ 2003-05-22 20:07 ` Andrew Morton
  2003-05-22 20:41   ` jjs
  1 sibling, 1 reply; 6+ messages in thread
From: Andrew Morton @ 2003-05-22 20:07 UTC (permalink / raw)
  To: jjs; +Cc: linux-kernel

jjs <jjs@tmsusa.com> wrote:
>
> kernel BUG at kernel/sched.c:614!
> invalid operand: 0000 [#1]
> CPU:    0
> EIP:    0060:[<c011d762>]    Not tainted VLI
> EFLAGS: 00010246
> EIP is at schedule_tail+0xe2/0x140
> eax: 00000008   ebx: 00000000   ecx: d65d4e40   edx: d563c000
> esi: d8fa0120   edi: d953a31c   ebp: d563dfb8   esp: d563df9c
> ds: 007b   es: 007b   ss: 0068
> Process bonobo-activati (pid: 1275, threadinfo=d563c000 task=d65d54c0)
> Stack: 43297d43 7d43297d 297d4329 43297d43 7d43297d d8fa0120 d65d54c0 
> d564ffbc
>        c010a362 d8fa0120 01200011 00000000 00000000 00000000 40018da8 
> bfffe148
>        00000000 0000007b 0000007b 00000078 ffffe410 00000073 00000202 
> bfffe0ec
> Call Trace:
>  [<c010a362>] ret_from_fork+0x6/0x14
> 

You hit the free-of-a-freed-task_struct bug.

sched.c:614 is

static inline void finish_task_switch(task_t *prev)
{
	runqueue_t *rq = this_rq();
	struct mm_struct *mm = rq->prev_mm;

	rq->prev_mm = NULL;
	finish_arch_switch(rq, prev);
	if (mm)
		mmdrop(mm);
	if (prev->state & (TASK_DEAD | TASK_ZOMBIE))
		put_task_struct(prev);			<== here
}

and my put_task_struct is:

#define put_task_struct(tsk)					\
	do {							\
		BUG_ON((tsk)->debug == 0x6b6b6b6b);		\
		if (atomic_dec_and_test(&(tsk)->usage))		\
			__put_task_struct(tsk);			\
	} while (0)

This bug has been hanging around for ages.  It is very rare and nobody
knows what causes it.

Are you running preempt?  SMP?   Is it repeatable?


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.5.69-mm8 improvements and one oops
  2003-05-22 20:07 ` Andrew Morton
@ 2003-05-22 20:41   ` jjs
  2003-05-22 20:50     ` William Lee Irwin III
  0 siblings, 1 reply; 6+ messages in thread
From: jjs @ 2003-05-22 20:41 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux kernel

Andrew Morton wrote

>You hit the free-of-a-freed-task_struct bug.
>  
>
>This bug has been hanging around for ages.  It is very rare and nobody
>knows what causes it.
>
I'm lucky I suppose -

>
>Are you running preempt?  SMP?   Is it repeatable?
>
It's preempt, defintely, always -

But just a UP kernel on a lowly UP box -

As for repeatability, I'll see if I can induce
the oops again but there's no telling...

Joe



^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: 2.5.69-mm8 improvements and one oops
  2003-05-22 20:41   ` jjs
@ 2003-05-22 20:50     ` William Lee Irwin III
  0 siblings, 0 replies; 6+ messages in thread
From: William Lee Irwin III @ 2003-05-22 20:50 UTC (permalink / raw)
  To: jjs; +Cc: Andrew Morton, linux kernel

Andrew Morton wrote
>> You hit the free-of-a-freed-task_struct bug.
>> This bug has been hanging around for ages.  It is very rare and nobody
>> knows what causes it.

On Thu, May 22, 2003 at 01:41:06PM -0700, jjs wrote:
> I'm lucky I suppose -

The cause for this is less than obvious.


Andrew Morton wrote
>> Are you running preempt?  SMP?   Is it repeatable?

On Thu, May 22, 2003 at 01:41:06PM -0700, jjs wrote:
> It's preempt, defintely, always -
> But just a UP kernel on a lowly UP box -
> As for repeatability, I'll see if I can induce
> the oops again but there's no telling...

Preempt is a common theme in the reports I've seen on this. It appears
SMP allows the offender to clean up after the bug, but relatively long
times between racy things rescheduling on preempt triggers the issue
more readily.


-- wli

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2003-05-22 20:37 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-05-22 18:14 2.5.69-mm8 improvements and one oops jjs
2003-05-22 18:35 ` Mike Fedyk
2003-05-22 19:05   ` jjs
2003-05-22 20:07 ` Andrew Morton
2003-05-22 20:41   ` jjs
2003-05-22 20:50     ` William Lee Irwin III

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox