All of lore.kernel.org
 help / color / mirror / Atom feed
From: Reuben Farrelly <reuben-lkml@reub.net>
To: Andrew Morton <akpm@osdl.org>
Cc: Linux Kernel <linux-kernel@vger.kernel.org>, kaber@trash.net
Subject: Re: 2.6.16-rc1-mm5
Date: Wed, 08 Feb 2006 19:20:08 +1300	[thread overview]
Message-ID: <43E98D98.2090805@reub.net> (raw)
In-Reply-To: <20060203000704.3964a39f.akpm__33726.4416892596$1138954098$gmane$org@osdl.org>

On 3/02/2006 9:07 p.m., Andrew Morton wrote:
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.16-rc1/2.6.16-rc1-mm5/
> 
> 
> - The ntp/time rework patches from John Stultz have been resurrected and fixed.
> 
> - There's now a `hot-fixes' directory at the above URL.  Please look in
>   there for any updates which should be applied.
> 
> 
> Known problems:

[Andrew just released another -mm while I was typing this one up]

I've been hunting down some hard-to-catch reboots in the last two -mm releases 
that leave no traces in the log other than an abrupt halt of all logging, at 
first I thought I was chasing a Netfilter problem but after a few days of full 
Netfilter debugging on, I came to the conclusion that the problem wasn't 
necessarily Netfilter specific as I was still seeing reboots with no Netfilter 
errors logged immediately before the crash.  With most of the debugging options 
turned on in "Kernel Hacking" and the machine running like a dog for a few days, 
it seemed stable (unfortunately).

Anyway, I might have finally hit the jackpot this time, as it only ever seemed 
to crash when I was at work.  But I'm here, and it just did it...

Fedora Core release 4 (Rawhide)
Kernel 2.6.16-rc1-mm5 on an i686

tornado.reub.net login: BUG: unable to handle kernel paging request at virtual 
address b0446000
  printing eip:
b013fe7a
*pde = 018e9163
*pte = 00446000
Oops: 0002 [#1]
SMP
last sysfs file: /devices/pci0000:00/0000:00:1f.3/i2c-0/0-002e/vrm
Modules linked in: nfsd exportfs lockd sunrpc lm85 hwmon_vid eeprom ipv6 ip_gre 
iptable_filter iptable_nat ip_nat ip_conntrack nfnetlink iptable_mangle 
ip_tables binfmt_misc ide_cd cdrom serio_raw piix hw_random i2c_i801
CPU:    1
EIP:    0060:[<b013fe7a>]    Not tainted VLI
EFLAGS: 00010282   (2.6.16-rc1-mm5 #1)
EIP is at get_page_from_freelist+0x325/0x386
eax: 00000000   ebx: b10088c0   ecx: 00000400   edx: b03c2780
esi: 00000000   edi: b0446000   ebp: b2bddeb8   esp: b2bdde5c
ds: 007b   es: 007b   ss: 0068
Process check_ifopersta (pid: 21337, threadinfo=b2bdd000 task=b4974ab0)
Stack: <0>00000002 00000044 b0446000 b03c2900 00000001 00000001 00000000 b03c2780
        00000000 00000002 00000000 000280d2 b03c35ac b10088c0 00000000 b2bddfa0
        00000282 00000001 00000000 00000001 b03c35a8 b4974ab0 000280d2 b2bddef8
Call Trace:
  [<b0103bf5>] show_stack_log_lvl+0xc5/0xea
  [<b0103db8>] show_registers+0x19e/0x22b
  [<b0103f61>] die+0x11c/0x22c
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013ff30>] __alloc_pages+0x55/0x2bf
  [<b0147a53>] __handle_mm_fault+0x6e5/0x7fb
  [<b0113917>] do_page_fault+0x3cd/0x5de
  [<b0103737>] error_code+0x4f/0x54
Code: db 0f 8e 2e ff ff ff 8b 5d d8 c7 45 dc 00 00 00 00 31 f6 ba 03 00 00 00 89 
d8 e8 4f 46 fd ff 89 45 <1>BUG: unable to handle kernel paging request at 
virtual address b0444000
  printing eip:
b013fe7a
*pde = 018e9163
*pte = 00444000
ac b9 00 04 00 00 89 c7 89 f0 <f3> ab ba 03 00 00 00 8b 45 ac e8 b2 46 fd ff 83 
45 dc 01 83 c3
  <0>Oops: 0002 [#2]
note: check_ifopersta[21337] exited with preempt_count 1
SMP
last sysfs file: /devices/pci0000:00/0000:00:1f.3/i2c-0/0-002e/vrm
Modules linked in: nfsd exportfs lockd sunrpc lm85 hwmon_vid eeprom ipv6 ip_gre 
iptable_filter iptable_nat ip_nat ip_conntrack nfnetlink iptable_mangle 
ip_tables binfmt_misc ide_cd cdrom serio_raw piix hw_random i2c_i801
CPU:    0
EIP:    0060:[<b013fe7a>]    Not tainted VLI
EFLAGS: 00010282   (2.6.16-rc1-mm5 #1)
EIP is at get_page_from_freelist+0x325/0x386
eax: 00000000   ebx: b1008880   ecx: 00000400   edx: b03c2780
esi: 00000000   edi: b0444000   ebp: e2857eb8   esp: e2857e5c
ds: 007b   es: 007b   ss: 0068
Process python (pid: 2532, threadinfo=e2857000 task=de395570)
Stack: <0>00000002 00000044 b0444000 b03c2900 00000001 00000001 00000000 b03c2780
        00000000 00000002 00000000 000280d2 b03c35ac b1008880 00000000 b016629f
        00000286 00000001 00000000 00000001 b03c35a8 de395570 000280d2 e2857ef8
Call Trace:
  [<b0103bf5>] show_stack_log_lvl+0xc5/0xea
  [<b0103db8>] show_registers+0x19e/0x22b
  [<b0103f61>] die+0x11c/0x22c
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013ff30>] __alloc_pages+0x55/0x2bf
  [<b0147a53>] __handle_mm_fault+0x6e5/0x7fb
  [<b0113917>] do_page_fault+0x3cd/0x5de
  [<b0103737>] error_code+0x4f/0x54
Code: db 0f 8e 2e ff ff ff 8b 5d d8 c7 45 dc 00 00 00 00 31 f6 ba 03 00 00 00 89 
d8 e8 4f 46 fd ff 89 45 ac b9 00 04 00 00 89 c7 89 f0 <f3> ab ba 03 00 00 00 8b 
45 ac e8 b2 46 fd ff 83 45 dc 01 83 c3
  <6>note: python[2532] exited with preempt_count 1
BUG: sleeping function called from invalid context at include/linux/rwsem.h:43
in_atomic():1, irqs_disabled():0
  [<b010412b>] show_trace+0xd/0xf
  [<b0104251>] dump_stack+0x17/0x19
  [<b0114eb1>] __might_sleep+0xa0/0xa7
  [<b011d386>] exit_mm+0x32/0x164
  [<b011e9ba>] do_exit+0x108/0x845
  [<b0104071>] do_trap+0x0/0x9e
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013ff30>] __alloc_pages+0x55/0x2bf
  [<b0147a53>] __handle_mm_fault+0x6e5/0x7fb
  [<b0113917>] do_page_fault+0x3cd/0x5de
  [<b0103737>] error_code+0x4f/0x54
BUG: unable to handle kernel paging request at virtual address b0445000
BUG: unable to handle kernel paging request at virtual address b0412004
  printing eip:
b016986e
*pde = 018e9163
*pte = 00412000
Oops: 0002 [#3]
SMP
last sysfs file: /devices/pci0000:00/0000:00:1f.3/i2c-0/0-002e/vrm
Modules linked in: nfsd exportfs lockd sunrpc lm85 hwmon_vid eeprom ipv6 ip_gre 
iptable_filter iptable_nat ip_nat ip_conntrack nfnetlink iptable_mangle 
ip_tables binfmt_misc ide_cd cdrom serio_raw piix hw_random i2c_i801
CPU:    0
EIP:    0060:[<b016986e>]    Not tainted VLI
EFLAGS: 00010286   (2.6.16-rc1-mm5 #1)
EIP is at __pollwait+0x9f/0xd2
eax: b0412008   ebx: 00000014   ecx: 00000000   edx: b0412000
esi: 00000000   edi: e9b5ec50   ebp: e9b5ebf8   esp: e9b5ebe4
ds: 007b   es: 007b   ss: 0068
Process dovecot (pid: 2235, threadinfo=e9b5e000 task=efc3e570)
Stack: <0>c290b000 d25b82c0 c290b000 e9b5ec50 d25b82c0 e9b5ec14 b0162a80 cf8b4254
        00000000 00000020 d25b82c0 00000014 e9b5efa0 b0168a80 e9b5ec50 e9b5efa8
        08073ed0 000000d0 08073fa0 e9b5ee90 e9b5ee90 0000001a e9b5ec50 00000000
Call Trace:
  [<b0103bf5>] show_stack_log_lvl+0xc5/0xea
  [<b0103db8>] show_registers+0x19e/0x22b
  [<b0103f61>] die+0x11c/0x22c
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b0162a80>] pipe_poll+0x2c/0xae
  [<b0168a80>] do_sys_poll+0x25f/0x47d
  [<b0168cda>] sys_poll+0x3c/0x4e
  [<b0102b3f>] sysenter_past_esp+0x54/0x75
Code: 5d c3 85 f6 74 10 8b 56 04 83 c2 1c 8d 86 00 10 00 00 39 c2 76 1f 31 d2 b8 
d0 00 00 00 e8 35 69 fd ff 89 c2 85 c0 74 18 8d 40 08 <89> 42 04 89 32 89 57 04 
89 d6 8b 4e 04 8d 41 1c 89 46 04 eb 82
  <1> printing eip:
b013fe7a
*pde = 018e9163
*pte = 00445000
Oops: 0002 [#4]
SMP
last sysfs file: /devices/pci0000:00/0000:00:1f.3/i2c-0/0-002e/vrm
Modules linked in: nfsd exportfs lockd sunrpc lm85 hwmon_vid eeprom ipv6 ip_gre 
iptable_filter iptable_nat ip_nat ip_conntrack nfnetlink iptable_mangle 
ip_tables binfmt_misc ide_cd cdrom serio_raw piix hw_random i2c_i801
CPU:    1
EIP:    0060:[<b013fe7a>]    Not tainted VLI
EFLAGS: 00010282   (2.6.16-rc1-mm5 #1)
EIP is at get_page_from_freelist+0x325/0x386
eax: 00000000   ebx: b10088a0   ecx: 00000400   edx: b03c2780
esi: 00000000   edi: b0445000   ebp: b3253cb0   esp: b3253c54
ds: 007b   es: 007b   ss: 0068
Process gdb (pid: 21399, threadinfo=b3253000 task=bedc8570)
Stack: <0>00000002 00000044 b0445000 b03c2900 00000001 00000001 00000000 b03c2780
        00000000 00000002 00000000 000280d2 b03c35ac b10088a0 00000000 00000000
        00000282 00000001 00000000 00000001 b03c35a8 bedc8570 000280d2 b3253cf0
Call Trace:
  [<b0103bf5>] show_stack_log_lvl+0xc5/0xea
  [<b0103db8>] show_registers+0x19e/0x22b
  [<b0103f61>] die+0x11c/0x22c
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013ff30>] __alloc_pages+0x55/0x2bf
  [<b0147a53>] __handle_mm_fault+0x6e5/0x7fb
  [<b0113917>] do_page_fault+0x3cd/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013b9b9>] do_generic_mapping_read+0x2a4/0x4bf
  [<b013c33b>] __generic_file_aio_read+0xe3/0x230
  [<b013d808>] generic_file_read+0x8e/0xae
  [<b0157dc1>] vfs_read+0x89/0x144
  [<b015828b>] sys_read+0x3d/0x64
  [<b0102b3f>] sysenter_past_esp+0x54/0x75
Code: db 0f 8e 2e ff ff ff 8b 5d d8 c7 45 dc 00 00 00 00 31 f6 ba 03 00 00 00 89 
d8 e8 4f 46 fd ff 89 45 ac b9 00 04 00 00 89 c7 89 f0 <f3> ab ba 03 00 00 00 8b 
45 ac e8 b2 46 fd ff 83 45 dc 01 83 c3
  <6>note: gdb[21399] exited with preempt_count 1
BUG: sleeping function called from invalid context at include/linux/rwsem.h:43
in_atomic():1, irqs_disabled():0
  [<b010412b>] show_trace+0xd/0xf
  [<b0104251>] dump_stack+0x17/0x19
  [<b0114eb1>] __might_sleep+0xa0/0xa7
  [<b011d386>] exit_mm+0x32/0x164
  [<b011e9ba>] do_exit+0x108/0x845
  [<b0104071>] do_trap+0x0/0x9e
  [<b01137c4>] do_page_fault+0x27a/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013ff30>] __alloc_pages+0x55/0x2bf
  [<b0147a53>] __handle_mm_fault+0x6e5/0x7fb
  [<b0113917>] do_page_fault+0x3cd/0x5de
  [<b0103737>] error_code+0x4f/0x54
  [<b013b9b9>] do_generic_mapping_read+0x2a4/0x4bf
  [<b013c33b>] __generic_file_aio_read+0xe3/0x230
  [<b013d808>] generic_file_read+0x8e/0xae
  [<b0157dc1>] vfs_read+0x89/0x144
  [<b015828b>] sys_read+0x3d/0x64
  [<b0102b3f>] sysenter_past_esp+0x54/0x75

The only patch applied to this release was this one in a thread started by 
Ulrich Drepper:

--- devel/fs/namei.c~nameic-unlock-missing-in-error-case        2006-02-04 15:57
:22.000000000 -0800
+++ devel-akpm/fs/namei.c       2006-02-04 16:07:56.000000000 -0800

I'll build an rc2-mm1 now and see how we go, but I guess it'd still be good to 
know if this one was ever fixed or not.

reuben

  reply	other threads:[~2006-02-08  6:20 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-02-03  8:07 2.6.16-rc1-mm5 Andrew Morton
2006-02-08  6:20 ` Reuben Farrelly [this message]
  -- strict thread matches above, loose matches on Subject: below --
2006-02-03  8:07 2.6.16-rc1-mm5 Andrew Morton
2006-02-03 10:17 ` 2.6.16-rc1-mm5 Benoit Boissinot
2006-02-03 20:13   ` 2.6.16-rc1-mm5 Benoit Boissinot
2006-02-03 20:36     ` 2.6.16-rc1-mm5 Andrew Morton
2006-02-04  7:59 ` 2.6.16-rc1-mm5 Matthias Urlichs
2006-02-04  9:29 ` 2.6.16-rc1-mm5 Sam Ravnborg
2006-02-04  9:36   ` 2.6.16-rc1-mm5 Andrew Morton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=43E98D98.2090805@reub.net \
    --to=reuben-lkml@reub.net \
    --cc=akpm@osdl.org \
    --cc=kaber@trash.net \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.