Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1)

linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1)
@ 2007-05-16 13:55 Roger Heflin
  2007-05-16 16:29 ` [NFS] " J. Bruce Fields
  0 siblings, 1 reply; 3+ messages in thread
From: Roger Heflin @ 2007-05-16 13:55 UTC (permalink / raw)
  To: NFS, Linux-Kernel

Running bonnie over nfs on a RHEL4.4 client against a 2.6.21.1 server
got me this crash after about 4 hours of running on the server:

This was running lvm -> ext3 -> nfs ---- nfsclient (RHEL4.4).

	Ideas?

                       Roger

May 15 21:10:31 vault1 kernel: ------------[ cut here ]------------
May 15 21:10:31 vault1 kernel: kernel BUG at mm/slab.c:2380!
May 15 21:10:31 vault1 kernel: invalid opcode: 0000 [1] SMP
May 15 21:10:31 vault1 kernel: CPU 0
May 15 21:10:31 vault1 kernel: Modules linked in: qla2xxx nfsd exportfs 
lockd nfs_acl sunrpc hidp l2cap bluetooth ipv6 cpufreq_ondemand jfs 
dm_mirror dm_multipath dm_mod video sbs i2c_ec dock button battery 
asus_acpi ac lp snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy 
snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device sg snd_pcm_oss 
floppy snd_mixer_oss snd_pcm cfi_cmdset_0002 cfi_util mtdpart snd_timer 
jedec_probe cfi_probe gen_probe snd ck804xrom sata_nv mtdcore chipreg 
i2c_nforce2 soundcore map_funcs libata snd_page_alloc pcspkr i2c_core 
k8temp hwmon forcedeth ohci1394 ieee1394 parport_pc ide_cd parport cdrom 
serio_raw scsi_transport_fc shpchp megaraid_mbox sd_mod scsi_mod 
megaraid_mm ext3 jbd ehci_hcd ohci_hcd uhci_hcd
May 15 21:10:31 vault1 kernel: Pid: 4256, comm: nfsd Not tainted 2.6.21.1 #1
May 15 21:10:31 vault1 kernel: RIP: 0010:[<ffffffff8025a710>] 
[<ffffffff8025a710>] cache_alloc_refill+0xe6/0x1f3
May 15 21:10:31 vault1 kernel: RSP: 0018:ffff81021dead6d0  EFLAGS: 00010002
May 15 21:10:31 vault1 kernel: RAX: 0000000000000001 RBX: 
ffff81012211d960 RCX: ffff810120013040
May 15 21:10:31 vault1 kernel: RDX: 000000000000000e RSI: 
ffff810139020000 RDI: ffff810120013040

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [NFS] Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1)
  2007-05-16 13:55 Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1) Roger Heflin
@ 2007-05-16 16:29 ` J. Bruce Fields
  2007-05-16 16:35   ` Roger Heflin
  0 siblings, 1 reply; 3+ messages in thread
From: J. Bruce Fields @ 2007-05-16 16:29 UTC (permalink / raw)
  To: Roger Heflin; +Cc: NFS, Linux-Kernel

On Wed, May 16, 2007 at 08:55:19AM -0500, Roger Heflin wrote:
> Running bonnie over nfs on a RHEL4.4 client against a 2.6.21.1 server
> got me this crash after about 4 hours of running on the server:
> 
> This was running lvm -> ext3 -> nfs ---- nfsclient (RHEL4.4).

Yipes.  Has this happened only once, or do you have a way to reliably
reproduce it?

Is it a new problem?  (And, if so, what changed?)

> May 15 21:10:31 vault1 kernel: ------------[ cut here ]------------
> May 15 21:10:31 vault1 kernel: kernel BUG at mm/slab.c:2380!

That's the check_spinlock_acquired() in cache_alloc_refill().  What
causes that to fail?

> May 15 21:10:31 vault1 kernel: invalid opcode: 0000 [1] SMP
> May 15 21:10:31 vault1 kernel: CPU 0
> May 15 21:10:31 vault1 kernel: Modules linked in: qla2xxx nfsd exportfs 
> lockd nfs_acl sunrpc hidp l2cap bluetooth ipv6 cpufreq_ondemand jfs 
> dm_mirror dm_multipath dm_mod video sbs i2c_ec dock button battery 
> asus_acpi ac lp snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy 
> snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device sg snd_pcm_oss 
> floppy snd_mixer_oss snd_pcm cfi_cmdset_0002 cfi_util mtdpart snd_timer 
> jedec_probe cfi_probe gen_probe snd ck804xrom sata_nv mtdcore chipreg 
> i2c_nforce2 soundcore map_funcs libata snd_page_alloc pcspkr i2c_core 
> k8temp hwmon forcedeth ohci1394 ieee1394 parport_pc ide_cd parport cdrom 
> serio_raw scsi_transport_fc shpchp megaraid_mbox sd_mod scsi_mod 
> megaraid_mm ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> May 15 21:10:31 vault1 kernel: Pid: 4256, comm: nfsd Not tainted 2.6.21.1 #1
> May 15 21:10:31 vault1 kernel: RIP: 0010:[<ffffffff8025a710>] 
> [<ffffffff8025a710>] cache_alloc_refill+0xe6/0x1f3
> May 15 21:10:31 vault1 kernel: RSP: 0018:ffff81021dead6d0  EFLAGS: 00010002
> May 15 21:10:31 vault1 kernel: RAX: 0000000000000001 RBX: 
> ffff81012211d960 RCX: ffff810120013040
> May 15 21:10:31 vault1 kernel: RDX: 000000000000000e RSI: 
> ffff810139020000 RDI: ffff810120013040

There wasn't a backtrace?

--b.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [NFS] Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1)
  2007-05-16 16:29 ` [NFS] " J. Bruce Fields
@ 2007-05-16 16:35   ` Roger Heflin
  0 siblings, 0 replies; 3+ messages in thread
From: Roger Heflin @ 2007-05-16 16:35 UTC (permalink / raw)
  To: J. Bruce Fields; +Cc: NFS, Linux-Kernel

J. Bruce Fields wrote:
> On Wed, May 16, 2007 at 08:55:19AM -0500, Roger Heflin wrote:
>> Running bonnie over nfs on a RHEL4.4 client against a 2.6.21.1 server
>> got me this crash after about 4 hours of running on the server:
>>
>> This was running lvm -> ext3 -> nfs ---- nfsclient (RHEL4.4).
> 
> Yipes.  Has this happened only once, or do you have a way to reliably
> reproduce it?

I have not reproduced it yet, I will update if I do, I suspect
that I will be able to reproduce it, but it took several hours
of running.

> 
> Is it a new problem?  (And, if so, what changed?)

New different tests.

> 
>> May 15 21:10:31 vault1 kernel: ------------[ cut here ]------------
>> May 15 21:10:31 vault1 kernel: kernel BUG at mm/slab.c:2380!
> 
> That's the check_spinlock_acquired() in cache_alloc_refill().  What
> causes that to fail?
> 
>> May 15 21:10:31 vault1 kernel: invalid opcode: 0000 [1] SMP
>> May 15 21:10:31 vault1 kernel: CPU 0
>> May 15 21:10:31 vault1 kernel: Modules linked in: qla2xxx nfsd exportfs 
>> lockd nfs_acl sunrpc hidp l2cap bluetooth ipv6 cpufreq_ondemand jfs 
>> dm_mirror dm_multipath dm_mod video sbs i2c_ec dock button battery 
>> asus_acpi ac lp snd_intel8x0 snd_ac97_codec ac97_bus snd_seq_dummy 
>> snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device sg snd_pcm_oss 
>> floppy snd_mixer_oss snd_pcm cfi_cmdset_0002 cfi_util mtdpart snd_timer 
>> jedec_probe cfi_probe gen_probe snd ck804xrom sata_nv mtdcore chipreg 
>> i2c_nforce2 soundcore map_funcs libata snd_page_alloc pcspkr i2c_core 
>> k8temp hwmon forcedeth ohci1394 ieee1394 parport_pc ide_cd parport cdrom 
>> serio_raw scsi_transport_fc shpchp megaraid_mbox sd_mod scsi_mod 
>> megaraid_mm ext3 jbd ehci_hcd ohci_hcd uhci_hcd
>> May 15 21:10:31 vault1 kernel: Pid: 4256, comm: nfsd Not tainted 2.6.21.1 #1
>> May 15 21:10:31 vault1 kernel: RIP: 0010:[<ffffffff8025a710>] 
>> [<ffffffff8025a710>] cache_alloc_refill+0xe6/0x1f3
>> May 15 21:10:31 vault1 kernel: RSP: 0018:ffff81021dead6d0  EFLAGS: 00010002
>> May 15 21:10:31 vault1 kernel: RAX: 0000000000000001 RBX: 
>> ffff81012211d960 RCX: ffff810120013040
>> May 15 21:10:31 vault1 kernel: RDX: 000000000000000e RSI: 
>> ffff810139020000 RDI: ffff810120013040
> 
> There wasn't a backtrace?
> 
> --b.
> 

That was all I had before it took the machine out, it is on a serial
console at this time, so if it happens again I should get better data.

                              Roger

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2007-05-16 16:35 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-05-16 13:55 Kernel BUG at mm/slab.c:2380 on NFS server in nfsd (2.6.21.1) Roger Heflin
2007-05-16 16:29 ` [NFS] " J. Bruce Fields
2007-05-16 16:35   ` Roger Heflin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).