* Kernel BUG when using corrupted ext3 fs [not found] <f9be06770706080751y4bbb60d0q3a2286d8e2b576f6@mail.gmail.com> @ 2007-06-08 14:53 ` Karl Pickett 2007-06-11 11:41 ` Jan Kara 2007-06-11 20:04 ` Eric Sandeen 0 siblings, 2 replies; 4+ messages in thread From: Karl Pickett @ 2007-06-08 14:53 UTC (permalink / raw) To: linux-kernel I had to hard shutdown a fc6 machine due to vmware and nvidia doing ridiculous things to my screen resolution and locking up. ugh. Anyway, upon reboot it recovers the journal and mounts / rw fine. Then rc.sysinit tries to delete various things from /tmp (.ICE-unix, .font stuff, etc), kernel shows a BUG, and boot stops. Booting from a fc7 live cd and trying to mount /dev/sda2 rw and delete /tmp/* also causes a BUG. This dump is from fc7: eth0: no IPv6 routers present kjournald starting. Commit interval 5 seconds EXT3 FS on sda2, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev sda2, type ext3), uses xattr inode_doinit_with_dentry: getxattr returned 5 for dev=sda2 ino=13271075 EXT3-fs warning (device sda2): ext3_unlink: Deleting nonexistent file (13271075), 0 list_add corruption. next->prev should be prev (e8a1412c), but was e8b4ba78. (next=e8b4ba78). ------------[ cut here ]------------ kernel BUG at lib/list_debug.c:27! invalid opcode: 0000 [#1] SMP last sysfs file: /devices/pci0000:00/0000:00:01.0/0000:01:00.0/enable Modules linked in: ipv6 dm_mirror dm_multipath video sbs i2c_ec button dock battery ac lp iTCO_wdt tg3 iTCO_vendor_support i2c_i801 i2c_core parport_pc serio_raw parport sg pcspkr ext3 jbd mbcache squashfs dm_snapshot dm_mod loop sd_mod ata_piix ehci_hcd ahci ata_generic libata uhci_hcd sr_mod scsi_mod cdrom CPU: 0 EIP: 0060:[<c04e85bf>] Not tainted VLI EFLAGS: 00210282 (2.6.21-1.3194.fc7 #1) EIP is at __list_add+0x26/0x5c eax: 00000061 ebx: e8b4ba78 ecx: 00200086 edx: 00200000 esi: e8b4ba9c edi: c1964000 ebp: e89bf48c esp: e8a80eb8 ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 Process rm (pid: 2511, ti=e8a80000 task=dfb4c710 task.ti=e8a80000) Stack: c069e0d0 e8a1412c e8b4ba78 e8b4ba78 00000000 e8b4ba9c f8a5d704 e89bf48c e8997710 e8b4b634 e8b4b6cc e89bf48c e8b4ba04 e8b4ba78 e8997710 00000b00 00000195 e8b4ba9c 00000000 e8b4b6cc e89bf48c f8a5e593 e8a03400 00000000 Call Trace: [<f8a5d704>] ext3_orphan_add+0x123/0x153 [ext3] [<f8a5e593>] ext3_unlink+0x13a/0x188 [ext3] [<c047c6ed>] vfs_unlink+0x9f/0xd5 [<c047e0ad>] do_unlinkat+0x8b/0x119 [<c047fd52>] sys_getdents64+0x9b/0xa5 [<c0404f70>] syscall_call+0x7/0xb ======================= Code: 83 c4 0c 5b c3 56 53 89 c3 83 ec 10 8b 41 04 39 d0 74 1c 89 4c 24 0c 89 54 24 04 89 44 24 08 c7 04 24 d0 e0 69 c0 e8 8b f1 f3 ff <0f> 0b eb fe 8b 32 39 ce 74 1c 89 54 24 0c 89 74 24 08 89 4c 24 EIP: [<c04e85bf>] __list_add+0x26/0x5c SS:ESP 0068:e8a80eb8 BUG: warning at kernel/exit.c:798/do_exit() (Not tainted) [<c04294f3>] do_exit+0x54/0x6d3 [<c040655c>] die+0x21b/0x240 [<c0406967>] do_invalid_op+0x0/0xab [<c0406a09>] do_invalid_op+0xa2/0xab [<c04e85bf>] __list_add+0x26/0x5c [<c0427711>] vprintk+0x290/0x2c9 [<c049085b>] __getblk+0x3b/0x286 [<f8a20b1c>] do_get_write_access+0x444/0x471 [jbd] [<c060190c>] error_code+0x7c/0x84 [<c04e85bf>] __list_add+0x26/0x5c [<f8a5d704>] ext3_orphan_add+0x123/0x153 [ext3] [<f8a5e593>] ext3_unlink+0x13a/0x188 [ext3] [<c047c6ed>] vfs_unlink+0x9f/0xd5 [<c047e0ad>] do_unlinkat+0x8b/0x119 [<c047fd52>] sys_getdents64+0x9b/0xa5 [<c0404f70>] syscall_call+0x7/0xb ======================= I did an fsck -f from the fc7 live cd and now it boots. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Kernel BUG when using corrupted ext3 fs 2007-06-08 14:53 ` Kernel BUG when using corrupted ext3 fs Karl Pickett @ 2007-06-11 11:41 ` Jan Kara 2007-06-11 14:05 ` Karl Pickett 2007-06-11 20:04 ` Eric Sandeen 1 sibling, 1 reply; 4+ messages in thread From: Jan Kara @ 2007-06-11 11:41 UTC (permalink / raw) To: Karl Pickett; +Cc: linux-kernel Hello, > I had to hard shutdown a fc6 machine due to vmware and nvidia doing > ridiculous things to my screen resolution and locking up. ugh. > Anyway, upon reboot it recovers the journal and mounts / rw fine. > Then rc.sysinit tries to delete various things from /tmp (.ICE-unix, > .font stuff, etc), kernel shows a BUG, and boot stops. Booting from a > fc7 live cd and trying to mount /dev/sda2 rw and delete /tmp/* also > causes a BUG. This dump is from fc7: > > > eth0: no IPv6 routers present > kjournald starting. Commit interval 5 seconds > EXT3 FS on sda2, internal journal > EXT3-fs: mounted filesystem with ordered data mode. > SELinux: initialized (dev sda2, type ext3), uses xattr > inode_doinit_with_dentry: getxattr returned 5 for dev=sda2 ino=13271075 > EXT3-fs warning (device sda2): ext3_unlink: Deleting nonexistent file > (13271075), 0 Obviously, ext3 was not consistent after replaying the journal which should never happen. Do you have somewhere available the original filesystem image or at least messages from fsck when fixing the filesystem? > list_add corruption. next->prev should be prev (e8a1412c), but was > e8b4ba78. (next=e8b4ba78). This is a corruption of superblocks orphan inode list. Probably it is a result of previous fs corruption... > ------------[ cut here ]------------ > kernel BUG at lib/list_debug.c:27! > invalid opcode: 0000 [#1] > SMP > last sysfs file: /devices/pci0000:00/0000:00:01.0/0000:01:00.0/enable > Modules linked in: ipv6 dm_mirror dm_multipath video sbs i2c_ec button > dock battery ac lp iTCO_wdt tg3 iTCO_vendor_support i2c_i801 i2c_core > parport_pc serio_raw parport sg pcspkr ext3 jbd mbcache squashfs > dm_snapshot dm_mod loop sd_mod ata_piix ehci_hcd ahci ata_generic > libata uhci_hcd sr_mod scsi_mod cdrom > CPU: 0 > EIP: 0060:[<c04e85bf>] Not tainted VLI > EFLAGS: 00210282 (2.6.21-1.3194.fc7 #1) > EIP is at __list_add+0x26/0x5c > eax: 00000061 ebx: e8b4ba78 ecx: 00200086 edx: 00200000 > esi: e8b4ba9c edi: c1964000 ebp: e89bf48c esp: e8a80eb8 > ds: 007b es: 007b fs: 00d8 gs: 0033 ss: 0068 > Process rm (pid: 2511, ti=e8a80000 task=dfb4c710 task.ti=e8a80000) > Stack: c069e0d0 e8a1412c e8b4ba78 e8b4ba78 00000000 e8b4ba9c f8a5d704 > e89bf48c > e8997710 e8b4b634 e8b4b6cc e89bf48c e8b4ba04 e8b4ba78 e8997710 > 00000b00 > 00000195 e8b4ba9c 00000000 e8b4b6cc e89bf48c f8a5e593 e8a03400 > 00000000 > Call Trace: > [<f8a5d704>] ext3_orphan_add+0x123/0x153 [ext3] > [<f8a5e593>] ext3_unlink+0x13a/0x188 [ext3] > [<c047c6ed>] vfs_unlink+0x9f/0xd5 > [<c047e0ad>] do_unlinkat+0x8b/0x119 > [<c047fd52>] sys_getdents64+0x9b/0xa5 > [<c0404f70>] syscall_call+0x7/0xb > ======================= > Code: 83 c4 0c 5b c3 56 53 89 c3 83 ec 10 8b 41 04 39 d0 74 1c 89 4c > 24 0c 89 54 24 04 89 44 24 08 c7 04 24 d0 e0 69 c0 e8 8b f1 f3 ff <0f> > 0b eb fe 8b 32 39 ce 74 1c 89 54 24 0c 89 74 24 08 89 4c 24 > EIP: [<c04e85bf>] __list_add+0x26/0x5c SS:ESP 0068:e8a80eb8 > BUG: warning at kernel/exit.c:798/do_exit() (Not tainted) > [<c04294f3>] do_exit+0x54/0x6d3 > [<c040655c>] die+0x21b/0x240 > [<c0406967>] do_invalid_op+0x0/0xab > [<c0406a09>] do_invalid_op+0xa2/0xab > [<c04e85bf>] __list_add+0x26/0x5c > [<c0427711>] vprintk+0x290/0x2c9 > [<c049085b>] __getblk+0x3b/0x286 > [<f8a20b1c>] do_get_write_access+0x444/0x471 [jbd] > [<c060190c>] error_code+0x7c/0x84 > [<c04e85bf>] __list_add+0x26/0x5c > [<f8a5d704>] ext3_orphan_add+0x123/0x153 [ext3] > [<f8a5e593>] ext3_unlink+0x13a/0x188 [ext3] > [<c047c6ed>] vfs_unlink+0x9f/0xd5 > [<c047e0ad>] do_unlinkat+0x8b/0x119 > [<c047fd52>] sys_getdents64+0x9b/0xa5 > [<c0404f70>] syscall_call+0x7/0xb > ======================= > > > I did an fsck -f from the fc7 live cd and now it boots. Honza -- Jan Kara <jack@suse.cz> SuSE CR Labs ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Kernel BUG when using corrupted ext3 fs 2007-06-11 11:41 ` Jan Kara @ 2007-06-11 14:05 ` Karl Pickett 0 siblings, 0 replies; 4+ messages in thread From: Karl Pickett @ 2007-06-11 14:05 UTC (permalink / raw) To: Jan Kara; +Cc: linux-kernel On 6/11/07, Jan Kara <jack@suse.cz> wrote: > Hello, > > > I had to hard shutdown a fc6 machine due to vmware and nvidia doing > > ridiculous things to my screen resolution and locking up. ugh. > > Anyway, upon reboot it recovers the journal and mounts / rw fine. > > Then rc.sysinit tries to delete various things from /tmp (.ICE-unix, > > .font stuff, etc), kernel shows a BUG, and boot stops. Booting from a > > fc7 live cd and trying to mount /dev/sda2 rw and delete /tmp/* also > > causes a BUG. This dump is from fc7: > > > > > > eth0: no IPv6 routers present > > kjournald starting. Commit interval 5 seconds > > EXT3 FS on sda2, internal journal > > EXT3-fs: mounted filesystem with ordered data mode. > > SELinux: initialized (dev sda2, type ext3), uses xattr > > inode_doinit_with_dentry: getxattr returned 5 for dev=sda2 ino=13271075 > > EXT3-fs warning (device sda2): ext3_unlink: Deleting nonexistent file > > (13271075), 0 > Obviously, ext3 was not consistent after replaying the journal which > should never happen. Do you have somewhere available the original > filesystem image or at least messages from fsck when fixing the > filesystem? Unfortunately, no. I did not think of logging the fsck output :( I recall hitting 'y' to fix about 30 things, with the last fixes being something like superblocks not matching each other which I hadn't seen before. This machine had also been hard powered off at least 5 times before in the last few months (due to nvidia lockups). -- Karl Pickett ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Kernel BUG when using corrupted ext3 fs 2007-06-08 14:53 ` Kernel BUG when using corrupted ext3 fs Karl Pickett 2007-06-11 11:41 ` Jan Kara @ 2007-06-11 20:04 ` Eric Sandeen 1 sibling, 0 replies; 4+ messages in thread From: Eric Sandeen @ 2007-06-11 20:04 UTC (permalink / raw) To: Karl Pickett; +Cc: linux-kernel Karl Pickett wrote: > I had to hard shutdown a fc6 machine due to vmware and nvidia doing > ridiculous things to my screen resolution and locking up. ugh. > Anyway, upon reboot it recovers the journal and mounts / rw fine. > Then rc.sysinit tries to delete various things from /tmp (.ICE-unix, > .font stuff, etc), kernel shows a BUG, and boot stops. Booting from a > fc7 live cd and trying to mount /dev/sda2 rw and delete /tmp/* also > causes a BUG. This dump is from fc7: > ... The corrupted orphan inode list bug I fixed, http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=2988a7740dc0dd9a0cb56576e8fe1d777dff0db3 is in that fc7 kernel, so that's probably not it (though your fc6 kernel probably didn't have that fix) (it's related to link & unlink racing) Perhaps this could be related to Vasily's issue w/ bad_inodes not getting removed from the list ? [RFC PATCH ext3/ext4] orphan list corruption due bad inode http://lkml.org/lkml/2007/6/4/7 I haven't looked at all the ways we could get bad inodes on the orphan inode list, and not properly removed... -Eric ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2007-06-11 20:08 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <f9be06770706080751y4bbb60d0q3a2286d8e2b576f6@mail.gmail.com>
2007-06-08 14:53 ` Kernel BUG when using corrupted ext3 fs Karl Pickett
2007-06-11 11:41 ` Jan Kara
2007-06-11 14:05 ` Karl Pickett
2007-06-11 20:04 ` Eric Sandeen
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox