* vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... @ 2002-05-05 18:34 william stinson 2002-05-05 19:04 ` Anton Altaparmakov 0 siblings, 1 reply; 13+ messages in thread From: william stinson @ 2002-05-05 18:34 UTC (permalink / raw) To: linux-kernel Hi as vanilla linux 2.5.13 compiled beautifully for me last night one I couldn't resist the temptation to boot it up and give it a whirl on my workstation (a monoprocessor AMD ATHLON on VIA motherboard with recent 20GB IDE disk and EXT2 file system, NVIDIA video card). Boot went OK until a message something like "checking filesystems - check forced -mounted 31 times without verification - verifying now". Shortly afterwards I got an OOPS message. EIP : 0010: [<c01d59cb> Not Tainted .... <0> Kernel Panic: Aiee, killing interrupt handler! In interrupt handler - not syncing Not to worry I try to reboot my stable kernel - this fails at the mount command (library's needed by mount command are missing). Impossible to login (password file must be corrupted too). With the rescue disk I run e2fsck and home partition is dead (bad superblocks) and nothing recoverable. The root file system is also corrupted (bad superblocks but not as badly as home). I have some other partitions which I haven't checked yet - maybe some of them survived. As I am not subscribed to the list please CC me in any response. If I can recover the kernel compile I will try to give some configuration options and try to decode the full oops message. More details available on request. Best regards William Stinson (wstinsonfr@yahoo.fr.nospam) P.S. The hard disk is using VIA bus master PCI IDE and the distribution is a "vanilla" mandrake 8.1. I have a REALTEK RTL8029 Ethernet Adaptor. USB is with VIA VT83C572/VT82C586 PCI to VIA Universal Host controller. ___________________________________________________________ Do You Yahoo!? -- Une adresse @yahoo.fr gratuite et en français ! Yahoo! Mail : http://fr.mail.yahoo.com ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-05 18:34 vanilla 2.5.13 severe file system corruption experienced follozing e2fsck william stinson @ 2002-05-05 19:04 ` Anton Altaparmakov 2002-05-06 5:55 ` Jens Axboe 0 siblings, 1 reply; 13+ messages in thread From: Anton Altaparmakov @ 2002-05-05 19:04 UTC (permalink / raw) To: william stinson; +Cc: linux-kernel The IDE core has a nasty PIO bug which causes it to scribble randomly over your disk (even when read-only mounted or not mounted at all). Wait for 2.5.14, that has the fix in it, or use the latest -dj kernel which has the fix, too. Note even with that fix IDE (at least TCQ) is really easy to crash when you put the system under heavier I/O (at least on my via box)... Anton At 19:34 05/05/02, william stinson wrote: >Hi > >as vanilla linux 2.5.13 compiled beautifully for me >last night one I couldn't resist the temptation to >boot it up and give it a whirl on my workstation (a >monoprocessor AMD ATHLON on VIA motherboard with >recent 20GB IDE disk and EXT2 file system, NVIDIA >video card). > >Boot went OK until a message something like "checking >filesystems - check forced -mounted 31 times without >verification - verifying now". Shortly afterwards I >got an OOPS message. > > >EIP : 0010: [<c01d59cb> Not Tainted >.... ><0> Kernel Panic: Aiee, killing interrupt handler! In >interrupt handler - not syncing > >Not to worry I try to reboot my stable kernel - this >fails at the mount command (library's needed by mount >command are missing). Impossible to login (password >file must be corrupted too). > >With the rescue disk I run e2fsck and home partition >is dead (bad superblocks) and nothing recoverable. >The root file system is also corrupted (bad >superblocks but not as badly as home). I have some >other partitions which I haven't checked yet - maybe >some of them survived. > >As I am not subscribed to the list please CC me in any >response. If I can recover the kernel compile I will >try to give some configuration options and try to >decode the full oops message. More details available >on request. > >Best regards >William Stinson (wstinsonfr@yahoo.fr.nospam) > >P.S. > > The hard disk is using VIA bus master PCI IDE and the >distribution is a "vanilla" mandrake 8.1. I have a >REALTEK RTL8029 Ethernet Adaptor. USB is with VIA >VT83C572/VT82C586 PCI to VIA Universal Host >controller. > >___________________________________________________________ >Do You Yahoo!? -- Une adresse @yahoo.fr gratuite et en français ! >Yahoo! Mail : http://fr.mail.yahoo.com >- >To unsubscribe from this list: send the line "unsubscribe linux-kernel" in >the body of a message to majordomo@vger.kernel.org >More majordomo info at http://vger.kernel.org/majordomo-info.html >Please read the FAQ at http://www.tux.org/lkml/ -- "I've not lost my mind. It's backed up on tape somewhere." - Unknown -- Anton Altaparmakov <aia21 at cantab.net> (replace at with @) Linux NTFS Maintainer / IRC: #ntfs on irc.openprojects.net WWW: http://linux-ntfs.sf.net/ & http://www-stu.christs.cam.ac.uk/~aia21/ ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-05 19:04 ` Anton Altaparmakov @ 2002-05-06 5:55 ` Jens Axboe 2002-05-06 8:47 ` Anton Altaparmakov 0 siblings, 1 reply; 13+ messages in thread From: Jens Axboe @ 2002-05-06 5:55 UTC (permalink / raw) To: Anton Altaparmakov; +Cc: william stinson, linux-kernel On Sun, May 05 2002, Anton Altaparmakov wrote: > Note even with that fix IDE (at least TCQ) is really easy to crash when you > put the system under heavier I/O (at least on my via box)... If you have stumpled upon a tcq bug, I'd like to know more about it. -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 5:55 ` Jens Axboe @ 2002-05-06 8:47 ` Anton Altaparmakov 2002-05-06 8:50 ` Jens Axboe 0 siblings, 1 reply; 13+ messages in thread From: Anton Altaparmakov @ 2002-05-06 8:47 UTC (permalink / raw) To: Jens Axboe; +Cc: william stinson, linux-kernel At 06:55 06/05/02, Jens Axboe wrote: >On Sun, May 05 2002, Anton Altaparmakov wrote: > > Note even with that fix IDE (at least TCQ) is really easy to crash when > you > > put the system under heavier I/O (at least on my via box)... > >If you have stumpled upon a tcq bug, I'd like to know more about it. Back trace (sorry didn't have ckermit running so didn't catch the whole output and was too lazy to write it all down): blk_queue_invalidate_tags, tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. At the moment I try to not use 2.5.x as much as possible and only boot into it to test ntfs or other changes I make, so when I do that next I will make sure I am capturing on the serial console and send you details if I experince the panic again... Anton -- "I've not lost my mind. It's backed up on tape somewhere." - Unknown -- Anton Altaparmakov <aia21 at cantab.net> (replace at with @) Linux NTFS Maintainer / IRC: #ntfs on irc.openprojects.net WWW: http://linux-ntfs.sf.net/ & http://www-stu.christs.cam.ac.uk/~aia21/ ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 8:47 ` Anton Altaparmakov @ 2002-05-06 8:50 ` Jens Axboe 2002-05-06 10:02 ` tcq problem details " Anton Altaparmakov 2002-05-06 13:43 ` Sebastian Droege 0 siblings, 2 replies; 13+ messages in thread From: Jens Axboe @ 2002-05-06 8:50 UTC (permalink / raw) To: Anton Altaparmakov; +Cc: william stinson, linux-kernel On Mon, May 06 2002, Anton Altaparmakov wrote: > At 06:55 06/05/02, Jens Axboe wrote: > >On Sun, May 05 2002, Anton Altaparmakov wrote: > >> Note even with that fix IDE (at least TCQ) is really easy to crash when > >you > >> put the system under heavier I/O (at least on my via box)... > > > >If you have stumpled upon a tcq bug, I'd like to know more about it. > > Back trace (sorry didn't have ckermit running so didn't catch the whole > output and was too lazy to write it all down): blk_queue_invalidate_tags, > tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, > ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. Same problem as Sebastian I'm sure, in which case the backtrace holds no info for me, the IDE messages printed _before_ the panic would be helpful though :-) > At the moment I try to not use 2.5.x as much as possible and only boot into > it to test ntfs or other changes I make, so when I do that next I will make > sure I am capturing on the serial console and send you details if I > experince the panic again... Sure, I understand. Just hang on, I'll be releasing a tcq backport for 2.4.19-pre8 today that you can test instead. -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
* tcq problem details Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 8:50 ` Jens Axboe @ 2002-05-06 10:02 ` Anton Altaparmakov 2002-05-06 12:10 ` Jens Axboe 2002-05-06 13:43 ` Sebastian Droege 1 sibling, 1 reply; 13+ messages in thread From: Anton Altaparmakov @ 2002-05-06 10:02 UTC (permalink / raw) To: Jens Axboe; +Cc: linux-kernel Jens, I didn't get a panic in the limited testing I did just now on 2.5.14 for ntfs however I do get soemthing odd. Even when the box is fully idle proc/ide/blah/tcq shows this: TCQ currently on: yes Max queue depth: 32 Max achieved depth: 14 Max depth since last: 1 Current depth: 0 Active tags: [ 1, 3, 4, 6, 9, 11, 12, 14, 17, 19, 20, 22, 25, 27, 28, 29, 30, 31, ] Queue: released [ 1390 ] - started [ 3986 ] pending request and queue count mismatch (counted: 18) DMA status: not running Some times the number of active tags is higher, seems to vary... /me ignorant: this looks wrong. Why are there active tags when no activity? If a am right and this is a problem then perhaps tags are "leaking" some how? -- ide related msgs from boot -- ATA/ATAPI driver v7.0.0 ATA: system bus speed 33MHz ATA: interface: VIA Technologies, Inc. Bus Master IDE, on PCI slot 00:07.1 ATA: chipset rev.: 6 ATA: non-legacy mode: IRQ probe delayed VP_IDE: VIA vt82c686b (rev 40) IDE UDMA100 controller on pci00:07.1 ide0: BM-DMA at 0xd000-0xd007, BIOS settings: hda:DMA, hdb:pio ide1: BM-DMA at 0xd008-0xd00f, BIOS settings: hdc:DMA, hdd:DMA hda: IC35L040AVER07-0, ATA DISK drive hdc: LITE-ON LTR-12102B, ATAPI CD/DVD-ROM drive hdd: Maxtor 90288D2, ATA DISK drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 ide1 at 0x170-0x177,0x376 on irq 15 hda: tagged command queueing enabled, command queue depth 32 hda: 80418240 sectors w/1916KiB Cache, CHS=79780/16/63, UDMA(100) hdd: 5627664 sectors w/256KiB Cache, CHS=5583/16/63, UDMA(33) hdc: ATAPI 40X CD-ROM CD-R/RW drive, 8192kB Cache, (U)DMA Uniform CD-ROM driver Revision: 3.12 Partition check: hda: [PTBL] [5005/255/63] hda1 hda2 < hda5 hda6 hda7 > hdd: [PTBL] [697/128/63] hdd1 hdd2 < hdd5 hdd6 hdd7 hdd8 hdd9 hdd10 > Best regards, Anton -- "I've not lost my mind. It's backed up on tape somewhere." - Unknown -- Anton Altaparmakov <aia21 at cantab.net> (replace at with @) Linux NTFS Maintainer / IRC: #ntfs on irc.openprojects.net WWW: http://linux-ntfs.sf.net/ & http://www-stu.christs.cam.ac.uk/~aia21/ ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: tcq problem details Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 10:02 ` tcq problem details " Anton Altaparmakov @ 2002-05-06 12:10 ` Jens Axboe 2002-05-06 13:18 ` Jens Axboe 0 siblings, 1 reply; 13+ messages in thread From: Jens Axboe @ 2002-05-06 12:10 UTC (permalink / raw) To: Anton Altaparmakov; +Cc: linux-kernel On Mon, May 06 2002, Anton Altaparmakov wrote: > Jens, > > I didn't get a panic in the limited testing I did just now on 2.5.14 for > ntfs however I do get soemthing odd. Even when the box is fully idle > proc/ide/blah/tcq shows this: > > TCQ currently on: yes > Max queue depth: 32 > Max achieved depth: 14 > Max depth since last: 1 > Current depth: 0 > Active tags: [ 1, 3, 4, 6, 9, 11, 12, 14, 17, 19, 20, 22, 25, > 27, 28, 29, 30, 31, ] > Queue: released [ 1390 ] - started [ 3986 ] > pending request and queue count mismatch (counted: 18) > DMA status: not running > > Some times the number of active tags is higher, seems to vary... > > /me ignorant: this looks wrong. Why are there active tags when no activity? > If a am right and this is a problem then perhaps tags are "leaking" some > how? Agrh, that's a silly bug in blk_queue_init_tags(). Could you replace the memset() of tags->tag_index in there with something ala: for (i = 0; i < depth; i++) tags->tag_index[i] = NULL; and see if that solves it? -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: tcq problem details Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 12:10 ` Jens Axboe @ 2002-05-06 13:18 ` Jens Axboe 2002-05-06 14:36 ` Anton Altaparmakov 0 siblings, 1 reply; 13+ messages in thread From: Jens Axboe @ 2002-05-06 13:18 UTC (permalink / raw) To: Anton Altaparmakov; +Cc: linux-kernel, Linus Torvalds On Mon, May 06 2002, Jens Axboe wrote: > Agrh, that's a silly bug in blk_queue_init_tags(). Could you replace the > memset() of tags->tag_index in there with something ala: Brown paper bag time, this should make it work. Linus, please apply. # This is a BitKeeper generated patch for the following project: # Project Name: Linux kernel tree # This patch format is intended for GNU patch command version 2.5 or higher. # This patch includes the following deltas: # ChangeSet 1.549 -> 1.550 # drivers/block/ll_rw_blk.c 1.64 -> 1.65 # # The following is the BitKeeper ChangeSet Log # -------------------------------------------- # 02/05/06 axboe@burns.home.kernel.dk 1.550 # Transposed the last two arguments to memset, causing a slab poisoned # kernel not to use tagging correctly... Brown paper bag stuff. # -------------------------------------------- # diff -Nru a/drivers/block/ll_rw_blk.c b/drivers/block/ll_rw_blk.c --- a/drivers/block/ll_rw_blk.c Mon May 6 15:17:09 2002 +++ b/drivers/block/ll_rw_blk.c Mon May 6 15:17:09 2002 @@ -358,8 +358,8 @@ if (!tags->tag_map) goto fail_map; - memset(tags->tag_index, depth * sizeof(struct request *), 0); - memset(tags->tag_map, bits * sizeof(unsigned long), 0); + memset(tags->tag_index, 0, depth * sizeof(struct request *)); + memset(tags->tag_map, 0, bits * sizeof(unsigned long)); INIT_LIST_HEAD(&tags->busy_list); tags->busy = 0; tags->max_depth = depth; -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: tcq problem details Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 13:18 ` Jens Axboe @ 2002-05-06 14:36 ` Anton Altaparmakov 0 siblings, 0 replies; 13+ messages in thread From: Anton Altaparmakov @ 2002-05-06 14:36 UTC (permalink / raw) To: Jens Axboe; +Cc: linux-kernel, Linus Torvalds At 14:18 06/05/02, Jens Axboe wrote: >On Mon, May 06 2002, Jens Axboe wrote: > > Agrh, that's a silly bug in blk_queue_init_tags(). Could you replace the > > memset() of tags->tag_index in there with something ala: > >Brown paper bag time, this should make it work. Linus, please apply. It now works. (-; running 2.5.14 with ide, tcq, highmem (1G ram), NO preemption or smp compiled in, seems to be holding up. ran already one loop of my ntfs stress test without problems. now trying second time... [aia21@drop aia21]$ sudo cat /proc/ide/ide0/hda/tcq TCQ currently on: yes Max queue depth: 32 Max achieved depth: 32 Max depth since last: 4 Current depth: 0 Active tags: [ ] Queue: released [ 3852 ] - started [ 6539 ] DMA status: not running Anton -- "I've not lost my mind. It's backed up on tape somewhere." - Unknown -- Anton Altaparmakov <aia21 at cantab.net> (replace at with @) Linux NTFS Maintainer / IRC: #ntfs on irc.openprojects.net WWW: http://linux-ntfs.sf.net/ & http://www-stu.christs.cam.ac.uk/~aia21/ ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 8:50 ` Jens Axboe 2002-05-06 10:02 ` tcq problem details " Anton Altaparmakov @ 2002-05-06 13:43 ` Sebastian Droege 2002-05-06 13:48 ` Jens Axboe 1 sibling, 1 reply; 13+ messages in thread From: Sebastian Droege @ 2002-05-06 13:43 UTC (permalink / raw) To: Jens Axboe; +Cc: aia21, wstinsonfr, linux-kernel On Mon, 6 May 2002 10:50:33 +0200 Jens Axboe <axboe@suse.de> wrote: > On Mon, May 06 2002, Anton Altaparmakov wrote: > > At 06:55 06/05/02, Jens Axboe wrote: > > >On Sun, May 05 2002, Anton Altaparmakov wrote: > > >> Note even with that fix IDE (at least TCQ) is really easy to crash when > > >you > > >> put the system under heavier I/O (at least on my via box)... > > > > > >If you have stumpled upon a tcq bug, I'd like to know more about it. > > > > Back trace (sorry didn't have ckermit running so didn't catch the whole > > output and was too lazy to write it all down): blk_queue_invalidate_tags, > > tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, > > ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. > > Same problem as Sebastian I'm sure, in which case the backtrace holds no > info for me, the IDE messages printed _before_ the panic would be > helpful though :-) Ok here they are (or do you mean the ide initialisation?): [normal stuff] ide_tcq_intr_timeout: timeout waiting for service interrupt... ide_tcq_intr_timeout: hwgroup not busy hda: invalidating pending queue (10) kernel BUG at ll_rw_blk.c:407! [panic] ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 13:43 ` Sebastian Droege @ 2002-05-06 13:48 ` Jens Axboe 2002-05-06 15:14 ` Sebastian Droege 0 siblings, 1 reply; 13+ messages in thread From: Jens Axboe @ 2002-05-06 13:48 UTC (permalink / raw) To: Sebastian Droege; +Cc: aia21, wstinsonfr, linux-kernel On Mon, May 06 2002, Sebastian Droege wrote: > On Mon, 6 May 2002 10:50:33 +0200 > Jens Axboe <axboe@suse.de> wrote: > > > On Mon, May 06 2002, Anton Altaparmakov wrote: > > > At 06:55 06/05/02, Jens Axboe wrote: > > > >On Sun, May 05 2002, Anton Altaparmakov wrote: > > > >> Note even with that fix IDE (at least TCQ) is really easy to crash when > > > >you > > > >> put the system under heavier I/O (at least on my via box)... > > > > > > > >If you have stumpled upon a tcq bug, I'd like to know more about it. > > > > > > Back trace (sorry didn't have ckermit running so didn't catch the whole > > > output and was too lazy to write it all down): blk_queue_invalidate_tags, > > > tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, > > > ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. > > > > Same problem as Sebastian I'm sure, in which case the backtrace holds no > > info for me, the IDE messages printed _before_ the panic would be > > helpful though :-) > Ok here they are (or do you mean the ide initialisation?): > > [normal stuff] > > ide_tcq_intr_timeout: timeout waiting for service interrupt... > ide_tcq_intr_timeout: hwgroup not busy > hda: invalidating pending queue (10) > kernel BUG at ll_rw_blk.c:407! Thanks, yes these were the messages I meant. Could you try 2.4.19-pre8 plus patches just posted? -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 13:48 ` Jens Axboe @ 2002-05-06 15:14 ` Sebastian Droege 2002-05-06 15:17 ` Jens Axboe 0 siblings, 1 reply; 13+ messages in thread From: Sebastian Droege @ 2002-05-06 15:14 UTC (permalink / raw) To: Jens Axboe; +Cc: aia21, wstinsonfr, linux-kernel [-- Attachment #1: Type: text/plain, Size: 1896 bytes --] On Mon, 6 May 2002 15:48:03 +0200 Jens Axboe <axboe@suse.de> wrote: > On Mon, May 06 2002, Sebastian Droege wrote: > > On Mon, 6 May 2002 10:50:33 +0200 > > Jens Axboe <axboe@suse.de> wrote: > > > > > On Mon, May 06 2002, Anton Altaparmakov wrote: > > > > At 06:55 06/05/02, Jens Axboe wrote: > > > > >On Sun, May 05 2002, Anton Altaparmakov wrote: > > > > >> Note even with that fix IDE (at least TCQ) is really easy to crash when > > > > >you > > > > >> put the system under heavier I/O (at least on my via box)... > > > > > > > > > >If you have stumpled upon a tcq bug, I'd like to know more about it. > > > > > > > > Back trace (sorry didn't have ckermit running so didn't catch the whole > > > > output and was too lazy to write it all down): blk_queue_invalidate_tags, > > > > tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, > > > > ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. > > > > > > Same problem as Sebastian I'm sure, in which case the backtrace holds no > > > info for me, the IDE messages printed _before_ the panic would be > > > helpful though :-) > > Ok here they are (or do you mean the ide initialisation?): > > > > [normal stuff] > > > > ide_tcq_intr_timeout: timeout waiting for service interrupt... > > ide_tcq_intr_timeout: hwgroup not busy > > hda: invalidating pending queue (10) > > kernel BUG at ll_rw_blk.c:407! > > Thanks, yes these were the messages I meant. Could you try 2.4.19-pre8 > plus patches just posted? Ok... tested it panics with a NULL pointer dereference at 00000004 after hda: IBM-DTTA-351010, ATA DISK drive hdb: WDC WD800BB-00BSA0, ATA DISK drive hdc: CD-W512EB, ATAPI CD/DVD-ROM drive hdd: CD-532E-B, ATAPI CD/DVD-ROM drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 ide1 at 0x170-0x177,0x376 on irq 15 [panic] Sorry but I have no more time for testing and oops handcopying today I'll do that tomorrow Bye [-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --] ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: vanilla 2.5.13 severe file system corruption experienced follozing e2fsck ... 2002-05-06 15:14 ` Sebastian Droege @ 2002-05-06 15:17 ` Jens Axboe 0 siblings, 0 replies; 13+ messages in thread From: Jens Axboe @ 2002-05-06 15:17 UTC (permalink / raw) To: Sebastian Droege; +Cc: aia21, wstinsonfr, linux-kernel On Mon, May 06 2002, Sebastian Droege wrote: > On Mon, 6 May 2002 15:48:03 +0200 > Jens Axboe <axboe@suse.de> wrote: > > > On Mon, May 06 2002, Sebastian Droege wrote: > > > On Mon, 6 May 2002 10:50:33 +0200 > > > Jens Axboe <axboe@suse.de> wrote: > > > > > > > On Mon, May 06 2002, Anton Altaparmakov wrote: > > > > > At 06:55 06/05/02, Jens Axboe wrote: > > > > > >On Sun, May 05 2002, Anton Altaparmakov wrote: > > > > > >> Note even with that fix IDE (at least TCQ) is really easy to crash when > > > > > >you > > > > > >> put the system under heavier I/O (at least on my via box)... > > > > > > > > > > > >If you have stumpled upon a tcq bug, I'd like to know more about it. > > > > > > > > > > Back trace (sorry didn't have ckermit running so didn't catch the whole > > > > > output and was too lazy to write it all down): blk_queue_invalidate_tags, > > > > > tcq_invalidate_queue, ide_dmaq_complete, ide_dmaq_intr, ata_irq_request, > > > > > ide_dmaq_intr, handle_IRQ_event, do_IRQ, ideprobe_init. > > > > > > > > Same problem as Sebastian I'm sure, in which case the backtrace holds no > > > > info for me, the IDE messages printed _before_ the panic would be > > > > helpful though :-) > > > Ok here they are (or do you mean the ide initialisation?): > > > > > > [normal stuff] > > > > > > ide_tcq_intr_timeout: timeout waiting for service interrupt... > > > ide_tcq_intr_timeout: hwgroup not busy > > > hda: invalidating pending queue (10) > > > kernel BUG at ll_rw_blk.c:407! > > > > Thanks, yes these were the messages I meant. Could you try 2.4.19-pre8 > > plus patches just posted? > Ok... tested > it panics with a NULL pointer dereference at 00000004 after > > hda: IBM-DTTA-351010, ATA DISK drive > hdb: WDC WD800BB-00BSA0, ATA DISK drive > hdc: CD-W512EB, ATAPI CD/DVD-ROM drive > hdd: CD-532E-B, ATAPI CD/DVD-ROM drive > ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 > ide1 at 0x170-0x177,0x376 on irq 15 > [panic] > > Sorry but I have no more time for testing and oops handcopying today > I'll do that tomorrow Ok thanks, it's probably the auto_poll check though. Could you try one more boot, just commenting out the call to ide_tcq_check_autopoll() in ide_enable_queued()? It's in drivers/ide/ide-tcq.c. -- Jens Axboe ^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2002-05-06 15:18 UTC | newest] Thread overview: 13+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2002-05-05 18:34 vanilla 2.5.13 severe file system corruption experienced follozing e2fsck william stinson 2002-05-05 19:04 ` Anton Altaparmakov 2002-05-06 5:55 ` Jens Axboe 2002-05-06 8:47 ` Anton Altaparmakov 2002-05-06 8:50 ` Jens Axboe 2002-05-06 10:02 ` tcq problem details " Anton Altaparmakov 2002-05-06 12:10 ` Jens Axboe 2002-05-06 13:18 ` Jens Axboe 2002-05-06 14:36 ` Anton Altaparmakov 2002-05-06 13:43 ` Sebastian Droege 2002-05-06 13:48 ` Jens Axboe 2002-05-06 15:14 ` Sebastian Droege 2002-05-06 15:17 ` Jens Axboe
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox