All of lore.kernel.org
 help / color / mirror / Atom feed
From: Laurent Riffard <laurent.riffard@free.fr>
To: James Bottomley <James.Bottomley@SteelEye.com>
Cc: Hannes Reinecke <hare@suse.de>,
	Andrew Morton <akpm@linux-foundation.org>,
	linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org,
	linux-scsi@vger.kernel.org
Subject: Re: 2.6.24-rc3-mm1: I/O error, system hangs
Date: Sun, 25 Nov 2007 21:39:35 +0100	[thread overview]
Message-ID: <4749DD87.5020206@free.fr> (raw)
In-Reply-To: <1195976275.3427.6.camel@localhost.localdomain>

Le 25.11.2007 08:37, James Bottomley a écrit :
> On Sat, 2007-11-24 at 23:59 +0100, Laurent Riffard wrote:
>> Le 24.11.2007 14:26, James Bottomley a écrit :
>>> OK, could you post dmesgs again, please.  I actually tested this
>> with an
>>> aic79xx card, and for me it does cause Domain Validation to succeed
>>> again.
>> James, 
>>
>> Here is a dmesg produced by 2.6.24-rc3-mm1 + your patch "separates
>> the 
>> BLOCK and QUIESCE states
>> correctly" (http://lkml.org/lkml/2007/11/24/8).
>>
>> How to reproduce :
>> - boot
>> - switch to a text console
>> - capture dmesg in a file, sync, etc. There are 3 I/O errors, but the 
>>   system does work.
>> - switch to X console, log in the Gnome Desktop, the system partially 
>>   hangs.
>> - switch back to a text console: dmesg(1) still works, it shows some 
>>   additonal I/O errors. At this point, any disk access makes the system 
>>   completely hung.
>>
>> Additionnal data:
>> - the I/O errors always happen on the same blocks.
>>
>> plain text document attachment (dmesg-2.6.24-rc3-mm1-patched)
> [...]
>> [   25.521256] scsi0 : pata_via
>> [   25.521711] scsi1 : pata_via
>> [   25.524089] ata1: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0xb800 irq 14
>> [   25.524176] ata2: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0xb808 irq 15
>> [   25.683141] ata1.00: ATA-5: ST340016A, 3.75, max UDMA/100
>> [   25.683208] ata1.00: 78165360 sectors, multi 16: LBA 
>> [   25.683475] ata1.01: ATA-7: Maxtor 6Y080L0, YAR41BW0, max UDMA/133
>> [   25.684116] ata1.01: 160086528 sectors, multi 16: LBA 
>> [   25.691127] ata1.00: configured for UDMA/100
>> [   25.699142] ata1.01: configured for UDMA/100
>> [   26.170860] ata2.00: ATAPI: HL-DT-ST DVDRAM GSA-4165B, DL05, max UDMA/33
>> [   26.171562] ata2.01: ATAPI: CD-950E/AKU, A4Q, max MWDMA2, CDB intr
>> [   26.330839] ata2.00: configured for UDMA/33
>> [   26.490828] ata2.01: configured for MWDMA2
>> [   26.503014] scsi 0:0:0:0: Direct-Access     ATA      ST340016A 3.75 PQ: 0 ANSI: 5
>> [   26.504670] scsi 0:0:1:0: Direct-Access     ATA      Maxtor 6Y080L0 YAR4 PQ: 0 ANSI: 5
>> [   26.509842] scsi 1:0:0:0: CD-ROM            HL-DT-ST DVDRAM GSA-4165B DL05 PQ: 0 ANSI: 5
>> [   26.511673] scsi 1:0:1:0: CD-ROM            E-IDE    CD-950E/AKU A4Q  PQ: 0 ANSI: 5
> [...]
>> [   60.216113] sd 0:0:0:0: [sda] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
>> [   60.216124] end_request: I/O error, dev sda, sector 16460
> 
> I think this one's quite easy:  PATA devices in libata are queue depth 1
> (since they don't do NCQ).  Thus, they're peculiarly sensitive to the
> bug where we fail over queue depth requests.
> 
> On the other hand, I don't see how a filesystem request is getting
> REQ_FAILFAST ... unless there's a bio or readahead issue involved.
> Anyway, could you try this patch:
> 
> http://marc.info/?l=linux-scsi&m=119592627425498
> 
> Which should fix the queue depth issue, and see if the errors go away?

No, this one doesn't help...

-- 
laurent
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

WARNING: multiple messages have this Message-ID (diff)
From: Laurent Riffard <laurent.riffard@free.fr>
To: James Bottomley <James.Bottomley@SteelEye.com>
Cc: Hannes Reinecke <hare@suse.de>,
	Andrew Morton <akpm@linux-foundation.org>,
	linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org,
	linux-scsi@vger.kernel.org
Subject: Re: 2.6.24-rc3-mm1: I/O error, system hangs
Date: Sun, 25 Nov 2007 21:39:35 +0100	[thread overview]
Message-ID: <4749DD87.5020206@free.fr> (raw)
In-Reply-To: <1195976275.3427.6.camel@localhost.localdomain>

Le 25.11.2007 08:37, James Bottomley a écrit :
> On Sat, 2007-11-24 at 23:59 +0100, Laurent Riffard wrote:
>> Le 24.11.2007 14:26, James Bottomley a écrit :
>>> OK, could you post dmesgs again, please.  I actually tested this
>> with an
>>> aic79xx card, and for me it does cause Domain Validation to succeed
>>> again.
>> James, 
>>
>> Here is a dmesg produced by 2.6.24-rc3-mm1 + your patch "separates
>> the 
>> BLOCK and QUIESCE states
>> correctly" (http://lkml.org/lkml/2007/11/24/8).
>>
>> How to reproduce :
>> - boot
>> - switch to a text console
>> - capture dmesg in a file, sync, etc. There are 3 I/O errors, but the 
>>   system does work.
>> - switch to X console, log in the Gnome Desktop, the system partially 
>>   hangs.
>> - switch back to a text console: dmesg(1) still works, it shows some 
>>   additonal I/O errors. At this point, any disk access makes the system 
>>   completely hung.
>>
>> Additionnal data:
>> - the I/O errors always happen on the same blocks.
>>
>> plain text document attachment (dmesg-2.6.24-rc3-mm1-patched)
> [...]
>> [   25.521256] scsi0 : pata_via
>> [   25.521711] scsi1 : pata_via
>> [   25.524089] ata1: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0xb800 irq 14
>> [   25.524176] ata2: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0xb808 irq 15
>> [   25.683141] ata1.00: ATA-5: ST340016A, 3.75, max UDMA/100
>> [   25.683208] ata1.00: 78165360 sectors, multi 16: LBA 
>> [   25.683475] ata1.01: ATA-7: Maxtor 6Y080L0, YAR41BW0, max UDMA/133
>> [   25.684116] ata1.01: 160086528 sectors, multi 16: LBA 
>> [   25.691127] ata1.00: configured for UDMA/100
>> [   25.699142] ata1.01: configured for UDMA/100
>> [   26.170860] ata2.00: ATAPI: HL-DT-ST DVDRAM GSA-4165B, DL05, max UDMA/33
>> [   26.171562] ata2.01: ATAPI: CD-950E/AKU, A4Q, max MWDMA2, CDB intr
>> [   26.330839] ata2.00: configured for UDMA/33
>> [   26.490828] ata2.01: configured for MWDMA2
>> [   26.503014] scsi 0:0:0:0: Direct-Access     ATA      ST340016A 3.75 PQ: 0 ANSI: 5
>> [   26.504670] scsi 0:0:1:0: Direct-Access     ATA      Maxtor 6Y080L0 YAR4 PQ: 0 ANSI: 5
>> [   26.509842] scsi 1:0:0:0: CD-ROM            HL-DT-ST DVDRAM GSA-4165B DL05 PQ: 0 ANSI: 5
>> [   26.511673] scsi 1:0:1:0: CD-ROM            E-IDE    CD-950E/AKU A4Q  PQ: 0 ANSI: 5
> [...]
>> [   60.216113] sd 0:0:0:0: [sda] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
>> [   60.216124] end_request: I/O error, dev sda, sector 16460
> 
> I think this one's quite easy:  PATA devices in libata are queue depth 1
> (since they don't do NCQ).  Thus, they're peculiarly sensitive to the
> bug where we fail over queue depth requests.
> 
> On the other hand, I don't see how a filesystem request is getting
> REQ_FAILFAST ... unless there's a bio or readahead issue involved.
> Anyway, could you try this patch:
> 
> http://marc.info/?l=linux-scsi&m=119592627425498
> 
> Which should fix the queue depth issue, and see if the errors go away?

No, this one doesn't help...

-- 
laurent

  reply	other threads:[~2007-11-25 20:39 UTC|newest]

Thread overview: 139+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-11-21  4:45 2.6.24-rc3-mm1 Andrew Morton
2007-11-21  5:51 ` 2.6.24-rc3-mm1 Dave Young
2007-11-21  6:00   ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-21  6:03     ` 2.6.24-rc3-mm1 Dave Young
2007-11-21  6:15       ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-21  6:22         ` 2.6.24-rc3-mm1 Dave Young
2007-11-21 18:35         ` 2.6.24-rc3-mm1 Kirill A. Shutemov
2007-11-21 22:25           ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-26 18:48       ` 2.6.24-rc3-mm1 Rik van Riel
2007-11-26 19:33         ` 2.6.24-rc3-mm1 Jiri Slaby
2007-11-21  5:56 ` 2.6.24-rc3-mm1 - Build Failure on S390x Kamalesh Babulal
2007-11-21  6:04   ` Andrew Morton
2007-11-21  5:58 ` 2.6.24-rc3-mm1 KAMEZAWA Hiroyuki
2007-11-21  6:08   ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-21 12:49     ` 2.6.24-rc3-mm1 Rene Herman
2007-11-21  6:11 ` 2.6.24-rc3-mm1 - Kernel Panic on IO-APIC Kamalesh Babulal
2007-11-21  6:18   ` Andrew Morton
2007-11-21  9:22     ` Kamalesh Babulal
2007-11-21  9:29       ` Andrew Morton
2007-11-21  9:43         ` Kamalesh Babulal
2007-11-21 19:33         ` Torsten Kaiser
2007-11-22 10:04           ` Kirill A. Shutemov
2007-11-21 19:22     ` Len Brown
2007-11-21 19:48       ` Torsten Kaiser
2007-11-24  0:49     ` Alexey Dobriyan
2007-11-26 19:39     ` Rik van Riel
2007-11-26 20:33       ` Andrew Morton
2007-11-26 20:45         ` Ingo Molnar
2007-11-26 22:08           ` Jiri Slaby
2007-11-26 22:17             ` Andrew Morton
2007-11-26 22:22               ` Jiri Slaby
2007-11-26 23:14               ` Jiri Slaby
2007-11-26 23:28                 ` Andrew Morton
2007-11-27 17:50                   ` Rik van Riel
2007-11-26 20:54         ` Rik van Riel
2007-11-26 20:56         ` Christoph Lameter
2007-11-21  8:06 ` 2.6.24-rc3-mm1- powerpc link failure Kamalesh Babulal
2007-11-21  8:06   ` Kamalesh Babulal
2007-11-21 22:52   ` Stephen Rothwell
2007-11-21 22:52     ` Stephen Rothwell
2007-11-21  8:24 ` 2.6.24-rc3-mm1 make headers_check fails Kamalesh Babulal
2007-11-21  0:32   ` Andrew Morton
2007-11-21  8:41     ` Kamalesh Babulal
2007-11-21  8:44       ` Avi Kivity
2007-11-21  8:52         ` Robert P. J. Day
2007-11-21  9:04           ` Andrew Morton
2007-11-21  9:06             ` Robert P. J. Day
2007-11-21  9:58         ` Sam Ravnborg
2007-11-21 10:00           ` Avi Kivity
2007-11-21 10:17             ` Avi Kivity
2007-11-21 10:31               ` Robert P. J. Day
2007-11-28  5:02               ` Andrew Morton
2007-12-02  8:56                 ` Avi Kivity
2007-11-24 14:34           ` Adrian Bunk
2007-11-21  8:42 ` 2.6.24-rc3-mm1 (sync is slow ?) KAMEZAWA Hiroyuki
2007-11-21  8:49   ` Andrew Morton
2007-11-22  3:06     ` KAMEZAWA Hiroyuki
2007-11-24 12:04     ` kosaki
2007-11-24 18:04       ` Gabriel C
2007-11-26  7:06         ` KAMEZAWA Hiroyuki
2007-11-21  8:49   ` KAMEZAWA Hiroyuki
2007-11-21 18:23 ` 2.6.24-rc3-mm1: usb mouse doesn't work Kirill A. Shutemov
2007-11-21 22:22   ` Andrew Morton
2007-11-22 10:17     ` Kirill A. Shutemov
2007-11-22 17:07       ` [linux-usb-devel] " Alan Stern
2007-11-22 17:41         ` Marin Mitov
2007-11-23  2:51           ` Alan Stern
2007-11-23  5:19             ` Kirill A. Shutemov
2007-11-23 16:21               ` Alan Stern
2007-12-31 21:06               ` Alan Stern
2007-11-21 21:45 ` 2.6.24-rc3-mm1: I/O error, system hangs Laurent Riffard
2007-11-21 22:41   ` Andrew Morton
2007-11-23  7:29     ` Laurent Riffard
2007-11-23  7:29       ` Laurent Riffard
2007-11-23  7:51       ` Hannes Reinecke
2007-11-23  7:51         ` Hannes Reinecke
2007-11-23 11:38         ` Hannes Reinecke
2007-11-23 17:52           ` Laurent Riffard
2007-11-24  6:42             ` James Bottomley
2007-11-24 12:57               ` Laurent Riffard
2007-11-24 13:26                 ` James Bottomley
2007-11-24 13:26                   ` James Bottomley
2007-11-24 17:54                   ` Gabriel C
2007-11-24 18:04                     ` James Bottomley
2007-11-24 18:08                       ` Gabriel C
2007-11-24 18:08                         ` Gabriel C
2007-11-24 18:28                         ` Gabriel C
2007-11-24 18:28                           ` Gabriel C
2007-11-24 22:59                   ` Laurent Riffard
2007-11-25  7:37                     ` James Bottomley
2007-11-25  7:37                       ` James Bottomley
2007-11-25 20:39                       ` Laurent Riffard [this message]
2007-11-25 20:39                         ` Laurent Riffard
2007-11-28 21:38                         ` Laurent Riffard
2007-11-24 17:44           ` James Bottomley
2007-11-26  7:54             ` Hannes Reinecke
2007-11-22 10:22 ` 2.6.24-rc3-mm1 Kirill A. Shutemov
2007-11-23  0:18   ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-23  0:48     ` 2.6.24-rc3-mm1 Thomas Gleixner
2007-11-23  6:05       ` 2.6.24-rc3-mm1 Kirill A. Shutemov
2007-11-23  8:59         ` 2.6.24-rc3-mm1 Andreas Herrmann
2007-11-23  1:39 ` 2.6.24-rc3-mm1 Gabriel C
2007-11-23  4:12   ` 2.6.24-rc3-mm1 Andrew Morton
2007-11-23  5:55     ` 2.6.24-rc3-mm1 Gabriel C
2007-11-27  6:15       ` 2.6.24-rc3-mm1 Andrew Morton
2007-12-11 16:33         ` 2.6.24-rc3-mm1 James Bottomley
2007-12-12 10:08           ` 2.6.24-rc3-mm1 Boaz Harrosh
2007-12-12 11:03             ` [PATCH] REQ-flags to/from BIO-flags bugfix Boaz Harrosh
2007-12-12 15:18               ` Matthew Wilcox
2007-12-12 15:54                 ` Matthew Wilcox
2007-12-13  5:36                   ` David Chinner
2007-12-12 16:06                 ` Boaz Harrosh
2007-12-12 16:33                   ` Matthew Wilcox
2007-12-12 11:36             ` 2.6.24-rc3-mm1 Jens Axboe
2007-12-14  9:00           ` 2.6.24-rc3-mm1 Hannes Reinecke
2007-12-14  9:00             ` 2.6.24-rc3-mm1 Hannes Reinecke
2007-12-14 14:26             ` 2.6.24-rc3-mm1 James Bottomley
2008-01-07 14:05               ` Multipath failover handling (Was: Re: 2.6.24-rc3-mm1) Hannes Reinecke
2008-01-07 14:05                 ` Hannes Reinecke
2008-01-07 17:57                 ` James Bottomley
2008-01-07 18:24                   ` Mike Christie
2007-11-26 19:13 ` 2.6.24-rc3-mm1 Randy Dunlap
2007-11-26 19:34   ` 2.6.24-rc3-mm1 Christoph Lameter
2007-11-26 20:40     ` 2.6.24-rc3-mm1 Randy Dunlap
2007-11-26 20:56       ` 2.6.24-rc3-mm1 Christoph Lameter
2007-11-26 20:47     ` [PATCH -mm] x86 allnoconfig memory model Randy Dunlap
2007-11-26 21:00       ` Christoph Lameter
2007-11-26 21:17         ` Randy Dunlap
2007-11-26 21:20         ` Andrew Morton
2007-11-26 21:52           ` Christoph Lameter
2007-11-26 21:57             ` Andrew Morton
2007-11-26 23:19               ` Christoph Lameter
2007-11-27  7:16 ` 2.6.24-rc3-mm1 - brick my Dell Latitude D820 Valdis.Kletnieks
2007-11-27  7:27   ` Andrew Morton
2007-11-27  7:54     ` Valdis.Kletnieks
2007-11-27  8:17       ` Andrew Morton
2007-11-27 10:25     ` Ingo Molnar
2007-11-27  8:25   ` Dave Young
2007-11-27  8:46     ` Valdis.Kletnieks

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4749DD87.5020206@free.fr \
    --to=laurent.riffard@free.fr \
    --cc=James.Bottomley@SteelEye.com \
    --cc=akpm@linux-foundation.org \
    --cc=hare@suse.de \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.