public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
       [not found]   ` <20090729200714.GA20706@lifeintegrity.com>
@ 2009-07-29 23:04     ` Robert Hancock
  2009-07-31 18:40       ` Allan Wind
  0 siblings, 1 reply; 7+ messages in thread
From: Robert Hancock @ 2009-07-29 23:04 UTC (permalink / raw)
  To: linux-ide; +Cc: linux-scsi

On 07/29/2009 02:07 PM, Allan Wind wrote:
> On 2009-07-29T13:43:06, Robert Hancock wrote:
>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>> using Linux 2.6.30.3 and seeing the following:
>>>
>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>> Are there no error messages before this point? Can you post the full
>> dmesg output from bootup?
>
> Thanks for looking into this, Robert.  I do not see any relevant error
> message before this point, but made the entire 82k dmesg available here:
> http://lifeintegrity.com/~allan/dmesg

It seems like some request failed but apparently that mptsas driver 
isn't dumping out what happened for some reason. There are some of those 
"recovered error" indications but they're not near the I/O error report, 
so I'm not sure what's going on. CCing linux-scsi.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-29 23:04     ` LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Robert Hancock
@ 2009-07-31 18:40       ` Allan Wind
  2009-08-03  5:08         ` Allan Wind
  0 siblings, 1 reply; 7+ messages in thread
From: Allan Wind @ 2009-07-31 18:40 UTC (permalink / raw)
  To: linux-ide, linux-scsi

On 2009-07-29T17:04:27, Robert Hancock wrote:
> On 07/29/2009 02:07 PM, Allan Wind wrote:
>> On 2009-07-29T13:43:06, Robert Hancock wrote:
>>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>>> using Linux 2.6.30.3 and seeing the following:
>>>>
>>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>>> Are there no error messages before this point? Can you post the full
>>> dmesg output from bootup?
>>
>> Thanks for looking into this, Robert.  I do not see any relevant error
>> message before this point, but made the entire 82k dmesg available here:
>> http://lifeintegrity.com/~allan/dmesg
>
> It seems like some request failed but apparently that mptsas driver  
> isn't dumping out what happened for some reason. There are some of those  
> "recovered error" indications but they're not near the I/O error report,  
> so I'm not sure what's going on. CCing linux-scsi.

Is there any data I can help with to advance this issue?


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-31 18:40       ` Allan Wind
@ 2009-08-03  5:08         ` Allan Wind
  2009-08-03 14:34           ` James Bottomley
  0 siblings, 1 reply; 7+ messages in thread
From: Allan Wind @ 2009-08-03  5:08 UTC (permalink / raw)
  To: linux-ide, linux-scsi

On 2009-07-31T14:40:23, Allan Wind wrote:
> On 2009-07-29T17:04:27, Robert Hancock wrote:
> > On 07/29/2009 02:07 PM, Allan Wind wrote:
> >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> >>>> using Linux 2.6.30.3 and seeing the following:
> >>>>
> >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> >>> Are there no error messages before this point? Can you post the full
> >>> dmesg output from bootup?
> >>
> >> Thanks for looking into this, Robert.  I do not see any relevant error
> >> message before this point, but made the entire 82k dmesg available here:
> >> http://lifeintegrity.com/~allan/dmesg
> >
> > It seems like some request failed but apparently that mptsas driver  
> > isn't dumping out what happened for some reason. There are some of those  
> > "recovered error" indications but they're not near the I/O error report,  
> > so I'm not sure what's going on. CCing linux-scsi.
> 
> Is there any data I can help with to advance this issue?

The above complains about sector 3907028974 which is exactly 
19566 sectors greater than the size of the raid array according 
to parted.  In other words it appears to be an access to the last 
sector of the array.


Number  Start   End          Size         File system  Name  
Flags
 1      34s     19565s       19532s                          
bios_grub
 2      19566s  3907029134s  3907009569s  ext3               raid


Model: ATA WDC WD2002FYPS-0 (scsi)
Disk /dev/sdb: 3907029168s
Sector size (logical/physical): 512B/512B
Partition Table: gpt

Number  Start   End          Size         File system  Name  
Flags
 1      34s     19565s       19532s                          
bios_grub
 2      19566s  3907029134s  3907009569s  ext3               raid


Model: Unknown (unknown)
Disk /dev/md0: 3907009408s
Sector size (logical/physical): 512B/512B
Partition Table: loop

Number  Start  End          Size         File system  Flags
 1      0s     3907009407s  3907009408s  ext3


# mdadm --detail /dev/md0
/dev/md0:
        Version : 0.90
  Creation Time : Tue Jul 21 00:06:07 2009
     Raid Level : raid1
     Array Size : 1953504704 (1863.01 GiB 2000.39 GB)
  Used Dev Size : 1953504704 (1863.01 GiB 2000.39 GB)
   Raid Devices : 2
  Total Devices : 2
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Mon Aug  3 00:59:07 2009
          State : clean
 Active Devices : 2
Working Devices : 2
 Failed Devices : 0
  Spare Devices : 0

           UUID : fdc1a639:0e6cf1a1:14ef9d30:0388f34d
         Events : 0.59706

    Number   Major   Minor   RaidDevice State
       0       8        2        0      active sync   /dev/sda2
       1       8       18        1      active sync   /dev/sdb2

Disabled NCQ did not work around this issue.

 
/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03  5:08         ` Allan Wind
@ 2009-08-03 14:34           ` James Bottomley
  2009-08-03 16:06             ` Allan Wind
  0 siblings, 1 reply; 7+ messages in thread
From: James Bottomley @ 2009-08-03 14:34 UTC (permalink / raw)
  To: Allan Wind; +Cc: linux-ide, linux-scsi

On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> On 2009-07-31T14:40:23, Allan Wind wrote:
> > On 2009-07-29T17:04:27, Robert Hancock wrote:
> > > On 07/29/2009 02:07 PM, Allan Wind wrote:
> > >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> > >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> > >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> > >>>> using Linux 2.6.30.3 and seeing the following:
> > >>>>
> > >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> > >>> Are there no error messages before this point? Can you post the full
> > >>> dmesg output from bootup?
> > >>
> > >> Thanks for looking into this, Robert.  I do not see any relevant error
> > >> message before this point, but made the entire 82k dmesg available here:
> > >> http://lifeintegrity.com/~allan/dmesg
> > >
> > > It seems like some request failed but apparently that mptsas driver  
> > > isn't dumping out what happened for some reason. There are some of those  
> > > "recovered error" indications but they're not near the I/O error report,  
> > > so I'm not sure what's going on. CCing linux-scsi.
> > 
> > Is there any data I can help with to advance this issue?
> 
> The above complains about sector 3907028974 which is exactly 
> 19566 sectors greater than the size of the raid array according 
> to parted.  In other words it appears to be an access to the last 
> sector of the array.

If it's a read beyond the end of a partition, then it's possible it got
rejected in the partition checking logic before ever reaching the I/O
controller (which would explain why no messages from the fusion in the
log).

However, I don't think the analysis is correct.  Parted says


> Number  Start   End          Size         File system  Name  Flags
>  1      34s     19565s       19532s                          bios_grub
>  2      19566s  3907029134s  3907009569s  ext3               raid

So the absolute sector number 3907028974 is within partition 2.

I still think something went wrong in block.  Even if the fusion failed
to spit an error, the SCSI layer is usually quite chatty about failures.
To get a simple error in the log with no explanation usually tends to
indicate that it occurred in the block layer.

James



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 14:34           ` James Bottomley
@ 2009-08-03 16:06             ` Allan Wind
  2009-08-03 16:15               ` James Bottomley
  0 siblings, 1 reply; 7+ messages in thread
From: Allan Wind @ 2009-08-03 16:06 UTC (permalink / raw)
  To: James Bottomley; +Cc: linux-ide, linux-scsi

On 2009-08-03T09:34:53, James Bottomley wrote:
> On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> >
> > The above complains about sector 3907028974 which is exactly 
> > 19566 sectors greater than the size of the raid array according 
> > to parted.  In other words it appears to be an access to the last 
> > sector of the array.
> 
> If it's a read beyond the end of a partition, then it's possible it got
> rejected in the partition checking logic before ever reaching the I/O
> controller (which would explain why no messages from the fusion in the
> log).
> 
> However, I don't think the analysis is correct.  Parted says
> 
> 
> > Number  Start   End          Size         File system  Name  Flags
> >  1      34s     19565s       19532s                          bios_grub
> >  2      19566s  3907029134s  3907009569s  ext3               raid
> 
> So the absolute sector number 3907028974 is within partition 2.

I was actually trying to make a different point.  Namely that it 
was curious that the error message complains about sector
3907028974 which is exactly the size of the array + 19566, or in 
other words 1 sector past the end of array:

/dev/sdb2
start: 19566
end: 3907029134
size: 3907029134 - 19566 + 1 = 3907009568

/dev/md0
start: 0
end: 3907009407
size: 3907009407 - 0 + 1 = 3907009408

If dm maps linear to the disk sectors:

dm  /dev/sdb2
0   19566
1   19567
...
3907009406 3907028972
3907009407 3907028973

What dm sector would map to physical sector 3907028974?


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 16:06             ` Allan Wind
@ 2009-08-03 16:15               ` James Bottomley
  2009-08-26 21:17                 ` Allan Wind
  0 siblings, 1 reply; 7+ messages in thread
From: James Bottomley @ 2009-08-03 16:15 UTC (permalink / raw)
  To: Allan Wind; +Cc: linux-ide, linux-scsi

On Mon, 2009-08-03 at 12:06 -0400, Allan Wind wrote:
> On 2009-08-03T09:34:53, James Bottomley wrote:
> > On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> > >
> > > The above complains about sector 3907028974 which is exactly 
> > > 19566 sectors greater than the size of the raid array according 
> > > to parted.  In other words it appears to be an access to the last 
> > > sector of the array.
> > 
> > If it's a read beyond the end of a partition, then it's possible it got
> > rejected in the partition checking logic before ever reaching the I/O
> > controller (which would explain why no messages from the fusion in the
> > log).
> > 
> > However, I don't think the analysis is correct.  Parted says
> > 
> > 
> > > Number  Start   End          Size         File system  Name  Flags
> > >  1      34s     19565s       19532s                          bios_grub
> > >  2      19566s  3907029134s  3907009569s  ext3               raid
> > 
> > So the absolute sector number 3907028974 is within partition 2.
> 
> I was actually trying to make a different point.  Namely that it 
> was curious that the error message complains about sector
> 3907028974 which is exactly the size of the array + 19566, or in 
> other words 1 sector past the end of array:

That's where md stores its superblock ... but it's still within the
partition.

James



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 16:15               ` James Bottomley
@ 2009-08-26 21:17                 ` Allan Wind
  0 siblings, 0 replies; 7+ messages in thread
From: Allan Wind @ 2009-08-26 21:17 UTC (permalink / raw)
  To: linux-ide, linux-scsi

I have not seen any problems since upgrading to 2.6.30.4.


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2009-08-26 21:17 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20090727060322.GB27002@lifeintegrity.com>
     [not found] ` <4A70A64A.2070605@gmail.com>
     [not found]   ` <20090729200714.GA20706@lifeintegrity.com>
2009-07-29 23:04     ` LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Robert Hancock
2009-07-31 18:40       ` Allan Wind
2009-08-03  5:08         ` Allan Wind
2009-08-03 14:34           ` James Bottomley
2009-08-03 16:06             ` Allan Wind
2009-08-03 16:15               ` James Bottomley
2009-08-26 21:17                 ` Allan Wind

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox