linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
@ 2009-07-27  6:03 Allan Wind
  2009-07-29 19:43 ` Robert Hancock
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-27  6:03 UTC (permalink / raw)
  To: linux-ide

I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
using Linux 2.6.30.3 and seeing the following:

[ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
[ 4907.485543] md: super_written gets error=-5, uptodate=0
[ 4907.485546] raid1: Disk failure on sdb2, disabling device.
[ 4907.485547] raid1: Operation continuing on 1 devices.
[ 4907.499157] RAID1 conf printout:
[ 4907.499159]  --- wd:1 rd:2
[ 4907.499162]  disk 0, wo:0, o:1, dev:sda2
[ 4907.499164]  disk 1, wo:1, o:0, dev:sdb2
[ 4907.503037] RAID1 conf printout:
[ 4907.503039]  --- wd:1 rd:2
[ 4907.503041]  disk 0, wo:0, o:1, dev:sda2
[ 6705.292961] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6705.292967] Descriptor sense data with sense descriptors (in hex):
[ 6705.292970]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6705.292978]         00 4f 00 c2 00 50
[ 6705.292983] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6705.359497] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6705.359502] Descriptor sense data with sense descriptors (in hex):
[ 6705.359504]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6705.359513]         00 4f 00 c2 00 50
[ 6705.359517] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6724.022616] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6724.022622] Descriptor sense data with sense descriptors (in hex):
[ 6724.022624]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6724.022633]         00 4f 00 c2 00 50
[ 6724.022638] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6724.078063] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6724.078068] Descriptor sense data with sense descriptors (in hex):
[ 6724.078070]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6724.078079]         00 4f 00 c2 00 50
[ 6724.078083] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6740.035419] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6740.035425] Descriptor sense data with sense descriptors (in hex):
[ 6740.035427]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6740.035435]         00 4f 00 c2 00 50
[ 6740.035440] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6740.090867] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6740.090872] Descriptor sense data with sense descriptors (in hex):
[ 6740.090874]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6740.090882]         00 4f 00 c2 00 50
[ 6740.090887] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6812.955958] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6812.955965] Descriptor sense data with sense descriptors (in hex):
[ 6812.955967]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6812.955976]         00 4f 00 c2 00 50
[ 6812.955980] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6813.011403] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6813.011408] Descriptor sense data with sense descriptors (in hex):
[ 6813.011410]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6813.011419]         00 4f 00 c2 00 50
[ 6813.011423] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6818.944137] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6818.944143] Descriptor sense data with sense descriptors (in hex):
[ 6818.944146]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6818.944154]         00 4f 00 c2 00 50
[ 6818.944159] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6818.999583] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6818.999587] Descriptor sense data with sense descriptors (in hex):
[ 6818.999590]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6818.999598]         00 4f 00 c2 00 50
[ 6818.999602] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7036.061540] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7036.061547] Descriptor sense data with sense descriptors (in hex):
[ 7036.061549]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7036.061558]         00 4f 00 c2 00 50
[ 7036.061562] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7036.116986] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7036.116991] Descriptor sense data with sense descriptors (in hex):
[ 7036.116993]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7036.117001]         00 4f 00 c2 00 50
[ 7036.117006] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7042.247767] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7042.247773] Descriptor sense data with sense descriptors (in hex):
[ 7042.247776]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7042.247784]         00 4f 00 c2 00 50
[ 7042.247789] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7042.303215] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7042.303219] Descriptor sense data with sense descriptors (in hex):
[ 7042.303221]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7042.303230]         00 4f 00 c2 00 50
[ 7042.303234] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7048.136156] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7048.136162] Descriptor sense data with sense descriptors (in hex):
[ 7048.136165]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7048.136174]         00 4f 00 c2 00 50
[ 7048.136178] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7048.191599] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7048.191604] Descriptor sense data with sense descriptors (in hex):
[ 7048.191606]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7048.191614]         00 4f 00 c2 00 50
[ 7048.191619] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7051.153936] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7051.153942] Descriptor sense data with sense descriptors (in hex):
[ 7051.153944]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7051.153953]         00 4f 00 c2 00 50
[ 7051.153958] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7051.209384] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7051.209388] Descriptor sense data with sense descriptors (in hex):
[ 7051.209390]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7051.209399]         00 4f 00 c2 00 50
[ 7051.209403] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available

SMART suggest the drives are fine:

pawan:/var/log# smartctl -l error /dev/sda
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
No Errors Logged

pawan:/var/log# smartctl -l error /dev/sdb
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
No Errors Logged

I disabled NCQ per instructions at:
http://linux-ata.org/faq.html#ncq
Btw, is there a better way than writing a init.d script for tweaking NCQ at
reboot?

Any ideas?


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-27  6:03 LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Allan Wind
@ 2009-07-29 19:43 ` Robert Hancock
  2009-07-29 20:07   ` Allan Wind
  0 siblings, 1 reply; 11+ messages in thread
From: Robert Hancock @ 2009-07-29 19:43 UTC (permalink / raw)
  To: Allan Wind; +Cc: linux-ide

On 07/27/2009 12:03 AM, Allan Wind wrote:
> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> using Linux 2.6.30.3 and seeing the following:
>
> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974

Are there no error messages before this point? Can you post the full 
dmesg output from bootup?

> [ 4907.485543] md: super_written gets error=-5, uptodate=0
> [ 4907.485546] raid1: Disk failure on sdb2, disabling device.
> [ 4907.485547] raid1: Operation continuing on 1 devices.
> [ 4907.499157] RAID1 conf printout:
> [ 4907.499159]  --- wd:1 rd:2
> [ 4907.499162]  disk 0, wo:0, o:1, dev:sda2
> [ 4907.499164]  disk 1, wo:1, o:0, dev:sdb2
> [ 4907.503037] RAID1 conf printout:
> [ 4907.503039]  --- wd:1 rd:2
> [ 4907.503041]  disk 0, wo:0, o:1, dev:sda2
> [ 6705.292961] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6705.292967] Descriptor sense data with sense descriptors (in hex):
> [ 6705.292970]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6705.292978]         00 4f 00 c2 00 50
> [ 6705.292983] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6705.359497] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6705.359502] Descriptor sense data with sense descriptors (in hex):
> [ 6705.359504]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6705.359513]         00 4f 00 c2 00 50
> [ 6705.359517] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6724.022616] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6724.022622] Descriptor sense data with sense descriptors (in hex):
> [ 6724.022624]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6724.022633]         00 4f 00 c2 00 50
> [ 6724.022638] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6724.078063] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6724.078068] Descriptor sense data with sense descriptors (in hex):
> [ 6724.078070]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6724.078079]         00 4f 00 c2 00 50
> [ 6724.078083] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6740.035419] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6740.035425] Descriptor sense data with sense descriptors (in hex):
> [ 6740.035427]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6740.035435]         00 4f 00 c2 00 50
> [ 6740.035440] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6740.090867] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6740.090872] Descriptor sense data with sense descriptors (in hex):
> [ 6740.090874]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6740.090882]         00 4f 00 c2 00 50
> [ 6740.090887] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6812.955958] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6812.955965] Descriptor sense data with sense descriptors (in hex):
> [ 6812.955967]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6812.955976]         00 4f 00 c2 00 50
> [ 6812.955980] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6813.011403] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6813.011408] Descriptor sense data with sense descriptors (in hex):
> [ 6813.011410]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6813.011419]         00 4f 00 c2 00 50
> [ 6813.011423] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6818.944137] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6818.944143] Descriptor sense data with sense descriptors (in hex):
> [ 6818.944146]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6818.944154]         00 4f 00 c2 00 50
> [ 6818.944159] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6818.999583] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6818.999587] Descriptor sense data with sense descriptors (in hex):
> [ 6818.999590]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6818.999598]         00 4f 00 c2 00 50
> [ 6818.999602] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7036.061540] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7036.061547] Descriptor sense data with sense descriptors (in hex):
> [ 7036.061549]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7036.061558]         00 4f 00 c2 00 50
> [ 7036.061562] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7036.116986] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7036.116991] Descriptor sense data with sense descriptors (in hex):
> [ 7036.116993]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7036.117001]         00 4f 00 c2 00 50
> [ 7036.117006] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7042.247767] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7042.247773] Descriptor sense data with sense descriptors (in hex):
> [ 7042.247776]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7042.247784]         00 4f 00 c2 00 50
> [ 7042.247789] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7042.303215] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7042.303219] Descriptor sense data with sense descriptors (in hex):
> [ 7042.303221]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7042.303230]         00 4f 00 c2 00 50
> [ 7042.303234] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7048.136156] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7048.136162] Descriptor sense data with sense descriptors (in hex):
> [ 7048.136165]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7048.136174]         00 4f 00 c2 00 50
> [ 7048.136178] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7048.191599] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7048.191604] Descriptor sense data with sense descriptors (in hex):
> [ 7048.191606]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7048.191614]         00 4f 00 c2 00 50
> [ 7048.191619] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7051.153936] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7051.153942] Descriptor sense data with sense descriptors (in hex):
> [ 7051.153944]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7051.153953]         00 4f 00 c2 00 50
> [ 7051.153958] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7051.209384] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7051.209388] Descriptor sense data with sense descriptors (in hex):
> [ 7051.209390]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7051.209399]         00 4f 00 c2 00 50
> [ 7051.209403] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
>
> SMART suggest the drives are fine:
>
> pawan:/var/log# smartctl -l error /dev/sda
> smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
>
> === START OF READ SMART DATA SECTION ===
> SMART Error Log Version: 1
> No Errors Logged
>
> pawan:/var/log# smartctl -l error /dev/sdb
> smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
>
> === START OF READ SMART DATA SECTION ===
> SMART Error Log Version: 1
> No Errors Logged
>
> I disabled NCQ per instructions at:
> http://linux-ata.org/faq.html#ncq
> Btw, is there a better way than writing a init.d script for tweaking NCQ at
> reboot?
>
> Any ideas?
>
>
> /Allan


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-29 19:43 ` Robert Hancock
@ 2009-07-29 20:07   ` Allan Wind
  2009-07-29 23:04     ` Robert Hancock
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-29 20:07 UTC (permalink / raw)
  To: linux-ide

On 2009-07-29T13:43:06, Robert Hancock wrote:
> On 07/27/2009 12:03 AM, Allan Wind wrote:
>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>> using Linux 2.6.30.3 and seeing the following:
>>
>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>
> Are there no error messages before this point? Can you post the full  
> dmesg output from bootup?

Thanks for looking into this, Robert.  I do not see any relevant error 
message before this point, but made the entire 82k dmesg available here:
http://lifeintegrity.com/~allan/dmesg


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-29 20:07   ` Allan Wind
@ 2009-07-29 23:04     ` Robert Hancock
  2009-07-31 18:40       ` Allan Wind
  0 siblings, 1 reply; 11+ messages in thread
From: Robert Hancock @ 2009-07-29 23:04 UTC (permalink / raw)
  To: linux-ide; +Cc: linux-scsi

On 07/29/2009 02:07 PM, Allan Wind wrote:
> On 2009-07-29T13:43:06, Robert Hancock wrote:
>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>> using Linux 2.6.30.3 and seeing the following:
>>>
>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>> Are there no error messages before this point? Can you post the full
>> dmesg output from bootup?
>
> Thanks for looking into this, Robert.  I do not see any relevant error
> message before this point, but made the entire 82k dmesg available here:
> http://lifeintegrity.com/~allan/dmesg

It seems like some request failed but apparently that mptsas driver 
isn't dumping out what happened for some reason. There are some of those 
"recovered error" indications but they're not near the I/O error report, 
so I'm not sure what's going on. CCing linux-scsi.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-29 23:04     ` Robert Hancock
@ 2009-07-31 18:40       ` Allan Wind
  2009-08-03  5:08         ` Allan Wind
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-31 18:40 UTC (permalink / raw)
  To: linux-ide, linux-scsi

On 2009-07-29T17:04:27, Robert Hancock wrote:
> On 07/29/2009 02:07 PM, Allan Wind wrote:
>> On 2009-07-29T13:43:06, Robert Hancock wrote:
>>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>>> using Linux 2.6.30.3 and seeing the following:
>>>>
>>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>>> Are there no error messages before this point? Can you post the full
>>> dmesg output from bootup?
>>
>> Thanks for looking into this, Robert.  I do not see any relevant error
>> message before this point, but made the entire 82k dmesg available here:
>> http://lifeintegrity.com/~allan/dmesg
>
> It seems like some request failed but apparently that mptsas driver  
> isn't dumping out what happened for some reason. There are some of those  
> "recovered error" indications but they're not near the I/O error report,  
> so I'm not sure what's going on. CCing linux-scsi.

Is there any data I can help with to advance this issue?


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-07-31 18:40       ` Allan Wind
@ 2009-08-03  5:08         ` Allan Wind
  2009-08-03 14:34           ` James Bottomley
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-03  5:08 UTC (permalink / raw)
  To: linux-ide, linux-scsi

On 2009-07-31T14:40:23, Allan Wind wrote:
> On 2009-07-29T17:04:27, Robert Hancock wrote:
> > On 07/29/2009 02:07 PM, Allan Wind wrote:
> >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> >>>> using Linux 2.6.30.3 and seeing the following:
> >>>>
> >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> >>> Are there no error messages before this point? Can you post the full
> >>> dmesg output from bootup?
> >>
> >> Thanks for looking into this, Robert.  I do not see any relevant error
> >> message before this point, but made the entire 82k dmesg available here:
> >> http://lifeintegrity.com/~allan/dmesg
> >
> > It seems like some request failed but apparently that mptsas driver  
> > isn't dumping out what happened for some reason. There are some of those  
> > "recovered error" indications but they're not near the I/O error report,  
> > so I'm not sure what's going on. CCing linux-scsi.
> 
> Is there any data I can help with to advance this issue?

The above complains about sector 3907028974 which is exactly 
19566 sectors greater than the size of the raid array according 
to parted.  In other words it appears to be an access to the last 
sector of the array.


Number  Start   End          Size         File system  Name  
Flags
 1      34s     19565s       19532s                          
bios_grub
 2      19566s  3907029134s  3907009569s  ext3               raid


Model: ATA WDC WD2002FYPS-0 (scsi)
Disk /dev/sdb: 3907029168s
Sector size (logical/physical): 512B/512B
Partition Table: gpt

Number  Start   End          Size         File system  Name  
Flags
 1      34s     19565s       19532s                          
bios_grub
 2      19566s  3907029134s  3907009569s  ext3               raid


Model: Unknown (unknown)
Disk /dev/md0: 3907009408s
Sector size (logical/physical): 512B/512B
Partition Table: loop

Number  Start  End          Size         File system  Flags
 1      0s     3907009407s  3907009408s  ext3


# mdadm --detail /dev/md0
/dev/md0:
        Version : 0.90
  Creation Time : Tue Jul 21 00:06:07 2009
     Raid Level : raid1
     Array Size : 1953504704 (1863.01 GiB 2000.39 GB)
  Used Dev Size : 1953504704 (1863.01 GiB 2000.39 GB)
   Raid Devices : 2
  Total Devices : 2
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Mon Aug  3 00:59:07 2009
          State : clean
 Active Devices : 2
Working Devices : 2
 Failed Devices : 0
  Spare Devices : 0

           UUID : fdc1a639:0e6cf1a1:14ef9d30:0388f34d
         Events : 0.59706

    Number   Major   Minor   RaidDevice State
       0       8        2        0      active sync   /dev/sda2
       1       8       18        1      active sync   /dev/sdb2

Disabled NCQ did not work around this issue.

 
/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03  5:08         ` Allan Wind
@ 2009-08-03 14:34           ` James Bottomley
  2009-08-03 16:06             ` Allan Wind
  0 siblings, 1 reply; 11+ messages in thread
From: James Bottomley @ 2009-08-03 14:34 UTC (permalink / raw)
  To: Allan Wind; +Cc: linux-ide, linux-scsi

On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> On 2009-07-31T14:40:23, Allan Wind wrote:
> > On 2009-07-29T17:04:27, Robert Hancock wrote:
> > > On 07/29/2009 02:07 PM, Allan Wind wrote:
> > >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> > >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> > >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> > >>>> using Linux 2.6.30.3 and seeing the following:
> > >>>>
> > >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> > >>> Are there no error messages before this point? Can you post the full
> > >>> dmesg output from bootup?
> > >>
> > >> Thanks for looking into this, Robert.  I do not see any relevant error
> > >> message before this point, but made the entire 82k dmesg available here:
> > >> http://lifeintegrity.com/~allan/dmesg
> > >
> > > It seems like some request failed but apparently that mptsas driver  
> > > isn't dumping out what happened for some reason. There are some of those  
> > > "recovered error" indications but they're not near the I/O error report,  
> > > so I'm not sure what's going on. CCing linux-scsi.
> > 
> > Is there any data I can help with to advance this issue?
> 
> The above complains about sector 3907028974 which is exactly 
> 19566 sectors greater than the size of the raid array according 
> to parted.  In other words it appears to be an access to the last 
> sector of the array.

If it's a read beyond the end of a partition, then it's possible it got
rejected in the partition checking logic before ever reaching the I/O
controller (which would explain why no messages from the fusion in the
log).

However, I don't think the analysis is correct.  Parted says


> Number  Start   End          Size         File system  Name  Flags
>  1      34s     19565s       19532s                          bios_grub
>  2      19566s  3907029134s  3907009569s  ext3               raid

So the absolute sector number 3907028974 is within partition 2.

I still think something went wrong in block.  Even if the fusion failed
to spit an error, the SCSI layer is usually quite chatty about failures.
To get a simple error in the log with no explanation usually tends to
indicate that it occurred in the block layer.

James



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 14:34           ` James Bottomley
@ 2009-08-03 16:06             ` Allan Wind
  2009-08-03 16:15               ` James Bottomley
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-03 16:06 UTC (permalink / raw)
  To: James Bottomley; +Cc: linux-ide, linux-scsi

On 2009-08-03T09:34:53, James Bottomley wrote:
> On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> >
> > The above complains about sector 3907028974 which is exactly 
> > 19566 sectors greater than the size of the raid array according 
> > to parted.  In other words it appears to be an access to the last 
> > sector of the array.
> 
> If it's a read beyond the end of a partition, then it's possible it got
> rejected in the partition checking logic before ever reaching the I/O
> controller (which would explain why no messages from the fusion in the
> log).
> 
> However, I don't think the analysis is correct.  Parted says
> 
> 
> > Number  Start   End          Size         File system  Name  Flags
> >  1      34s     19565s       19532s                          bios_grub
> >  2      19566s  3907029134s  3907009569s  ext3               raid
> 
> So the absolute sector number 3907028974 is within partition 2.

I was actually trying to make a different point.  Namely that it 
was curious that the error message complains about sector
3907028974 which is exactly the size of the array + 19566, or in 
other words 1 sector past the end of array:

/dev/sdb2
start: 19566
end: 3907029134
size: 3907029134 - 19566 + 1 = 3907009568

/dev/md0
start: 0
end: 3907009407
size: 3907009407 - 0 + 1 = 3907009408

If dm maps linear to the disk sectors:

dm  /dev/sdb2
0   19566
1   19567
...
3907009406 3907028972
3907009407 3907028973

What dm sector would map to physical sector 3907028974?


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 16:06             ` Allan Wind
@ 2009-08-03 16:15               ` James Bottomley
  2009-08-26 21:17                 ` Allan Wind
  0 siblings, 1 reply; 11+ messages in thread
From: James Bottomley @ 2009-08-03 16:15 UTC (permalink / raw)
  To: Allan Wind; +Cc: linux-ide, linux-scsi

On Mon, 2009-08-03 at 12:06 -0400, Allan Wind wrote:
> On 2009-08-03T09:34:53, James Bottomley wrote:
> > On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> > >
> > > The above complains about sector 3907028974 which is exactly 
> > > 19566 sectors greater than the size of the raid array according 
> > > to parted.  In other words it appears to be an access to the last 
> > > sector of the array.
> > 
> > If it's a read beyond the end of a partition, then it's possible it got
> > rejected in the partition checking logic before ever reaching the I/O
> > controller (which would explain why no messages from the fusion in the
> > log).
> > 
> > However, I don't think the analysis is correct.  Parted says
> > 
> > 
> > > Number  Start   End          Size         File system  Name  Flags
> > >  1      34s     19565s       19532s                          bios_grub
> > >  2      19566s  3907029134s  3907009569s  ext3               raid
> > 
> > So the absolute sector number 3907028974 is within partition 2.
> 
> I was actually trying to make a different point.  Namely that it 
> was curious that the error message complains about sector
> 3907028974 which is exactly the size of the array + 19566, or in 
> other words 1 sector past the end of array:

That's where md stores its superblock ... but it's still within the
partition.

James



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-03 16:15               ` James Bottomley
@ 2009-08-26 21:17                 ` Allan Wind
  2009-09-11  6:20                   ` Jeppe Oland
  0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-26 21:17 UTC (permalink / raw)
  To: linux-ide, linux-scsi

I have not seen any problems since upgrading to 2.6.30.4.


/Allan
-- 
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>


^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
  2009-08-26 21:17                 ` Allan Wind
@ 2009-09-11  6:20                   ` Jeppe Oland
  0 siblings, 0 replies; 11+ messages in thread
From: Jeppe Oland @ 2009-09-11  6:20 UTC (permalink / raw)
  To: linux-ide

Allan Wind <allan_wind <at> lifeintegrity.com> writes:
> I have not seen any problems since upgrading to 2.6.30.4.

I just upgraded to Debian Testing with 2.6.30-1-amd64,and now I am seeing
the same problem. Didn't have any problems with Debian 5.0 stable.

[751979.174584] sd 6:0:0:0: [sda] Sense Key : Recovered Error [current]
[descriptor]
[751979.174590] Descriptor sense data with sense descriptors (in hex):
[751979.174592]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[751979.174597]         00 4f 00 c2 40 50
[751979.174600] sd 6:0:0:0: [sda] Add. Sense: ATA pass through information
available
[753779.392241] sd 6:0:0:0: [sda] Sense Key : Recovered Error [current]
[descriptor]
[753779.392257] Descriptor sense data with sense descriptors (in hex):
[753779.392260]         72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[753779.392265]         00 4f 00 c2 40 50
[753779.392268] sd 6:0:0:0: [sda] Add. Sense: ATA pass through information
available
[754954.862297] REISERFS error (device dm-9): vs-13070
reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of
[42641 42810 0x0 SD]
[754954.862326] REISERFS error (device dm-9): vs-13070
reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of
[42641 42819 0x0 SD]

and lots more like that.

SMART reports no issues, and I have not noticed any data corruption (yet).

Controller is a Dell SAS RAID controller (SCSI storage controller: LSI Logic /
Symbios Logic SAS1068E PCI-Express Fusion-MPT SAS (rev 08)) with a single drive
attached to it.

Rgards,
-Jeppe


^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2009-09-11  6:25 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-07-27  6:03 LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Allan Wind
2009-07-29 19:43 ` Robert Hancock
2009-07-29 20:07   ` Allan Wind
2009-07-29 23:04     ` Robert Hancock
2009-07-31 18:40       ` Allan Wind
2009-08-03  5:08         ` Allan Wind
2009-08-03 14:34           ` James Bottomley
2009-08-03 16:06             ` Allan Wind
2009-08-03 16:15               ` James Bottomley
2009-08-26 21:17                 ` Allan Wind
2009-09-11  6:20                   ` Jeppe Oland

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).