* LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
@ 2009-07-27 6:03 Allan Wind
2009-07-29 19:43 ` Robert Hancock
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-27 6:03 UTC (permalink / raw)
To: linux-ide
I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
using Linux 2.6.30.3 and seeing the following:
[ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
[ 4907.485543] md: super_written gets error=-5, uptodate=0
[ 4907.485546] raid1: Disk failure on sdb2, disabling device.
[ 4907.485547] raid1: Operation continuing on 1 devices.
[ 4907.499157] RAID1 conf printout:
[ 4907.499159] --- wd:1 rd:2
[ 4907.499162] disk 0, wo:0, o:1, dev:sda2
[ 4907.499164] disk 1, wo:1, o:0, dev:sdb2
[ 4907.503037] RAID1 conf printout:
[ 4907.503039] --- wd:1 rd:2
[ 4907.503041] disk 0, wo:0, o:1, dev:sda2
[ 6705.292961] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6705.292967] Descriptor sense data with sense descriptors (in hex):
[ 6705.292970] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6705.292978] 00 4f 00 c2 00 50
[ 6705.292983] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6705.359497] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6705.359502] Descriptor sense data with sense descriptors (in hex):
[ 6705.359504] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6705.359513] 00 4f 00 c2 00 50
[ 6705.359517] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6724.022616] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6724.022622] Descriptor sense data with sense descriptors (in hex):
[ 6724.022624] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6724.022633] 00 4f 00 c2 00 50
[ 6724.022638] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6724.078063] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6724.078068] Descriptor sense data with sense descriptors (in hex):
[ 6724.078070] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6724.078079] 00 4f 00 c2 00 50
[ 6724.078083] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6740.035419] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6740.035425] Descriptor sense data with sense descriptors (in hex):
[ 6740.035427] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6740.035435] 00 4f 00 c2 00 50
[ 6740.035440] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6740.090867] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 6740.090872] Descriptor sense data with sense descriptors (in hex):
[ 6740.090874] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6740.090882] 00 4f 00 c2 00 50
[ 6740.090887] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 6812.955958] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6812.955965] Descriptor sense data with sense descriptors (in hex):
[ 6812.955967] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6812.955976] 00 4f 00 c2 00 50
[ 6812.955980] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6813.011403] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6813.011408] Descriptor sense data with sense descriptors (in hex):
[ 6813.011410] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6813.011419] 00 4f 00 c2 00 50
[ 6813.011423] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6818.944137] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6818.944143] Descriptor sense data with sense descriptors (in hex):
[ 6818.944146] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6818.944154] 00 4f 00 c2 00 50
[ 6818.944159] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 6818.999583] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 6818.999587] Descriptor sense data with sense descriptors (in hex):
[ 6818.999590] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 6818.999598] 00 4f 00 c2 00 50
[ 6818.999602] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7036.061540] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7036.061547] Descriptor sense data with sense descriptors (in hex):
[ 7036.061549] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7036.061558] 00 4f 00 c2 00 50
[ 7036.061562] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7036.116986] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7036.116991] Descriptor sense data with sense descriptors (in hex):
[ 7036.116993] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7036.117001] 00 4f 00 c2 00 50
[ 7036.117006] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7042.247767] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7042.247773] Descriptor sense data with sense descriptors (in hex):
[ 7042.247776] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7042.247784] 00 4f 00 c2 00 50
[ 7042.247789] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7042.303215] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7042.303219] Descriptor sense data with sense descriptors (in hex):
[ 7042.303221] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7042.303230] 00 4f 00 c2 00 50
[ 7042.303234] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7048.136156] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7048.136162] Descriptor sense data with sense descriptors (in hex):
[ 7048.136165] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7048.136174] 00 4f 00 c2 00 50
[ 7048.136178] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7048.191599] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
[ 7048.191604] Descriptor sense data with sense descriptors (in hex):
[ 7048.191606] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7048.191614] 00 4f 00 c2 00 50
[ 7048.191619] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
[ 7051.153936] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7051.153942] Descriptor sense data with sense descriptors (in hex):
[ 7051.153944] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7051.153953] 00 4f 00 c2 00 50
[ 7051.153958] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
[ 7051.209384] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
[ 7051.209388] Descriptor sense data with sense descriptors (in hex):
[ 7051.209390] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[ 7051.209399] 00 4f 00 c2 00 50
[ 7051.209403] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
SMART suggest the drives are fine:
pawan:/var/log# smartctl -l error /dev/sda
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
No Errors Logged
pawan:/var/log# smartctl -l error /dev/sdb
smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
No Errors Logged
I disabled NCQ per instructions at:
http://linux-ata.org/faq.html#ncq
Btw, is there a better way than writing a init.d script for tweaking NCQ at
reboot?
Any ideas?
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-07-27 6:03 LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Allan Wind
@ 2009-07-29 19:43 ` Robert Hancock
2009-07-29 20:07 ` Allan Wind
0 siblings, 1 reply; 11+ messages in thread
From: Robert Hancock @ 2009-07-29 19:43 UTC (permalink / raw)
To: Allan Wind; +Cc: linux-ide
On 07/27/2009 12:03 AM, Allan Wind wrote:
> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> using Linux 2.6.30.3 and seeing the following:
>
> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
Are there no error messages before this point? Can you post the full
dmesg output from bootup?
> [ 4907.485543] md: super_written gets error=-5, uptodate=0
> [ 4907.485546] raid1: Disk failure on sdb2, disabling device.
> [ 4907.485547] raid1: Operation continuing on 1 devices.
> [ 4907.499157] RAID1 conf printout:
> [ 4907.499159] --- wd:1 rd:2
> [ 4907.499162] disk 0, wo:0, o:1, dev:sda2
> [ 4907.499164] disk 1, wo:1, o:0, dev:sdb2
> [ 4907.503037] RAID1 conf printout:
> [ 4907.503039] --- wd:1 rd:2
> [ 4907.503041] disk 0, wo:0, o:1, dev:sda2
> [ 6705.292961] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6705.292967] Descriptor sense data with sense descriptors (in hex):
> [ 6705.292970] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6705.292978] 00 4f 00 c2 00 50
> [ 6705.292983] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6705.359497] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6705.359502] Descriptor sense data with sense descriptors (in hex):
> [ 6705.359504] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6705.359513] 00 4f 00 c2 00 50
> [ 6705.359517] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6724.022616] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6724.022622] Descriptor sense data with sense descriptors (in hex):
> [ 6724.022624] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6724.022633] 00 4f 00 c2 00 50
> [ 6724.022638] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6724.078063] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6724.078068] Descriptor sense data with sense descriptors (in hex):
> [ 6724.078070] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6724.078079] 00 4f 00 c2 00 50
> [ 6724.078083] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6740.035419] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6740.035425] Descriptor sense data with sense descriptors (in hex):
> [ 6740.035427] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6740.035435] 00 4f 00 c2 00 50
> [ 6740.035440] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6740.090867] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 6740.090872] Descriptor sense data with sense descriptors (in hex):
> [ 6740.090874] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6740.090882] 00 4f 00 c2 00 50
> [ 6740.090887] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 6812.955958] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6812.955965] Descriptor sense data with sense descriptors (in hex):
> [ 6812.955967] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6812.955976] 00 4f 00 c2 00 50
> [ 6812.955980] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6813.011403] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6813.011408] Descriptor sense data with sense descriptors (in hex):
> [ 6813.011410] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6813.011419] 00 4f 00 c2 00 50
> [ 6813.011423] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6818.944137] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6818.944143] Descriptor sense data with sense descriptors (in hex):
> [ 6818.944146] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6818.944154] 00 4f 00 c2 00 50
> [ 6818.944159] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 6818.999583] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 6818.999587] Descriptor sense data with sense descriptors (in hex):
> [ 6818.999590] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 6818.999598] 00 4f 00 c2 00 50
> [ 6818.999602] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7036.061540] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7036.061547] Descriptor sense data with sense descriptors (in hex):
> [ 7036.061549] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7036.061558] 00 4f 00 c2 00 50
> [ 7036.061562] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7036.116986] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7036.116991] Descriptor sense data with sense descriptors (in hex):
> [ 7036.116993] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7036.117001] 00 4f 00 c2 00 50
> [ 7036.117006] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7042.247767] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7042.247773] Descriptor sense data with sense descriptors (in hex):
> [ 7042.247776] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7042.247784] 00 4f 00 c2 00 50
> [ 7042.247789] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7042.303215] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7042.303219] Descriptor sense data with sense descriptors (in hex):
> [ 7042.303221] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7042.303230] 00 4f 00 c2 00 50
> [ 7042.303234] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7048.136156] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7048.136162] Descriptor sense data with sense descriptors (in hex):
> [ 7048.136165] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7048.136174] 00 4f 00 c2 00 50
> [ 7048.136178] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7048.191599] sd 4:0:0:0: [sda] Sense Key : Recovered Error [current] [descriptor]
> [ 7048.191604] Descriptor sense data with sense descriptors (in hex):
> [ 7048.191606] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7048.191614] 00 4f 00 c2 00 50
> [ 7048.191619] sd 4:0:0:0: [sda] Add. Sense: ATA pass through information available
> [ 7051.153936] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7051.153942] Descriptor sense data with sense descriptors (in hex):
> [ 7051.153944] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7051.153953] 00 4f 00 c2 00 50
> [ 7051.153958] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
> [ 7051.209384] sd 4:0:1:0: [sdb] Sense Key : Recovered Error [current] [descriptor]
> [ 7051.209388] Descriptor sense data with sense descriptors (in hex):
> [ 7051.209390] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
> [ 7051.209399] 00 4f 00 c2 00 50
> [ 7051.209403] sd 4:0:1:0: [sdb] Add. Sense: ATA pass through information available
>
> SMART suggest the drives are fine:
>
> pawan:/var/log# smartctl -l error /dev/sda
> smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
>
> === START OF READ SMART DATA SECTION ===
> SMART Error Log Version: 1
> No Errors Logged
>
> pawan:/var/log# smartctl -l error /dev/sdb
> smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
>
> === START OF READ SMART DATA SECTION ===
> SMART Error Log Version: 1
> No Errors Logged
>
> I disabled NCQ per instructions at:
> http://linux-ata.org/faq.html#ncq
> Btw, is there a better way than writing a init.d script for tweaking NCQ at
> reboot?
>
> Any ideas?
>
>
> /Allan
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-07-29 19:43 ` Robert Hancock
@ 2009-07-29 20:07 ` Allan Wind
2009-07-29 23:04 ` Robert Hancock
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-29 20:07 UTC (permalink / raw)
To: linux-ide
On 2009-07-29T13:43:06, Robert Hancock wrote:
> On 07/27/2009 12:03 AM, Allan Wind wrote:
>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>> using Linux 2.6.30.3 and seeing the following:
>>
>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>
> Are there no error messages before this point? Can you post the full
> dmesg output from bootup?
Thanks for looking into this, Robert. I do not see any relevant error
message before this point, but made the entire 82k dmesg available here:
http://lifeintegrity.com/~allan/dmesg
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-07-29 20:07 ` Allan Wind
@ 2009-07-29 23:04 ` Robert Hancock
2009-07-31 18:40 ` Allan Wind
0 siblings, 1 reply; 11+ messages in thread
From: Robert Hancock @ 2009-07-29 23:04 UTC (permalink / raw)
To: linux-ide; +Cc: linux-scsi
On 07/29/2009 02:07 PM, Allan Wind wrote:
> On 2009-07-29T13:43:06, Robert Hancock wrote:
>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>> using Linux 2.6.30.3 and seeing the following:
>>>
>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>> Are there no error messages before this point? Can you post the full
>> dmesg output from bootup?
>
> Thanks for looking into this, Robert. I do not see any relevant error
> message before this point, but made the entire 82k dmesg available here:
> http://lifeintegrity.com/~allan/dmesg
It seems like some request failed but apparently that mptsas driver
isn't dumping out what happened for some reason. There are some of those
"recovered error" indications but they're not near the I/O error report,
so I'm not sure what's going on. CCing linux-scsi.
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-07-29 23:04 ` Robert Hancock
@ 2009-07-31 18:40 ` Allan Wind
2009-08-03 5:08 ` Allan Wind
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-07-31 18:40 UTC (permalink / raw)
To: linux-ide, linux-scsi
On 2009-07-29T17:04:27, Robert Hancock wrote:
> On 07/29/2009 02:07 PM, Allan Wind wrote:
>> On 2009-07-29T13:43:06, Robert Hancock wrote:
>>> On 07/27/2009 12:03 AM, Allan Wind wrote:
>>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
>>>> using Linux 2.6.30.3 and seeing the following:
>>>>
>>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
>>> Are there no error messages before this point? Can you post the full
>>> dmesg output from bootup?
>>
>> Thanks for looking into this, Robert. I do not see any relevant error
>> message before this point, but made the entire 82k dmesg available here:
>> http://lifeintegrity.com/~allan/dmesg
>
> It seems like some request failed but apparently that mptsas driver
> isn't dumping out what happened for some reason. There are some of those
> "recovered error" indications but they're not near the I/O error report,
> so I'm not sure what's going on. CCing linux-scsi.
Is there any data I can help with to advance this issue?
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-07-31 18:40 ` Allan Wind
@ 2009-08-03 5:08 ` Allan Wind
2009-08-03 14:34 ` James Bottomley
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-03 5:08 UTC (permalink / raw)
To: linux-ide, linux-scsi
On 2009-07-31T14:40:23, Allan Wind wrote:
> On 2009-07-29T17:04:27, Robert Hancock wrote:
> > On 07/29/2009 02:07 PM, Allan Wind wrote:
> >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> >>>> using Linux 2.6.30.3 and seeing the following:
> >>>>
> >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> >>> Are there no error messages before this point? Can you post the full
> >>> dmesg output from bootup?
> >>
> >> Thanks for looking into this, Robert. I do not see any relevant error
> >> message before this point, but made the entire 82k dmesg available here:
> >> http://lifeintegrity.com/~allan/dmesg
> >
> > It seems like some request failed but apparently that mptsas driver
> > isn't dumping out what happened for some reason. There are some of those
> > "recovered error" indications but they're not near the I/O error report,
> > so I'm not sure what's going on. CCing linux-scsi.
>
> Is there any data I can help with to advance this issue?
The above complains about sector 3907028974 which is exactly
19566 sectors greater than the size of the raid array according
to parted. In other words it appears to be an access to the last
sector of the array.
Number Start End Size File system Name
Flags
1 34s 19565s 19532s
bios_grub
2 19566s 3907029134s 3907009569s ext3 raid
Model: ATA WDC WD2002FYPS-0 (scsi)
Disk /dev/sdb: 3907029168s
Sector size (logical/physical): 512B/512B
Partition Table: gpt
Number Start End Size File system Name
Flags
1 34s 19565s 19532s
bios_grub
2 19566s 3907029134s 3907009569s ext3 raid
Model: Unknown (unknown)
Disk /dev/md0: 3907009408s
Sector size (logical/physical): 512B/512B
Partition Table: loop
Number Start End Size File system Flags
1 0s 3907009407s 3907009408s ext3
# mdadm --detail /dev/md0
/dev/md0:
Version : 0.90
Creation Time : Tue Jul 21 00:06:07 2009
Raid Level : raid1
Array Size : 1953504704 (1863.01 GiB 2000.39 GB)
Used Dev Size : 1953504704 (1863.01 GiB 2000.39 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Persistence : Superblock is persistent
Update Time : Mon Aug 3 00:59:07 2009
State : clean
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0
UUID : fdc1a639:0e6cf1a1:14ef9d30:0388f34d
Events : 0.59706
Number Major Minor RaidDevice State
0 8 2 0 active sync /dev/sda2
1 8 18 1 active sync /dev/sdb2
Disabled NCQ did not work around this issue.
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-08-03 5:08 ` Allan Wind
@ 2009-08-03 14:34 ` James Bottomley
2009-08-03 16:06 ` Allan Wind
0 siblings, 1 reply; 11+ messages in thread
From: James Bottomley @ 2009-08-03 14:34 UTC (permalink / raw)
To: Allan Wind; +Cc: linux-ide, linux-scsi
On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> On 2009-07-31T14:40:23, Allan Wind wrote:
> > On 2009-07-29T17:04:27, Robert Hancock wrote:
> > > On 07/29/2009 02:07 PM, Allan Wind wrote:
> > >> On 2009-07-29T13:43:06, Robert Hancock wrote:
> > >>> On 07/27/2009 12:03 AM, Allan Wind wrote:
> > >>>> I have a pair of Western Digital RE4-GP (WD2002FYPS) in RAID1 configuration
> > >>>> using Linux 2.6.30.3 and seeing the following:
> > >>>>
> > >>>> [ 4907.485324] end_request: I/O error, dev sdb, sector 3907028974
> > >>> Are there no error messages before this point? Can you post the full
> > >>> dmesg output from bootup?
> > >>
> > >> Thanks for looking into this, Robert. I do not see any relevant error
> > >> message before this point, but made the entire 82k dmesg available here:
> > >> http://lifeintegrity.com/~allan/dmesg
> > >
> > > It seems like some request failed but apparently that mptsas driver
> > > isn't dumping out what happened for some reason. There are some of those
> > > "recovered error" indications but they're not near the I/O error report,
> > > so I'm not sure what's going on. CCing linux-scsi.
> >
> > Is there any data I can help with to advance this issue?
>
> The above complains about sector 3907028974 which is exactly
> 19566 sectors greater than the size of the raid array according
> to parted. In other words it appears to be an access to the last
> sector of the array.
If it's a read beyond the end of a partition, then it's possible it got
rejected in the partition checking logic before ever reaching the I/O
controller (which would explain why no messages from the fusion in the
log).
However, I don't think the analysis is correct. Parted says
> Number Start End Size File system Name Flags
> 1 34s 19565s 19532s bios_grub
> 2 19566s 3907029134s 3907009569s ext3 raid
So the absolute sector number 3907028974 is within partition 2.
I still think something went wrong in block. Even if the fusion failed
to spit an error, the SCSI layer is usually quite chatty about failures.
To get a simple error in the log with no explanation usually tends to
indicate that it occurred in the block layer.
James
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-08-03 14:34 ` James Bottomley
@ 2009-08-03 16:06 ` Allan Wind
2009-08-03 16:15 ` James Bottomley
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-03 16:06 UTC (permalink / raw)
To: James Bottomley; +Cc: linux-ide, linux-scsi
On 2009-08-03T09:34:53, James Bottomley wrote:
> On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> >
> > The above complains about sector 3907028974 which is exactly
> > 19566 sectors greater than the size of the raid array according
> > to parted. In other words it appears to be an access to the last
> > sector of the array.
>
> If it's a read beyond the end of a partition, then it's possible it got
> rejected in the partition checking logic before ever reaching the I/O
> controller (which would explain why no messages from the fusion in the
> log).
>
> However, I don't think the analysis is correct. Parted says
>
>
> > Number Start End Size File system Name Flags
> > 1 34s 19565s 19532s bios_grub
> > 2 19566s 3907029134s 3907009569s ext3 raid
>
> So the absolute sector number 3907028974 is within partition 2.
I was actually trying to make a different point. Namely that it
was curious that the error message complains about sector
3907028974 which is exactly the size of the array + 19566, or in
other words 1 sector past the end of array:
/dev/sdb2
start: 19566
end: 3907029134
size: 3907029134 - 19566 + 1 = 3907009568
/dev/md0
start: 0
end: 3907009407
size: 3907009407 - 0 + 1 = 3907009408
If dm maps linear to the disk sectors:
dm /dev/sdb2
0 19566
1 19567
...
3907009406 3907028972
3907009407 3907028973
What dm sector would map to physical sector 3907028974?
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-08-03 16:06 ` Allan Wind
@ 2009-08-03 16:15 ` James Bottomley
2009-08-26 21:17 ` Allan Wind
0 siblings, 1 reply; 11+ messages in thread
From: James Bottomley @ 2009-08-03 16:15 UTC (permalink / raw)
To: Allan Wind; +Cc: linux-ide, linux-scsi
On Mon, 2009-08-03 at 12:06 -0400, Allan Wind wrote:
> On 2009-08-03T09:34:53, James Bottomley wrote:
> > On Mon, 2009-08-03 at 01:08 -0400, Allan Wind wrote:
> > >
> > > The above complains about sector 3907028974 which is exactly
> > > 19566 sectors greater than the size of the raid array according
> > > to parted. In other words it appears to be an access to the last
> > > sector of the array.
> >
> > If it's a read beyond the end of a partition, then it's possible it got
> > rejected in the partition checking logic before ever reaching the I/O
> > controller (which would explain why no messages from the fusion in the
> > log).
> >
> > However, I don't think the analysis is correct. Parted says
> >
> >
> > > Number Start End Size File system Name Flags
> > > 1 34s 19565s 19532s bios_grub
> > > 2 19566s 3907029134s 3907009569s ext3 raid
> >
> > So the absolute sector number 3907028974 is within partition 2.
>
> I was actually trying to make a different point. Namely that it
> was curious that the error message complains about sector
> 3907028974 which is exactly the size of the array + 19566, or in
> other words 1 sector past the end of array:
That's where md stores its superblock ... but it's still within the
partition.
James
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-08-03 16:15 ` James Bottomley
@ 2009-08-26 21:17 ` Allan Wind
2009-09-11 6:20 ` Jeppe Oland
0 siblings, 1 reply; 11+ messages in thread
From: Allan Wind @ 2009-08-26 21:17 UTC (permalink / raw)
To: linux-ide, linux-scsi
I have not seen any problems since upgrading to 2.6.30.4.
/Allan
--
Allan Wind
Life Integrity, LLC
<http://lifeintegrity.com>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key
2009-08-26 21:17 ` Allan Wind
@ 2009-09-11 6:20 ` Jeppe Oland
0 siblings, 0 replies; 11+ messages in thread
From: Jeppe Oland @ 2009-09-11 6:20 UTC (permalink / raw)
To: linux-ide
Allan Wind <allan_wind <at> lifeintegrity.com> writes:
> I have not seen any problems since upgrading to 2.6.30.4.
I just upgraded to Debian Testing with 2.6.30-1-amd64,and now I am seeing
the same problem. Didn't have any problems with Debian 5.0 stable.
[751979.174584] sd 6:0:0:0: [sda] Sense Key : Recovered Error [current]
[descriptor]
[751979.174590] Descriptor sense data with sense descriptors (in hex):
[751979.174592] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[751979.174597] 00 4f 00 c2 40 50
[751979.174600] sd 6:0:0:0: [sda] Add. Sense: ATA pass through information
available
[753779.392241] sd 6:0:0:0: [sda] Sense Key : Recovered Error [current]
[descriptor]
[753779.392257] Descriptor sense data with sense descriptors (in hex):
[753779.392260] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 00
[753779.392265] 00 4f 00 c2 40 50
[753779.392268] sd 6:0:0:0: [sda] Add. Sense: ATA pass through information
available
[754954.862297] REISERFS error (device dm-9): vs-13070
reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of
[42641 42810 0x0 SD]
[754954.862326] REISERFS error (device dm-9): vs-13070
reiserfs_read_locked_inode: i/o failure occurred trying to find stat data of
[42641 42819 0x0 SD]
and lots more like that.
SMART reports no issues, and I have not noticed any data corruption (yet).
Controller is a Dell SAS RAID controller (SCSI storage controller: LSI Logic /
Symbios Logic SAS1068E PCI-Express Fusion-MPT SAS (rev 08)) with a single drive
attached to it.
Rgards,
-Jeppe
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2009-09-11 6:25 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-07-27 6:03 LSISAS1068E + WDC WD2002FYPS: I/O error & Sense Key Allan Wind
2009-07-29 19:43 ` Robert Hancock
2009-07-29 20:07 ` Allan Wind
2009-07-29 23:04 ` Robert Hancock
2009-07-31 18:40 ` Allan Wind
2009-08-03 5:08 ` Allan Wind
2009-08-03 14:34 ` James Bottomley
2009-08-03 16:06 ` Allan Wind
2009-08-03 16:15 ` James Bottomley
2009-08-26 21:17 ` Allan Wind
2009-09-11 6:20 ` Jeppe Oland
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).