* Faulty seagate drives, are going to be blacklisted?
@ 2009-01-19 23:29 Diego Calleja
2009-01-20 0:22 ` David Rees
` (3 more replies)
0 siblings, 4 replies; 16+ messages in thread
From: Diego Calleja @ 2009-01-19 23:29 UTC (permalink / raw)
To: linux-kernel, linux-ide
Tech sites are reporting everywhere a massive flaw in seagate drives that
can lock up the drive and make it unusable (the bios doesn't detect it, you
can't read the data). Haven't read anything about it here on the lists.
Seagate has ack'ed the problem:
http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207931
So, apparently there're a lot of drives on the market (including mine)
that can die any day. Are those drives going to be blacklisted? It's
still not clear if the firmware update is safe (some affected but
working drives are dying after the firmware update), so some people
like me is still waiting (and hoping that the drive doesn't die) for
more stable firmware updates...
Here is the list of drives+firmware affected, according to the support site
as of now. Some models are still being diagnosed.
Seagate Barracuda 7200.11 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207951)
Models Affected:
ST3500320AS
ST3640330AS
ST3750330AS
ST31000340AS
Firmware Affected
SD15, SD16, SD17, SD18, SD19, AD14
Recommended Firmware Update
SD1A
Seagate Barracuda 7200.11, page 2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957)
Models Affected:
ST31500341AS
ST31000333AS
ST3640323AS
ST3640623AS
ST3320613AS
ST3320813AS
ST3160813AS
Firmware Affected
Still Unknow
Recommended Firmware Update
Still Unknow
Seagate Barracuda ES.2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207963)
Models Affected:
ST3250310NS
ST3500320NS
ST3750330NS
ST31000340NS
Firmware Affected
Still Unknow
Recommended Firmware Update
Still Unknow
DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207969)
Models Affected:
STM3500320AS
STM3750330AS
STM31000340AS
Firmware Affected
MX15 (or higher)
Recommended Firmware Update
MX1A
DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207975)
Models Affected:
STM31000334AS
STM3320614AS
STM3160813AS
Firmware Affected
Still Unknow
Recommended Firmware Update
Still Unknow
^ permalink raw reply [flat|nested] 16+ messages in thread* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-19 23:29 Faulty seagate drives, are going to be blacklisted? Diego Calleja @ 2009-01-20 0:22 ` David Rees 2009-01-20 2:55 ` Robert Hancock ` (2 subsequent siblings) 3 siblings, 0 replies; 16+ messages in thread From: David Rees @ 2009-01-20 0:22 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide On Mon, Jan 19, 2009 at 3:29 PM, Diego Calleja <diegocg@gmail.com> wrote: > So, apparently there're a lot of drives on the market (including mine) > that can die any day. Are those drives going to be blacklisted? It's > still not clear if the firmware update is safe (some affected but > working drives are dying after the firmware update), so some people > like me is still waiting (and hoping that the drive doesn't die) for > more stable firmware updates... What would blacklisting these buggy drives achieve? There isn't anything that can be done except warn the user that they have known buggy firmware and let them know they should contact the vendor for a firmware update. But until that bug hits, it doesn't seem to otherwise affect the performance or functionality of the drives. -Dave ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-19 23:29 Faulty seagate drives, are going to be blacklisted? Diego Calleja 2009-01-20 0:22 ` David Rees @ 2009-01-20 2:55 ` Robert Hancock 2009-01-20 15:26 ` Diego Calleja 2009-01-26 19:04 ` Felix Miata 2009-01-20 3:32 ` Valdis.Kletnieks 2009-01-21 10:27 ` Patrick Horn 3 siblings, 2 replies; 16+ messages in thread From: Robert Hancock @ 2009-01-20 2:55 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide Diego Calleja wrote: > Tech sites are reporting everywhere a massive flaw in seagate drives that > can lock up the drive and make it unusable (the bios doesn't detect it, you > can't read the data). Haven't read anything about it here on the lists. > Seagate has ack'ed the problem: > http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207931 > > So, apparently there're a lot of drives on the market (including mine) > that can die any day. Are those drives going to be blacklisted? It's > still not clear if the firmware update is safe (some affected but > working drives are dying after the firmware update), so some people > like me is still waiting (and hoping that the drive doesn't die) for > more stable firmware updates... > > Here is the list of drives+firmware affected, according to the support site > as of now. Some models are still being diagnosed. There are a few drives which are currently marked to disable NCQ and warn the user that the firmware that should be upgraded: ST31500341AS ST31000333AS ST3640623AS ST3640323AS ST3320813AS ST3320613AS all for firmware versions SD15 through SD19. > > > Seagate Barracuda 7200.11 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207951) > > Models Affected: > ST3500320AS > ST3640330AS > ST3750330AS > ST31000340AS > Firmware Affected > SD15, SD16, SD17, SD18, SD19, AD14 > Recommended Firmware Update > SD1A > > Seagate Barracuda 7200.11, page 2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957) > Models Affected: > ST31500341AS > ST31000333AS > ST3640323AS > ST3640623AS > ST3320613AS > ST3320813AS > ST3160813AS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > > > Seagate Barracuda ES.2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207963) > Models Affected: > ST3250310NS > ST3500320NS > ST3750330NS > ST31000340NS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > > DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207969) > Models Affected: > STM3500320AS > STM3750330AS > STM31000340AS > Firmware Affected > MX15 (or higher) > Recommended Firmware Update > MX1A > > DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207975) > Models Affected: > STM31000334AS > STM3320614AS > STM3160813AS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > -- > To unsubscribe from this list: send the line "unsubscribe linux-ide" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 2:55 ` Robert Hancock @ 2009-01-20 15:26 ` Diego Calleja 2009-01-21 0:30 ` Robert Hancock 2009-01-26 19:04 ` Felix Miata 1 sibling, 1 reply; 16+ messages in thread From: Diego Calleja @ 2009-01-20 15:26 UTC (permalink / raw) To: Robert Hancock; +Cc: linux-kernel, linux-ide El Mon, 19 Jan 2009 20:55:05 -0600, Robert Hancock <hancockr@shaw.ca> escribió: > There are a few drives which are currently marked to disable NCQ and > warn the user that the firmware that should be upgraded: > > ST31500341AS > ST31000333AS > ST3640623AS > ST3640323AS > ST3320813AS > ST3320613AS > > all for firmware versions SD15 through SD19. Yes, I saw them, but apparently the NCQ bug is unrelated to this one. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 15:26 ` Diego Calleja @ 2009-01-21 0:30 ` Robert Hancock 0 siblings, 0 replies; 16+ messages in thread From: Robert Hancock @ 2009-01-21 0:30 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide Diego Calleja wrote: > El Mon, 19 Jan 2009 20:55:05 -0600, Robert Hancock <hancockr@shaw.ca> escribió: > >> There are a few drives which are currently marked to disable NCQ and >> warn the user that the firmware that should be upgraded: >> >> ST31500341AS >> ST31000333AS >> ST3640623AS >> ST3640323AS >> ST3320813AS >> ST3320613AS >> >> all for firmware versions SD15 through SD19. > > > Yes, I saw them, but apparently the NCQ bug is unrelated to this one. I suspect it might be related, given that the firmware versions seem to partially overlap. With this issue though, there isn't anything the kernel can do about the problem, so blacklisting doesn't seem to really make much sense. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 2:55 ` Robert Hancock 2009-01-20 15:26 ` Diego Calleja @ 2009-01-26 19:04 ` Felix Miata 2009-01-26 19:54 ` Mark Lord 1 sibling, 1 reply; 16+ messages in thread From: Felix Miata @ 2009-01-26 19:04 UTC (permalink / raw) To: linux-ide On 2009/01/19 20:55 (GMT-0600) Robert Hancock composed: > There are a few drives which are currently marked to disable NCQ and > warn the user that the firmware that should be upgraded: > ST31500341AS > ST31000333AS > ST3640623AS > ST3640323AS > ST3320813AS > ST3320613AS > all for firmware versions SD15 through SD19. I just got off the phone with Seagate tech because I could not find out whether the SD22 firmware in my ST3320613AS was an affected version. He said that it was and recommended I upgrade. The "Drive Detect software" that http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957 directs to use to discover the firmware revision requires windoz. Anyone know if it works under Wine? Instead I used a bootable Seatools CD that came in a retail Seagate HD package. -- "Train a child in the way he should go, and when he is old he will not turn from it." Proverbs 22:6 NIV Team OS/2 ** Reg. Linux User #211409 Felix Miata *** http://fm.no-ip.com/ ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-26 19:04 ` Felix Miata @ 2009-01-26 19:54 ` Mark Lord 2009-01-26 20:57 ` Gene Heskett 0 siblings, 1 reply; 16+ messages in thread From: Mark Lord @ 2009-01-26 19:54 UTC (permalink / raw) To: Felix Miata; +Cc: linux-ide Felix Miata wrote: .. > The "Drive Detect software" that > http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957 > directs to use to discover the firmware revision requires windoz. Anyone know > if it works under Wine? Instead I used a bootable Seatools CD that came in a > retail Seagate HD package. .. Under Linux, use this: hdparm -I /dev/sd? | head -8 ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-26 19:54 ` Mark Lord @ 2009-01-26 20:57 ` Gene Heskett 2009-01-26 21:34 ` Felix Miata 2009-01-26 23:56 ` Mark Lord 0 siblings, 2 replies; 16+ messages in thread From: Gene Heskett @ 2009-01-26 20:57 UTC (permalink / raw) To: Mark Lord; +Cc: Felix Miata, linux-ide On Monday 26 January 2009, Mark Lord wrote: >Felix Miata wrote: >.. > >> The "Drive Detect software" that >> http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957 >> directs to use to discover the firmware revision requires windoz. Anyone >> know if it works under Wine? Instead I used a bootable Seatools CD that >> came in a retail Seagate HD package. > >.. > >Under Linux, use this: hdparm -I /dev/sd? | head -8 > Should I be worried about this one? Seagate 500GB sata /dev/sdb: ATA device, with non-removable media Model Number: ST3500320AS Serial Number: 9QM5BB7Y Firmware Revision: SD15 Transport: Serial Thank you -- Cheers, Gene "There are four boxes to be used in defense of liberty: soap, ballot, jury, and ammo. Please use in that order." -Ed Howdershelt (Author) QOTD: "I used to be an idealist, but I got mugged by reality." ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-26 20:57 ` Gene Heskett @ 2009-01-26 21:34 ` Felix Miata 2009-01-26 23:56 ` Mark Lord 1 sibling, 0 replies; 16+ messages in thread From: Felix Miata @ 2009-01-26 21:34 UTC (permalink / raw) To: linux-ide On 2009/01/26 15:57 (GMT-0500) Gene Heskett composed: > On Monday 26 January 2009, Mark Lord wrote: >>Under Linux, use this: hdparm -I /dev/sd? | head -8 Thanks much! :-) > Should I be worried about this one? Seagate 500GB sata > /dev/sdb: > ATA device, with non-removable media > Model Number: ST3500320AS > Serial Number: 9QM5BB7Y > Firmware Revision: SD15 > Transport: Serial Based upon http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207951 I would contact Seagate support ASAP. -- "Train a child in the way he should go, and when he is old he will not turn from it." Proverbs 22:6 NIV Team OS/2 ** Reg. Linux User #211409 Felix Miata *** http://fm.no-ip.com/ ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-26 20:57 ` Gene Heskett 2009-01-26 21:34 ` Felix Miata @ 2009-01-26 23:56 ` Mark Lord 1 sibling, 0 replies; 16+ messages in thread From: Mark Lord @ 2009-01-26 23:56 UTC (permalink / raw) To: Gene Heskett; +Cc: Felix Miata, linux-ide Gene Heskett wrote: .. > Should I be worried about this one? Seagate 500GB sata > > /dev/sdb: > > ATA device, with non-removable media > Model Number: ST3500320AS > Serial Number: 9QM5BB7Y > Firmware Revision: SD15 > Transport: Serial .. According to Seagate, that particular unit has a 1/320 chance of being bricked at power-on, so.. yeah, I'd worry. But there's a simple bootable .iso image with a firmware-updater on it (along with an exceptionally confusing README, which you should ignore) at these links: http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207951 http://support.seagate.com/firmware/MooseDT-SD1A-2D-8-16-32MB.ISO Cheers ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-19 23:29 Faulty seagate drives, are going to be blacklisted? Diego Calleja 2009-01-20 0:22 ` David Rees 2009-01-20 2:55 ` Robert Hancock @ 2009-01-20 3:32 ` Valdis.Kletnieks 2009-01-20 15:30 ` Diego Calleja 2009-01-21 10:27 ` Patrick Horn 3 siblings, 1 reply; 16+ messages in thread From: Valdis.Kletnieks @ 2009-01-20 3:32 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide [-- Attachment #1: Type: text/plain, Size: 782 bytes --] On Tue, 20 Jan 2009 00:29:23 +0100, Diego Calleja said: > Tech sites are reporting everywhere a massive flaw in seagate drives that > can lock up the drive and make it unusable (the bios doesn't detect it, you > can't read the data). Haven't read anything about it here on the lists. > Seagate has ack'ed the problem: > http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207931 > > So, apparently there're a lot of drives on the market (including mine) > that can die any day. Are those drives going to be blacklisted? The $64 question is, of course: What exactly should the operating system *do* if it detects one of these drives? Prohibit it from bricking later by essentially bricking it *now*? What if the drive already has a lot of production data on it? [-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 3:32 ` Valdis.Kletnieks @ 2009-01-20 15:30 ` Diego Calleja 2009-01-20 17:24 ` Valdis.Kletnieks 0 siblings, 1 reply; 16+ messages in thread From: Diego Calleja @ 2009-01-20 15:30 UTC (permalink / raw) To: Valdis.Kletnieks; +Cc: linux-kernel, linux-ide El Mon, 19 Jan 2009 22:32:25 -0500, Valdis.Kletnieks@vt.edu escribió: > The $64 question is, of course: What exactly should the operating system > *do* if it detects one of these drives? Prohibit it from bricking later > by essentially bricking it *now*? What if the drive already has a lot of > production data on it? Yeah, that's why I asked. Now that I think about it, it should probably be the HAL people who should add one of those desktop "bubbles" warning the users about the possible failure (they already do it for faulty batteries) ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 15:30 ` Diego Calleja @ 2009-01-20 17:24 ` Valdis.Kletnieks 2009-01-20 18:18 ` Diego Calleja 0 siblings, 1 reply; 16+ messages in thread From: Valdis.Kletnieks @ 2009-01-20 17:24 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide [-- Attachment #1: Type: text/plain, Size: 446 bytes --] On Tue, 20 Jan 2009 16:30:28 +0100, Diego Calleja said: > Yeah, that's why I asked. Now that I think about it, it should probably be > the HAL people who should add one of those desktop "bubbles" warning the > users about the possible failure (they already do it for faulty batteries) Probably a better approach, as long as we leave enough info visible in various /sys files for HAL to figure it out - but I'm pretty sure we already do that... [-- Attachment #2: Type: application/pgp-signature, Size: 226 bytes --] ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-20 17:24 ` Valdis.Kletnieks @ 2009-01-20 18:18 ` Diego Calleja 0 siblings, 0 replies; 16+ messages in thread From: Diego Calleja @ 2009-01-20 18:18 UTC (permalink / raw) To: Valdis.Kletnieks; +Cc: linux-kernel, linux-ide El Tue, 20 Jan 2009 12:24:07 -0500, Valdis.Kletnieks@vt.edu escribió: > Probably a better approach, as long as we leave enough info visible in various > /sys files for HAL to figure it out - but I'm pretty sure we already do that... Yeah, it's all there already, and HAL has support for it. It just needs the neccesary .fdi files. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-19 23:29 Faulty seagate drives, are going to be blacklisted? Diego Calleja ` (2 preceding siblings ...) 2009-01-20 3:32 ` Valdis.Kletnieks @ 2009-01-21 10:27 ` Patrick Horn 2009-01-25 1:12 ` Tejun Heo 3 siblings, 1 reply; 16+ messages in thread From: Patrick Horn @ 2009-01-21 10:27 UTC (permalink / raw) To: Diego Calleja; +Cc: linux-kernel, linux-ide Diego Calleja wrote: > Tech sites are reporting everywhere a massive flaw in seagate drives that > can lock up the drive and make it unusable (the bios doesn't detect it, you > can't read the data). Haven't read anything about it here on the lists. > Seagate has ack'ed the problem: > http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207931 > > So, apparently there're a lot of drives on the market (including mine) > that can die any day. Are those drives going to be blacklisted? It's > still not clear if the firmware update is safe (some affected but > working drives are dying after the firmware update), so some people > like me is still waiting (and hoping that the drive doesn't die) for > more stable firmware updates... > > Here is the list of drives+firmware affected, according to the support site > as of now. Some models are still being diagnosed. > > > Seagate Barracuda 7200.11 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207951) > > Models Affected: > ST3500320AS > ST3640330AS > ST3750330AS > ST31000340AS > Firmware Affected > SD15, SD16, SD17, SD18, SD19, AD14 > Recommended Firmware Update > SD1A > > Seagate Barracuda 7200.11, page 2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207957) > Models Affected: > ST31500341AS > ST31000333AS > ST3640323AS > ST3640623AS > ST3320613AS > ST3320813AS > ST3160813AS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > > > Seagate Barracuda ES.2 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207963) > Models Affected: > ST3250310NS > ST3500320NS > ST3750330NS > ST31000340NS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > > DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207969) > Models Affected: > STM3500320AS > STM3750330AS > STM31000340AS > Firmware Affected > MX15 (or higher) > Recommended Firmware Update > MX1A > > DiamondMax 22 (http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207975) > Models Affected: > STM31000334AS > STM3320614AS > STM3160813AS > Firmware Affected > Still Unknow > Recommended Firmware Update > Still Unknow > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ Hi, I have another drive which doesn't seem to be on any list, and a google search comes up with very little information about this one. I have two raided SATA 1TB "MAXTOR STM31000333AS" drives, firmware MX15, one of which "failed" last weekend. I have since rebuilt the array and it has had no further problems, but I know it's only a matter of time before it happens again. I checked SMART, and both drives are essentially identical with nothing anywhere near failure. I am on Ubuntu kernel 2.6.28-4-generic #5-Ubuntu but I will be happy to build a kernel if this becomes at all reproducible. At first I thought that this NCQ problem might apply to me, but my drive is (gasp) one letter different from two of those listed (both seagate and maxtor variants): http://seagate.custkb.com/seagate/crm/selfservice/search.jsp?DocId=207931 And MX15 is listed as a faulty firmware for the STM31000340AS/334AS I have been using these drives for just three weeks up to now, before having the one drive fail (and later it gave a bunch of errors at bootup, which was solved when it reset the SATA link). The other drive has luckily not had any issues. Is this error just coincidence, or did Seagate forget to mention my drive? (And what happened to the firmware updates--they seem to be "In Validation") Is seagate the only site with information about this? Any public blacklist of every affected drive? What can I see in dmesg that indicates that NCQ is the cause? Thanks, -Patrick (I'll paste my dmesg as I don't know enough to tell if this is the same issue as the other seagate drives--I trimmed the repetitive parts) [ 7520.699730] ata2.00: exception Emask 0x10 SAct 0x7ff4f SErr 0x400100 action 0x6 frozen [ 7520.699734] ata2.00: irq_stat 0x08000000, interface fatal error [ 7520.699738] ata2: SError: { UnrecovData Handshk } [ 7520.699743] ata2.00: cmd 61/50:00:89:4b:c0/00:00:01:00:00/40 tag 0 ncq 40960 out [ 7520.699745] res 40/00:30:91:60:c0/00:00:01:00:00/40 Emask 0x10 (ATA bus error) [ 7520.699748] ata2.00: status: { DRDY } [ 7520.699752] ata2.00: cmd 61/40:08:b1:4f:c0/00:00:01:00:00/40 tag 1 ncq 32768 out [ 7520.699753] res 40/00:30:91:60:c0/00:00:01:00:00/40 Emask 0x10 (ATA bus error) [ 7520.699756] ata2.00: status: { DRDY } [ 7520.699875] ata2: hard resetting link [ 7521.180020] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [ 7521.250673] ata2.00: configured for UDMA/133 [ 7521.250724] ata2: EH complete [ 7521.250812] sd 1:0:0:0: [sdb] 1953525168 512-byte hardware sectors: (1.00 TB/931 GiB) [ 7521.250832] sd 1:0:0:0: [sdb] Write Protect is off [ 7521.250835] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00 [ 7521.250865] sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [ 7521.258968] ata2.00: exception Emask 0x10 SAct 0x7ffff SErr 0x400100 action 0x6 frozen [ 7521.258972] ata2.00: irq_stat 0x08000000, interface fatal error [ 7521.258975] ata2: SError: { UnrecovData Handshk } ... it then goes down to 1.5 Gbps but continues to give errors until it is kicked from the raid array an hour later [10477.764175] ata2.00: status: { DRDY } [10477.764179] ata2: hard resetting link [10478.248019] ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) [10478.318670] ata2.00: configured for UDMA/33 [10478.318679] end_request: I/O error, dev sdb, sector 989067690 [10478.318685] raid1: Disk failure on sdb3, disabling device. [10478.318686] raid1: Operation continuing on 1 devices. This drive also encountered a similar error on bootup the next day: [ 9.389771] ata2.00: exception Emask 0x10 SAct 0xf SErr 0xc00000 action 0x6 frozen [ 9.389774] ata2.00: irq_stat 0x0c000000, interface fatal error [ 9.389776] ata2: SError: { Handshk LinkSeq } [ 9.389780] ata2.00: cmd 60/02:00:3f:af:4e/00:00:00:00:00/40 tag 0 ncq 1024 in [ 9.389781] res 40/00:10:41:af:4e/00:00:00:00:00/40 Emask 0x10 (ATA bus error) [ 9.389783] ata2.00: status: { DRDY } From lspci -vvv: 0:1f.2 SATA controller: Intel Corporation 82801IR/IO/IH (ICH9R/DO/DH) 6 port SATA AHCI Controller (rev 02) (prog-if 01) Subsystem: ASUSTeK Computer Inc. Device 8277 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin B routed to IRQ 2299 Region 0: I/O ports at 9c00 [size=8] Region 1: I/O ports at 9880 [size=4] Region 2: I/O ports at 9800 [size=8] Region 3: I/O ports at 9480 [size=4] Region 4: I/O ports at 9400 [size=32] Region 5: Memory at f9ffe800 (32-bit, non-prefetchable) [size=2K] Capabilities: [80] Message Signalled Interrupts: Mask- 64bit- Queue=0/4 Enable+ Address: fee0f00c Data: 4181 Capabilities: [70] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [a8] SATA HBA <?> Capabilities: [b0] Vendor Specific Information <?> Kernel driver in use: ahci Kernel modules: ahci ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: Faulty seagate drives, are going to be blacklisted? 2009-01-21 10:27 ` Patrick Horn @ 2009-01-25 1:12 ` Tejun Heo 0 siblings, 0 replies; 16+ messages in thread From: Tejun Heo @ 2009-01-25 1:12 UTC (permalink / raw) To: Patrick Horn; +Cc: Diego Calleja, linux-kernel, linux-ide Hello, Patrick. Patrick Horn wrote: ... > Is this error just coincidence, or did Seagate forget to mention my drive? > (And what happened to the firmware updates--they seem to be "In > Validation") > Is seagate the only site with information about this? Any public > blacklist of every affected drive? What can I see in dmesg that > indicates that NCQ is the cause? I think it's coincidental. AFAIK, there was no report of increased transmission failures. Two known problems with these firmwares are 1. timeout on FLUSH if NCQ is in use on certain drives 2. bricking after power off (so, the failure is almost always during BIOS probing during boot) > (I'll paste my dmesg as I don't know enough to tell if this is the same > issue as the other seagate drives--I trimmed the repetitive parts) > > [ 7520.699730] ata2.00: exception Emask 0x10 SAct 0x7ff4f SErr 0x400100 > action 0x6 frozen > [ 7520.699734] ata2.00: irq_stat 0x08000000, interface fatal error > [ 7520.699738] ata2: SError: { UnrecovData Handshk } This is transmission error. Most common causes are power related or unreliable connection especially if backplanes are involved. Is the problem still reproducible? If so, can you please try to move it to different power connector and SATA port and see what changes? Thanks. -- tejun ^ permalink raw reply [flat|nested] 16+ messages in thread
end of thread, other threads:[~2009-01-26 23:56 UTC | newest] Thread overview: 16+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-01-19 23:29 Faulty seagate drives, are going to be blacklisted? Diego Calleja 2009-01-20 0:22 ` David Rees 2009-01-20 2:55 ` Robert Hancock 2009-01-20 15:26 ` Diego Calleja 2009-01-21 0:30 ` Robert Hancock 2009-01-26 19:04 ` Felix Miata 2009-01-26 19:54 ` Mark Lord 2009-01-26 20:57 ` Gene Heskett 2009-01-26 21:34 ` Felix Miata 2009-01-26 23:56 ` Mark Lord 2009-01-20 3:32 ` Valdis.Kletnieks 2009-01-20 15:30 ` Diego Calleja 2009-01-20 17:24 ` Valdis.Kletnieks 2009-01-20 18:18 ` Diego Calleja 2009-01-21 10:27 ` Patrick Horn 2009-01-25 1:12 ` Tejun Heo
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).