linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* unreadable drives can be synchronized?
@ 2007-05-16 15:50 Colin McCabe
  2007-05-16 17:22 ` Bill Davidsen
  2007-05-17  0:54 ` Neil Brown
  0 siblings, 2 replies; 10+ messages in thread
From: Colin McCabe @ 2007-05-16 15:50 UTC (permalink / raw)
  To: linux-raid

Hi all,

I am running software RAID on Linux 2.6.21.

While experimenting with adding and removing devices from the RAID array, I
noticed something very troubling. I have a bad drive (let's call it drive B)
which gets random read errors. I also have a good drive, call it drive A.

B can synchronize with A. But then, if I remove A from the raid array, A
cannot be re-added. This is because the bad drive, B, cannot be read from.

Basically, B appears to be "write-only"; it will never return an error on a
write, but just try to read from it, and you will be sorry.

Writing is fine:
[root@cmccabe-devel root]# dd if=/dev/zero of=/dev/sdb bs=524288
dd: writing `/dev/sdb': No space left on device
114464+0 records in
114463+0 records out

Reading is not:
[root@cmccabe-devel root]# dd if=/dev/sdb of=/dev/null bs=524288
ata1.00: exception Emask 0x0 SAct 0x3 SErr 0x0 action 0x2 frozen
ata1.00: cmd 60/00:00:00:b0:01/01:00:00:00:00/40 tag 0 cdb 0x0 data 131072 in
[ ... copious errors ... ]

I have disabled write caching using hdparm -W0.
Both drives are: Fujitsu MHV2060BH, 60 GB, Serial ATA
The SATA controller is: ICH6

My problem is that even though B gets into the synchronized state, it is no
good at all. This is potentially misleading, and if someone removes A after
synchronizing B, the system will probably crash, since there will be no good
drives left.

I wonder if anyone else is interested in a "paranoid recovery" mode where the
md layer tests the data that has been written. Even if this doubles the
recovery time, I think that it would be desirable for many applications.

Colin

^ permalink raw reply	[flat|nested] 10+ messages in thread
* Re: unreadable drives can be synchronized?
@ 2007-05-18 14:47 Andrew Burgess
  2007-05-18 15:04 ` Tomasz Chmielewski
  2007-05-18 18:10 ` Colin McCabe
  0 siblings, 2 replies; 10+ messages in thread
From: Andrew Burgess @ 2007-05-18 14:47 UTC (permalink / raw)
  To: colin.p.mccabe, linux-raid

>Basically, B appears to be "write-only"; it will never return an error on a
>write, but just try to read from it, and you will be sorry.

It would be interesting to see what SMART says about drive B, especially
the short and long self tests.


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2007-05-23 17:46 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-05-16 15:50 unreadable drives can be synchronized? Colin McCabe
2007-05-16 17:22 ` Bill Davidsen
2007-05-16 20:09   ` Colin McCabe
2007-05-16 20:18     ` Colin McCabe
2007-05-17  0:54 ` Neil Brown
  -- strict thread matches above, loose matches on Subject: below --
2007-05-18 14:47 Andrew Burgess
2007-05-18 15:04 ` Tomasz Chmielewski
2007-05-18 18:18   ` Colin McCabe
2007-05-23 17:46     ` Bill Davidsen
2007-05-18 18:10 ` Colin McCabe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).