* raid5 post mortem analisys
@ 2007-09-21 16:25 Daniel Santos
2007-09-22 13:52 ` Bill Davidsen
0 siblings, 1 reply; 2+ messages in thread
From: Daniel Santos @ 2007-09-21 16:25 UTC (permalink / raw)
To: linux-raid
Hello,
I had a raid 5 array with 3 drives. (on a USB 2.0 bus :)). After some
time, on drive failed. After some more time another drive failed and the
array stopped running.
I know that the usage pattern from the first failure to the second was
read-only, i.e as a user only reads were performed.
I also know that the cause of the drive's failures was that they just
dissapeared from the USB bus (probably from a bug in the hard drive's
enclosure's USB to IDE bridge)
I trashed the array anyway, but since I am new to linux md devices, I
was wishing that you could help me understand if there was any
possibility of getting it back up assuming that there was no data
corruption.
I run kernel 2.6.17 on a debian system and use mdadm for controlling the
array from user space.
Daniel Santos
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: raid5 post mortem analisys
2007-09-21 16:25 raid5 post mortem analisys Daniel Santos
@ 2007-09-22 13:52 ` Bill Davidsen
0 siblings, 0 replies; 2+ messages in thread
From: Bill Davidsen @ 2007-09-22 13:52 UTC (permalink / raw)
To: Daniel Santos; +Cc: linux-raid
Daniel Santos wrote:
> Hello,
>
> I had a raid 5 array with 3 drives. (on a USB 2.0 bus :)). After some
> time, on drive failed. After some more time another drive failed and
> the array stopped running.
> I know that the usage pattern from the first failure to the second was
> read-only, i.e as a user only reads were performed.
Unless you were mounting the filesystem with the "noatime" parameter,
each read resulted in a write to update the inode, and possibly a
journal file depending on the filesystem type.
> I also know that the cause of the drive's failures was that they just
> dissapeared from the USB bus (probably from a bug in the hard drive's
> enclosure's USB to IDE bridge)
>
> I trashed the array anyway, but since I am new to linux md devices, I
> was wishing that you could help me understand if there was any
> possibility of getting it back up assuming that there was no data
> corruption.
You might have been able to save the data, had you grabbed it quickly,
but without reasonably stable hardware you can't have stable RAID (or
anything else). If you had stopped the array and done a power cycles,
assuming that your analysis of the failure is correct, the third drive
might have come back to life and a resync could have been done. There
might also be some hardware option which would have helped, in the mount
or at kernel boot, although nothing comes to mind.
> I run kernel 2.6.17 on a debian system and use mdadm for controlling
> the array from user space.
There have been some fixes in more recent kernels, but I don't know that
they would help if the hardware just goes walkabout. Someone else may
have thoughts on that, but having drives just go away is hard to recover.
--
bill davidsen <davidsen@tmr.com>
CTO TMR Associates, Inc
Doing interesting things with small computers since 1979
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2007-09-22 13:52 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-09-21 16:25 raid5 post mortem analisys Daniel Santos
2007-09-22 13:52 ` Bill Davidsen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).