raid5 post mortem analisys

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* raid5 post mortem analisys
@ 2007-09-21 16:25 Daniel Santos
  2007-09-22 13:52 ` Bill Davidsen
  0 siblings, 1 reply; 2+ messages in thread
From: Daniel Santos @ 2007-09-21 16:25 UTC (permalink / raw)
  To: linux-raid

Hello,

I had a raid 5 array with 3 drives. (on a USB 2.0 bus :)). After some 
time, on drive failed. After some more time another drive failed and the 
array stopped running.
I know that the usage pattern from the first failure to the second was 
read-only, i.e as a user only reads were performed.
I also know that the cause of the drive's failures was that they just 
dissapeared from the USB bus (probably from a bug in the hard drive's 
enclosure's USB to IDE bridge)

I trashed the array anyway, but since I am new to linux md devices, I 
was wishing that you could help me understand if there was any 
possibility of getting it back up assuming that there was no data 
corruption.
I run kernel 2.6.17 on a debian system and use mdadm for controlling the 
array from user space.

Daniel Santos

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: raid5 post mortem analisys
  2007-09-21 16:25 raid5 post mortem analisys Daniel Santos
@ 2007-09-22 13:52 ` Bill Davidsen
  0 siblings, 0 replies; 2+ messages in thread
From: Bill Davidsen @ 2007-09-22 13:52 UTC (permalink / raw)
  To: Daniel Santos; +Cc: linux-raid

Daniel Santos wrote:
> Hello,
>
> I had a raid 5 array with 3 drives. (on a USB 2.0 bus :)). After some 
> time, on drive failed. After some more time another drive failed and 
> the array stopped running.
> I know that the usage pattern from the first failure to the second was 
> read-only, i.e as a user only reads were performed.

Unless you were mounting the filesystem with the "noatime" parameter, 
each read resulted in a write to update the inode, and possibly a 
journal file depending on the filesystem type.
> I also know that the cause of the drive's failures was that they just 
> dissapeared from the USB bus (probably from a bug in the hard drive's 
> enclosure's USB to IDE bridge)
>
> I trashed the array anyway, but since I am new to linux md devices, I 
> was wishing that you could help me understand if there was any 
> possibility of getting it back up assuming that there was no data 
> corruption.

You might have been able to save the data, had you grabbed it quickly, 
but without reasonably stable hardware you can't have stable RAID (or 
anything else). If you had stopped the array and done a power cycles, 
assuming that your analysis of the failure is correct, the third drive 
might have come back to life and a resync could have been done. There 
might also be some hardware option which would have helped, in the mount 
or at kernel boot, although nothing comes to mind.
> I run kernel 2.6.17 on a debian system and use mdadm for controlling 
> the array from user space.

There have been some fixes in more recent kernels, but I don't know that 
they would help if the hardware just goes walkabout. Someone else may 
have thoughts on that, but having drives just go away is hard to recover.

-- 
bill davidsen <davidsen@tmr.com>
  CTO TMR Associates, Inc
  Doing interesting things with small computers since 1979


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2007-09-22 13:52 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-09-21 16:25 raid5 post mortem analisys Daniel Santos
2007-09-22 13:52 ` Bill Davidsen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).