public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Wakko Warner <wakko@animx.eu.org>
To: Mikulas Patocka <mikulas@artax.karlin.mff.cuni.cz>
Cc: linux-kernel@vger.kernel.org
Subject: Re: RAID-5 design bug (or misfeature)
Date: Sun, 29 May 2005 19:01:38 -0400	[thread overview]
Message-ID: <20050529230137.GA18854@animx.eu.org> (raw)
In-Reply-To: <Pine.LNX.4.58.0505300043540.5305@artax.karlin.mff.cuni.cz>

Mikulas Patocka wrote:
> RAID-5 has rather serious design bug --- when two disks become temporarily
> inaccessible (as it happened to me because of high temperature in server
> room), linux writes information about these errors to the remaining disks
> and when failed disks are on line again, RAID-5 won't ever be accessible.

I ran into this myself, however, I had 10 disks (5 per channel) and one
chennel went down.  Ok, my array was dead at that point and I had to reboot. 
What luck, the arry wasn't usable anymore.  My /usr was on that array, but
my / was not.  I did not want to go through the initrd/initramfs thing at
the time to setup my / with raid5, plus the fact you truely cannot boot from
it (thus partitioning and setting aside a slice wasn't viable to me)

> RAID-HOWTO lists some actions that can be done in this case, but none of
> them can be done if root filesystem is on RAID --- the machine just won't
> boot.

I had to reconstruct the array by hand with mdadm.  evms wouldn't touch it. 
Fortunately, I had a copy of each disk's information and the raid5's
information in files so it was quite easy to rebuild.  I did have backups
but that wasn't really what I wanted to do.  (It did take over 2 hours
before I could return to normal.  evms can't handle a raid5 that was in
reconstruction.  I think newer versions have this fixed.)

> I think Linux should stop accessing all disks in RAID-5 array if two disks
> fail and not write "this array is dead" in superblocks on remaining disks,
> efficiently destroying the whole array.

That'd be nice =)

-- 
 Lab tests show that use of micro$oft causes cancer in lab animals

  reply	other threads:[~2005-05-29 23:05 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-05-29 22:53 RAID-5 design bug (or misfeature) Mikulas Patocka
2005-05-29 23:01 ` Wakko Warner [this message]
2005-05-29 23:58 ` Bernd Eckenfels
2005-05-30  2:47   ` Mikulas Patocka
2005-05-30  3:00     ` Bernd Eckenfels
2005-05-30 11:55     ` Alan Cox
2005-05-30 13:23       ` Stephen Frost
2005-05-30 16:09       ` Mikulas Patocka
2005-05-31  8:05         ` Helge Hafting
2005-05-31 21:39         ` Pavel Machek
2005-06-01  1:43           ` Mikulas Patocka
2005-06-01 18:18       ` Bill Davidsen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20050529230137.GA18854@animx.eu.org \
    --to=wakko@animx.eu.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mikulas@artax.karlin.mff.cuni.cz \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox