From: "Jakob Østergaard" <jakob@unthought.net>
To: Chris Andrews <chrisa@inweb.co.uk>
Cc: linux-kernel@vger.kernel.org
Subject: Re: RAID5: mkraid --force /dev/md0 doesn't work properly
Date: Tue, 2 Oct 2001 07:09:13 +0200 [thread overview]
Message-ID: <20011002070913.A5302@unthought.net> (raw)
In-Reply-To: <00c401c14a71$f8c478f0$1e1cd2d5@chris>
In-Reply-To: <00c401c14a71$f8c478f0$1e1cd2d5@chris>; from chrisa@inweb.co.uk on Mon, Oct 01, 2001 at 01:09:50PM +0100
On Mon, Oct 01, 2001 at 01:09:50PM +0100, Chris Andrews wrote:
> Evan Harris (eharris@puremagic.com) said:
>
> > I have a 6 disk RAID5 scsi array that had one disk go offline through a
> > dying power supply, taking the array into degraded mode, and then another
> > went offline a couple of hours later from what I think was a loose cable.
>
> I had much the same happen, except that I lost 6 disks out of 12 (power
> failure to one external rack of two), so I had no chance of starting in
> degraded mode. In this situation, where there are not enough disks for a
> viable raid, what is the recommended solution? In my case, there was nothing
> wrong with the six disks, but their superblock event counters were out of
> step.
The solution is exactly the same as before: re-create the array with N-1
disks, so that parity reconstruction will not begin.
Find the "oldest" disk and mark that one as failed - in case you lose an
entire rack of disks, any one of those should do.
Re-create the RAID, fsck (and don't worry about quite some inconsistency), and
most of your data should be back.
If you screw up (eg. re-order disks), your data will never know what hit them.
>
> Is the best idea to modify /etc/raidtab as discussed, and run mkraid with
> the real force option? What I actually did was to hand-edit the superblocks
> on the disks, and got the array going. That experience would lead me to
> suggest that there's room for some more options to allow the use of disks
> where there's actually nothing wrong, but right now the raid code won't use
> them. I'm thinking of a set of '--ignore' options to raidstart:
> --ignore-eventcounter, --ignore-failedflag, etc, which an admin could use as
> an alternative to trying mkraid.
re-creating the RAID does exactly that: "hand-modifies" the superblocks to
let the array run again.
Your idea is pretty good: if you did not have to re-write the superblocks from
the raidtab, you would not risk screwing up drive-ordering because of
inconsistent raidtabs.
I'd do a patch if I wasn't busy re-constructing/creating/moving/reconfiguring
RAID arrays right now ;)
>
> Right now it seems that software-raid works well, until it doesn't, at which
> point you're stuck - there's very little in the way of tools or overrides to
> sort problems out. Something other than 'try mkraid force as a last resort'
> would be useful.
You're not stuck. You have plenty of options, just as you stated in your post.
With a hardware solution you'd be *stuck* - not as in "there's no pretty tool",
but as in "game over, sucker!" ;)
But I agree with you that the process could be improved, and I really like your
suggestion with --ignore-eventcounter (or --try-recover maybe ?).
>
> (If anyone thinks this is a good idea, yes, I am volunteering to provide
> patches...)
Aha !
I think it's a great idea !
--
................................................................
: jakob@unthought.net : And I see the elder races, :
:.........................: putrid forms of man :
: Jakob Østergaard : See him rise and claim the earth, :
: OZ9ABN : his downfall is at hand. :
:.........................:............{Konkhra}...............:
next prev parent reply other threads:[~2001-10-02 5:09 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2001-10-01 12:09 RAID5: mkraid --force /dev/md0 doesn't work properly Chris Andrews
2001-10-02 5:09 ` Jakob Østergaard [this message]
-- strict thread matches above, loose matches on Subject: below --
2001-10-01 0:29 Evan Harris
2001-10-01 0:41 ` Jakob Østergaard
2001-10-01 0:51 ` Evan Harris
2001-10-01 3:56 ` Jakob Østergaard
2001-10-01 6:25 ` Evan Harris
2001-10-01 7:36 ` Peter Svensson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20011002070913.A5302@unthought.net \
--to=jakob@unthought.net \
--cc=chrisa@inweb.co.uk \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.