From: Bruce Lowekamp <brucelowekamp@gmail.com>
To: Guy Watkins <guy@watkins-home.com>
Cc: Neil Brown <neilb@cse.unsw.edu.au>, linux-raid@vger.kernel.org
Subject: Re: Bad blocks are killing us!
Date: Thu, 18 Nov 2004 11:03:12 -0500 [thread overview]
Message-ID: <e9132f820411180803472424b2@mail.gmail.com> (raw)
In-Reply-To: <200411180147.iAI1l5N02116@www.watkins-home.com>
On Wed, 17 Nov 2004 20:46:59 -0500, Guy Watkins <guy@watkins-home.com> wrote:
> 2 things about your comments:
>
> 1.
> You said:
> "no one should be using md in an RT-critical application"
>
> I am sorry to hear that! What do recommend? Windows 2000 maybe?
Very funny. I would only use hardware raid if I had a safety-related
RT system. (as much as I love md, and I do keep all of my project
data on a very large md raid5.)
>
> 2.
> You said:
> "but the md-level
> approach might be better. But I'm not sure I see the point of
> it---unless you have raid 6 with multiple parity blocks, if a disk
> actually has the wrong information recorded on it I don't think you
> can detect which drive is bad, just that one of them is."
>
> If there is a parity block that does not match the data, true you do not
> know which device has the wrong data. However, if you do not "correct" the
> parity, when a device fails, it will be constructed differently than it was
> before it failed. This will just cause more corrupt data. The parity must
> be made consistent with whatever data is on the data blocks to prevent this
> corrosion of data. With RAID6 it should be possible to determine which
> block is wrong. It would be a pain in the @$$, but I think it would be
> doable. I will explain my theory if someone asks.
The question with a raid5 parity error is, how do you correct it?
You're right that if a disk fails, the data changes, and that is bad.
But, IMHO, I don't want the raid subsystem to guess what the correct
data is if it detects that there is that sort of an error. Flag the
error and take the array offline. That system needs some sort of
diagnosis to determine if data has actually been lost. If it happened
with my /home partition, I would probably verify the data with
backups. If it was a different partition, I might just run fsck on
it. But I think the user needs to be involved if data loss was
detected.
I don't know enough about how the md raid6 implementation works, but a
naive approach of removing each drive and seeing when you find one
that disagrees with the parity of the n-1 other drives seems like it
would work. Don't think I would want to code it. Still, at least
then you can correct the data and notify the user level. But no data
was lost, so continue as normal. Of course, personally, if md told me
a drive had developed an undetected bit error, I would remove the
drive immediately for more diagnostics and let it switch to a spare,
and would probably rather that be the default behavior if there's a
hot spare. But I'm a bit paranoid...
Bruce
--
Bruce Lowekamp (lowekamp@cs.wm.edu)
Computer Science Dept, College of William and Mary
next prev parent reply other threads:[~2004-11-18 16:03 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <200411150522.iAF5MNN18341@www.watkins-home.com>
2004-11-15 22:27 ` Bad blocks are killing us! Neil Brown
2004-11-16 16:28 ` Maurilio Longo
2004-11-16 18:18 ` Guy
2004-11-16 23:04 ` Neil Brown
2004-11-16 23:07 ` Guy
2004-11-17 13:21 ` Badstripe proposal (was Re: Bad blocks are killing us!) David Greaves
2004-11-18 9:59 ` Maurilio Longo
2004-11-18 10:29 ` Robin Bowes
2004-11-19 17:12 ` Jure Pe_ar
2004-11-20 13:15 ` Maurilio Longo
2004-11-21 18:23 ` Jure Pe_ar
2004-11-16 23:29 ` Bad blocks are killing us! dean gaudet
2004-11-17 21:58 ` Bruce Lowekamp
2004-11-18 1:46 ` Guy Watkins
2004-11-18 16:03 ` Bruce Lowekamp [this message]
2004-11-19 18:47 ` Dieter Stueken
2004-11-22 8:22 ` Dieter Stueken
2004-11-22 9:17 ` Guy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=e9132f820411180803472424b2@mail.gmail.com \
--to=brucelowekamp@gmail.com \
--cc=guy@watkins-home.com \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@cse.unsw.edu.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).