From: David Brown <david.brown@hesbynett.no>
To: Roy Sigurd Karlsbakk <roy@karlsbakk.net>
Cc: Linux Raid <linux-raid@vger.kernel.org>,
Bernd Schubert <bernd.schubert@fastmail.fm>
Subject: Re: Checksumming RAID?
Date: Tue, 27 Nov 2012 15:34:07 +0100 [thread overview]
Message-ID: <50B4CF5F.3010309@hesbynett.no> (raw)
In-Reply-To: <25425956.18.1354024591879.JavaMail.root@zimbra>
On 27/11/2012 14:56, Roy Sigurd Karlsbakk wrote:
>>> If this system is running RAID-6, recovery should be possible to
>>> check both parity chunks, right?
>>
>> Yes, of course. (And if anyone ever needs it, it is possible to
>> extend raid6 to 3 parity chunks. I've done the maths, but it is
>> not implemented - there doesn't seem to be a big need for it.) But
>> - again referring back to Neil's blog - if the low-level raid spots
>> a consistency error, it still cannot correct it reliably even with
>> 2 parity chunks, and should pass on a read error to the higher
>> level raid. Using raid6 at the low level would let you do a good
>> consistency check even in the case of a failed drive (or a known
>> read error on a drive) - or two simultaneous undetected read
>> errors. And raid6 on the higher level raid would let you correct
>> such errors, even when there are other errors around. You'd soon
>> reach the point where it is more likely for your disks to
>> spontaneously turn into a bowl of petunias than for read errors to
>> be undetected or unrecoverable.
>
> That would be nice. So what should be done here in the first place,
> is to change the code to allow parity data to be read and calculated
> also on reads?
Well, what should be done /first/ is to hope that some of the more
experienced md raid experts express an opinion on the idea - is it
possible, is it useful, and is it practical to implement?
The main aim would be to add an option to a md arrays that will turn
each read into an implicit scrub or check of the whole stripe, and that
a consistency error there would return a read error to the next layer of
md raid.
I can see plenty of scope for complications here, such as what to do on
normal (detected) read errors, or how to ensure that the upper layer
re-writes the whole stripe and not just part of it (or perhaps partial
re-writes would be enough). I am fully aware that I'm just giving a
rough idea here - it needs a lot more thought before anyone can start
changing code. But if my theory here is correct, and if it is practical
to implement, then it might be a useful tool for big data producers.
mvh.,
David
>
> Vennlige hilsener / Best regards
>
> roy
next prev parent reply other threads:[~2012-11-27 14:34 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-26 13:27 Checksumming RAID? Roy Sigurd Karlsbakk
2012-11-27 9:45 ` David Brown
2012-11-27 10:17 ` Bernd Schubert
2012-11-27 11:20 ` David Brown
2012-11-27 11:39 ` Roy Sigurd Karlsbakk
2012-11-27 12:37 ` David Brown
2012-11-27 13:09 ` Roy Sigurd Karlsbakk
2012-11-27 13:20 ` David Brown
2012-11-27 13:56 ` Roy Sigurd Karlsbakk
2012-11-27 14:34 ` David Brown [this message]
2012-11-27 20:49 ` Stan Hoeppner
2012-11-28 10:58 ` Roy Sigurd Karlsbakk
2012-11-27 12:31 ` Bernd Schubert
2012-11-27 13:05 ` David Brown
2012-11-27 18:53 ` Chris Murphy
2012-11-27 19:27 ` Roy Sigurd Karlsbakk
2012-11-27 19:50 ` Chris Murphy
2012-11-28 10:56 ` Roy Sigurd Karlsbakk
2012-11-28 10:59 ` Roy Sigurd Karlsbakk
2012-11-28 13:25 ` Drew
2012-11-28 17:51 ` Roy Sigurd Karlsbakk
2012-11-28 19:16 ` Chris Murphy
2012-11-28 19:08 ` Chris Murphy
2012-11-28 19:18 ` Roy Sigurd Karlsbakk
2012-11-28 20:02 ` Chris Murphy
2012-11-27 13:54 ` Joe Landman
2012-11-27 18:48 ` Chris Murphy
2012-11-27 19:36 ` Chris Murphy
2012-12-03 12:24 ` Pasi Kärkkäinen
2012-12-03 14:09 ` Checksumming RAID? / SCSI SAS T10 PI and DIF/DIX / T13 SATA EPP Pasi Kärkkäinen
2012-12-05 19:05 ` Martin K. Petersen
2012-12-06 11:10 ` John Robinson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50B4CF5F.3010309@hesbynett.no \
--to=david.brown@hesbynett.no \
--cc=bernd.schubert@fastmail.fm \
--cc=linux-raid@vger.kernel.org \
--cc=roy@karlsbakk.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.