From: Bernd Schubert <bernd.schubert@fastmail.fm>
To: David Brown <david.brown@hesbynett.no>
Cc: Roy Sigurd Karlsbakk <roy@karlsbakk.net>,
Linux Raid <linux-raid@vger.kernel.org>
Subject: Re: Checksumming RAID?
Date: Tue, 27 Nov 2012 11:17:51 +0100 [thread overview]
Message-ID: <50B4934F.6060105@fastmail.fm> (raw)
In-Reply-To: <50B48BAA.8060903@hesbynett.no>
On 11/27/2012 10:45 AM, David Brown wrote:
> On 26/11/2012 14:27, Roy Sigurd Karlsbakk wrote:
>> Hi all
>>
>> I see from an article at
>> http://pages.cs.wisc.edu/~bpkroth/cs736/md-checksums/md-checksums-paper.pdf
>>
>> that an implementation has been made to allow for ZFS-like
>> checksumming inside Linux MD. However, this code doesn't seem to
>> exist in any kernel trees. Does anyone know the current status for
>> data checksumming in MD?
>>
>
> See <http://neil.brown.name/blog/20110227114201> for a discussion on
> data checksums.
>
> As far as I have seen on this mailing list, there has been no "official"
> work on checksums as described in that paper. I suspect it's just a
> matter of a student or two doing a project as part of their university
> degree. It's great that people can do that - they are free to take a
> copy of the kernel, and experiment with new ideas. If the ideas are
> good, then it is possible to work it back into the mainline kernel
> development.
>
> However, in this case I think there is not much support for data
> checksumming amongst the "big boys" in this part of the Linux kernel -
> as explained by Neil in his blog post.
>
> My first thought when reading the paper in question is that it doesn't
> really add much that is actually useful. md does not need checksums -
> it already has a more powerful system for error detection and correction
> through the parity blocks. If you want more checksumming than raid5
> gives you, then use raid6.
>
> What might be of interest for confirming the data integrity is so say
> that whenever a block is to be read, the stripe it is in should be
> scrubbed. This would enforce regular scrubbing of data that is
> regularly used, and give the same benefits as the article's data
> checksumming. It would lead to more disk reads when you have small
> reads, but the overhead would be small for larger reads or for RMW
> writes (since the whole stripe, minus the parity, is read in this case).
>
> However, referring to another of Neil's blog posts at
> <http://neil.brown.name/blog/20100211050355>, you have to ask yourself
> how likely is it that data will be read from the drive with an error,
> but without the disk telling you of the error - and what can you
> sensibly do about it? You don't need checksums to tell you that there
> is a problem reading data from the disk - the disk already has very
> comprehensive checking of the data, and if that fails it will report an
> error and the md layer will re-construct the data from the parity and
> the rest of the stripe.
Thats the theory, real live unfortunately teaches a different story. I
just helped to recover as much data as possible from a troublesome
infortrend raid system, which again is part of a software raid. The
stupid hardware raid decided for unknown reasons to return different
data on each read. And this is already the 4th or 5th time that happened
(its a rather big installation and each time another hardware raid
causes the trouble).
And yes, I also aready have seen several hard disks to return wrong
data. That is actually the reason why some hardware raid vendors such as
DDN do parity reads all the time and then correct wrong data or entirely
fail the disks.
I will sent patches to better handle parity mismatches during the next
weeks (for performance reasons only for background checks).
Cheers,
Bernd
next prev parent reply other threads:[~2012-11-27 10:17 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-26 13:27 Checksumming RAID? Roy Sigurd Karlsbakk
2012-11-27 9:45 ` David Brown
2012-11-27 10:17 ` Bernd Schubert [this message]
2012-11-27 11:20 ` David Brown
2012-11-27 11:39 ` Roy Sigurd Karlsbakk
2012-11-27 12:37 ` David Brown
2012-11-27 13:09 ` Roy Sigurd Karlsbakk
2012-11-27 13:20 ` David Brown
2012-11-27 13:56 ` Roy Sigurd Karlsbakk
2012-11-27 14:34 ` David Brown
2012-11-27 20:49 ` Stan Hoeppner
2012-11-28 10:58 ` Roy Sigurd Karlsbakk
2012-11-27 12:31 ` Bernd Schubert
2012-11-27 13:05 ` David Brown
2012-11-27 18:53 ` Chris Murphy
2012-11-27 19:27 ` Roy Sigurd Karlsbakk
2012-11-27 19:50 ` Chris Murphy
2012-11-28 10:56 ` Roy Sigurd Karlsbakk
2012-11-28 10:59 ` Roy Sigurd Karlsbakk
2012-11-28 13:25 ` Drew
2012-11-28 17:51 ` Roy Sigurd Karlsbakk
2012-11-28 19:16 ` Chris Murphy
2012-11-28 19:08 ` Chris Murphy
2012-11-28 19:18 ` Roy Sigurd Karlsbakk
2012-11-28 20:02 ` Chris Murphy
2012-11-27 13:54 ` Joe Landman
2012-11-27 18:48 ` Chris Murphy
2012-11-27 19:36 ` Chris Murphy
2012-12-03 12:24 ` Pasi Kärkkäinen
2012-12-03 14:09 ` Checksumming RAID? / SCSI SAS T10 PI and DIF/DIX / T13 SATA EPP Pasi Kärkkäinen
2012-12-05 19:05 ` Martin K. Petersen
2012-12-06 11:10 ` John Robinson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50B4934F.6060105@fastmail.fm \
--to=bernd.schubert@fastmail.fm \
--cc=david.brown@hesbynett.no \
--cc=linux-raid@vger.kernel.org \
--cc=roy@karlsbakk.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).