From: Joe Landman <joe.landman@gmail.com>
To: David Brown <david.brown@hesbynett.no>
Cc: Bernd Schubert <bernd.schubert@fastmail.fm>,
Roy Sigurd Karlsbakk <roy@karlsbakk.net>,
Linux Raid <linux-raid@vger.kernel.org>
Subject: Re: Checksumming RAID?
Date: Tue, 27 Nov 2012 08:54:20 -0500 [thread overview]
Message-ID: <50B4C60C.6010409@gmail.com> (raw)
In-Reply-To: <50B4A215.4000203@hesbynett.no>
On 11/27/2012 06:20 AM, David Brown wrote:
> On 27/11/2012 11:17, Bernd Schubert wrote:
[...]
>> I will sent patches to better handle parity mismatches during the next
>> weeks (for performance reasons only for background checks).
>>
>> Cheers,
>> Bernd
Give me a heads up when they are ready, and I can get some testing in
for you.
>>
>>
>
> I can certainly sympathise with you, but I am not sure that data
> checksumming would help here. If your hardware raid sends out nonsense,
Well, unfortunately, Bernd (and DDN et al) are right, it is helpful. It
has to be engineered correctly to be of use. T10-DIF and PI are efforts
in this direction. This can be implemented in software.
> then it is going to be very difficult to get anything trustworthy. The
> obvious answer here is to throw out the broken hardware raid and use a
> system that works - but it is equally obvious that that is easier said
... which is not so obvious when you start dealing with hundreds to TB
and PB of data, and you have hardware which works perfectly most of the
time (apart from a random cosmic ray, power fluctuation/surge, ...)
> than done! But I would find it hard to believe that this is a common
> issue with hardware raid systems - it goes against the whole point of
> data storage.
http://storagemojo.com/2007/09/19/cerns-data-corruption-research/
(and no, I don't work for Robin, he largely ignores us :( )
> There is always a chance of undetected read errors - the question is if
> the chances of such read errors, and the consequences of them, justify
> the costs of extra checking. And if they /do/ justify extra checking,
I guess the real question is, how valuable is your data ... if you took
the trouble to store it, I am guessing that you'd like to know a) its
stored correctly, b) it is retrievable, and c) what you retrieve is
correct.
Hardware (and software) RAID help with b, and sometimes a. C is what
T10-DIF/PI and related are trying to solve.
> are data checksums the right way? I agree with Neil's post that
> end-to-end checksums (such as CRCs in a gzip file, or GPG integrity
> checks) are the best check when they are possible, but they are not
> always possible because they are not transparent.
I personally would like to push the checking more into the file system
layers than the disk block layers. Though I expect strong resistance
to that as well. File systems assume perfectly operating underlying
storage blocks in most cases, and breaking that model (or breaking it
any more than we are doing now) would be troubling to many.
Adding in crc verify on read (and crc generation and storage on write)
shouldn't be too painful at the block layer. Much of the infrastructure
is in place. I've been thinking of building a pluggable connection into
MD, so we could experiment with different mechanisms (w/o rebuilding the
kernel or MD each time). Though this is (unfortunately) pretty far down
our list of priorities at the moment.
>
> mvh.,
>
> David
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-raid" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2012-11-27 13:54 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-26 13:27 Checksumming RAID? Roy Sigurd Karlsbakk
2012-11-27 9:45 ` David Brown
2012-11-27 10:17 ` Bernd Schubert
2012-11-27 11:20 ` David Brown
2012-11-27 11:39 ` Roy Sigurd Karlsbakk
2012-11-27 12:37 ` David Brown
2012-11-27 13:09 ` Roy Sigurd Karlsbakk
2012-11-27 13:20 ` David Brown
2012-11-27 13:56 ` Roy Sigurd Karlsbakk
2012-11-27 14:34 ` David Brown
2012-11-27 20:49 ` Stan Hoeppner
2012-11-28 10:58 ` Roy Sigurd Karlsbakk
2012-11-27 12:31 ` Bernd Schubert
2012-11-27 13:05 ` David Brown
2012-11-27 18:53 ` Chris Murphy
2012-11-27 19:27 ` Roy Sigurd Karlsbakk
2012-11-27 19:50 ` Chris Murphy
2012-11-28 10:56 ` Roy Sigurd Karlsbakk
2012-11-28 10:59 ` Roy Sigurd Karlsbakk
2012-11-28 13:25 ` Drew
2012-11-28 17:51 ` Roy Sigurd Karlsbakk
2012-11-28 19:16 ` Chris Murphy
2012-11-28 19:08 ` Chris Murphy
2012-11-28 19:18 ` Roy Sigurd Karlsbakk
2012-11-28 20:02 ` Chris Murphy
2012-11-27 13:54 ` Joe Landman [this message]
2012-11-27 18:48 ` Chris Murphy
2012-11-27 19:36 ` Chris Murphy
2012-12-03 12:24 ` Pasi Kärkkäinen
2012-12-03 14:09 ` Checksumming RAID? / SCSI SAS T10 PI and DIF/DIX / T13 SATA EPP Pasi Kärkkäinen
2012-12-05 19:05 ` Martin K. Petersen
2012-12-06 11:10 ` John Robinson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50B4C60C.6010409@gmail.com \
--to=joe.landman@gmail.com \
--cc=bernd.schubert@fastmail.fm \
--cc=david.brown@hesbynett.no \
--cc=linux-raid@vger.kernel.org \
--cc=roy@karlsbakk.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).