linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Phil Turmel <philip@turmel.org>
To: Chris Murphy <lists@colorremedies.com>
Cc: "linux-raid@vger.kernel.org List" <linux-raid@vger.kernel.org>
Subject: Re: Questions about bitrot and RAID 5/6
Date: Fri, 24 Jan 2014 13:12:32 -0500	[thread overview]
Message-ID: <52E2AD10.5080208@turmel.org> (raw)
In-Reply-To: <62EB0D79-9A50-4C50-ACBF-1C507D6F449B@colorremedies.com>

On 01/24/2014 12:59 PM, Chris Murphy wrote:
> 
> On Jan 24, 2014, at 10:03 AM, Phil Turmel <philip@turmel.org> wrote:
>>> w many bits of loss occur with one URE?
>> 
>> Complete physical sector.
> 
> 
> A complete physical sector represents 512 bytes / 4096 bits, or in
> the case of AF disks 4096 bytes / 32768 bits, of loss for one URE.
> Correct?
> 
> So a URE is either 4096 bits nonrecoverable, or 32768 bits
> nonrecoverable, for HDDs. Correct?

Yes.  Note that the specification is for an *event*, not for a specific
number of bits lost.  The error rate is not "bits lost per bits read",
it is "bits lost event per bits read".

>>>> Your comments suggest you've completely discounted the fact
>>>> that published URE rates are now close to, or within, drive
>>>> capacities.
>>>> 
>>>> Spend some time with the math and you will be very concerned.
>>> 
>>> Yeah I tried that a year ago and when it came to really super
>>> basic questions, no one was willing to answer them and the thread
>>> died as if we don't actually know what we're talking about. So I
>>> think some rather basic definitions are in order and an agreement
>>> that we don't get to redefine mathematics by saying a max error
>>> rate is a mean.
>>> 
>>> http://www.spinics.net/lists/raid/msg41669.html
>> 
>> I participated in that thread.  Some of your comments there imply
>> that the math is simple.  It's not (unless you are whiz with
>> statistics). Look at the Poisson distribution I referenced and the
>> computation examples I gave.
> 
> At the moment a Poisson distribution is out of scope because my
> questions have nothing to do with how often, when, or how many, such
> URE's will occur. At the moment I only want complete utter clarity on
> what a URE/nonrecoverable error (not even the rate) is in terms of
> quantity. That's my main problem.

Ok, but the earlier arguments in this thread over the relative merits of
raid5 versus raid6 very much depend on the error rate.

>> Note that a statement about the rate of a randomly occurring error
>> is implicitly stating an average.
> 
> Except that it has only one limiter, with the next notch a whole
> order magnitude less error. So I don't see how you get an average
> unless you're willing to just make assumptions about the bottom end.
> It doesn't make sense that a manufacturer would state a maximum error
> rate of X and then target that as an average. The average is
> certainly well below the max.

You are confused.  The specification is a maximum of an average.  An
average that changes with time, and cannot be measured from single events.

Phil

  reply	other threads:[~2014-01-24 18:12 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-01-20 20:34 Questions about bitrot and RAID 5/6 Mason Loring Bliss
2014-01-20 21:46 ` NeilBrown
2014-01-20 22:55   ` Peter Grandi
2014-01-21  9:18   ` David Brown
2014-01-21 17:19   ` Mason Loring Bliss
2014-01-22 10:40     ` David Brown
2014-01-23  0:48       ` Chris Murphy
2014-01-23  8:18         ` David Brown
2014-01-23 17:28           ` Chris Murphy
2014-01-23 18:53             ` Phil Turmel
2014-01-23 21:38               ` Chris Murphy
2014-01-24 13:22                 ` Phil Turmel
2014-01-24 16:11                   ` Chris Murphy
2014-01-24 17:03                     ` Phil Turmel
2014-01-24 17:59                       ` Chris Murphy
2014-01-24 18:12                         ` Phil Turmel [this message]
2014-01-24 19:32                           ` Chris Murphy
2014-01-24 19:57                             ` Phil Turmel
2014-01-24 20:54                               ` Chris Murphy
2014-01-25 10:23                                 ` Dag Nygren
2014-01-25 15:48                                 ` Phil Turmel
2014-01-25 17:44                                   ` Stan Hoeppner
2014-01-27  3:34                                     ` Chris Murphy
2014-01-27  7:16                                       ` Mikael Abrahamsson
2014-01-27 18:20                                         ` Chris Murphy
2014-01-30 10:22                                           ` Mikael Abrahamsson
2014-01-30 20:59                                             ` Chris Murphy
2014-01-27  3:20                                   ` Chris Murphy
2014-01-25 17:56                                 ` Wilson Jonathan
2014-01-27  4:07                                   ` Chris Murphy
2014-01-23 22:06               ` David Brown
2014-01-23 22:02             ` David Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=52E2AD10.5080208@turmel.org \
    --to=philip@turmel.org \
    --cc=linux-raid@vger.kernel.org \
    --cc=lists@colorremedies.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).