All of lore.kernel.org
 help / color / mirror / Atom feed
From: Brad Campbell <lists2009@fnarfbargle.com>
To: Roman Mamedov <rm@romanrm.net>, David Brown <david.brown@hesbynett.no>
Cc: Bart Kus <me@bartk.us>, linux-raid@vger.kernel.org
Subject: Re: md-raid paranoia mode?
Date: Thu, 12 Jun 2014 16:30:55 +0800	[thread overview]
Message-ID: <5399653F.1040503@fnarfbargle.com> (raw)
In-Reply-To: <20140612140644.352e8c69@natsu>


On 12/06/14 16:06, Roman Mamedov wrote:
> In one case which Brad was describing, it was a hardware design fault 
> in his RAID controller, resulting in it returning bad data only when 
> all ports are utilized at high speeds. If MD had online checksum 
> mismatch detection, it would alert him immediately that something's 
> going wrong, rather than have this bug happily chew through all his 
> data, with "months of read/modify/write cycles combined with corrupt 
> data spread the corruption all over the array".


Yeah, you are right it would have possibly spared some of my data. 
Having said if I'd been paying attention to the mismatch counts at the 
end of my monthly scrubs I'd have noticed it a _lot_ sooner also. I had 
the tools, I was just not using them right. My fault, not md's.

Having said that, if I'd not gone through that I'd probably still not 
have comprehensive and complete backups, and I'd not have 
developed/found tools to allow me to better monitor my systems. So while 
it was a painful experience, it was not catastrophic and (as Calvin's 
dad would say) it built some more character.

I'm a lot older, and hopefully wiser from the experience. I also know my 
time is better spent with monitoring and backups than developing code to 
build that feature into md. While that would paper over one part of the 
storage chain, backups and monitoring covers me end to end.

-- 
Dolphins are so intelligent that within a few weeks they can train 
Americans to stand at the edge of the pool and throw them fish.

  reply	other threads:[~2014-06-12  8:30 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-06-11  6:48 md-raid paranoia mode? Bart Kus
     [not found] ` <CAH3kUhH06kpJNqb-zdcv5nu2e1FeZuotcW0SjBbWDOCcasm9OA@mail.gmail.com>
2014-06-11 10:34   ` Bart Kus
2014-06-12  7:26     ` Mattias Wadenstein
2014-06-11 17:31 ` Piergiorgio Sartor
2014-06-12  2:15 ` Brad Campbell
2014-06-12  6:28   ` Roman Mamedov
2014-06-12  6:45     ` NeilBrown
2014-06-12  7:26     ` David Brown
2014-06-12  8:06       ` Roman Mamedov
2014-06-12  8:30         ` Brad Campbell [this message]
2014-06-12  8:53         ` Roman Mamedov
2014-06-12 11:27         ` David Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5399653F.1040503@fnarfbargle.com \
    --to=lists2009@fnarfbargle.com \
    --cc=david.brown@hesbynett.no \
    --cc=linux-raid@vger.kernel.org \
    --cc=me@bartk.us \
    --cc=rm@romanrm.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.