From: Duncan <1i5t5.duncan@cox.net>
To: linux-btrfs@vger.kernel.org
Subject: Re: How does btrfs handle bad blocks in raid1?
Date: Fri, 10 Jan 2014 17:03:52 +0000 (UTC) [thread overview]
Message-ID: <pan$e7d15$902dd302$8a10224f$9d8c4bb8@cox.net> (raw)
In-Reply-To: DD76DCF3-93B9-4EAF-9732-C703D7C50A74@colorremedies.com
Chris Murphy posted on Thu, 09 Jan 2014 11:52:08 -0700 as excerpted:
> Understood. I'm considering a 2nd drive dying during rebuild (from a 1st
> drive dying) as essentially simultaneous failures. And in the case of
> raid10, the likelihood of a 2nd drive failure being the lonesome drive
> in a mirrored set is statistically very unlikely. The next drive to fail
> is going to be some other drive in the array, which still has a mirror.
While still statistically unlikely, the likelihood of that critical
second device[1] in a mirror-pair on a raid10 dying isn't /as/ unlikely
as you might think -- it's actually more likely than that of any one of
the still mirrored devices failing, for example.
The reason is that as soon as one of the devices in a mirror-pair fails,
the other one is suddenly doing double the work it was previously, and
twice the work any other still-paired devices in the array are doing!
And as any human who has tried to pull an 80-hour-work-week can attest,
double the work is *NOT* simply double the stress!
If both devices in the pair are from the same manufacturing run and were
installed at the same time and run under exactly the same conditions, as
quite likely unless deliberately guarded against, chances are rather
higher than you'd like that by the time one fails, suddenly piling twice
the workload on the OTHER one isn't going to end well, especially under
the increased workload of a recovery after a replacement device has been
added.
That's the well known but all too infrequently considered trap of both
raid5 and 2-way-mirrored raid1, thus the reason many admins are so
reluctant to trust them and prefer N-way-mirroring/parity, with N bumped
upward as necessary to suit the level of device-failure paranoia.
For me, that cost/benefit/paranoia balance tends toward N=3 for
mirroring, N=2 for parity (since parity parallels mirror redundancy
count, not mirror total count). =:^)
---
[1] I'm trying to train myself to use "device" in most cases where I
formerly used "drive", since "device" is generally technically correct
even if it's a logical/virtual device such as an mdraid device or even
simply a partition on a physical device, while "drive" may well be
technically incorrect, since both virtual devices such as mdraid and
partitions, and physical devices such as SSDs, are arguably not "drives"
at all. But it's definitely a process I'm still in the middle of. It's
not a formed habit yet and if I'm not thinking about that when I chose my
term...
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman
next prev parent reply other threads:[~2014-01-10 17:04 UTC|newest]
Thread overview: 45+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-09 10:26 How does btrfs handle bad blocks in raid1? Clemens Eisserer
2014-01-09 10:42 ` Hugo Mills
2014-01-09 12:41 ` Duncan
2014-01-09 12:52 ` Austin S Hemmelgarn
2014-01-09 15:15 ` Duncan
2014-01-09 16:49 ` George Eleftheriou
2014-01-09 17:09 ` Hugo Mills
2014-01-09 17:34 ` George Eleftheriou
2014-01-09 17:43 ` Hugo Mills
2014-01-09 18:40 ` George Eleftheriou
2014-01-09 17:29 ` Chris Murphy
2014-01-09 18:00 ` George Eleftheriou
2014-01-10 15:27 ` Duncan
2014-01-10 15:46 ` George Mitchell
2014-01-09 17:31 ` Chris Murphy
2014-01-09 18:20 ` Austin S Hemmelgarn
2014-01-09 14:58 ` Chris Mason
2014-01-09 18:08 ` Chris Murphy
2014-01-09 18:22 ` Austin S Hemmelgarn
2014-01-09 18:52 ` Chris Murphy
2014-01-10 17:03 ` Duncan [this message]
2014-01-09 18:40 ` Chris Murphy
2014-01-09 19:13 ` Kyle Gates
2014-01-09 19:31 ` Chris Murphy
2014-01-09 23:24 ` George Mitchell
2014-01-10 0:08 ` Clemens Eisserer
2014-01-10 0:46 ` George Mitchell
[not found] <201401100106.s0A16CNd016476@atl4mhib27.myregisteredsite.com>
2014-01-10 1:31 ` George Mitchell
2014-01-14 19:13 ` Chris Murphy
2014-01-14 19:37 ` Roman Mamedov
2014-01-14 21:05 ` Chris Murphy
2014-01-14 21:19 ` Roman Mamedov
2014-01-14 21:37 ` Chris Murphy
2014-01-14 21:45 ` Chris Murphy
2014-01-14 21:54 ` Roman Mamedov
2014-01-14 20:29 ` George Mitchell
2014-01-14 21:00 ` Roman Mamedov
2014-01-14 21:06 ` Hugo Mills
2014-01-14 21:27 ` Chris Murphy
2014-01-14 21:27 ` George Mitchell
2014-01-14 21:28 ` George Mitchell
2014-01-14 21:14 ` Chris Murphy
2014-01-14 21:48 ` George Mitchell
2014-01-14 21:48 ` George Mitchell
2014-01-14 22:14 ` George Mitchell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='pan$e7d15$902dd302$8a10224f$9d8c4bb8@cox.net' \
--to=1i5t5.duncan@cox.net \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.