From: Tomasz Pala <gotar@polanet.pl>
To: George Mitchell <george@chinilu.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: Unexpected raid1 behaviour
Date: Tue, 19 Dec 2017 21:28:52 +0100 [thread overview]
Message-ID: <20171219202852.GA14726@polanet.pl> (raw)
In-Reply-To: <f6785d3b-89fc-fd3b-38fe-815613b335fb@chinilu.com>
On Tue, Dec 19, 2017 at 10:31:40 -0800, George Mitchell wrote:
> I have significant experience as a user of raid1. I spent years using
> software raid1 and then more years using hardware (3ware) raid1 and now
> around 3 years using btrfs raid1. I have not found btrfs raid1 to be
> less reliable than any of the previous implementations of raid. I have
You are aware that in order to proof something one needs only one
example? Degraded r/o is such, QED.
Doesn't matter how long did you ride on top of any RAID implementation,
unless you got them in action, i.e. had actual drive malfunction. Did you
have broken drive under btrfs raid?
> a failure, you don't just plug things back in and expect it to be fixed
> without seriously investigating what has gone wrong and potential
> unexpected consequences. I have found that even with hardware raid you
> can find ways to screw things up to the point that you lose your data.
Everything could be screwed beyond comprehension, but we're talking
about PRIMARY objectives. In case of RAID1+ it seems to be obvious:
https://en.oxforddictionaries.com/definition/redundancy
- unplugging ANY SINGLE drive MUST NOT render system unusable.
This is really as simple as that.
> I have had situations where I reconnected a drive on hardware raid1 only
> to find that the array would not sync and from there on I ended up
> having to directly attach one of the drives and recover the partition
I had a situation when replugging a drive started a sync of older data
over the newer. So what? This doesn't change a thing - the drive
reappearance or resync is RECOVERY part. RECOVERY scenarios are entirely
different thing than REDUNDANCY itself. RECOVERY phase in some
implementation could be entirely off-line process and it still would be
RAID. Remove REDUNDANCY part and it's not RAID anymore.
If one is naming thing an apple, shouldn't be surprised if others
compare it to apples, not oranges.
> table with test disk in order to regain access to my data. So NO FORM
> of raid is a replacement for backups and NO FORM of raid is a
> replacement for due diligence in recovery from failure mode. Raid gives
And who said it is?
> you a second chance when things go wrong, it does not make failures
> transparent which is seemingly what we sometimes expect from raid. And
Wouldn't want to worry you, but properly managed RAIDs make I/J-of-K
trivial-failures transparent. Just like ECC protects N/M bits transparently.
Investigating the reasons is sysadmin's job, just like other
maintenance, including restoring protection level.
--
Tomasz Pala <gotar@pld-linux.org>
next prev parent reply other threads:[~2017-12-19 20:28 UTC|newest]
Thread overview: 61+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-16 19:50 Unexpected raid1 behaviour Dark Penguin
2017-12-17 11:58 ` Duncan
2017-12-17 15:48 ` Peter Grandi
2017-12-17 20:42 ` Chris Murphy
2017-12-18 8:49 ` Anand Jain
2017-12-18 8:49 ` Anand Jain
2017-12-18 10:36 ` Peter Grandi
2017-12-18 12:10 ` Nikolay Borisov
2017-12-18 13:43 ` Anand Jain
2017-12-18 22:28 ` Chris Murphy
2017-12-18 22:29 ` Chris Murphy
2017-12-19 12:30 ` Adam Borowski
2017-12-19 12:54 ` Andrei Borzenkov
2017-12-19 12:59 ` Peter Grandi
2017-12-18 13:06 ` Austin S. Hemmelgarn
2017-12-18 19:43 ` Tomasz Pala
2017-12-18 22:01 ` Peter Grandi
2017-12-19 12:46 ` Austin S. Hemmelgarn
2017-12-19 12:25 ` Austin S. Hemmelgarn
2017-12-19 14:46 ` Tomasz Pala
2017-12-19 16:35 ` Austin S. Hemmelgarn
2017-12-19 17:56 ` Tomasz Pala
2017-12-19 19:47 ` Chris Murphy
2017-12-19 21:17 ` Tomasz Pala
2017-12-20 0:08 ` Chris Murphy
2017-12-23 4:08 ` Tomasz Pala
2017-12-23 5:23 ` Duncan
2017-12-20 16:53 ` Andrei Borzenkov
2017-12-20 16:57 ` Austin S. Hemmelgarn
2017-12-20 20:02 ` Chris Murphy
2017-12-20 20:07 ` Chris Murphy
2017-12-20 20:14 ` Austin S. Hemmelgarn
2017-12-21 1:34 ` Chris Murphy
2017-12-21 11:49 ` Andrei Borzenkov
2017-12-19 20:11 ` Austin S. Hemmelgarn
2017-12-19 21:58 ` Tomasz Pala
2017-12-20 13:10 ` Austin S. Hemmelgarn
2017-12-19 23:53 ` Chris Murphy
2017-12-20 13:12 ` Austin S. Hemmelgarn
2017-12-19 18:31 ` George Mitchell
2017-12-19 20:28 ` Tomasz Pala [this message]
2017-12-19 19:35 ` Chris Murphy
2017-12-19 20:41 ` Tomasz Pala
2017-12-19 20:47 ` Austin S. Hemmelgarn
2017-12-19 22:23 ` Tomasz Pala
2017-12-20 13:33 ` Austin S. Hemmelgarn
2017-12-20 17:28 ` Duncan
2017-12-21 11:44 ` Andrei Borzenkov
2017-12-21 12:27 ` Austin S. Hemmelgarn
2017-12-22 16:05 ` Tomasz Pala
2017-12-22 21:04 ` Chris Murphy
2017-12-23 2:52 ` Tomasz Pala
2017-12-23 5:40 ` Duncan
2017-12-19 23:59 ` Chris Murphy
2017-12-20 8:34 ` Tomasz Pala
2017-12-20 8:51 ` Tomasz Pala
2017-12-20 19:49 ` Chris Murphy
2017-12-18 5:11 ` Anand Jain
2017-12-18 1:20 ` Qu Wenruo
2017-12-18 13:31 ` Austin S. Hemmelgarn
2018-01-12 12:26 ` Dark Penguin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20171219202852.GA14726@polanet.pl \
--to=gotar@polanet.pl \
--cc=george@chinilu.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox