From: Brad Campbell <brad@wasp.net.au>
To: Michael Tokarev <mjt@tls.msk.ru>
Cc: mingz@ele.uri.edu, linux-raid@vger.kernel.org
Subject: Re: consistency detect
Date: Mon, 11 Oct 2004 14:58:59 +0400 [thread overview]
Message-ID: <416A6773.1080104@wasp.net.au> (raw)
In-Reply-To: <416A63B2.7020004@tls.msk.ru>
Michael Tokarev wrote:
>> In short, it does not deal with it at all. RAID will deal with a disk
>> failure, it has no guarantees about consistency on power failures,
>> hard lockups or other catastrophic events.
>
>
> This is incorrect. In-kernel raid code keeps track of arrays and
> underlying disk state during write operations. On clean shutdown,
> when everything has been written, raid superblocks on all disks
> gets updated to indicate this. In case of unclean shutdown, raid
> code will reconstruct older copies of data using most recent ones
> (ie, from a disk which has most recent "events" value in superblock).
> The same is done for all other raid levels (4, 5, 6), but onot for
> raid0 for obvious reasons (as there's no R in raid0 per se).
When does the "events" value in the superblock actually get updated? I understood it only got
updated on an event, ie raid start, raid stop, disk add/remove/fail.
I realise the system does an auto rebuild when started after an unclean shutdown, the question
really is how does it know which disk is the freshest in a raid-1? In a raid-4,5,6 it's pretty
obvious as there is really only one copy of the data, but then does the code actually ensure that
the data gets written before the updated parity? or does it just flush the lot to disk in what it
thinks is the most optimum fashion?
The In-kernel data becomes pretty moot when the kernel has just blasted a couple of large blocks out
to a couple of disks and the plug has been pulled. It's going to be pretty indeterminate as to which
disk has the most accurate image of what was actually sent to it. Thus my comment that there is
really no way of accurately dealing with a catastrophic failure, and RAID is not there to do that
anyway.
I guess if you had a hardware RAID card that had a battery backed up RAM you have a much better
chance but then you really have a mini-ups :p)
Brad
next prev parent reply other threads:[~2004-10-11 10:58 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-10-11 3:32 consistency detect Ming Zhang
2004-10-11 8:54 ` Brad Campbell
2004-10-11 10:42 ` Michael Tokarev
2004-10-11 10:58 ` Brad Campbell [this message]
2004-10-11 21:15 ` Ming Zhang
2004-10-11 23:23 ` Neil Brown
2004-10-12 0:05 ` Ming Zhang
2004-10-12 0:13 ` Neil Brown
2004-10-12 0:43 ` Ming Zhang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=416A6773.1080104@wasp.net.au \
--to=brad@wasp.net.au \
--cc=linux-raid@vger.kernel.org \
--cc=mingz@ele.uri.edu \
--cc=mjt@tls.msk.ru \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).