From: NeilBrown <neilb@suse.de>
To: Gionatan Danti <g.danti@assyoma.it>
Cc: linux-raid@vger.kernel.org, keld@keldix.com
Subject: Re: RAID 10 far and offset on-disk layouts
Date: Tue, 14 Jan 2014 09:27:51 +1100 [thread overview]
Message-ID: <20140114092751.09464b7b@notabene.brown> (raw)
In-Reply-To: <52D3BCB1.1010200@assyoma.it>
[-- Attachment #1: Type: text/plain, Size: 3060 bytes --]
On Mon, 13 Jan 2014 11:15:13 +0100 Gionatan Danti <g.danti@assyoma.it> wrote:
> On 01/13/2014 10:45 AM, NeilBrown wrote:
> > On Mon, 13 Jan 2014 09:52:50 +0100 Gionatan Danti <g.danti@assyoma.it> wrote:
> >
> >> Hi Neil,
> >> let me recap from a previous message:
> >>
> >> >FAR LAYOUT
> >> >md(4) states:
> >> >"The first copy of all data blocks will be striped across the early >part
> >> >of all drives in RAID0 fashion, and then the next copy of all blocks
> >> >will be striped across a later section of all drives, always ensuring
> >> >that all copies of any given block are on different drives"
> >> >
> >> >The "on different drives" part let me wonder _how_ are chunks
> >> >distributed. On a 4-disk array, I can imagine some different schemas:
> >> >
> >> >1) A1 A2 A3 A4
> >> > .. .. .. ..
> >> > A4 A1 A2 A3
> >> >
> >> >2) A1 A2 A3 A4
> >> > .. .. .. ..
> >> > A2 A1 A4 A3
> >> >
> >> >The first schema is the one depicted by SuSe documentation [1], while
> >> >the second is the one described by Wikipedia [2].
> >> >
> >> >Question 1: as the two schema have different reliability
> >> >characteristics, which is really used?
> >>
> >> SuSe entry:
> >> https://www.suse.com/documentation/sles11/stor_admin/data/raidmdadmr10cpx.html#b7cynnk
> >>
> >> Wikipedia entry:
> >> http://en.wikipedia.org/wiki/Linux_MD_RAID_10#LINUX-MD-RAID-10 (see how
> >> far layout is depicted)
> >>
> >> Keld kindly told me that the SuSe is simply not updated, as it depict a
> >> situation changed with newer kernels. So my two questions:
> >
> > I cannot see an important difference between the two pages you reference.
> > Both appear to be correct.
>
> Mmm... they seem different to me.
>
> SeSe FAR Layout:
>
> sda1 sdb1 sdc1 sde1
> 0 1 2 3
> 4 5 6 7
> . . .
> 3 0 1 2
> 7 4 5 6
>
> Notice how (for example) sdb1 is coupled both to sda1 (0,4) and
> sdc1(1,5). If sdb1 fails, any sda1 or sdc1 failure lead to data loss.
>
> Now, Wikipedia FAR Layout:
>
> 4 drives (sda1, sdb1, sdc1, sdd1)
> --------------------
> A1 A2 A3 A4
> A5 A6 A7 A8
> A9 A10 A11 A12
> .. .. .. ..
> A2 A1 A4 A3
> A6 A5 A8 A7
> A10 A9 A12 A11
> .. .. .. ..
>
> Notice now how a single disk (eg: sdb1) is coupled to only another
> _single_ disk (eg: sda1). In this case, if sdb1 fails, you had to lose
> sda1 to have a data loss. Losing sdc1 or sdd1 will _not_ lead to data loss.
>
Thanks for being explicit - it is much easier to answer explicit questions :-)
Yes, they are different. So the wikipedia article is wrong, or at least
misleading. That is not what the "f2" layout looks like.
The md driver does support that layout. I don't know yet what mdadm will
call it, but it won't be called "f2".
So this change:
http://en.wikipedia.org/w/index.php?title=Non-standard_RAID_levels&diff=501908270&oldid=501604733
was wrong.
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
next prev parent reply other threads:[~2014-01-13 22:27 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-12-27 14:29 RAID 10 far and offset on-disk layouts Gionatan Danti
2013-12-27 14:46 ` Peter Grandi
2013-12-27 15:16 ` Gionatan Danti
2013-12-27 17:16 ` Peter Grandi
2013-12-27 17:32 ` Gionatan Danti
2013-12-27 18:26 ` keld
2013-12-27 15:19 ` keld
2013-12-27 15:22 ` Gionatan Danti
2013-12-27 15:49 ` keld
2014-01-09 8:03 ` Gionatan Danti
2014-01-12 23:20 ` NeilBrown
2014-01-13 8:52 ` Gionatan Danti
2014-01-13 9:45 ` NeilBrown
2014-01-13 10:15 ` Gionatan Danti
2014-01-13 22:27 ` NeilBrown [this message]
2014-01-13 23:38 ` keld
2014-01-14 0:46 ` Stan Hoeppner
2014-01-14 9:38 ` keld
2014-01-14 9:06 ` Gionatan Danti
2014-01-14 9:16 ` NeilBrown
2014-01-14 9:27 ` Gionatan Danti
2014-01-14 10:06 ` keld
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140114092751.09464b7b@notabene.brown \
--to=neilb@suse.de \
--cc=g.danti@assyoma.it \
--cc=keld@keldix.com \
--cc=linux-raid@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).