Re: 2x6 or 3x4 raid10 arrays ? - Keld Jørn Simonsen

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: "Keld Jørn Simonsen" <keld@dkuug.dk>
To: Nat Makarevitch <nat@makarevitch.org>
Cc: linux-raid@vger.kernel.org
Subject: Re: 2x6 or 3x4 raid10 arrays ?
Date: Sun, 2 Mar 2008 01:30:57 +0100	[thread overview]
Message-ID: <20080302003057.GA6958@rap.rap.dk> (raw)
In-Reply-To: <loom.20080301T213935-669@post.gmane.org>

On Sat, Mar 01, 2008 at 10:05:06PM +0000, Nat Makarevitch wrote:
> Keld Jørn Simonsen <keld <at> dkuug.dk> writes:
> 
> > I believe that a full chunk is read for each read access.
> 
> I've read various definitions for "chunk". Is a "stripe" a 'cluster' (in the
> "group of disk sectors" meaning) on a single physical drive (device, let's say
> "spindle"), and a 'chunk' a set of stripes made with a single stripe of each
> spindle? For what I understand this is the definition used in the 'md' world
> (see 'man mdadm'), therefore I will use it thereafter.

I have understood chunk to be a set of sectors on a single device.
If it was to be understood as a set of space on multiple devices, then
the chunk size would normaly be relative to the number of devices
involved. And that is not what we talk about in eg mdadm chunk=256

> Yes, AFAIK a full chunk is concerned by each access.

Given your definition of chunk, I do not beleve so. A read of a chunk on
eg a 12 device raid, would not involve 12 reads, one on each device.

> > This would lead to that it is not important whether to use 
> > two arrays of 6 disks each, or 3 arrays of 4 disks each. 
> > Or for that sake 1 array of 12 disks.
> 
> I beg to disagree. Creating more than one array may be OK when you very
> precisely know your load profile per table, but in most cases this is not true,
> or this profile will vary, therefore your best bet is "to maintain, for each
> request, as much disk heads available as possible", carpet-bomb the array with
> all requests and let the elevator(s) optimize. Another way to see it, in some
> reciprocal way, is to say that you don't want to have any head sleeping when
> there is a request to serve. 

This is for bigger operations. I believe that for smaller operations,
such as a random read in a database, you would only like to have one IO
operation on one device. Seek times are important here. You can in one
average seek time of say 10 milliseconds on SATA-II transfer 800 kB
data, which is much more than normal size of a record in a table.
Having 12 operations for a record of say 1 kB would be very expensive.
The elevator is better of serving a number of concurrent requests.

> > Some other factors may be more important: such as the ability to survive
> > disk crashes
> 
> That's very true, however one may not neglect logistics. If I'm pretty sure that
> I can change a spindle in less than 2 hours after a failure I will prefer using
> all disks less one on a single array and letting the last one as a connected
> (but powered off) spare. The alarm trips, some automatic or manual procedure
> powers the spare and mounts it in the array, while the procedure aiming at
> physically extracting the failed device and replacing it (it will become the new
> spare) rolls. With more latency-prone logistics one may reserve more disks as
> spares.

Yes, rebuild time would also be a factor. Smaller raids are quicker to
rebuild, I think. Or maybe this is irrelevant, at least for raid10,f2 -
as rebuilding is done concurrently from all devices involved, and
probebly limited by the write speed of the replacing device.

Best regards
keld
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

next prev parent reply	other threads:[~2008-03-02  0:30 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-02-28 10:45 2x6 or 3x4 raid10 arrays ? Franck Routier
2008-02-28 11:22 ` Tim Southerwood
2008-02-28 16:54 ` Brendan Conoboy
2008-02-28 18:25 ` Janek Kozicki
2008-02-28 20:53   ` Janek Kozicki
2008-02-28 21:04     ` Brendan Conoboy
2008-03-01 20:07     ` Bill Davidsen
2008-03-01 20:55       ` Keld Jørn Simonsen
2008-02-28 22:36 ` Nat Makarevitch
2008-03-01 20:40   ` Keld Jørn Simonsen
2008-03-01 21:10     ` Keld Jørn Simonsen
2008-03-01 22:05     ` Nat Makarevitch
2008-03-02  0:30       ` Keld Jørn Simonsen [this message]
2008-03-02  9:00         ` Nat Makarevitch
2008-03-01 20:18 ` Bill Davidsen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080302003057.GA6958@rap.rap.dk \
    --to=keld@dkuug.dk \
    --cc=linux-raid@vger.kernel.org \
    --cc=nat@makarevitch.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).