From: Bill Davidsen <davidsen@tmr.com>
To: Chris Green <cgreen@valvesoftware.com>
Cc: Tony Germano <tony_germano@hotmail.com>, linux-raid@vger.kernel.org
Subject: Re: Proposal: non-striping RAID4
Date: Wed, 28 May 2008 19:14:30 -0400 [thread overview]
Message-ID: <483DE756.6020004@tmr.com> (raw)
In-Reply-To: <BC2A76AABABF8741A8ADEE62DB1350F80384C398@exchange2.valvesoftware.com>
Chris Green wrote:
> I don't think this quite does it. It sounds like that would give me the
> spin down capability, but not (to me), the most
> interesting facility - the ability to have a system with
> RAID5-equivalent redundancy
> (i.e. I get N-1 drives worth of storage and can recover perfectly from
> the loss of 1 drive) but also lets me survive a multiple
> drive failure with only partial data loss.
>
Agree, you can't get that with current kernel, although I think the
parts are there to actually make it work. All it would take is some
clever calling of raid-4 operations to get it going. It's not trivial,
but if you knew the code it might fall out by laying the raid-4 out and
just changing the mapping such that all of the sequential sectors fall
on the same device. Interesting summer project for someone.
> -----Original Message-----
> From: Bill Davidsen [mailto:davidsen@tmr.com]
> Sent: Saturday, May 24, 2008 7:22 AM
> To: Chris Green
> Cc: Tony Germano; linux-raid@vger.kernel.org
> Subject: Re: Proposal: non-striping RAID4
>
> Chris Green wrote:
>> I would really like to have this functionality. Honestly, its pretty
>> much perfect for the "home server" application (which I have several
>> of), where:
>>
>> - writes are far less common than reads,
>> - The system goes hours without any reads and days without any
>> writes.
>> - single drive read speed is plenty for the applications that are
>> sitting on the other side
>> - a lot of the data is too voluminous to backup (media that can
> just
>> be re-ripped or downloaded).
>> - you want some redundancy beyond a single drive copy, but don't
> want
>> to spend a lot of drives on it. The model of "if you lose 1 disk, you
>> lose nothing, if you lose 2 disks you lose a portion" is better than
> the
>> raid5 model of losing everything with a double-disk failure.
>> - a common access pattern is to do a long sequential read at a slow
>> rate that takes hours to go through a few gigs (playing media).
>>
>
> I think you can do this right now with a touch of cleverness...
>
> Assume you create a raid-1 array, load your data, and call that
> initialized.
>
> From cron, daily or weekly, you set one drive of the array
> "write-mostly" and set the spin-down time (hdparm -S) to an hour or so.
> Now reads will go to one drive, the other will spin down, *and*, should
> you do one of those infrequent writes, the idle drive will spin back up
> and write the data (I want a bitmap of course). At the end of the time
> period you clear the write-mostly and spin-down time on the idle drive,
> put them on the other drive, and ideally you wind up with redundancy,
> splitting the disk wear evenly, and using existing capabilities.
>
> Actually you can't quite use existing capabilities, write-mostly can
> only be used at inconvenient times, like build, create, or add, so it's
> not obviously possible to change without at least shutting the array
> down. Perhaps Neil will give us his thoughts on that. However, if you
> don't mind a *really* ugly script, you might be able to mark the active
> drive failed, which would force all i/o to the previously sleeping
> drive, then remove the previously active drive, and add it back in using
>
> write-mostly. You would do a full sync (I think) but the change would be
>
> made.
>
> Better to make write-mostly a flag which can be enabled and disabled at
> will. That would be useful when a remote drive is normally operated over
>
> a fast link and has to drop to a slow backup link. I'm sure other uses
> would be found.
>
--
Bill Davidsen <davidsen@tmr.com>
"We have more to fear from the bungling of the incompetent than from
the machinations of the wicked." - from Slashdot
next prev parent reply other threads:[~2008-05-28 23:14 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-05-22 21:15 Proposal: non-striping RAID4 Tony Germano
2008-05-22 22:10 ` David Lethe
2008-05-22 22:56 ` Tony Germano
2008-05-23 15:12 ` Roger Heflin
2008-05-23 15:47 ` Chris Green
2008-05-24 14:21 ` Bill Davidsen
2008-05-24 14:19 ` Chris Green
2008-05-28 23:14 ` Bill Davidsen [this message]
2008-05-30 17:23 ` Tony Germano
-- strict thread matches above, loose matches on Subject: below --
2007-11-23 15:58 Chris Green
2007-11-10 0:57 James Lee
2007-11-12 1:29 ` Bill Davidsen
2007-11-13 23:48 ` James Lee
2007-11-14 1:06 ` James Lee
2007-11-14 23:16 ` Bill Davidsen
2007-11-15 0:24 ` James Lee
2007-11-15 6:01 ` Neil Brown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=483DE756.6020004@tmr.com \
--to=davidsen@tmr.com \
--cc=cgreen@valvesoftware.com \
--cc=linux-raid@vger.kernel.org \
--cc=tony_germano@hotmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.