* RAID5E
@ 2006-05-31 13:47 Bill Davidsen
2006-05-31 14:27 ` RAID5E Mattias Wadenstein
2006-06-01 1:13 ` RAID5E Neil Brown
0 siblings, 2 replies; 6+ messages in thread
From: Bill Davidsen @ 2006-05-31 13:47 UTC (permalink / raw)
To: Linux Raid List
Where I was working most recently some systems were using RAID5E (RAID5
with both the parity and hot spare distributed). This seems to be highly
desirable for small arrays, where spreading head motion over one more
drive will improve performance, and in all cases where a rebuild to the
hot spare will avoid a bottleneck on a single drive.
Is there any plan to add this capability?
--
bill davidsen <davidsen@tmr.com>
CTO TMR Associates, Inc
Doing interesting things with small computers since 1979
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: RAID5E
2006-05-31 13:47 RAID5E Bill Davidsen
@ 2006-05-31 14:27 ` Mattias Wadenstein
2006-05-31 14:54 ` RAID5E Erik Mouw
2006-05-31 19:03 ` RAID5E Bill Davidsen
2006-06-01 1:13 ` RAID5E Neil Brown
1 sibling, 2 replies; 6+ messages in thread
From: Mattias Wadenstein @ 2006-05-31 14:27 UTC (permalink / raw)
To: Bill Davidsen; +Cc: Linux Raid List
On Wed, 31 May 2006, Bill Davidsen wrote:
> Where I was working most recently some systems were using RAID5E (RAID5 with
> both the parity and hot spare distributed). This seems to be highly desirable
> for small arrays, where spreading head motion over one more drive will
> improve performance, and in all cases where a rebuild to the hot spare will
> avoid a bottleneck on a single drive.
>
> Is there any plan to add this capability?
What advantage does that have over raid6? You use exactly as many drives
(n+2), with the disadvantage of having to do a rebuild without parity when
a drive fails and a raid failure at a double disk failure.
/Mattias Wadenstein
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: RAID5E
2006-05-31 14:27 ` RAID5E Mattias Wadenstein
@ 2006-05-31 14:54 ` Erik Mouw
2006-05-31 19:03 ` RAID5E Bill Davidsen
1 sibling, 0 replies; 6+ messages in thread
From: Erik Mouw @ 2006-05-31 14:54 UTC (permalink / raw)
To: Mattias Wadenstein; +Cc: Bill Davidsen, Linux Raid List
On Wed, May 31, 2006 at 04:27:54PM +0200, Mattias Wadenstein wrote:
> On Wed, 31 May 2006, Bill Davidsen wrote:
> >Where I was working most recently some systems were using RAID5E (RAID5
> >with both the parity and hot spare distributed). This seems to be highly
> >desirable for small arrays, where spreading head motion over one more
> >drive will improve performance, and in all cases where a rebuild to the
> >hot spare will avoid a bottleneck on a single drive.
> >
> >Is there any plan to add this capability?
>
> What advantage does that have over raid6? You use exactly as many drives
> (n+2), with the disadvantage of having to do a rebuild without parity when
> a drive fails and a raid failure at a double disk failure.
Advantage:
- Easier to calculate the checksum (RAID5 XOR instead of a generator
polynome with RAID6)
- Higher throughput compared to standard RAID5
- Actually uses the hot spare
Disadvantage:
- Doesn't protect against double disk failures, but RAID5 also doesn't
Note that you could also do RAID6E.
Erik
--
+-- Erik Mouw -- www.harddisk-recovery.com -- +31 70 370 12 90 --
| Lab address: Delftechpark 26, 2628 XH, Delft, The Netherlands
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: RAID5E
2006-05-31 14:27 ` RAID5E Mattias Wadenstein
2006-05-31 14:54 ` RAID5E Erik Mouw
@ 2006-05-31 19:03 ` Bill Davidsen
1 sibling, 0 replies; 6+ messages in thread
From: Bill Davidsen @ 2006-05-31 19:03 UTC (permalink / raw)
To: Mattias Wadenstein; +Cc: Linux Raid List
Mattias Wadenstein wrote:
> On Wed, 31 May 2006, Bill Davidsen wrote:
>
>> Where I was working most recently some systems were using RAID5E
>> (RAID5 with both the parity and hot spare distributed). This seems to
>> be highly desirable for small arrays, where spreading head motion
>> over one more drive will improve performance, and in all cases where
>> a rebuild to the hot spare will avoid a bottleneck on a single drive.
>>
>> Is there any plan to add this capability?
>
>
> What advantage does that have over raid6? You use exactly as many
> drives (n+2), with the disadvantage of having to do a rebuild without
> parity when a drive fails and a raid failure at a double disk failure.
The write overhead of RAID-6 vs. RAID-5 is much higher, both in terms of
CPU and disk operations generated. See below re double drive failure and
RAID-5 with spare.
As a starting point consider RAID-4, and four drives, one of which is a
hot spare (I did clearly note small arrays). For reads you only have
striping over two drives, for writes the parity drive has double the i/o
of the data drives. The spare may spin down or not, but if it fails it
will be just when you need it most.
With RAID-5 the load is evenly spread over the entire array of active
drives. This improves both read and write performance, but the hot spare
is still not used in normal operation. Note that in the two drive
failure mode the only critical time is during the rebuild to hot spare,
after that a second drive failure is tolerated without loss of data.
With RAID-5E the i/o is spread over all four drives, which gives a
further improvement in the read and write performance, particularly if
large reads and writes (relative to clunk size) are common. There is no
hot spare which is mostly unused, and rebuild to the hot spare is
distributed over all remaining drives.
Drawbacks: you can't share a single hot spare between multiple arrays.
That's not really common, and I mention it for completeness. The other
issue is that when a failed drive is replaced the rebuild appears to be
somewhat complex because the new drive doesn't just become the new hot
spare. The same issues apply regarding two drives failing, but the
rebuild time is usually shorter, so the exposure is less.
And finally, with RAID-6 and the same number of drives, if you have a
failure there's no hot spare, and the arrray runs in degraded mode until
the failed drive is replaced. That is a different balance between
reliability and performance after failure, and needs to be a
per-instance choice.
--
bill davidsen <davidsen@tmr.com>
CTO TMR Associates, Inc
Doing interesting things with small computers since 1979
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: RAID5E
2006-05-31 13:47 RAID5E Bill Davidsen
2006-05-31 14:27 ` RAID5E Mattias Wadenstein
@ 2006-06-01 1:13 ` Neil Brown
2006-06-12 13:50 ` RAID5E Bill Davidsen
1 sibling, 1 reply; 6+ messages in thread
From: Neil Brown @ 2006-06-01 1:13 UTC (permalink / raw)
To: Bill Davidsen; +Cc: Linux Raid List
On Wednesday May 31, davidsen@tmr.com wrote:
> Where I was working most recently some systems were using RAID5E (RAID5
> with both the parity and hot spare distributed). This seems to be highly
> desirable for small arrays, where spreading head motion over one more
> drive will improve performance, and in all cases where a rebuild to the
> hot spare will avoid a bottleneck on a single drive.
>
> Is there any plan to add this capability?
I thought about it briefly....
As I understand it, the layout of raid5e when non-degraded is very
similar to raid6 - however the 'Q' block is simply not used.
This would be trivial to implement.
The interesting bit comes when a device fails and you want to rebuild
that distributed spare.
There are two possible ways that you could do this:
1/ Leave the spare where it is and write the correct data into each
spare. This would be fairly easy but would leave an array with an
very ... interesting layout of data.
When you add a replacement you just move everything back.
2/ reshape the array to be a regular raid5 layout. This would be hard
to do well without NVRAM as you are moving live data, but would result
in a neat and tidy array. Ofcourse adding a drive back in would be
interesting again...
I had previously only thought of option '2', and so discarded the idea
as not worth the effort. The more I think about it, the more possible
option 1 sounds.
I've put it back on my todo list, but I don't expect to get to it this
year. Ofcourse if someone else wants to give it a try, I'm happy to
make suggestions and review code.
NeilBrown
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: RAID5E
2006-06-01 1:13 ` RAID5E Neil Brown
@ 2006-06-12 13:50 ` Bill Davidsen
0 siblings, 0 replies; 6+ messages in thread
From: Bill Davidsen @ 2006-06-12 13:50 UTC (permalink / raw)
To: Neil Brown; +Cc: Linux Raid List
Neil Brown wrote:
>On Wednesday May 31, davidsen@tmr.com wrote:
>
>
>>Where I was working most recently some systems were using RAID5E (RAID5
>>with both the parity and hot spare distributed). This seems to be highly
>>desirable for small arrays, where spreading head motion over one more
>>drive will improve performance, and in all cases where a rebuild to the
>>hot spare will avoid a bottleneck on a single drive.
>>
>>Is there any plan to add this capability?
>>
>>
>
>I thought about it briefly....
>
>As I understand it, the layout of raid5e when non-degraded is very
>similar to raid6 - however the 'Q' block is simply not used.
>This would be trivial to implement.
>
>The interesting bit comes when a device fails and you want to rebuild
>that distributed spare.
>There are two possible ways that you could do this:
>
>1/ Leave the spare where it is and write the correct data into each
> spare. This would be fairly easy but would leave an array with an
> very ... interesting layout of data.
> When you add a replacement you just move everything back.
>
>2/ reshape the array to be a regular raid5 layout. This would be hard
> to do well without NVRAM as you are moving live data, but would result
> in a neat and tidy array. Ofcourse adding a drive back in would be
> interesting again...
>
>I had previously only thought of option '2', and so discarded the idea
>as not worth the effort. The more I think about it, the more possible
>option 1 sounds.
>I've put it back on my todo list, but I don't expect to get to it this
>year. Ofcourse if someone else wants to give it a try, I'm happy to
>make suggestions and review code.
>
I do appreciate being too busy, I'm just glad I have been able to
clarify the tradeoffs of RAID5e, and get it on your list at all. I did
look at the code a bit, and it would seem that if the "rebuild to hot
spare" code is modified to handle a distributed spare, then it looks as
if RAID6e might pretty much fall out. Feel free to tell me I'm dreaming.
--
bill davidsen <davidsen@tmr.com>
CTO TMR Associates, Inc
Doing interesting things with small computers since 1979
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2006-06-12 13:50 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-05-31 13:47 RAID5E Bill Davidsen
2006-05-31 14:27 ` RAID5E Mattias Wadenstein
2006-05-31 14:54 ` RAID5E Erik Mouw
2006-05-31 19:03 ` RAID5E Bill Davidsen
2006-06-01 1:13 ` RAID5E Neil Brown
2006-06-12 13:50 ` RAID5E Bill Davidsen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).