* mdadm - raid6 best practices ? @ 2017-09-22 3:55 Nikhil Kshirsagar 2017-09-22 18:26 ` Song Liu 2017-09-23 0:40 ` Stan Hoeppner 0 siblings, 2 replies; 6+ messages in thread From: Nikhil Kshirsagar @ 2017-09-22 3:55 UTC (permalink / raw) To: Linux RAID Hello linux-raid, A customer has some questions about raid6 best practices. They are using a 84-disk MD1280 Expansion Shelf (seagate disks). They want to use mdadm to setup a s/w raid, using raid6. The questions are a) recommended number of disks included in RAID array set? b) Is it possible to include all 84 disks in one array and leave 2 of them as host spare? c) Is there a raid6 best practices document to refer to ? Can anyone point me to me any internal docs or articles to construct a reply to the customer? Thanks, Nikhil. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: mdadm - raid6 best practices ? 2017-09-22 3:55 mdadm - raid6 best practices ? Nikhil Kshirsagar @ 2017-09-22 18:26 ` Song Liu 2017-09-22 19:24 ` Wols Lists 2017-09-23 0:40 ` Stan Hoeppner 1 sibling, 1 reply; 6+ messages in thread From: Song Liu @ 2017-09-22 18:26 UTC (permalink / raw) To: linux-raid; +Cc: nkshirsa, Song Liu I would say this is a tradeoff between space overhead (2 parity disks per array) and data reliability (mean time before data loss, MTBDL, or MTTF). Space overhead is easy to calculate. For MTBDL, there are various of documents online, for example: https://jontse.com/courses/files/cornell/ece5730/Lecture24.pdf For your questions: > a) recommended number of disks included in RAID array set? I would personally recommend 12 to 15 HDDs per array. > b) Is it possible to include all 84 disks in one array and leave 2 of them as host spare? It is possible, but not recommended. 84 disks per array really hurts reliability. > c) Is there a raid6 best practices document to refer to ? I am not aware of such documents. Hope these help. Song ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: mdadm - raid6 best practices ? 2017-09-22 18:26 ` Song Liu @ 2017-09-22 19:24 ` Wols Lists 2017-09-22 21:44 ` John Stoffel 0 siblings, 1 reply; 6+ messages in thread From: Wols Lists @ 2017-09-22 19:24 UTC (permalink / raw) To: linux-raid; +Cc: nkshirsa On 22/09/17 19:26, Song Liu wrote: > I would say this is a tradeoff between space overhead (2 parity disks per > array) and data reliability (mean time before data loss, MTBDL, or MTTF). > Space overhead is easy to calculate. For MTBDL, there are various of > documents online, for example: > > https://jontse.com/courses/files/cornell/ece5730/Lecture24.pdf > > For your questions: > >> a) recommended number of disks included in RAID array set? > I would personally recommend 12 to 15 HDDs per array. > My gut feel too was that 84 is an awful lot of disks ... >> b) Is it possible to include all 84 disks in one array and leave 2 of > them as host spare? Do you mean hot spare, or were you thinking of raid-6's two parity disks? The problem with a hot spare is exactly that - when the array is powered up, they will be powered up. Yes, with an array that size I guess you really do want a bunch of hot spares configured, but I'm not sure that's really a good use of disks. If your main disks are beginning to die, the spares could easily be going the same way :-( > It is possible, but not recommended. 84 disks per array really hurts > reliability. > >> c) Is there a raid6 best practices document to refer to ? > I am not aware of such documents. > I don't know as it's really a "best practices" document, but there's a fair bit in the archaeology section of the website you might find interesting. More about optimisation and speed tests and stuff, but because it was all old when we started updating the site last year, that's why it ended up in archaeology. One thing you really must do is regular scrubs. And MAKE SURE that you have lots of active and passive monitoring - it would be oh so easy to lose three disks the moment the guy on watch gets casual and stops checking ... I don't have access to the stats off the top of my head, but with that many disks, losing a couple a month to failures doesn't seem that far-fetched. > Hope these help. > If you do come across some stuff (or write it yourself :-) please let me know - this is exactly the stuff that should be on the website if I can find it. > Song Cheers, Wol ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: mdadm - raid6 best practices ? 2017-09-22 19:24 ` Wols Lists @ 2017-09-22 21:44 ` John Stoffel 0 siblings, 0 replies; 6+ messages in thread From: John Stoffel @ 2017-09-22 21:44 UTC (permalink / raw) To: Wols Lists; +Cc: linux-raid, nkshirsa >>>>> "Wols" == Wols Lists <antlists@youngman.org.uk> writes: Wols> On 22/09/17 19:26, Song Liu wrote: >> I would say this is a tradeoff between space overhead (2 parity disks per >> array) and data reliability (mean time before data loss, MTBDL, or MTTF). >> Space overhead is easy to calculate. For MTBDL, there are various of >> documents online, for example: >> >> https://jontse.com/courses/files/cornell/ece5730/Lecture24.pdf >> >> For your questions: >> >>> a) recommended number of disks included in RAID array set? >> I would personally recommend 12 to 15 HDDs per array. >> Wols> My gut feel too was that 84 is an awful lot of disks ... It really is... just do the math. If you have 84 disks, and two of them are parity, what's the rate of failures that gives you three disk failures and them *boom* your data is gone. Also, with that wide a stripe, you then have *horrible* performance when you update a small 32k file, it has to read and re-write 81 disks worth of data. You're going to have horrible performance. It would be smarter to have 12-16 disks in RAID6 pairs, then you could either stripe or append to them. I'd probably not even do that unless I needed a single large volume. The goal is to minimize data loss if something goes wrong. You also don't mention what filesystem(s) you've looked into for this. Or how you plan of sharing this disk space to other systems. This is another area you need to be careful off. Also, when you have this many disks in your system, which sounds like one of the backblaze.com systems, you need to think about your controllers and whether or not you're using expanders, etc. If you're just going to maximum space for the least money, you could just go with RAID0 and stripe data across the disks. Pretty fast I bet, but if you lose one... boom! All gone. You could do five 16 disk RAID6 arrays, leaving you with four hot spares. Then I'd probably just make them all seperate XFS filesystems, especially since I don't know if you're using 4Tb disks or some other size. When you have this much space... it pays to be cautious, because users will assume it's reliable and won't lose their data. John ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: mdadm - raid6 best practices ? 2017-09-22 3:55 mdadm - raid6 best practices ? Nikhil Kshirsagar 2017-09-22 18:26 ` Song Liu @ 2017-09-23 0:40 ` Stan Hoeppner 2017-09-23 3:22 ` Nikhil Kshirsagar 1 sibling, 1 reply; 6+ messages in thread From: Stan Hoeppner @ 2017-09-23 0:40 UTC (permalink / raw) To: Nikhil Kshirsagar, Linux RAID On 09/21/2017 10:55 PM, Nikhil Kshirsagar wrote: > Hello linux-raid, > > A customer has some questions about raid6 best practices. They are > using a 84-disk MD1280 Expansion Shelf (seagate disks). They want to > use mdadm to setup a s/w raid, using raid6. > > The questions are > > a) recommended number of disks included in RAID array set? > b) Is it possible to include all 84 disks in one array and leave 2 of > them as host spare? > c) Is there a raid6 best practices document to refer to ? > > Can anyone point me to me any internal docs or articles to construct a > reply to the customer? > > Thanks, > Nikhil. The single most important question: What is the workload? Start the discussion there, or everything else is meaningless. -- Stan ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: mdadm - raid6 best practices ? 2017-09-23 0:40 ` Stan Hoeppner @ 2017-09-23 3:22 ` Nikhil Kshirsagar 0 siblings, 0 replies; 6+ messages in thread From: Nikhil Kshirsagar @ 2017-09-23 3:22 UTC (permalink / raw) To: Stan Hoeppner; +Cc: Linux RAID Thank you all for the information, I'll get some more workload details. Thanks again! -nikhil. On Sat, Sep 23, 2017 at 6:10 AM, Stan Hoeppner <stan@hardwarefreak.org> wrote: > On 09/21/2017 10:55 PM, Nikhil Kshirsagar wrote: >> >> Hello linux-raid, >> >> A customer has some questions about raid6 best practices. They are >> using a 84-disk MD1280 Expansion Shelf (seagate disks). They want to >> use mdadm to setup a s/w raid, using raid6. >> >> The questions are >> >> a) recommended number of disks included in RAID array set? >> b) Is it possible to include all 84 disks in one array and leave 2 of >> them as host spare? >> c) Is there a raid6 best practices document to refer to ? >> >> Can anyone point me to me any internal docs or articles to construct a >> reply to the customer? >> >> Thanks, >> Nikhil. > > > The single most important question: > > What is the workload? > > Start the discussion there, or everything else is meaningless. > > -- > Stan ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2017-09-23 3:22 UTC | newest] Thread overview: 6+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2017-09-22 3:55 mdadm - raid6 best practices ? Nikhil Kshirsagar 2017-09-22 18:26 ` Song Liu 2017-09-22 19:24 ` Wols Lists 2017-09-22 21:44 ` John Stoffel 2017-09-23 0:40 ` Stan Hoeppner 2017-09-23 3:22 ` Nikhil Kshirsagar
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).