* striping of a 4 drive raid10
@ 2008-01-27 19:33 Keld Jørn Simonsen
2008-01-27 20:11 ` Peter Grandi
` (2 more replies)
0 siblings, 3 replies; 8+ messages in thread
From: Keld Jørn Simonsen @ 2008-01-27 19:33 UTC (permalink / raw)
To: linux-raid
Hi
I have tried to make a striping raid out of my new 4 x 1 TB
SATA-2 disks. I tried raid10,f2 in several ways:
1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0
of md0+md1
2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2
of md0+md1
3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of
md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB
4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize
of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB
5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1
My new disks give a transfer rate of about 80 MB/s, so I expected
to have something like 320 MB/s for the whole raid, but I did not get
more than about 180 MB/s.
I think it may be something with the layout, that in effect
the drives should be something like:
sda1 sdb1 sdc1 sdd1
0 1 2 3
4 5 6 7
And this was not really doable for the combination of raids,
because thet combinations give different block layouts.
How can it be done? Do we need a new raid type?
Best regards
keld
^ permalink raw reply [flat|nested] 8+ messages in thread* Re: striping of a 4 drive raid10 2008-01-27 19:33 striping of a 4 drive raid10 Keld Jørn Simonsen @ 2008-01-27 20:11 ` Peter Grandi 2008-01-27 21:43 ` Keld Jørn Simonsen 2008-01-27 20:20 ` Neil Brown [not found] ` <18332.58858.191866.347099@notabene.brown> 2 siblings, 1 reply; 8+ messages in thread From: Peter Grandi @ 2008-01-27 20:11 UTC (permalink / raw) To: Linux RAID >>> On Sun, 27 Jan 2008 20:33:45 +0100, Keld Jørn Simonsen >>> <keld@dkuug.dk> said: keld> Hi I have tried to make a striping raid out of my new 4 x keld> 1 TB SATA-2 disks. I tried raid10,f2 in several ways: keld> 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 keld> of md0+md1 keld> 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 keld> of md0+md1 keld> 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of keld> md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB keld> 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize keld> of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB These stacked RAID levels don't make a lot of sense. keld> 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 This also does not make a lot of sense. Why have four mirrors instead of two? Instead, try 'md0 = raid10,f2' for example. The first mirror of will be striped across the outer half of all four drives, and the second mirrors will be rotated in the inner half of each drive. Which of course means that reads will be quite quick, but writes and degraded operation will be slower. Consider this post for more details: http://www.spinics.net/lists/raid/msg18130.html [ ... ] - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: striping of a 4 drive raid10 2008-01-27 20:11 ` Peter Grandi @ 2008-01-27 21:43 ` Keld Jørn Simonsen 0 siblings, 0 replies; 8+ messages in thread From: Keld Jørn Simonsen @ 2008-01-27 21:43 UTC (permalink / raw) To: Peter Grandi; +Cc: Linux RAID On Sun, Jan 27, 2008 at 08:11:35PM +0000, Peter Grandi wrote: > >>> On Sun, 27 Jan 2008 20:33:45 +0100, Keld Jørn Simonsen > >>> <keld@dkuug.dk> said: > > keld> Hi I have tried to make a striping raid out of my new 4 x > keld> 1 TB SATA-2 disks. I tried raid10,f2 in several ways: > > keld> 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 > keld> of md0+md1 > keld> 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 > keld> of md0+md1 > keld> 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of > keld> md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB > keld> 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize > keld> of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB > > These stacked RAID levels don't make a lot of sense. > > keld> 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 > > This also does not make a lot of sense. Why have four mirrors > instead of two? My error, I did mean f2. Anyway 4 mirrors would make the disk 2 times faster than 2 disks, and given disk prices these days this could make a lot of sense. > Instead, try 'md0 = raid10,f2' for example. The first mirror of > will be striped across the outer half of all four drives, and > the second mirrors will be rotated in the inner half of each > drive. > > Which of course means that reads will be quite quick, but writes > and degraded operation will be slower. > > Consider this post for more details: > > http://www.spinics.net/lists/raid/msg18130.html Thanks for the reference. There is also more in the original article on possible layouts of what is now known as raid10,f2 http://marc.info/?l=linux-raid&m=107427614604701&w=2 including performance enhancements due to use of the faster outer sectors, and smaller average seek times because you can seek on only half the disk. best regards keld - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: striping of a 4 drive raid10 2008-01-27 19:33 striping of a 4 drive raid10 Keld Jørn Simonsen 2008-01-27 20:11 ` Peter Grandi @ 2008-01-27 20:20 ` Neil Brown [not found] ` <479E1FD0.4000702@tmr.com> [not found] ` <18332.58858.191866.347099@notabene.brown> 2 siblings, 1 reply; 8+ messages in thread From: Neil Brown @ 2008-01-27 20:20 UTC (permalink / raw) To: Keld Jørn Simonsen; +Cc: linux-raid On Sunday January 27, keld@dkuug.dk wrote: > Hi > > I have tried to make a striping raid out of my new 4 x 1 TB > SATA-2 disks. I tried raid10,f2 in several ways: > > 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 > of md0+md1 > > 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 > of md0+md1 > > 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of > md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB > > 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize > of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB > > 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 Try 6: md0 = raid10,f2 of sda1+sdb1+sdc1+sdd1 Also try raid10,o2 with a largeish chunksize (256KB is probably big enough). NeilBrown > > My new disks give a transfer rate of about 80 MB/s, so I expected > to have something like 320 MB/s for the whole raid, but I did not get > more than about 180 MB/s. > > I think it may be something with the layout, that in effect > the drives should be something like: > > sda1 sdb1 sdc1 sdd1 > 0 1 2 3 > 4 5 6 7 > > And this was not really doable for the combination of raids, > because thet combinations give different block layouts. > > How can it be done? Do we need a new raid type? > > Best regards > keld > - > To unsubscribe from this list: send the line "unsubscribe linux-raid" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <479E1FD0.4000702@tmr.com>]
* Re: striping of a 4 drive raid10 [not found] ` <479E1FD0.4000702@tmr.com> @ 2008-01-28 19:03 ` Keld Jørn Simonsen 0 siblings, 0 replies; 8+ messages in thread From: Keld Jørn Simonsen @ 2008-01-28 19:03 UTC (permalink / raw) To: Bill Davidsen; +Cc: Neil Brown, linux-raid On Mon, Jan 28, 2008 at 01:32:48PM -0500, Bill Davidsen wrote: > Neil Brown wrote: > >On Sunday January 27, keld@dkuug.dk wrote: > > > >>Hi > >> > >>I have tried to make a striping raid out of my new 4 x 1 TB > >>SATA-2 disks. I tried raid10,f2 in several ways: > >> > >>1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 > >>of md0+md1 > >> > >>2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 > >>of md0+md1 > >> > >>3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize > >>of md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB > >> > >>4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize > >>of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB > >> > >>5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 > >> > > > >Try > > 6: md0 = raid10,f2 of sda1+sdb1+sdc1+sdd1 > > > >Also try raid10,o2 with a largeish chunksize (256KB is probably big > >enough). > > > > Looking at the issues raised, there might be some benefit from having > the mirror chunks on the slower inner tracks of a raid10, and to read > from the outer tracks if the drives with the data on the outer tracks > are idle. This would appear to offer a transfer rate benefit overall. Hmm, how do I do this? I think this is normal behaviour of a raid10,f2. Is that so? So you mean I should rather use f2 than o2? Or should I configure the f2 in some way? My hdparm -t gives: /dev/sda5: Timing buffered beginning disk reads: 82 MB in 1.00 seconds = 81.686 MB/sec Timing buffered ending disk reads: 42 MB in 1.03 seconds = 40.625 MB/sec Average seek time 13.714 msec, min=4.641, max=23.921 Average track-to-track time 28.151 msec, min=26.729, max=28.730 So, yes, there is a reason to use the faster outer tracks - and have the faster access time that f2 gives . How does o2 behave here? Does it read and search on the whole disk? As to your other comments in another mail, I could of cause install a newer kernel and mdadm, but then I would loose the support of my supported and paid system. And Neil said that there have been no performance fixes for f2 since the kernel I use (2.6.12). I thought that o2 support was included since 2.6.10 - but apparantly not so. Best regards keld ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <18332.58858.191866.347099@notabene.brown>]
* Re: striping of a 4 drive raid10 [not found] ` <18332.58858.191866.347099@notabene.brown> @ 2008-01-27 20:48 ` Keld Jørn Simonsen 2008-01-27 21:57 ` Neil Brown 2008-01-28 18:24 ` Bill Davidsen 0 siblings, 2 replies; 8+ messages in thread From: Keld Jørn Simonsen @ 2008-01-27 20:48 UTC (permalink / raw) To: Neil Brown; +Cc: linux-raid On Mon, Jan 28, 2008 at 07:13:30AM +1100, Neil Brown wrote: > On Sunday January 27, keld@dkuug.dk wrote: > > Hi > > > > I have tried to make a striping raid out of my new 4 x 1 TB > > SATA-2 disks. I tried raid10,f2 in several ways: > > > > 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 > > of md0+md1 > > > > 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 > > of md0+md1 > > > > 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of > > md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB > > > > 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize > > of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB > > > > 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 > > Try > 6: md0 = raid10,f2 of sda1+sdb1+sdc1+sdd1 That I already tried, (and I wrongly stated that I used f4 in stead of f2). I had two times a thruput of about 300 MB/s but since then I could not reproduce the behaviour. Are there errors on this that has been corrected in newer kernels? > Also try raid10,o2 with a largeish chunksize (256KB is probably big > enough). I tried that too, but my mdadm did not allow me to use the o flag. My kernel is 2.6.12 and mdadm is v1.12.0 - 14 June 2005. can I upgrade the mdadm alone to a newer version, and then which is recommendable? best regards keld ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: striping of a 4 drive raid10 2008-01-27 20:48 ` Keld Jørn Simonsen @ 2008-01-27 21:57 ` Neil Brown 2008-01-28 18:24 ` Bill Davidsen 1 sibling, 0 replies; 8+ messages in thread From: Neil Brown @ 2008-01-27 21:57 UTC (permalink / raw) To: Keld Jørn Simonsen; +Cc: linux-raid On Sunday January 27, keld@dkuug.dk wrote: > On Mon, Jan 28, 2008 at 07:13:30AM +1100, Neil Brown wrote: > > On Sunday January 27, keld@dkuug.dk wrote: > > > Hi > > > > > > I have tried to make a striping raid out of my new 4 x 1 TB > > > SATA-2 disks. I tried raid10,f2 in several ways: > > > > > > 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 > > > of md0+md1 > > > > > > 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 > > > of md0+md1 > > > > > > 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of > > > md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB > > > > > > 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize > > > of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB > > > > > > 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 > > > > Try > > 6: md0 = raid10,f2 of sda1+sdb1+sdc1+sdd1 > > That I already tried, (and I wrongly stated that I used f4 in stead of > f2). I had two times a thruput of about 300 MB/s but since then I could > not reproduce the behaviour. Are there errors on this that has been > corrected in newer kernels? No, I don't think any performance related changes have been made to raid10 lately. You could try increasing the read-ahead size. For a 4-drive raid10 it defaults to 4 times the read-ahead setting of a single drive, but increasing substantially (e.g. 64 times) seem to increase the speed of "dd" reading a gigabyte. Whether that will actually affect your target workload is a different question. > > > > Also try raid10,o2 with a largeish chunksize (256KB is probably big > > enough). > > I tried that too, but my mdadm did not allow me to use the o flag. > > My kernel is 2.6.12 and mdadm is v1.12.0 - 14 June 2005. > can I upgrade the mdadm alone to a newer version, and then which is > recommendable? You would need a newer kernel and a newer mdadm to get raid10 - offset mode. NeilBrown ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: striping of a 4 drive raid10 2008-01-27 20:48 ` Keld Jørn Simonsen 2008-01-27 21:57 ` Neil Brown @ 2008-01-28 18:24 ` Bill Davidsen 1 sibling, 0 replies; 8+ messages in thread From: Bill Davidsen @ 2008-01-28 18:24 UTC (permalink / raw) To: Keld Jørn Simonsen; +Cc: Neil Brown, linux-raid Keld Jørn Simonsen wrote: > On Mon, Jan 28, 2008 at 07:13:30AM +1100, Neil Brown wrote: > >> On Sunday January 27, keld@dkuug.dk wrote: >> >>> Hi >>> >>> I have tried to make a striping raid out of my new 4 x 1 TB >>> SATA-2 disks. I tried raid10,f2 in several ways: >>> >>> 1: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, md2 = raid0 >>> of md0+md1 >>> >>> 2: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, md2 = raid01,f2 >>> of md0+md1 >>> >>> 3: md0 = raid10,f2 of sda1+sdb1, md1= raid10,f2 of sdc1+sdd1, chunksize of >>> md0 =md1 =128 KB, md2 = raid0 of md0+md1 chunksize = 256 KB >>> >>> 4: md0 = raid0 of sda1+sdb1, md1= raid0 of sdc1+sdd1, chunksize >>> of md0 = md1 = 128 KB, md2 = raid01,f2 of md0+md1 chunksize = 256 KB >>> >>> 5: md0= raid10,f4 of sda1+sdb1+sdc1+sdd1 >>> >> Try >> 6: md0 = raid10,f2 of sda1+sdb1+sdc1+sdd1 >> > > That I already tried, (and I wrongly stated that I used f4 in stead of > f2). I had two times a thruput of about 300 MB/s but since then I could > not reproduce the behaviour. Are there errors on this that has been > corrected in newer kernels? > > > >> Also try raid10,o2 with a largeish chunksize (256KB is probably big >> enough). >> > > I tried that too, but my mdadm did not allow me to use the o flag. > > My kernel is 2.6.12 and mdadm is v1.12.0 - 14 June 2005. > can I upgrade the mdadm alone to a newer version, and then which is > recommendable? > I doubt that updating the mdadm is going to help, the kernel is old and lacks a number of improvements in the last few years. I don't think you will see any major improvements without a kernel upgrade. -- Bill Davidsen <davidsen@tmr.com> "Woe unto the statesman who makes war without a reason that will still be valid when the war is over..." Otto von Bismark - To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2008-01-28 19:03 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-01-27 19:33 striping of a 4 drive raid10 Keld Jørn Simonsen
2008-01-27 20:11 ` Peter Grandi
2008-01-27 21:43 ` Keld Jørn Simonsen
2008-01-27 20:20 ` Neil Brown
[not found] ` <479E1FD0.4000702@tmr.com>
2008-01-28 19:03 ` Keld Jørn Simonsen
[not found] ` <18332.58858.191866.347099@notabene.brown>
2008-01-27 20:48 ` Keld Jørn Simonsen
2008-01-27 21:57 ` Neil Brown
2008-01-28 18:24 ` Bill Davidsen
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).