Weird RAID 1 performance

linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Weird RAID 1 performance
@ 2004-09-18 21:11 Andrei Badea
  2004-09-19 10:10 ` Gordon Henderson
  0 siblings, 1 reply; 5+ messages in thread
From: Andrei Badea @ 2004-09-18 21:11 UTC (permalink / raw)
  To: linux-raid

[-- Attachment #1: Type: text/plain, Size: 1318 bytes --]

Hello,

I have a RAID 1 whose write performance I tested by writing a 10 GB file 
to it:

dd if=/dev/zero of=/mnt/data/zeros bs=1024 count=...

Looking at GKrellM I noticed the CPU usage is very jumpy, going from a 
few to 99 percent (but usually is roughly rovers around 50 percent). 
Moreover, the transfer regularly stops for a few seconds (the CPU usage 
is then about 2 percent). The average data transfer rate was 16 MB/s, 
while the disks alone can make almost 25 MB/s.

Is this normal behavior? Can the write performance be tuned (to be less 
"jumpy")?

The reason I'm asking is that I'm encoutering hangs when video capturing 
to this RAID using mencoder and huffyuv. I'm not experiencing the hangs 
when capturing to another partition (not RAID). Neither did I experience 
  hangs during the dd copy.

Maybe the RAID 1 is just not suited for video capture?

The filesystem is ext3 and /etc/raidtab is:

raiddev /dev/md0
         raid-level              1
         nr-raid-disks           2
         nr-spare-disks          0
         persistent-superblock   1
         device                  /dev/hde1
         raid-disk               0
         device                  /dev/hdg1
         raid-disk               1

Thank you for help.

Andrei

-- 
andrei.badea@movzx.net # http://movzx.net # ICQ: 52641547

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 256 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Weird RAID 1 performance
  2004-09-18 21:11 Weird RAID 1 performance Andrei Badea
@ 2004-09-19 10:10 ` Gordon Henderson
  2004-09-19 15:32   ` Mark Hahn
  0 siblings, 1 reply; 5+ messages in thread
From: Gordon Henderson @ 2004-09-19 10:10 UTC (permalink / raw)
  To: Andrei Badea; +Cc: linux-raid

On Sat, 18 Sep 2004, Andrei Badea wrote:

> Hello,
>
> I have a RAID 1 whose write performance I tested by writing a 10 GB file
> to it:
>
> dd if=/dev/zero of=/mnt/data/zeros bs=1024 count=...
>
> Looking at GKrellM I noticed the CPU usage is very jumpy, going from a
> few to 99 percent (but usually is roughly rovers around 50 percent).
> Moreover, the transfer regularly stops for a few seconds (the CPU usage
> is then about 2 percent). The average data transfer rate was 16 MB/s,
> while the disks alone can make almost 25 MB/s.

Modern IDE drives with the correct cabling ought to be able to write at up
to 55MB/sec.

  gordon @ argus: time dd if=/dev/zero of=bigfile bs=1024 count=10485760
  10485760+0 records in
  10485760+0 records out
  2.570u 77.210s 3:43.44 35.7%    0+0k 0+0io 101pf+0w
  gordon @ argus: du -h bigfile
  11G     bigfile

Doing the math on elapsed time gives me about 45 MB/sec

Thats to an Ext3, RAID1 mount with 2 IDE drives. (hda, hdc)

So the first thing to check (as it's the easiest) is the drivers. Do you
have the right drivers installed for the hardware you have?

I'm guessing it's a PCI card (from /dev/hde1 and /dev/hdg1 in your
raidtab) or maybe a 2nd on-board controller with a pseudo RAID function.

Run hdparm /dev/hde /dev/hdg and see what you get. You ought to have DMA
turned on. If it's not, then you need to make sure you have the right
driver compiled into the kernel (or the right module loaded)

The next thing to look for is interrupt sharing. I've found a lot of
motherboards appear to have broken interrupt controllers. If it's running
with an APIC, check /proc/interrupts for MIS or ERR numbers > 0. Try
turning the APIC off (either by not compiling it into the kernel, or
passing the noapic flag in the kernel boot line (append="noapic" in
lilo.conf) The BIOS might also have a mode for either PIC or APIC.

You also might want to check /proc/interrupts to try to make sure every
device has its own interrupt anyway, but this might be impossible to do
depending on your hardware. I'm guessing you have a 2-port PCI card and
are using 2 cables off it (one drive per cable which is good) and if so
it'll probably have just one interrpt for both devices. If you have a 2nd
card, try that, then it's one drive per card which might be better.

> Is this normal behavior? Can the write performance be tuned (to be less
> "jumpy")?

Interupts (and/or more likely the controllers) seem to me to be the
biggest bug/feature of a modern motherboard )-: I've seen systems work
fine - until you load up the Ethernet interface, or start to use the
(scsi) tape drive. Maybe the video capture card is in the same interrupt
and the hardware just can't cope with it? I have observed "jumpy"
behaviour as you describe too.

> The reason I'm asking is that I'm encoutering hangs when video capturing
> to this RAID using mencoder and huffyuv. I'm not experiencing the hangs
> when capturing to another partition (not RAID). Neither did I experience
>   hangs during the dd copy.

Writing to a RADI1 is going to do double the IO that writing to a single
devie will do. You might want to try to un-raid the devices and try to
write to just one of teh disks, then the other, but my guess is that you
won't see the problem until you load the bus/interrupt system by writing
to both as the same time (which RAID1 will try to do)

You might also want to see if theres some process stuffing data out to a
log-file via syslog during the capture. Some log files will get flushed
out on every write which might affect performance. See if anythings
growing in /var/log during a capture and check its entry in
/etc/syslog.conf. This might not be obviously a problem, but it could be
causing writes to a different disk (I'm guessing you have a /dev/hda to
boot from which is where the log-files go). So writing to that non-RAID
disk is fine, but load the system up with writing to that drive and the 2
RAID drives and ...

> Maybe the RAID 1 is just not suited for video capture?

I suspect the issues with your PC are more likely to be some hardware &
software (interrupt)  problem rather than the RAID1 code. I've been using
the Linux s/w RAID for many years now and not had a problem with the
drivers at all - in all cases it's been the underlying hardware (or
specific PCI/IDE hardware drivers) which has been the issue.

Good luck!

Gordon

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Weird RAID 1 performance
  2004-09-19 10:10 ` Gordon Henderson
@ 2004-09-19 15:32   ` Mark Hahn
  2004-09-19 15:50     ` Gordon Henderson
  2004-09-19 16:42     ` Andrei Badea
  0 siblings, 2 replies; 5+ messages in thread
From: Mark Hahn @ 2004-09-19 15:32 UTC (permalink / raw)
  To: Gordon Henderson; +Cc: Andrei Badea, linux-raid

> > I have a RAID 1 whose write performance I tested by writing a 10 GB file

but under which kernel?

> > Looking at GKrellM I noticed the CPU usage is very jumpy, going from a

that's some sort of gui monitoring tool, right?  I usually use 
"setrealtime vmstat 1" for this kind of thing, since it's at least 
a layer or two closer to the true numbers.

> > few to 99 percent (but usually is roughly rovers around 50 percent).
> > Moreover, the transfer regularly stops for a few seconds (the CPU usage
> > is then about 2 percent). The average data transfer rate was 16 MB/s,
> > while the disks alone can make almost 25 MB/s.

sounds a bit like a combination of poor VM (certainly the case for 
the VM in some kernels), and possibly /proc/sys/vm settings.

> The next thing to look for is interrupt sharing. I've found a lot of

I doubt this is an issue - shared interrupts can result in a few 
extra IOs per interrupt (as the wrong driver checks its device),
but I'd be very surprised to find this affecting performance unless
the device is very slow or the irq rate very high (1e5 or so).

> > Is this normal behavior? Can the write performance be tuned (to be less
> > "jumpy")?

certainly.  in 2.4 kernels, it was trivial to set bdflush to wake up 
every second, rather than every 5 seconds (the default).  I do this 
on a fairly heavily loaded fileserver, since the particular load 
rarely sees any write-coalescing benefit past a few ms.

> Interupts (and/or more likely the controllers) seem to me to be the
> biggest bug/feature of a modern motherboard )-: I've seen systems work

modern low-end motherboard, perhaps.

> > Maybe the RAID 1 is just not suited for video capture?

it's fine; the problem, if any, is your config.  what's the bandwidth
you need to write?  what's the bandwidth of your disks (after accounting
for the fact that every block is written twice)?  you should have a couple
seconds of buffer, at least, to "speed-match" the two rates, even if 
your producer (capture) is significantly slower than the bandwidth of 
your raid1.  note also that your capture card could well be eating 
lots of cpu and/or perturbing the kernel's vm.

regards, mark hahn.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Weird RAID 1 performance
  2004-09-19 15:32   ` Mark Hahn
@ 2004-09-19 15:50     ` Gordon Henderson
  2004-09-19 16:42     ` Andrei Badea
  1 sibling, 0 replies; 5+ messages in thread
From: Gordon Henderson @ 2004-09-19 15:50 UTC (permalink / raw)
  To: Mark Hahn; +Cc: Andrei Badea, linux-raid

On Sun, 19 Sep 2004, Mark Hahn wrote:

> > > few to 99 percent (but usually is roughly rovers around 50 percent).
> > > Moreover, the transfer regularly stops for a few seconds (the CPU usage
> > > is then about 2 percent). The average data transfer rate was 16 MB/s,
> > > while the disks alone can make almost 25 MB/s.
>
> sounds a bit like a combination of poor VM (certainly the case for
> the VM in some kernels), and possibly /proc/sys/vm settings.
>
> > The next thing to look for is interrupt sharing. I've found a lot of
>
> I doubt this is an issue - shared interrupts can result in a few
> extra IOs per interrupt (as the wrong driver checks its device),
> but I'd be very surprised to find this affecting performance unless
> the device is very slow or the irq rate very high (1e5 or so).

I'm just saying what I've seen in some of the servers I've built. Same
hardware, same kernel (2.4.23 to 27) but with different positioning of
boards in various PCI slots and trying to shuffle them about to reduce the
number of devices on the same interrupt, I can make things work better, or
worse.

> > > Is this normal behavior? Can the write performance be tuned (to be less
> > > "jumpy")?
>
> certainly.  in 2.4 kernels, it was trivial to set bdflush to wake up
> every second, rather than every 5 seconds (the default).  I do this
> on a fairly heavily loaded fileserver, since the particular load
> rarely sees any write-coalescing benefit past a few ms.
>
> > Interupts (and/or more likely the controllers) seem to me to be the
> > biggest bug/feature of a modern motherboard )-: I've seen systems work
>
> modern low-end motherboard, perhaps.

Perhaps. (But we don't know what the OPs motherboard is either) The
servers I have had probems with were dual athlon motherboards which
supported ECC ram. I don't know what make they were as I wasn't involved
in the purchasing side of things )-: I do know that when I try to get as
many devices on their own interrupt as possible in these motherboards,
things go remarkably better.

I've seen incrementing ERR interrupts in single processor systems (ASUS
A7N266 and A7N8X) until you turn the APIC off too.

I've seen some very bizarre hardware issues though. One of the above dual
athlon servers has a motherboard that needs to have a mouse plugged in to
make it work. It doesn't use the mouse (it's inside the case!) but if it's
not plugged in, it croaks after a while. (And always during an fsck for
some weird reason) (This we found by scanning the net for hardware issues
with the particular chipset on the motherboard)

Go figure!

Gordon

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Weird RAID 1 performance
  2004-09-19 15:32   ` Mark Hahn
  2004-09-19 15:50     ` Gordon Henderson
@ 2004-09-19 16:42     ` Andrei Badea
  1 sibling, 0 replies; 5+ messages in thread
From: Andrei Badea @ 2004-09-19 16:42 UTC (permalink / raw)
  To: linux-raid

[-- Attachment #1: Type: text/plain, Size: 7703 bytes --]

Hello Mark and Gordon,

thank you both for your answers.

Mark Hahn wrote:

 >>> I have a RAID 1 whose write performance I tested by writing a 10 GB file
 >
 >
 >
 > but under which kernel?


Sorry :-( It's 2.6.7 with the Debian patches, self-compiled.

 >>> Looking at GKrellM I noticed the CPU usage is very jumpy, going from a
 >
 >
 >
 > that's some sort of gui monitoring tool, right?  I usually use "setrealtime 
vmstat 1" for this kind of thing, since it's at least a layer or two closer to 
the true numbers.


I'm attaching the output from "setrealtime vmstat 1". You can see the "jumps" 
caused by executing

dd if=/dev/zero of=bigfile bs=1024 count=1048576

on my root partition, which is not on RAID. So it seems I'm having problems not 
with RAID, but with any high bandwidth disk transfer.

procs -----------memory---------- ---swap-- -----io---- --system-- ----cpu----
  r  b   swpd   free   buff  cache   si   so    bi    bo   in    cs us sy id wa
  2  0      0 324528   2628  52892    0    0    54   478 1582   920 10  2 86  2
  0  0      0 324336   2628  52892    0    0     0     0 2083  1823 11  1 88  0
  0  0      0 324336   2628  52892    0    0     0     0 2093  1521  3  1 96  0
  0  1      0 324328   2664  52892    0    0    36     0 2097  1600  4  2 87  7
  0  1      0 323560   3400  52892    0    0   724   148 2262  1711  2 10  0 88
  1  0      0 278184   3808  97340    0    0   360     0 2170  1541  4 57  0 39
  1  2      0 209192   3876 164540    0    0     0 34560 2380  2028 12 88  0  0
  1  2      0 136088   3944 235776    0    0     0 33728 2351  1625  8 92  0  0
  1  2      0  60376   4016 309596    0    0     0 33280 2342  1603  6 94  0  0
  0  4      0  39576   4036 330736    0    0     0 37368 2345  1469  4 31  0 65
  1  3      0  17016    580 356036    0    0     0 29768 2334  1391  7 75  0 18
  0  4      0   3320    600 369252    0    0     8 24576 2258  1594  3 27  0 70
  0  4      0   3064    640 369124    0    0     0 32768 2338  1709  5 52  0 43
  0  4      0   2872    672 369016    0    0     8 28616 2310  1709  4 45  0 51
  0  4      0   3400    696 368508    0    0     4 23708 2287  1662  3 36  0 61
  0  5      0   3648    736 368288    0    0     8 30264 2285  1618  4 32  0 64
  0  5      0   3520    772 368384    0    0     4 30724 2328  1523  4 45  0 51
  0  6      0   3868    780 368400    0    0     0 32388 2341  1560  3 18  0 79
  1  5      0   3000    800 369820    0    0     0 28416 2303  1473  3 29  0 68
  0  6      0   3192    856 369456    0    0     0 31616 2323  1648  6 83  0 11
  1  5      0   3192    876 369264    0    0     4 32384 2333  1764  5 48  0 47
  0  7      0   3128    900 369264    0    0     0 28608 2364  1597  4 50  0 46
  4  7      0   2928    920 369348    0    0     4 29072 2304  1581  4 33  0 63
  1  6      0   3312    960 368832    0    0     0 27884 2330  1582  5 50  0 45
  0  7      0   3952    980 368536    0    0     0 31240 2325  1464  3 33  0 64
  0  8      0   3184    992 369584    0    0     4 28680 2275  1456  3 11  0 86
  1  7      0   2928   1012 370028    0    0     0 28444 2333  1476  4 45  0 51
  0  8      0   3120   1032 369904    0    0     0 32796 2346  1479  4 40  0 56
  1  6      0   4588   1052 368452    0    0     4 32820 2340  1437  4 33  0 63
  2  2      0   3432   1096 369004    0    0     0 29436 2351  1572  7 68  0 25
  0  9      0   3196   1164 368900    0    0     0 33628 2316  1939  6 91  0  3
  1  7      0   3304   1196 368852    0    0     4 25904 2318  1522  3 41  0 56
  0  8      0   3560   1212 369060    0    0     0 35584 2330  1551  4 23  0 73
  0  9      0   2920   1036 370400    0    0     0 28696 2321  1505  3 55  0 42
  1  8      0   3048   1032 370072    0    0     0 32796 2329  1483  4 38  0 58
  0  7      0   3240    972 369532    0    0     0 28684 2324  1463  5 36  0 59
  0  7      0   3240    972 369532    0    0     0 32512 2331  1436  2  3  0 95

Are these numbers normal?

Also:

root@farpoint:tam0# hdparm /dev/hda

/dev/hda:
  multcount    = 16 (on)
  IO_support   =  1 (32-bit)
  unmaskirq    =  1 (on)
  using_dma    =  1 (on)
  keepsettings =  0 (off)
  readonly     =  0 (off)
  readahead    = 256 (on)
  geometry     = 65535/16/63, sectors = 78165360, start = 0

So DMA is ok. Gordon: the disks in the RAID also have DMA turned on.

 >>> few to 99 percent (but usually is roughly rovers around 50 percent).
 >>> Moreover, the transfer regularly stops for a few seconds (the CPU usage
 >>> is then about 2 percent). The average data transfer rate was 16 MB/s,
 >>> while the disks alone can make almost 25 MB/s.
 >
 >
 > sounds a bit like a combination of poor VM (certainly the case for the VM in 
some kernels), and possibly /proc/sys/vm settings.


Any thoughts, links, anything on these settings? I must admit I've never done 
this, but I'm happy to learn.

 >> The next thing to look for is interrupt sharing. I've found a lot of
 >
 >
 >
 > I doubt this is an issue - shared interrupts can result in a few extra IOs 
per interrupt (as the wrong driver checks its device),
 > but I'd be very surprised to find this affecting performance unless
 > the device is very slow or the irq rate very high (1e5 or so).


To answer to Gordon, I'm not using APIC at all.

 >>> Is this normal behavior? Can the write performance be tuned (to be less
 >>> "jumpy")?
 >
 >
 > certainly.  in 2.4 kernels, it was trivial to set bdflush to wake up every 
second, rather than every 5 seconds (the default).  I do this on a fairly 
heavily loaded fileserver, since the particular load rarely sees any 
write-coalescing benefit past a few ms.


What about 2.6 kernels?

 >> Interupts (and/or more likely the controllers) seem to me to be the
 >> biggest bug/feature of a modern motherboard )-: I've seen systems work
 >
 >
 >>> Maybe the RAID 1 is just not suited for video capture?
 >
 >
 > it's fine; the problem, if any, is your config.  what's the bandwidth
 > you need to write?  what's the bandwidth of your disks (after accounting
 > for the fact that every block is written twice)?  you should have a couple
 > seconds of buffer, at least, to "speed-match" the two rates, even if your 
producer (capture) is significantly slower than the bandwidth of your raid1. 
note also that your capture card could well be eating lots of cpu and/or 
perturbing the kernel's vm.


The bandwidth is about 8 MB/s. Each disk alone can write at about 25 MB/s and 
they are identical. The capture is indeed taking lots of CPU (see the next 
vmstat output), but is it so much that the whole thing hangs?

procs -----------memory---------- ---swap-- -----io---- --system-- ----cpu----
  r  b   swpd   free   buff  cache   si   so    bi    bo   in    cs us sy id wa
  2  0      0 157868   1584  52972    0    0  3420     0 1127   733 23  5 31 40
  0  0      0 156524   1584  52972    0    0     0     0 1138   665 67  3 30  0
  1  0      0 156396   1584  52972    0    0     0     0 1139   684 68  3 29  0
  1  0      0 156204   1592  52972    0    0     0   124 1140   704 66  5 28  1
  1  0      0 156012   1592  52972    0    0     0     0 1214   719 67  4 29  0
  1  0      0 156140   1592  52972    0    0     0   256 1336   881 71  5 24  0
  2  0      0 155116   1592  52972    0    0     0     0 1147  1202 75  4 21  0
  1  0      0 154940   1592  52972    0    0     0     0 1138   657 68  3 29  0
  2  0      0 154748   1592  52972    0    0     0     0 1138   624 66  4 30  0
  1  0      0 154556   1600  52972    0    0     0    12 1142   712 65  4 31  0
  0  0      0 171132   1600  52972    0    0     0     0 1130   697 67  3 30  0

Again, thank you for help.

Andrei

-- 
andrei.badea@movzx.net # http://movzx.net # ICQ: 52641547


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 256 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2004-09-19 16:42 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-09-18 21:11 Weird RAID 1 performance Andrei Badea
2004-09-19 10:10 ` Gordon Henderson
2004-09-19 15:32   ` Mark Hahn
2004-09-19 15:50     ` Gordon Henderson
2004-09-19 16:42     ` Andrei Badea

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).