* KVM incompatible with multipath?
@ 2009-06-27 15:57 infernix
2009-06-27 18:18 ` Anthony Liguori
0 siblings, 1 reply; 4+ messages in thread
From: infernix @ 2009-06-27 15:57 UTC (permalink / raw)
To: kvm
Hi,
I've got an Equallogic iSCSI box with 3x1GBit nics on my storage
network. I also have a server with 4 nics of which 3 are connected to
the storage network with 3x1GBit.
On this host, open-iscsi is running and connecting to the SAN over 3
interfaces. I'm seeing /dev/sdb, sdc and sdd, which are all the same
harddisk (same WWID).
With multipath I combine them to one disk to get a nice boost in
performance. This works perfectly fine (and very speedy) on the host.
Relevant multipath.conf data:
device {
vendor "EQLOGIC"
product "100E-00"
path_grouping_policy multibus
getuid_callout "/lib/udev/scsi_id -g -u -s /block/%n"
features "1 queue_if_no_path"
path_checker readsector0
failback immediate
path_selector "round-robin 0"
rr_weight uniform
rr_min_io 128
}
The problem with KVM (version 87) is that KVM cannot make use of the
multipath device in /dev/mapper.
If I boot a debootstrapped Debian Lenny KVM guest and configure it with
-hda /dev/mapper/my-multipathed-iscsi-disk, I cannot mount the root
partition. It'll complain about "Invalid target".
If I use -hda /dev/sdb or sdc or sdd (e.g. any of the multipathed disks,
which again are all the same but just through 3 separate connections),
everything works perfectly fine. Note that the behavior is the same when
using virtio.
I have tried to put LVM on top of the multipath device in the host, and
then run the KVM guest with -hda /dev/volumegroup/logicalvolume-blah but
that leads to serious data corruption. If I mount one of the lvm
partitions on the host, copy data to it, then unmount it and boot the
KVM guest, the data isn't there. If I shutdown the guest and do an
e2fsck on the host, I get lots of missing/unlinked inodes and a lot of
it is moved to /lost+found.
Does anyone have experience with KVM on multipathed disks? I'm really
puzzled here because everything works perfectly fine from the hosts
perspective.
Thanks for any help!
Regards,
infernix
^ permalink raw reply [flat|nested] 4+ messages in thread* Re: KVM incompatible with multipath? 2009-06-27 15:57 KVM incompatible with multipath? infernix @ 2009-06-27 18:18 ` Anthony Liguori 2009-06-27 22:42 ` infernix 0 siblings, 1 reply; 4+ messages in thread From: Anthony Liguori @ 2009-06-27 18:18 UTC (permalink / raw) To: infernix; +Cc: kvm infernix wrote: > Hi, > > I've got an Equallogic iSCSI box with 3x1GBit nics on my storage > network. I also have a server with 4 nics of which 3 are connected to > the storage network with 3x1GBit. > > On this host, open-iscsi is running and connecting to the SAN over 3 > interfaces. I'm seeing /dev/sdb, sdc and sdd, which are all the same > harddisk (same WWID). > > With multipath I combine them to one disk to get a nice boost in > performance. This works perfectly fine (and very speedy) on the host. > Relevant multipath.conf data: > > device { > vendor "EQLOGIC" > product "100E-00" > path_grouping_policy multibus > getuid_callout "/lib/udev/scsi_id -g -u -s /block/%n" > features "1 queue_if_no_path" > path_checker readsector0 > failback immediate > path_selector "round-robin 0" > rr_weight uniform > rr_min_io 128 > } > > The problem with KVM (version 87) is that KVM cannot make use of the > multipath device in /dev/mapper. > > If I boot a debootstrapped Debian Lenny KVM guest and configure it with > -hda /dev/mapper/my-multipathed-iscsi-disk, I cannot mount the root > partition. It'll complain about "Invalid target". How do you get the disk image onto /dev/mapper/my-multipathed-iscsi-disk? You need to create a partition table and setup grub in order to be able to use something as -hda. You don't get that automatically with debootstrap. Regards, Anthony Liguori ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: KVM incompatible with multipath? 2009-06-27 18:18 ` Anthony Liguori @ 2009-06-27 22:42 ` infernix 2009-06-27 23:00 ` infernix 0 siblings, 1 reply; 4+ messages in thread From: infernix @ 2009-06-27 22:42 UTC (permalink / raw) To: kvm Anthony Liguori wrote: > You need to create a partition table and setup grub in order to be able > to use something as -hda. You don't get that automatically with > debootstrap. Although I didn't include that in my mail, I did configure partitions and debootstrapped Lenny on part1. But that's not what's important anymore; this post kind of grew into a performance test of virtio block and net for Xen 3.2.1 and KVM 87, specifically when using multipath IO to an Equallogic iSCSI box. I made this work as a Xen 3.2.1 domU on the same box and ran some performance tests. Afterwards I retried KVM and I didn't experience the problems I had before, for reasons unknown. I boot the same multipathed disk that I used for Xen with 2.6.27.25 (+ kvm-87 modules in initrd). It actually boots now and I have no problems whatsoever. Note that this kernel I built works for xen domU, Linux native and for KVM guest. I've ran some bonnie++ tests, see below for the results. Whats interesting in these results is that KVM guest has much lower sequential block output than on the host kernel, but much better sequential input. The latter is probably due to caching and buffers in the KVM host kernel. Setting cache=writeback improves both, but still block output is ~75MB/sec slower than on the host. It seems that KVM guest write performance is CPU limited. Any advice on how to get better write speeds is highly appreciated. In any case, so far I'm unable to reproduce the data corruption. Probably some glitch in the matrix. Here's bonnie++ output with Xen domU (kernel 2.6.27.25) which has the multipathed 36090a0383049a2ac41a4643f000070c2-part1 configured as root disk xvda1: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 77918 98 115400 18 56323 8 60034 69 122466 7 422.4 0 Here's bonnie++ output with KVM guest (kernel 2.6.27.25) using virtio: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 59461 95 97238 17 60831 12 62181 95 205094 25 656.4 2 Here's host performance with Xen dom0 (kernel 2.6.26-2-amd64 from lenny): ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 66519 93 172569 50 85457 35 59671 86 164754 40 451.8 0 And here's native performance with 2.6.27.25 (no xen): ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 71796 98 182818 53 85511 29 61484 79 165302 31 668.4 1 The above tests were performed with blockdev --setra 16384 and MTU 1500. Here's native Linux with jumbo frames: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 73616 99 195577 48 96794 27 68845 84 201899 29 630.3 1 Here's KVM guest performance (with jumbo frames in the hosts iscsi interfaces), cache=writethrough: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 60814 96 93222 15 64166 12 58015 94 258557 31 649.3 2 Here's KVM guest performance (with jumbo frames in the hosts iscsi interfaces), cache=writeback and bonnie size=2.5 times host RAM: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 52627 95 120630 23 100333 22 61271 94 284889 37 464.6 2 Xen domU with jumbo in dom0: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 76316 97 116028 19 58278 9 60066 71 131953 9 282.8 0 Now for something different, iscsi multipath inside xen domU (e.g. domU gets 3 nics, each bridged to one of the nics on dom0): ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 64011 97 196484 49 91937 33 54583 81 160031 33 531.6 0 KVM guest with iscsi multipath, same bridging setup with 3 tap devices but *NO* jumbo frames: ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: KVM incompatible with multipath? 2009-06-27 22:42 ` infernix @ 2009-06-27 23:00 ` infernix 0 siblings, 0 replies; 4+ messages in thread From: infernix @ 2009-06-27 23:00 UTC (permalink / raw) To: kvm Whoops, please disregard the previous mail; I hit ctrl-enter while still composing, sorry. Anthony Liguori wrote: > You need to create a partition table and setup grub in order to be able to use something as -hda. You don't get that automatically with debootstrap. Although I didn't include that in my mail, I did configure partitions and debootstrapped Lenny on part1. But that's not what's important anymore, so far I'm unable to reproduce the data corruption. Probably some glitch in the matrix. This post kind of grew into a performance test of virtio block and net for Xen 3.2.1 and KVM 87, specifically when using multipath IO to an Equallogic iSCSI box. I added Xen 3.2.1 to the same box, installed a domU and ran some performance tests. Afterwards I retried KVM and I didn't experience the problems I had before. I boot the same multipathed disk that I used for Xen with 2.6.27.25 (+ kvm-87 modules in initrd). You can find the script below. It actually boots now and I have no problems whatsoever. Note that this kernel I built works for xen domU, Linux native and for KVM guest. I've ran some bonnie++ tests, see below for the results. At first I had thought that using iSCSI and multipath on the host, whether KVM or Xen, would be the fastest. So I ran a bunch of tests against that. Whats interesting in these results is that KVM guest has much lower sequential block output than on the host kernel, but much better sequential input. The latter is probably due to caching and buffers in the KVM host kernel. Setting cache=writeback improves both, but still block output is ~75MB/sec slower than on the host. It seems that KVM guest write performance is CPU limited. Any advice on how to get better write speeds is highly appreciated. Afterwards, I decided to try multipath+iscsi in the guest. It turns out that Xen is the big winner in the end, but with a catch. The highest performance was measured when using iscsi+multipath in the Xen domU with jumbo frames. Thus the rest of the results with the disk being mapped in the host aren't relevant at this point. Iscsi multipath to an Equallogic box with 3x1gbit, inside a xen domU (e.g. domU gets 3 nics, each bridged to one of the nics on dom0): ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 64011 97 196484 49 91937 33 54583 81 160031 33 531.6 0 KVM guest with iscsi multipath, same bridging setup with 3 tap devices but *NO* jumbo frames: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 48326 72 104770 39 57539 26 42610 79 123867 43 547.5 1 This is pretty good performance for Xen, and at 1500 byte MTU it's definitely not bad for KVM either. The catch, however, is that I was unable to turn on Jumbo frames in the KVM guest. I've applied the patch in https://bugzilla.redhat.com/show_bug.cgi?id=473114 to the guest kernel and changed some 4096 byte long buffers in vl.c and net.c to 16384, but I still couldn't configure the MTU on the guest to anything above 1500: test:~# ifconfig eth3 mtu 1500 test:~# ifconfig eth3 mtu 1501 SIOCSIFMTU: Invalid argument So my question is, how can I get virtio to play nice and accept a 9000 byte MTU? I can then finish my performance comparison and perhaps extend it with iometer. Thanks! Regards, infernix bonnie++ output with Xen domU (kernel 2.6.27.25) which has the multipathed disk configured as root disk xvda1: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 77918 98 115400 18 56323 8 60034 69 122466 7 422.4 0 Here's bonnie++ output with KVM guest (kernel 2.6.27.25) using virtio disk vda1 mapped to /dev/mapper/multipath disk: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 59461 95 97238 17 60831 12 62181 95 205094 25 656.4 2 Here's host performance with Xen dom0 (kernel 2.6.26-2-amd64 from lenny) directly on the /dev/mapper/multipath disk: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 66519 93 172569 50 85457 35 59671 86 164754 40 451.8 0 And here's native performance with 2.6.27.25 (no xen) directly on the /dev/mapper/multipath disk: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 71796 98 182818 53 85511 29 61484 79 165302 31 668.4 1 The above tests were performed with blockdev --setra 16384 and MTU 1500. Here's native Linux host with jumbo frames doing direct io on /dev/mapper/multipath disk: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 73616 99 195577 48 96794 27 68845 84 201899 29 630.3 1 Here's KVM guest performance (with jumbo frames in the hosts iscsi interfaces), cache=writethrough with the multipath disk as /dev/vda1: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 60814 96 93222 15 64166 12 58015 94 258557 31 649.3 2 Here's KVM guest performance (with jumbo frames in the hosts iscsi interfaces), cache=writeback and bonnie size=2.5*host RAM with the multipath disk as /dev/vda1: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 52627 95 120630 23 100333 22 61271 94 284889 37 464.6 2 Xen domU (kernel 2.6.27.25) which has the multipathed disk configured as root disk xvda1 + jumbo frames in dom0: ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP 76316 97 116028 19 58278 9 60066 71 131953 9 282.8 0 KVM guest script: #!/bin/sh ( sleep 2s; for i in 0 4 5 6; do brctl addif br$i tap$i; ifconfig tap$i 0.0.0.0 up promisc; ifconfig tap$i mtu 9000; done ) & /usr/local/kvm/bin/qemu-system-x86_64 -m 2048 -localtime -curses \ -net nic,model=virtio,vlan=0 -net tap,vlan=0,ifname=tap0,script=/bin/true \ -net nic,model=virtio,vlan=4,macaddr=00:16:42:51:34:a0 -net tap,vlan=4,ifname=tap4,script=/bin/true \ -net nic,model=virtio,vlan=5,macaddr=00:16:42:51:34:a1 -net tap,vlan=5,ifname=tap5,script=/bin/true \ -net nic,model=virtio,vlan=6,macaddr=00:16:42:51:34:a2 -net tap,vlan=6,ifname=tap6,script=/bin/true \ -initrd /boot/initrd.img-2.6.27.25-001.core2 \ -kernel /boot/vmlinuz-2.6.27.25-001.core2 \ -append 'root=/dev/vda1' \ -drive file=/dev/mapper/36090a0383049a2ac41a4643f000070c2,if=virtio,boot=on,cache=writeback ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2009-06-27 23:00 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-06-27 15:57 KVM incompatible with multipath? infernix 2009-06-27 18:18 ` Anthony Liguori 2009-06-27 22:42 ` infernix 2009-06-27 23:00 ` infernix
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox