* stability issue with KVM using SMP
@ 2008-09-16 8:25 James Stevens
2008-09-16 11:46 ` James Stevens
0 siblings, 1 reply; 2+ messages in thread
From: James Stevens @ 2008-09-16 8:25 UTC (permalink / raw)
To: kvm
Summary - I'm getting stability issues running SMP. The guest will run
for, typically, 6 to 12 hours before causing a problem.
The first time it happened the guest process that was running SMP went
Zombie, but in a spin loop clocking huge amount of CPU time, and I had
to reboot the host to clear it. Needless to say I had no access to the
guest's qmeu console (Ctrl-Alt-2) and a "kill -9 <pid>" from the host
had no effect.
However, the kernel had been compiled with a 64Gb memory model and I've
had problems with that in the past - in fact that's the first time I've
seen a kernel compiled with the 64Gb memory model actually boot under KVM.
So I replaced the kernel and the next time the guest simply locked up -
I couldn't access the guest o/s at all (no "ping", no linux console
response etc - in some cases there was a picture on the console [login
prompt] but it didn't respond to any key presses), but the qemu console
still worked. I tried a "system_powerdown" and that failed so I did a
"system_reset" and that rebooted the client.
I have now switched off SMP and the guest is working fine (just SLOW).
I've had quite a few non-SMP linux guests running on this host for some
time, using various kernels, and not seen a problem. Its only since I
tried to introduce SMP that its all gone pear shaped.
> # what cpu model (examples: Intel Core Duo, Intel Core 2 Duo, AMD
> Opteron 2210). See /proc/cpuinfo if you're not sure.
dual "Quad-Core AMD Opteron(tm) Processor 2352" with 32Gb RAM
> # what kvm version you are using. If you're using git directly,
> provide the output of 'git describe'.
"kvm-72"
> # the host kernel version
"2.6.26.2" running on Slackware 12
> # what host kernel arch you are using (i386 or x86_64)
# CONFIG_64BIT is not set
CONFIG_X86_32=y
# CONFIG_X86_64 is not set
CONFIG_X86=y
Should I be using 64 bit ?
> # what guest you are using, including OS type (Linux, Windows,
> Solaris, etc.), bitness (32 or 64), kernel version
Slackware 11, but with a new standard (no patches) kernel - 2.6.26.2
I also have an older Slackware 7 based guest with a 2.4.29 SMP kernel
that sees the "lock-up, with qemu console working" problem, but not the
Zombie issue. The 2.4.29 kernel uses a 1Gb memory model.
> # the qemu command line you are using to start the guest
/usr/local/bin/qemu-system-x86_64 -hda /opt/kvm/machine_14/vdisk1.img \
-m 1024 -vnc :14 -k en-gb -smp 4 \
-net nic,model=e1000,macaddr=52:54:00:14:00:00 -net tap \
-net nic,vlan=2,model=e1000,macaddr=52:54:00:14:00:01 \
-net tap,vlan=2,ifname=tap55 \
> # whether the problem goes away if using the -no-kvm-irqchip or
> -no-kvm-pit switch.
Not tried
> # whether the problem also appears with the -no-kvm switch.
I don't use this switch.
James
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: stability issue with KVM using SMP
2008-09-16 8:25 stability issue with KVM using SMP James Stevens
@ 2008-09-16 11:46 ` James Stevens
0 siblings, 0 replies; 2+ messages in thread
From: James Stevens @ 2008-09-16 11:46 UTC (permalink / raw)
To: kvm
Further to my previous report..
I have just noticed that, where the host is running on the dual
"Quad-Core AMD Opteron(tm) Processor 2352" (family 16, model 2) the
kernels in the *guest* machines (usually 2.6.26) are reporting "WARNING:
This combination of AMDprocessors is not suitable for SMP." - presumably
this is why my SMP isn't stable!
The kernel in that host machine (2.6.26.2) does *not* report the messages.
However, where the host is a single "AMD Athlon(tm) 64 X2 Dual Core
Processor 4000+" (family 15, model 107) the guest kernels do not report
the WARNING above - but I've not actually run any SMP guest on that
host, so don't know if it would work or not.
The warning seems to be due to a processor capability test in
"arch/x86/kernel/smpboot.c" - "add_taint(TAINT_UNSAFE_SMP);"
Its not clear to me why one guess is failing this test and the other
passing, because both guests are seeing "QEMU Virtual CPU version
0.9.1", family 6, model 2 (according to "/proc/cpuinfo").
Although most kernels on the Athlon host are running 2.6.21 not 2.6.26
James
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2008-09-16 11:47 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-09-16 8:25 stability issue with KVM using SMP James Stevens
2008-09-16 11:46 ` James Stevens
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox