From: Jan Kiszka <jan.kiszka@siemens.com>
To: "hampus.lind@ongame.com" <hampus.lind@ongame.com>
Cc: "kvm@vger.kernel.org" <kvm@vger.kernel.org>
Subject: Re: KVM hypervisor - hardware issues? - Of topic
Date: Wed, 19 Sep 2012 13:32:40 +0200 [thread overview]
Message-ID: <5059AD58.1060000@siemens.com> (raw)
In-Reply-To: <BBF41947B7E7F04AA8F46C03A3FFA49105AC4100C6@MBX.ongame.com>
On 2012-09-19 12:42, hampus.lind@ongame.com wrote:
> Hi,
>
> We are facing big problems with our esxi 5 and HP DL585 G7 environment. Several of our HP servers reboots/hangs randomly without leaving any trace in any logs. HP hardware diag shows no errors in hardware and vmware support are clueless.
>
> My finding when searching for a reason is that there seems to be a lot strange reboot issues for ESX and various servers vendors. A quick google on "esxi random reboot" gives me a long list of both HP and Dell customers facing similar issues.
>
> So we continue to dig and there seems to be a lot of issues with drivers and firmware version back and forth between vmware and server vendors... A real mess...
>
> Trying to narrow down the suspects I wonder if KVM users have the same issues with hypervisors randomly rebooting/hang when running HP DL servers (or dell)? If large high performance KVM environments runs without problems on those servers without reboot problems then we can have a serious talk with vmware about these issues, otherwise we are trapped between vendors point fingers.
>
> We have six HP DL858 G7 servers which all randomly reboot/hang now and then.
I'm recalling some report of random hard lock-ups on an HP server, I
think this was with Xen, a year ago or so. I do know that my Celsius
H700 Fujisu Notebook locks up hard under certain (uncommon) guest
workloads in KVM, most probably due to bugs in the SMI handling code of
the BIOS. Maybe you are facing a similar issue.
To check the correlation between lockup and SMI you can try to disable
SMIs (warning: may have unwanted or even dangerous side effects for you
hardware) via this tool and studying your chipset manual /wrt SMI control:
http://git.kiszka.org/?p=smictrl.git
In my case, the correlation is fairly telling. See also [1].
Jan
[1] http://permalink.gmane.org/gmane.comp.emulators.kvm.devel/60326
--
Siemens AG, Corporate Technology, CT RTC ITP SDP-DE
Corporate Competence Center Embedded Linux
prev parent reply other threads:[~2012-09-19 11:32 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-09-19 10:42 KVM hypervisor - hardware issues? - Of topic hampus.lind
2012-09-19 11:32 ` Jan Kiszka [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5059AD58.1060000@siemens.com \
--to=jan.kiszka@siemens.com \
--cc=hampus.lind@ongame.com \
--cc=kvm@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.