From mboxrd@z Thu Jan 1 00:00:00 1970 From: James Thomason Subject: Re: Guest Hang Bugs Date: Thu, 15 Jan 2009 14:49:30 -0800 Message-ID: References: <496F55EB.6040506@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Cc: To: Avi Kivity Return-path: Received: from multiply.divide.org ([207.218.248.89]:60036 "EHLO multiply.divide.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932531AbZAOWte (ORCPT ); Thu, 15 Jan 2009 17:49:34 -0500 In-Reply-To: <496F55EB.6040506@redhat.com> Sender: kvm-owner@vger.kernel.org List-ID: On 1/15/09 7:27 AM, "Avi Kivity" wrote: > Cam you try adding clocksource=acpi_pm to the _guest_ kernel command line? Avi, I booted the guest with clocksource=acpi_pm and it has been running under heavy load for about 4 hours with kvm -smp 12. Considering that the guest previously would not even boot with kvm -smp 12, I think this is the root cause. As another data point, I was not able to trigger the bug when running a guest with all active cores on socket 0 only, but I was able to trigger it when putting half the cores on socket 1. So in a nutshell the kernel uses an unreliable clock source by default on SMP systems? Thanks for your help! Regards, James