From mboxrd@z Thu Jan 1 00:00:00 1970 From: Philipp Hahn Subject: Re: qemu-kvm-0.13.0, 2.6.37.1 - after migration, some of guests got stuck Date: Mon, 21 Feb 2011 08:43:54 +0100 Message-ID: <201102210843.58844.hahn@univention.de> References: <20110220193235.GA3651@nik-comp.lan> Mime-Version: 1.0 Content-Type: multipart/signed; boundary="nextPart1378116.Bl1AtFxjda"; protocol="application/pgp-signature"; micalg=pgp-sha1 Content-Transfer-Encoding: 7bit Cc: KVM list , nikola.ciprich@linuxbox.cz To: Nikola Ciprich Return-path: Received: from mail.univention.de ([82.198.197.8]:2699 "EHLO mail.univention.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752276Ab1BUHoG (ORCPT ); Mon, 21 Feb 2011 02:44:06 -0500 In-Reply-To: <20110220193235.GA3651@nik-comp.lan> Sender: kvm-owner@vger.kernel.org List-ID: --nextPart1378116.Bl1AtFxjda Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hello, I'm no KVM core developer, so please take my advise with a grain of caution. Am Sonntag 20 Februar 2011 20:32:35 schrieb Nikola Ciprich: > I've just migrated a bunch of guests from one node to another > due to upgrading host kernel from 2.6.37 to 2.6.37.1. > After that, some of guests started consuming 100% of CPU time and their os > seems to be stuck. > > Looks like all the stuck guest were running 2.6.32 with kvm paravirt > enabled. others running same as well as different kernels seem to be ok, > including windows guests. I also sometimes experience problems with migrated (in my case: suspended)= =20 VMs, see : They were e= ither=20 stuck in an interrupt storm or did not receive any interrupts any more (loo= ks=20 like some interrupts relates state gets not saved/restored properly). =46or me the following did help: 1. Suspend VM to disk 2. Restart VM with the -no-kvm-irqchip Option Perhaps you could test that as well. (An alternative would be to use qemus= =20 internal debugger or gdb to remote debug the VM and detect what your VMs ar= e=20 doing, especially "x/20i $rip" to dump the next 20 assembler instructions) Please be advised, that -no-kvm-irqchip opens a new can of worms and should= =20 only be used to diagnose the problem. You probably should also backup the=20 save file and use the -snapshot option to be able to repeatly test differen= t=20 strategies. Sincerely Philipp =2D-=20 Philipp Hahn Open Source Software Engineer hahn@univention.de Univention GmbH Linux for Your Business fon: +49 421 22 232- 0 Mary-Somerville-Str.1 28359 Bremen fax: +49 421 22 232-99 http://www.univention.de/ ** Besuchen Sie uns auf der CeBIT in Hannover ** ** Auf dem Univention Stand D36 in Halle 2 ** ** Vom 01. bis 05. M=E4rz 2011 ** --nextPart1378116.Bl1AtFxjda Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) iEYEABECAAYFAk1iF7oACgkQYPlgoZpUDjngzQCdEx1hq8yqefkMZErMrFw94r9N OioAoLcUr+OCozmMFIQrWtOeRMEol2sf =q52x -----END PGP SIGNATURE----- --nextPart1378116.Bl1AtFxjda--