From mboxrd@z Thu Jan 1 00:00:00 1970 From: Daniel Bareiro Subject: Re: Swap usage with KVM Date: Sun, 11 Jul 2010 16:12:27 -0300 Message-ID: <20100711191227.GD9267@defiant.freesoftware> References: <20100711151257.GA13279@defiant.freesoftware> Reply-To: dbareiro@gmx.net Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="wxDdMuZNg1r63Hyj" Cc: "hugh.dickins@tiscali.co.uk" , Rik van Riel To: KVM General Return-path: Received: from mail.gmx.net ([213.165.64.20]:59065 "HELO mail.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1753983Ab0GKTMf (ORCPT ); Sun, 11 Jul 2010 15:12:35 -0400 Content-Disposition: inline In-Reply-To: <20100711151257.GA13279@defiant.freesoftware> Sender: kvm-owner@vger.kernel.org List-ID: --wxDdMuZNg1r63Hyj Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sunday, 11 July 2010 12:12:57 -0300, Daniel Bareiro wrote: > I have an installation with Debian GNU/Linux 5.0.4 amd64 with qemu-kvm > 0.12.3 compiled with the source code obtained from the official site > of KVM and Linux 2.6.32.12 compiled from source code of kernel.org. > All this is installed on an HP Proliant DL380 G6 with two Xeon E5530 > quadcore processors and 16 GiB of RAM which has two VMs with the > following configuration of memory: >=20 > Hostname | RAM > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D+=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D > Aps4 | 7 GiB > Leela | 7 GiB > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D+=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D > TOTAL | 14 GiB >=20 > Initially the host was created with a swap partition of 1 GiB, but > today we found that the use of swap quickly began to grow > increasingly. Therefore, as a contingency, we had to hot-add a > logical volume of 1 GB of swap on the VMHost. Is 'normal' this use of > memory? >=20 > I copy the Nagios Service Log Entries for the VMHost: >=20 > Event Start Time Event End Time Event Duration Event/State Type = Event/State > Information > -------------------------------------------------------------------------= -------------------------- > 06-07-2010 00:00:00 07-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 100% free (956 MB out of 956 MB) > 07-07-2010 00:00:00 08-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 100% free (956 MB out of 956 MB) > 08-07-2010 00:00:00 08-07-2010 16:41:43 0d 16h 41m 43s SERVICE O= K (HARD) SWAP OK - > 100% free (956 MB out of 956 MB) > 09-07-2010 00:00:00 10-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 99% free (939 MB out of 956 MB) > 10-07-2010 00:00:00 11-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 79% free (754 MB out of 956 MB) > 11-07-2010 00:00:00 11-07-2010 07:08:17 0d 7h 8m 17s SERVICE O= K (HARD) SWAP OK - > 51% free (482 MB out of 956 MB) > 11-07-2010 07:08:17 11-07-2010 10:41:07 0d 3h 32m 50s SERVICE W= ARNING (HARD) SWAP WARNING > - 29% free (272 MB out of 956 MB) > 11-07-2010 10:41:07 11-07-2010 10:45:57 0d 0h 4m 50s SERVICE C= RITICAL (HARD) SWAP > CRITICAL - 9% free (83 MB out of 956 MB) > -------------------------------------------------------------------------= -------------------------- >=20 > I'm not using qcow2 files. The /dev/cciss/c0d0p3 partition is a > physical volume that maintains the logical volumes that are used for > VM's disks. >=20 > The Nagios Service Log Entries for VMs shows no excessive use of swap > in the window of time when the problem occurred in the VMHost: >=20 > Aps4: >=20 > Event Start Time Event End Time Event Duration Event/State Type = Event/State > Information > -------------------------------------------------------------------------= -------------------------- > 06-07-2010 00:00:00 07-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 100% free (2850 MB out of 2863 MB) > 07-07-2010 00:00:00 08-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 98% free (2797 MB out of 2863 MB) > 08-07-2010 00:00:00 08-07-2010 16:41:43 0d 16h 41m 43s SERVICE O= K (HARD) SWAP OK - > 99% free (2812 MB out of 2863 MB) > 09-07-2010 00:00:00 10-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 98% free (2784 MB out of 2863 MB) > 10-07-2010 00:00:00 11-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 97% free (2754 MB out of 2863 MB) > 11-07-2010 00:00:00 11-07-2010 11:53:38 0d 11h 53m 38s+ SERVICE O= K (HARD) SWAP OK - > 100% free (2839 MB out of 2863 MB) > -------------------------------------------------------------------------= -------------------------- >=20 > Leela: >=20 > Event Start Time Event End Time Event Duration Event/State Type = Event/State > Information > -------------------------------------------------------------------------= -------------------------- > 06-07-2010 00:00:00 07-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 97% free (919 MB out of 956 MB) > 07-07-2010 00:00:00 08-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 97% free (920 MB out of 956 MB) > 08-07-2010 00:00:00 08-07-2010 16:41:43 0d 16h 41m 43s SERVICE O= K (HARD) SWAP OK - > 97% free (920 MB out of 956 MB) > 08-07-2010 17:01:36 08-07-2010 17:04:16 0d 0h 2m 40s SERVICE C= RITICAL (HARD) Connection > refused by host > 08-07-2010 17:04:16 09-07-2010 00:00:00 0d 6h 55m 44s SERVICE O= K (HARD) SWAP OK - > 97% free (921 MB out of 956 MB) > 09-07-2010 00:00:00 10-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 97% free (921 MB out of 956 MB) > 10-07-2010 00:00:00 11-07-2010 00:00:00 1d 0h 0m 0s SERVICE O= K (HARD) SWAP OK - > 97% free (921 MB out of 956 MB) > 11-07-2010 00:00:00 11-07-2010 11:58:29 0d 11h 58m 29s+ SERVICE O= K (HARD) SWAP OK - > 97% free (921 MB out of 956 MB) > -------------------------------------------------------------------------= -------------------------- >=20 >=20 >=20 > Unfortunately I could not take much more data because we had to act > quickly, but if you need any additional information, please feel free to > ask. Has anyone experienced something like this? Avi? I remember late last year there was a regression in Linux swapping and Rik and Hugh were working on it. Are you aware of any? Thanks in advance for your reply. Regards, Daniel --=20 Fingerprint: BFB3 08D6 B4D1 31B2 72B9 29CE 6696 BF1B 14E6 1D37 Powered by Debian GNU/Linux Lenny - Linux user #188.598 --wxDdMuZNg1r63Hyj Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) iEYEARECAAYFAkw6F5sACgkQZpa/GxTmHTdZAQCggmsf5AWLb3YDPIkpG+pAvkDo It8An1KX5lOo3J2rLjey4rpnTUIyoAKf =iLzi -----END PGP SIGNATURE----- --wxDdMuZNg1r63Hyj--