From mboxrd@z Thu Jan 1 00:00:00 1970 From: Daniel Bareiro Subject: 'swapper Not tainted' on VM Date: Thu, 23 Jul 2009 12:26:32 -0300 Message-ID: <20090723152632.GA7400@defiant.freesoftware.org> Reply-To: dbareiro@gmx.net Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="17pEHd4RhPHOinZp" To: KVM General Return-path: Received: from mail.gmx.net ([213.165.64.20]:50872 "HELO mail.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1752069AbZGWP0k (ORCPT ); Thu, 23 Jul 2009 11:26:40 -0400 Received: from defiant (defiant.freesoftware.org [10.1.0.65]) by hermes.freesoftware.org (Postfix) with ESMTP id EC3A31DB for ; Thu, 23 Jul 2009 12:24:08 -0300 (ART) Content-Disposition: inline Sender: kvm-owner@vger.kernel.org List-ID: --17pEHd4RhPHOinZp Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi all! I'm using KVM-62 on a host with Ubuntu Hardy Heron server amd64 installed from Ubuntu repositories with a productive VM running an application server. I am observing in the VM a 'swapper tainted' in some of the processors which cause that the application server is spontaneously restarted. Can this be due to some bug of KVM? The VM is running Debian GNU/Linux Lenny 5.0.2 with kernel 2.6.26-2-686-bigmem with 4 GiB of RAM, 4 CPUs and 'pci=3Dnoacpi' kernel option in order to avoid transmit timed out from network interface which turn it inaccessible to the rest of the network. The host machine has 2.6.24-19-server. Extracted data from the VM: /var/log/syslog.1: =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#3 stuck = for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core a= ta_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[] EFLAGS: = 00000202 CPU: 3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at run_timer_softirq+0x1= 0b/0x17c Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f748bf28 EBX: f752f6e0 ECX= : c02c8b32 EDX: f748bf28 Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: f0d05b5c EDI: f7482000 EBP= : c013067e ESP: f748bf1c Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: b7d7d050 CR3= : 00381000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? process_timeout= +0x0/0x5 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? __do_softirq+0x= 66/0xd3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? do_softirq+0x45= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? irq_exit+0x35/0= x67 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? smp_apic_timer_= interrupt+0x6b/0x75 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? apic_timer_inte= rrupt+0x28/0x30 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? native_safe_hal= t+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? cpu_idle+0xab/0= xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#1 stuck = for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal proc= essor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[] EFLAGS: = 00000246 CPU: 1 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at native_safe_halt+0x2/= 0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f7474000 EBX: c0107656 ECX= : 0304e000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: 00000001 EDI: 00000000 EBP= : 00000000 ESP: f7475fa8 Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: 08186bf1 CR3= : 35167000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] default_idle+0x2d= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#2 stuck = for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.565670] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal proc= essor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.565670] Jul 23 01:47:40 aps2 kernel: [44260.565670] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP: 0060:[] EFLAGS: = 00000246 CPU: 2 Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP is at native_safe_halt+0x2/= 0x3 Jul 23 01:47:40 aps2 kernel: [44260.565670] EAX: f747e000 EBX: c0107656 ECX= : 03059000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.565670] ESI: 00000002 EDI: 00000000 EBP= : 00000000 ESP: f747ffa8 Jul 23 01:47:40 aps2 kernel: [44260.565670] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.565670] CR0: 8005003b CR2: 09e89be7 CR3= : 379bf000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.565670] [] default_idle+0x2d= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.565670] [] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.565670] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D /var/log/messages: =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core a= ta_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[] EFLAGS: = 00000202 CPU: 3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at run_timer_softirq+0x1= 0b/0x17c Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f748bf28 EBX: f752f6e0 ECX= : c02c8b32 EDX: f748bf28 Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: f0d05b5c EDI: f7482000 EBP= : c013067e ESP: f748bf1c Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: b7d7d050 CR3= : 00381000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? process_timeout= +0x0/0x5 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? __do_softirq+0x= 66/0xd3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? do_softirq+0x45= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? irq_exit+0x35/0= x67 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? smp_apic_timer_= interrupt+0x6b/0x75 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? apic_timer_inte= rrupt+0x28/0x30 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? native_safe_hal= t+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? default_idle+0x= 2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] ? cpu_idle+0xab/0= xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core a= ta_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[] EFLAGS: = 00000246 CPU: 1 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at native_safe_halt+0x2/= 0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f7474000 EBX: c0107656 ECX= : 0304e000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: 00000001 EDI: 00000000 EBP= : 00000000 ESP: f7475fa8 Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: 08186bf1 CR3= : 35167000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] default_idle+0x2d= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Jul 23 01:47:40 aps2 kernel: [44260.565670] Modules linked in: ipv6 loop pa= rport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev e= xt3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal proc= essor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.565670] Jul 23 01:47:40 aps2 kernel: [44260.565670] Pid: 0, comm: swapper Not taint= ed (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP: 0060:[] EFLAGS: = 00000246 CPU: 2 Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP is at native_safe_halt+0x2/= 0x3 Jul 23 01:47:40 aps2 kernel: [44260.565670] EAX: f747e000 EBX: c0107656 ECX= : 03059000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.565670] ESI: 00000002 EDI: 00000000 EBP= : 00000000 ESP: f747ffa8 Jul 23 01:47:40 aps2 kernel: [44260.565670] DS: 007b ES: 007b FS: 00d8 GS:= 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.565670] CR0: 8005003b CR2: 09e89be7 CR3= : 379bf000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR0: 00000000 DR1: 00000000 DR2= : 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.565670] [] default_idle+0x2d= /0x53 Jul 23 01:47:40 aps2 kernel: [44260.565670] [] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.565670] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Application server log: =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D INFO | jvm 1 | 2009/07/23 00:56:31 | : 899860K->64068K(943744K), 0.086= 4870 secs] 2231330K->1402078K(2516608K), 0.0868390 secs] INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException ERROR | wrapper | 2009/07/23 01:47:40 | JVM appears hung: Timed out waiti= ng for signal from JVM. ERROR | wrapper | 2009/07/23 01:47:40 | JVM did not exit on request, term= inated INFO | wrapper | 2009/07/23 01:47:40 | JVM exited on its own while waiti= ng to kill the application. STATUS | wrapper | 2009/07/23 01:47:40 | JVM exited in response to signal = SIGKILL (9). STATUS | wrapper | 2009/07/23 01:47:44 | Launching a JVM... INFO | jvm 2 | 2009/07/23 01:47:46 | Wrapper (Version 3.2.3) http://wr= apper.tanukisoftware.org INFO | jvm 2 | 2009/07/23 01:47:46 | Copyright 1999-2006 Tanuki Soft= ware, Inc. All Rights Reserved. INFO | jvm 2 | 2009/07/23 01:47:46 | INFO | jvm 2 | 2009/07/23 01:47:47 | Enter 's' to shutdown, 'r' to res= tart... KVM Parameters on host machine: =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D kvm -hda /dev/vm/aps2-raiz -hdb /dev/vm/aps2-space -hdc \ /dev/vm/aps2-index -hdd /dev/vm/aps2-cache -m 4096 -smp 4 -net \ nic,vlan=3D0,macaddr=3D00:16:3E:00:00:27 -net tap -daemonize -vnc :5 \ -k es -localtime -monitor telnet:localhost:4005,server,nowait \ -serial telnet:localhost:4045,server,nowait Thanks in advance for your reply: Regards, Daniel --=20 Fingerprint: BFB3 08D6 B4D1 31B2 72B9 29CE 6696 BF1B 14E6 1D37 Powered by Debian GNU/Linux Squeeze - Linux user #188.598 --17pEHd4RhPHOinZp Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) iEYEARECAAYFAkpogSgACgkQZpa/GxTmHTfiLQCbBJLvDhATkASVGZodDCxGgeGL nUgAnjWtQ64v3uCOHcTB41Rms1t4UCUq =TYaL -----END PGP SIGNATURE----- --17pEHd4RhPHOinZp--