From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@bugzilla.kernel.org Subject: [Bug 197861] Shutting down a VM with Kernel 4.14 will sometime hang and a reboot is the only way to recover. Date: Wed, 10 Jan 2018 13:21:14 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT To: kvm@kernel.org Return-path: Received: from mail.wl.linuxfoundation.org ([198.145.29.98]:56104 "EHLO mail.wl.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752093AbeAJNVQ (ORCPT ); Wed, 10 Jan 2018 08:21:16 -0500 Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id DC96028571 for ; Wed, 10 Jan 2018 13:21:15 +0000 (UTC) In-Reply-To: Sender: kvm-owner@vger.kernel.org List-ID: https://bugzilla.kernel.org/show_bug.cgi?id=197861 bubez (michele.mase@gmail.com) changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |michele.mase@gmail.com --- Comment #37 from bubez (michele.mase@gmail.com) --- Host: ubuntu 17.10, vanilla kernel 4.14.12, nested virtualization and vhost_net workaround aplied options kvm_intel nested=1 options vhost_net experimental_zcopytx=0 Problem: can always be reproduced on redhat/centos7.x, after about 8 hour of guest uptime, guest machine hangs How to reproduce: boot a centos/redhat7.x guest vm (a minimal installation should be ok), and wait about 8hours, the period may vary. You can give a tail command on syslog to see some detailed message (for example tail -f /var/log/messages) Guest kernel: 3.10.0-693.11.6.el7.x86_64 Syslog output: /var/log/messages Jan 10 12:56:03 kvm178 dbus[756]: [system] Activating via systemd: service name='org.freedesktop.nm_dispatcher' unit='dbus-org.freedesktop.nm-dispatcher.service' Jan 10 12:56:03 kvm178 dhclient[911]: bound to 192.168.122.178 -- renewal in 1257 seconds. Jan 10 12:56:28 kvm178 dbus[756]: [system] Failed to activate service 'org.freedesktop.nm_dispatcher': timed out Jan 10 12:56:28 kvm178 dbus-daemon: dbus[756]: [system] Failed to activate service 'org.freedesktop.nm_dispatcher': timed out Jan 10 12:58:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 12:58:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 12:58:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 12:58:40 kvm178 kernel: Call Trace: Jan 10 12:58:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 12:58:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 12:58:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 12:58:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 12:58:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 12:58:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 12:58:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:00:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:00:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:00:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:00:40 kvm178 kernel: Call Trace: Jan 10 13:00:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:00:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:00:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:00:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:00:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:00:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:00:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:01:26 kvm178 systemd-logind: Failed to start session scope session-23.scope: Connection timed out Jan 10 13:02:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:02:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:02:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:02:40 kvm178 kernel: Call Trace: Jan 10 13:02:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:02:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:02:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:02:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:02:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:02:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:02:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:04:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:04:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:04:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:04:40 kvm178 kernel: Call Trace: Jan 10 13:04:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:04:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:04:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:04:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:04:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:04:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:04:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:06:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:06:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:06:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:06:40 kvm178 kernel: Call Trace: Jan 10 13:06:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:06:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:06:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:06:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:06:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:06:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:06:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:08:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:08:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:08:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:08:40 kvm178 kernel: Call Trace: Jan 10 13:08:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:08:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:08:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:08:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:08:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:08:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:08:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:10:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:10:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:10:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:10:40 kvm178 kernel: Call Trace: Jan 10 13:10:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:10:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:10:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:10:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:10:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:10:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:10:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:12:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:12:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:12:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:12:40 kvm178 kernel: Call Trace: Jan 10 13:12:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:12:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:12:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:12:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:12:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:12:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:12:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:14:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:14:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:14:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:14:40 kvm178 kernel: Call Trace: Jan 10 13:14:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:14:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:14:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:14:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:14:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:14:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:14:40 kvm178 kernel: [] async_page_fault+0x28/0x30 Jan 10 13:16:40 kvm178 kernel: INFO: task systemd:1 blocked for more than 120 seconds. Jan 10 13:16:40 kvm178 kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 10 13:16:40 kvm178 kernel: systemd D ffff88004d1d8000 0 1 0 0x00000000 Jan 10 13:16:40 kvm178 kernel: Call Trace: Jan 10 13:16:40 kvm178 kernel: [] schedule+0x29/0x70 Jan 10 13:16:40 kvm178 kernel: [] kvm_async_pf_task_wait+0x1df/0x230 Jan 10 13:16:40 kvm178 kernel: [] ? wake_up_atomic_t+0x30/0x30 Jan 10 13:16:40 kvm178 kernel: [] ? error_swapgs+0x61/0x18d Jan 10 13:16:40 kvm178 kernel: [] ? error_swapgs+0x150/0x18d Jan 10 13:16:40 kvm178 kernel: [] do_async_page_fault+0x96/0xd0 Jan 10 13:16:40 kvm178 kernel: [] async_page_fault+0x28/0x30 .... guest died, guest cpu 100%, hard reset on guest needed. Guests with redhat/centos6.x (kernel 2.6.32-696.18.7.el6.x86_64) and windows10 doesn't have problems. Hope this could help. -- You are receiving this mail because: You are watching the assignee of the bug.