From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:59366) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Xu2k2-0005ip-Gd for qemu-devel@nongnu.org; Thu, 27 Nov 2014 12:20:36 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Xu2jw-0007sN-AN for qemu-devel@nongnu.org; Thu, 27 Nov 2014 12:20:30 -0500 Received: from mx1.redhat.com ([209.132.183.28]:42981) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Xu2jw-0007sJ-2j for qemu-devel@nongnu.org; Thu, 27 Nov 2014 12:20:24 -0500 Message-ID: <54775D4A.8080709@redhat.com> Date: Thu, 27 Nov 2014 18:20:10 +0100 From: Paolo Bonzini MIME-Version: 1.0 References: <33183CC9F5247A488A2544077AF1902086E041A5@SZXEMA503-MBS.china.huawei.com> In-Reply-To: <33183CC9F5247A488A2544077AF1902086E041A5@SZXEMA503-MBS.china.huawei.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [BUG] Redhat-6.4_64bit-guest kernel panic with cpu-passthrough and guest numa List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: "Gonglei (Arei)" , "qemu-devel@nongnu.org" Cc: "Huangpeng (Peter)" , "wangxin (U)" , "Huangweidong (C)" , "benoit@irqsave.net" , "Herongguang (Stephen)" On 27/11/2014 14:00, Gonglei (Arei) wrote: >> >> Running a redhat-6.4-64bit (kernel 2.6.32-358.el6.x86_64) or elder guest on >> qemu-2.1, with kvm enabled and -cpu host, non default cpu-topology and guest >> numa >> I'm seeing a reliable kernel panic from the guest shortly after boot. It is >> happening in >> find_busiest_group(). >> >> We also found it happend since commit >> 787aaf5703a702094f395db6795e74230282cd62 by git bisect. >> >> The reproducer: >> >> (1) full qemu cmd line: >> qemu-system-x86_64 -machine pc-i440fx-2.1,accel=kvm,usb=off \ >> -cpu host -m 16384 \ >> -smp 16,sockets=2,cores=4,threads=2 \ >> -object memory-backend-ram,size=8192M,id=ram-node0 \ >> -numa node,nodeid=0,cpus=0-7,memdev=ram-node0 \ >> -object memory-backend-ram,size=8192M,id=ram-node1 \ >> -numa node,nodeid=1,cpus=8-15,memdev=ram-node1 \ >> -boot c -drive file=/data/wxin/vm/redhat_6.4_64 \ >> -vnc 0.0.0.0:0 -device >> cirrus-vga,id=video0,vgamem_mb=8,bus=pci.0,addr=0x1.0x4 \ >> -msg timestamp=on >> >> (2)the guest kernel messages: Can you find what line of kernel/sched.c it is? Thanks, Paolo >> divide error: 0000 [#1] SMP >> last sysfs file: >> CPU 0 >> Modules linked in: >> >> Pid: 1, comm: swapper Not tainted 2.6.32-358.el6.x86_64 #1 QEMU Standard >> PC (i440FX + PIIX, 1996) >> RIP: 0010:[] [] >> find_busiest_group+0x55c/0x9f0 >> RSP: 0018:ffff88023c85f9e0 EFLAGS: 00010046 >> RAX: 0000000000100000 RBX: ffff88023c85fbdc RCX: 0000000000000000 >> RDX: 0000000000000000 RSI: 0000000000000010 RDI: 0000000000000010 >> RBP: ffff88023c85fb50 R08: ffff88023ca16c10 R09: 0000000000000000 >> R10: 0000000000000001 R11: 0000000000000000 R12: 00000000ffffff01 >> R13: 0000000000016700 R14: ffffffffffffffff R15: 0000000000000000 >> FS: 0000000000000000(0000) GS:ffff880028200000(0000) >> knlGS:0000000000000000 >> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b >> CR2: 0000000000000000 CR3: 0000000001a85000 CR4: 00000000000407f0 >> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 >> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 >> Process swapper (pid: 1, threadinfo ffff88023c85e000, task ffff88043d27c040) >> Stack: >> ffff88023c85faf0 ffff88023c85fa60 ffff88023c85fbc8 0000000200000000 >> 0000000100000000 ffff880028210b60 0000000100000001 >> 0000000000000008 >> 0000000000016700 0000000000016700 ffff88023ca16c00 >> 0000000000016700 >> Call Trace: >> [] thread_return+0x398/0x76e >> [] schedule_timeout+0x215/0x2e0 >> [] ? enqueue_entity+0x125/0x410 >> [] wait_for_common+0x123/0x180 >> [] ? default_wake_function+0x0/0x20 >> [] wait_for_completion+0x1d/0x20 >> [] kthread_create+0x99/0x120 >> [] ? worker_thread+0x0/0x2a0 >> [] ? alternate_node_alloc+0xc9/0xe0 >> [] create_workqueue_thread+0x59/0xd0 >> [] ? mutex_lock+0x1e/0x50 >> [] __create_workqueue_key+0x14d/0x200 >> [] init_workqueues+0x9f/0xb1 >> [] kernel_init+0x25e/0x2fe >> [] child_rip+0xa/0x20 >> [] ? kernel_init+0x0/0x2fe >> [] ? child_rip+0x0/0x20 >> Code: 8b b5 b0 fe ff ff 48 8b bd b8 fe ff ff e8 9d 85 ff ff 0f 1f 44 00 00 48 8b 95 e0 >> fe ff ff 48 8b 45 a8 8b 4a 08 48 c1 e0 0a 31 d2 <48> f7 f1 48 8b 4d b0 48 89 45 a0 >> 31 c0 48 85 c9 74 0c 48 8b 45 >> RIP [] find_busiest_group+0x55c/0x9f0 >> RSP >> divide error: 0000 [#2] >> ---[ end trace d7d20afc6dd05e71 ]---