From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:45815) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XveLy-0004KS-Ge for qemu-devel@nongnu.org; Mon, 01 Dec 2014 22:42:24 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XveLs-0006XC-B5 for qemu-devel@nongnu.org; Mon, 01 Dec 2014 22:42:18 -0500 Received: from szxga02-in.huawei.com ([119.145.14.65]:44435) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XveLr-0006Wq-P3 for qemu-devel@nongnu.org; Mon, 01 Dec 2014 22:42:12 -0500 Message-ID: <547D34EF.4010103@huawei.com> Date: Tue, 2 Dec 2014 11:41:35 +0800 From: Gonglei MIME-Version: 1.0 References: <33183CC9F5247A488A2544077AF1902086E041A5@SZXEMA503-MBS.china.huawei.com> <54775D4A.8080709@redhat.com> <5477E019.6090408@huawei.com> <547C396A.4080001@redhat.com> In-Reply-To: <547C396A.4080001@redhat.com> Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [BUG] Redhat-6.4_64bit-guest kernel panic with cpu-passthrough and guest numa List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Paolo Bonzini Cc: "Huangweidong (C)" , "benoit@irqsave.net" , "wangxin (U)" , "qemu-devel@nongnu.org" , "Huangpeng (Peter)" , "Herongguang (Stephen)" On 2014/12/1 17:48, Paolo Bonzini wrote: > > > On 28/11/2014 03:38, Gonglei wrote: >>>> Can you find what line of kernel/sched.c it is? >> Yes, of course. See below please: >> "sgs->avg_load = (sgs->group_load * SCHED_LOAD_SCALE) / group->cpu_power; " >> in update_sg_lb_stats(), file sched.c, line 4094 >> And I can share the cause of we found. After commit 787aaf57(target-i386: >> forward CPUID cache leaves when -cpu host is used), guest will get cpu cache >> from host when -cpu host is used. But if we configure guest numa: >> node 0 cpus 0~7 >> node 1 cpus 8~15 >> then the numa nodes lie in the same host cpu cache (cpus 0~16). >> When the guest os boot, calculate group->cpu_power, but the guest find thoes >> two different nodes own the same cache, then node1's group->cpu_power >> will not be valued, just is the initial value '0'. And when vcpu is scheduled, >> division by 0 causes kernel panic. > > Thanks. Please open a Red Hat bugzilla with the information, and Cc > Larry Woodman who fixed a few instances of this in > the past. > Hi, Paolo A bug has been reported: https://bugzilla.redhat.com/process_bug.cgi Regards, -Gonglei