From mboxrd@z Thu Jan 1 00:00:00 1970 From: Vincent Caron Subject: Bug#603632: [Xen-devel] PVops domain 0 crash on NUMA system only Node==1 present (Was: Re: Bug#603632: linux-image-2.6.32-5-xen-amd64: Linux kernel 2.6.32/xen/amd64 booting fine on bare metal, but not as dom0 with Xen 4.0.1 (Dell R410)) Date: Fri, 03 Dec 2010 11:30:47 +0100 Message-ID: <1291372247.25131.74.camel@zerohal> References: <20101115233253.11935.35707.reportbug@zerohal> <1290513067.31507.7699.camel@zakaz.uk.xensource.com> <1290538327.9844.26.camel@localhost.localdomain> <1290550323.9844.85.camel@localhost.localdomain> <1290550726.9844.90.camel@localhost.localdomain> <1291333639.25131.48.camel@zerohal> <1291366264.5514.15039.camel@zakaz.uk.xensource.com> Reply-To: Vincent Caron , 603632@bugs.debian.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit Return-path: Resent-To: debian-bugs-dist@lists.debian.org Resent-Message-ID: In-Reply-To: <1291366264.5514.15039.camel@zakaz.uk.xensource.com> List-Post: List-Help: List-Subscribe: List-Unsubscribe: To: Ian Campbell Cc: Vincent CARON , Cris Daniluk , xen-devel , Keir Fraser , Jeremy Fitzhardinge , "603632@bugs.debian.org" <603632@bugs.debian.org> List-Id: xen-devel@lists.xenproject.org On Fri, 2010-12-03 at 08:51 +0000, Ian Campbell wrote: > On Thu, 2010-12-02 at 23:47 +0000, Vincent Caron wrote: > > On Tue, 2010-11-23 at 22:18 +0000, Ian Campbell wrote: > > > On Tue, 2010-11-23 at 22:12 +0000, Ian Campbell wrote: > > > > On Tue, 2010-11-23 at 18:52 +0000, Ian Campbell wrote: > > > > > On Tue, 2010-11-23 at 11:51 +0000, Ian Campbell wrote: > > > > > > > > > > > > Perhaps we should be passing numa_node_id() (e.g. current node) > > > > > > instead of node 0? > > > > > > > > > > I've just kicked off a build of the 2.6.32-27 Debian kernel with the > > > > > following additional patch, I will hopefully post the binaries tomorrow. > > > > > > > > Build was quicker than I thought... Vincent, Cris if you get a chance > > > > please can you test the kernel from: > > > > http://xenbits.xen.org/people/ianc/2.6.32-27+numa1/ > > > > > > Also, please can you try adding "numa=noacpi" to your kernel command > > > line when running with the standard Debian kernel (not the one above). > > > > > > Thanks! > > > > It just happens that your kernel above (2.6.32-27+numa1) boots fine > > under hypervisor _when_ passed 'numa=noacpi'. Yeah ! > > > > I then tried again with Debian Squeeze's latest 2.6.32-28, which > > crashes as -27 under hypervisor (and changelog show no xen or > > numa-related thingies). Then I added 'numa=noacpi', and it boots fine > > too. I got my 8 cores, networking, etc. > > > > Enclosed is the dmesg for the latter, Debian, kernel. > > > > Is the 'numa=noacpi' a "production acceptable" workaround ? > > Yes and in fact I think the actual fix is simply to have Xen fake out > the behaviour of numa=noacpi as below. I'll send this plus the other fix > out after I've given it a bit of proper testing. OK, I'll follow the patch until it makes it to Squeeze's kernel. Thanks a lot for your help and fix ! Now going to stress some Xen 4.0 domains...