From: Jeremy Fitzhardinge <jeremy@goop.org>
To: Ian Campbell <ijc@hellion.org.uk>
Cc: Vincent CARON <zerodeux@zerodeux.net>,
xen-devel <xen-devel@lists.xensource.com>,
Keir Fraser <keir@xen.org>, Cris Daniluk <cris.daniluk@gmail.com>,
603632@bugs.debian.org
Subject: Re: PVops domain 0 crash on NUMA system only Node==1 present (Was: Re: Bug#603632: linux-image-2.6.32-5-xen-amd64: Linux kernel 2.6.32/xen/amd64 booting fine on bare metal, but not as dom0 with Xen 4.0.1 (Dell R410))
Date: Tue, 23 Nov 2010 10:24:01 -0800 [thread overview]
Message-ID: <4CEC06C1.5010500@goop.org> (raw)
In-Reply-To: <1290513067.31507.7699.camel@zakaz.uk.xensource.com>
On 11/23/2010 03:51 AM, Ian Campbell wrote:
> I'm not sure but looking at the complete bootlog it looks as if the
> system may only have node==1 i.e. no 0 node which could plausibly lead
> to this sort of issue:
> [ 0.000000] Bootmem setup node 1 0000000000000000-0000000040000000
> [ 0.000000] NODE_DATA [0000000000008000 - 000000000000ffff]
> [ 0.000000] bootmap [0000000000010000 - 0000000000017fff] pages 8
> [ 0.000000] (8 early reservations) ==> bootmem [0000000000 - 0040000000]
> [ 0.000000] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000]
> [ 0.000000] #1 [0003446000 - 0003465000] XEN PAGETABLES ==> [0003446000 - 0003465000]
> [ 0.000000] #2 [0000006000 - 0000008000] TRAMPOLINE ==> [0000006000 - 0000008000]
> [ 0.000000] #3 [0001000000 - 0001694994] TEXT DATA BSS ==> [0001000000 - 0001694994]
> [ 0.000000] #4 [00016b5000 - 0003244e00] RAMDISK ==> [00016b5000 - 0003244e00]
> [ 0.000000] #5 [0003245000 - 0003446000] XEN START INFO ==> [0003245000 - 0003446000]
> [ 0.000000] #6 [0001695000 - 000169532d] BRK ==> [0001695000 - 000169532d]
> [ 0.000000] #7 [0000100000 - 00002e0000] PGTABLE ==> [0000100000 - 00002e0000]
> [ 0.000000] found SMP MP-table at [ffff8800000fe710] fe710
> [ 0.000000] Zone PFN ranges:
> [ 0.000000] DMA 0x00000000 -> 0x00001000
> [ 0.000000] DMA32 0x00001000 -> 0x00100000
> [ 0.000000] Normal 0x00100000 -> 0x00100000
> [ 0.000000] Movable zone start PFN for each node
> [ 0.000000] early_node_map[2] active PFN ranges
> [ 0.000000] 1: 0x00000000 -> 0x000000a0
> [ 0.000000] 1: 0x00000100 -> 0x00040000
> [ 0.000000] On node 1 totalpages: 262048
> [ 0.000000] DMA zone: 56 pages used for memmap
> [ 0.000000] DMA zone: 483 pages reserved
> [ 0.000000] DMA zone: 3461 pages, LIFO batch:0
> [ 0.000000] DMA32 zone: 3528 pages used for memmap
> [ 0.000000] DMA32 zone: 254520 pages, LIFO batch:31
>
> Perhaps we should be passing numa_node_id() (e.g. current node) instead
> of node 0? There doesn't seem to be another obvious alternative to
> passing in an explicit node number to this callchain (some places cope
> with -1 but not this path AFAICT).
Does booting native get the same configuration?
> It's also not obvious if dom0 should be seeing the tables which describe
> the hosts nodes anyway or if we should be clobbering something. Given
> that dom0 sees a pseudo-physical address map I'm not convinced seeing
> the real SRAT is in any way beneficial. Perhaps we should simply be
> clobbering NUMAness until actual PV understanding of NUMA is ready?
Yes, the host SRAT is meaningless in the domain and we really should
ignore it. I'm not sure what happens if you boot on a really NUMA system.
> One thing I notice when googling R410 issues is that they apparently
> have a "Cores per CPU" BIOS option which might be worth playing with,
> since configuring a reduced number of cores might remove node 0 but not
> node 1 (odd but not invalid?). Presumably it is also worth making sure
> you have the latest BIOS etc.
Also, what's the DIMM configuration? Are the slots fully populated?
J
next prev parent reply other threads:[~2010-11-23 18:24 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20101115233253.11935.35707.reportbug@zerohal>
2010-11-23 11:51 ` PVops domain 0 crash on NUMA system only Node==1 present (Was: Re: Bug#603632: linux-image-2.6.32-5-xen-amd64: Linux kernel 2.6.32/xen/amd64 booting fine on bare metal, but not as dom0 with Xen 4.0.1 (Dell R410)) Ian Campbell
2010-11-23 12:44 ` Bug#603632: " Cris Daniluk
2010-11-23 18:24 ` Jeremy Fitzhardinge [this message]
2010-11-25 12:51 ` Bug#603632: [Xen-devel] " Vincent Caron
2010-11-25 15:01 ` Ian Campbell
2010-11-25 15:49 ` Bug#603632: [Xen-devel] " Vincent Caron
2010-11-23 18:52 ` Ian Campbell
2010-11-23 22:12 ` Ian Campbell
2010-11-23 22:18 ` Ian Campbell
2010-12-02 23:47 ` Bug#603632: [Xen-devel] " Vincent Caron
2010-12-03 0:12 ` Jeremy Fitzhardinge
2010-12-03 0:27 ` Bug#603632: [Xen-devel] " Vincent Caron
2010-12-03 8:54 ` Ian Campbell
2010-12-03 8:52 ` Ian Campbell
2010-12-03 9:20 ` Ian Campbell
2010-12-03 9:49 ` Ian Campbell
2010-12-03 9:54 ` [PATCH] xen: disable ACPI NUMA for PV guests Ian Campbell
2010-12-13 22:17 ` Vincent Caron
2010-12-03 8:51 ` PVops domain 0 crash on NUMA system only Node==1 present (Was: Re: Bug#603632: linux-image-2.6.32-5-xen-amd64: Linux kernel 2.6.32/xen/amd64 booting fine on bare metal, but not as dom0 with Xen 4.0.1 (Dell R410)) Ian Campbell
2010-12-03 10:30 ` Bug#603632: [Xen-devel] " Vincent Caron
2010-11-25 13:29 ` Bug#603632: " Vincent Caron
2010-11-25 14:44 ` Ian Campbell
2010-11-25 16:38 ` Bug#603632: " Vincent Caron
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4CEC06C1.5010500@goop.org \
--to=jeremy@goop.org \
--cc=603632@bugs.debian.org \
--cc=cris.daniluk@gmail.com \
--cc=ijc@hellion.org.uk \
--cc=keir@xen.org \
--cc=xen-devel@lists.xensource.com \
--cc=zerodeux@zerodeux.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.