All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andre Przywara <andre.przywara@amd.com>
To: Jan Beulich <JBeulich@novell.com>
Cc: xen-devel@lists.xensource.com
Subject: Re: [PATCH] Fix hypervisor crash with unpopulated	 NUMA nodes
Date: Wed, 7 Oct 2009 14:13:06 +0200	[thread overview]
Message-ID: <4ACC85D2.9090100@amd.com> (raw)
In-Reply-To: <4ACC9D73020000780001879A@vpn.id2.novell.com>

Jan Beulich wrote:
>>>> Andre Przywara <andre.przywara@amd.com> 07.10.09 11:45 >>>
>> on NUMA systems with memory-less nodes Xen crashes quite early in the 
>> hypervisor (while initializing the heaps). This is not an issue if this 
>> happens to be the last node, but "inner" nodes trigger this reliably.
>> On multi-node processors it is much more likely to leave a node unequipped.
>> The attached patch fixes this by enumerating the node via the 
>> node_online_map instead of counting from 0 to num_nodes.
> 
> While I do not see anything wrong with the patch, I still wonder why it
> would be needed: It seems to indicate that node_online_map represents
> only nodes with memory, but imo should be representing nodes with
> memory or processors (leaving aside pure I/O nodes for the moment).
> So perhaps there's rather a problem with the setup of node_online_map 
> somewhere?
Yes, because the map creation is callback driven by ACPI code.
The BIOS of my machine is omitting the memory entries for memory-less 
nodes, so there is no callback triggered for these nodes. Nevertheless 
Xen uses the SRAT provided node numbers, this creates the hole.
(My setup: 2 + 0 + 2 + 0 GB per node, Xen sees two nodes named 0 and 2).
I agree that should be changed (that is what I meant with "will rework 
later"), not only because the "lonely" cores will simply be added to 
another node. But since I will be not in the office for the next two 
weeks I would like to get this patch applied for the time being.

Regards,
Andre.

-- 
Andre Przywara
AMD-Operating System Research Center (OSRC), Dresden, Germany
Tel: +49 351 448 3567 12
----to satisfy European Law for business letters:
Advanced Micro Devices GmbH
Karl-Hammerschmidt-Str. 34, 85609 Dornach b. Muenchen
Geschaeftsfuehrer: Andrew Bowd; Thomas M. McCoy; Giuliano Meroni
Sitz: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632

      reply	other threads:[~2009-10-07 12:13 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-10-07  9:45 [PATCH] Fix hypervisor crash with unpopulated NUMA nodes Andre Przywara
2009-10-07 11:53 ` Jan Beulich
2009-10-07 12:13   ` Andre Przywara [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4ACC85D2.9090100@amd.com \
    --to=andre.przywara@amd.com \
    --cc=JBeulich@novell.com \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.