linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "H. Peter Anvin" <hpa@linux.intel.com>
To: David Rientjes <rientjes@google.com>
Cc: mingo@redhat.com, hpa@zytor.com, linux-kernel@vger.kernel.org,
	yinghai@kernel.org, rja@sgi.com, tglx@linutronix.de,
	linux-tip-commits@vger.kernel.org,
	Linus Torvalds <torvalds@linux-foundation.org>,
	Robin Holt <holt@sgi.com>, Thomas Gleixner <tglx@linutronix.de>,
	Ingo Molnar <mingo@elte.hu>
Subject: Re: [tip:x86/urgent] x86, numa: For each node, register the memory blocks actually used
Date: Mon, 11 Oct 2010 15:21:15 -0700	[thread overview]
Message-ID: <4CB38DDB.9050006@linux.intel.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1010111504450.5358@chino.kir.corp.google.com>

On 10/11/2010 03:05 PM, David Rientjes wrote:
>>
>> Use nodememblk_range[] instead of nodes[] in order to make sure we
>> capture the actual memory blocks registered with each node.  nodes[]
>> contains an extended range which spans all memory regions associated
>> with a node, but that does not mean that all the memory in between are
>> included.
>>
>> Reported-by: Russ Anderson <rja@sgi.com>
>> Tested-by: Russ Anderson <rja@sgi.com>
>> Signed-off-by: Yinghai Lu <yinghai@kernel.org>
>> LKML-Reference: <4CB27BDF.5000800@kernel.org>
>> Cc: David Rientjes <rientjes@google.com>
>> Cc: <stable@kernel.org> 2.6.33 .34 .35 .36
>> Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>
> 
> Acked-by: David Rientjes <rientjes@google.com>
> 
> Sorry I hadn't seen this thread earlier, I wasn't cc'd on it.

Thanks for confirming.  I don't have access to any systems on which I
can verify this condition myself, but I spent some fairly serious time
time morning on code inspection, and I'm pretty sure I grok what this
patch does and that it is the right thing.

This is not just an SGI UV problem but will in fact bite any system
which has nodes with interlaced memory blocks (for example block 0
belongs to node 0, block 1 belongs to node 1, and then block 2 belongs
to node 0 again.)

There are multiple loops after these which rely on the nodes[] range,
but in fact they rely on exactly this loop to have registered the
relevant memory ranges for the node, so fixing this loop fixes the
subsequent ones.  Of course, it *seriously* begs the question why
nodes[] carry a range at all (well, other than to support bootmem, which
seems like yet another good reason to finish off bootmem.)

Any help in testing would be highly appreciated.  Please feel free to
involve anyone else who would likely have access to the kind of large
NUMA x86 systems which are likely to be affected.

	-hpa

  reply	other threads:[~2010-10-11 22:21 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-10-08 21:34 [BUG] x86: bootmem broken on SGI UV Russ Anderson
2010-10-08 21:43 ` H. Peter Anvin
2010-10-08 22:15 ` Yinghai Lu
2010-10-08 22:57 ` Yinghai Lu
2010-10-09 12:59   ` Russ Anderson
2010-10-09 16:39     ` Robin Holt
2010-10-09 18:06       ` Yinghai Lu
2010-10-09 18:17         ` [PATCH -v2] x86, numa: Fix cross nodes memory configuration Yinghai Lu
2010-10-09 18:39         ` [BUG] x86: bootmem broken on SGI UV Linus Torvalds
2010-10-10 10:41       ` Ingo Molnar
2010-10-10 10:43         ` Ingo Molnar
2010-10-10 11:44           ` Robin Holt
2010-10-10 11:56             ` Robin Holt
2010-10-10 14:05               ` Ingo Molnar
2010-10-10 22:51                 ` Robin Holt
2010-10-11  2:52                   ` [PATCH -v3] x86, numa: Fix cross nodes memory configuration Yinghai Lu
2010-10-11 22:01                     ` [tip:x86/urgent] x86, numa: For each node, register the memory blocks actually used tip-bot for Yinghai Lu
2010-10-11 22:05                       ` David Rientjes
2010-10-11 22:21                         ` H. Peter Anvin [this message]
2010-10-11 22:28                     ` tip-bot for Yinghai Lu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4CB38DDB.9050006@linux.intel.com \
    --to=hpa@linux.intel.com \
    --cc=holt@sgi.com \
    --cc=hpa@zytor.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-tip-commits@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=mingo@redhat.com \
    --cc=rientjes@google.com \
    --cc=rja@sgi.com \
    --cc=tglx@linutronix.de \
    --cc=torvalds@linux-foundation.org \
    --cc=yinghai@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).