Re: Who's doing what with cpu/memory/node hotplug?

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Jack Steiner <steiner@sgi.com>
To: linux-ia64@vger.kernel.org
Subject: Re: Who's doing what with cpu/memory/node hotplug?
Date: Thu, 13 May 2004 16:33:25 +0000	[thread overview]
Message-ID: <20040513163325.GB30758@sgi.com> (raw)
In-Reply-To: <20040512205107.16bb82a6.pj@sgi.com>

On Thu, May 13, 2004 at 09:01:48AM -0700, Dave Hansen wrote:
> On Thu, 2004-05-13 at 07:00, Jack Steiner wrote:
> > Where can I find a copy of the latest CONFIG_NONLINEAR patch? I recall
> > one that was posted by Dave McCracken in early Apr. Is that the one I should
> > review?
> 
> There are a few bugfixes on top of it since then, but most of it remains
> untouched.  

Ok, I grab that patch & take a look.


> 
> > When we did the initial implementation of CONFIG_DISCONTIGMEM, we
> > looked briefly at the CONFIG_NONLINEAR patch (or idea) that was
> > floating around at that time. The patch may have changed so some of 
> > my initial concerns may no longer apply, but the early patch would
> > not have performed very well on the SGI hardware. 
> > 
> > The SGI architecture has an absurdly sparse address space. The smallest
> > memory block is 64MB but the max physical address is 49 bits (500TB).  
> > IIRC, this resulted in some very large tables used to convert between 
> > logical & physical addresses. Because of the size of these tables, cache
> > misses would be common on references to these tables. Is this still a 
> > valid concern???
> 
> Our address space on ppc64 isn't that sparse, but we may be dealing with
> memory sections that are as small as 16MB.  Our problem is that any of
> those 16MB sections of memory is removable, which means that we are
> going to get some sparse data structures if a machine isn't populated
> with a significant portion of its capacity of RAM.
> 
> But, one cool/horrifying thing is that it appears that ppc64 is somewhat
> dynamic about what these section sizes are.  On power4, they're fixed at
> 256MB, but on power5, they can go as small as 16MB, determined at boot. 
> But, we don't really know what the size is going to be until we're
> booted, and we run the same binaries everywhere.
> 
> We're probably going to need something other than statically allocated
> nonlinear arrays eventually.  As you eluded to, there are 2 things that
> combined will make nonlinear bad on the cache: sparse physical addresses
> spaces and small memory sections.  However, small sections aren't really
> all that common, especailly on big machines.  Most of the machines that
> we deal with are going to be presenting their memory in pretty
> contiguous regions.  If we can make nonlinear more dynamic, we could
> shrink the arrays in all of the common cases, leaving only the
> pathological memory removal, or machines that really *do* dole out
> memory in 64MB sections, which are probably virutalized anyway. 
> 
> I don't doubt that your architecture allows memory addition in units as
> small as 64MB, but what is it in practice?  

It really is 64MB - at least that is the largest power of 2 that
evenly divides into the size of a chunk of memory.

We currently support 512MB, 1GB & 2GB DIMMs. Memory is populated in
multiples of 4 DIMMs. That means that memory will PHYSICALLY be
in chunks of 2GB, 4 GB or 8GB of contiguous memory.  The memory
on a node consists of up to 4 chunks of memory located at 
offset 0, 16GB, 32GB & 48GB of the nodes memory space. 

The memory directory (used for tracking coherency) is allocated within
each chunk of memory. The directory occupied the top 1/32 of the chunk.
This memory is not accessible to software - a bus error occurs if
software tries to access the directory. 1/32 of a 2GB chunk of memory
is where I got 64MB. Maybe there is another way to handle the
"missing" 64MB of a 2GB chunk of memory.

The other oddity of the SN architecture is that there is no memory
at physical address 0. Physical memory on node 0 starts at 384GB.



> 
> You could always sparsely allocate the nonlinear arrays, and handle
> faults on them like you do mem_map ;)
> 
> -- Dave

-- 
Thanks

Jack Steiner (steiner@sgi.com)          651-683-5302
Principal Engineer                      SGI - Silicon Graphics, Inc.




-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id%62&alloc_ida84&op=click
_______________________________________________
Linux-hotplug-devel mailing list  http://linux-hotplug.sourceforge.net
Linux-hotplug-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-hotplug-devel

WARNING: multiple messages have this Message-ID (diff)

From: Jack Steiner <steiner@sgi.com>
To: linux-ia64@vger.kernel.org
Subject: Re: Who's doing what with cpu/memory/node hotplug?
Date: Thu, 13 May 2004 16:33:25 +0000	[thread overview]
Message-ID: <20040513163325.GB30758@sgi.com> (raw)
In-Reply-To: <20040512205107.16bb82a6.pj@sgi.com>

On Thu, May 13, 2004 at 09:01:48AM -0700, Dave Hansen wrote:
> On Thu, 2004-05-13 at 07:00, Jack Steiner wrote:
> > Where can I find a copy of the latest CONFIG_NONLINEAR patch? I recall
> > one that was posted by Dave McCracken in early Apr. Is that the one I should
> > review?
> 
> There are a few bugfixes on top of it since then, but most of it remains
> untouched.  

Ok, I grab that patch & take a look.

> 
> > When we did the initial implementation of CONFIG_DISCONTIGMEM, we
> > looked briefly at the CONFIG_NONLINEAR patch (or idea) that was
> > floating around at that time. The patch may have changed so some of 
> > my initial concerns may no longer apply, but the early patch would
> > not have performed very well on the SGI hardware. 
> > 
> > The SGI architecture has an absurdly sparse address space. The smallest
> > memory block is 64MB but the max physical address is 49 bits (500TB).  
> > IIRC, this resulted in some very large tables used to convert between 
> > logical & physical addresses. Because of the size of these tables, cache
> > misses would be common on references to these tables. Is this still a 
> > valid concern???
> 
> Our address space on ppc64 isn't that sparse, but we may be dealing with
> memory sections that are as small as 16MB.  Our problem is that any of
> those 16MB sections of memory is removable, which means that we are
> going to get some sparse data structures if a machine isn't populated
> with a significant portion of its capacity of RAM.
> 
> But, one cool/horrifying thing is that it appears that ppc64 is somewhat
> dynamic about what these section sizes are.  On power4, they're fixed at
> 256MB, but on power5, they can go as small as 16MB, determined at boot. 
> But, we don't really know what the size is going to be until we're
> booted, and we run the same binaries everywhere.
> 
> We're probably going to need something other than statically allocated
> nonlinear arrays eventually.  As you eluded to, there are 2 things that
> combined will make nonlinear bad on the cache: sparse physical addresses
> spaces and small memory sections.  However, small sections aren't really
> all that common, especailly on big machines.  Most of the machines that
> we deal with are going to be presenting their memory in pretty
> contiguous regions.  If we can make nonlinear more dynamic, we could
> shrink the arrays in all of the common cases, leaving only the
> pathological memory removal, or machines that really *do* dole out
> memory in 64MB sections, which are probably virutalized anyway. 
> 
> I don't doubt that your architecture allows memory addition in units as
> small as 64MB, but what is it in practice?  

It really is 64MB - at least that is the largest power of 2 that
evenly divides into the size of a chunk of memory.

We currently support 512MB, 1GB & 2GB DIMMs. Memory is populated in
multiples of 4 DIMMs. That means that memory will PHYSICALLY be
in chunks of 2GB, 4 GB or 8GB of contiguous memory.  The memory
on a node consists of up to 4 chunks of memory located at 
offset 0, 16GB, 32GB & 48GB of the nodes memory space. 

The memory directory (used for tracking coherency) is allocated within
each chunk of memory. The directory occupied the top 1/32 of the chunk.
This memory is not accessible to software - a bus error occurs if
software tries to access the directory. 1/32 of a 2GB chunk of memory
is where I got 64MB. Maybe there is another way to handle the
"missing" 64MB of a 2GB chunk of memory.

The other oddity of the SN architecture is that there is no memory
at physical address 0. Physical memory on node 0 starts at 384GB.

> 
> You could always sparsely allocate the nonlinear arrays, and handle
> faults on them like you do mem_map ;)
> 
> -- Dave

-- 
Thanks

Jack Steiner (steiner@sgi.com)          651-683-5302
Principal Engineer                      SGI - Silicon Graphics, Inc.

next prev parent reply	other threads:[~2004-05-13 16:33 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-05-13  3:51 Who's doing what with cpu/memory/node hotplug? Paul Jackson
2004-05-13  3:51 ` Paul Jackson
2004-05-13  4:23 ` Dave Hansen
2004-05-13  4:23   ` Dave Hansen
2004-05-13  4:44 ` Ashok Raj
2004-05-13  4:44   ` Ashok Raj
2004-05-13  4:59 ` Paul Jackson
2004-05-13  4:59   ` Paul Jackson
2004-05-13  6:35 ` Keiichiro Tokunaga
2004-05-13  6:35   ` Keiichiro Tokunaga
2004-05-13  6:50 ` Dave Hansen
2004-05-13  6:50   ` Dave Hansen
2004-05-13  7:18 ` Paul Jackson
2004-05-13  7:18   ` Paul Jackson
2004-05-13 14:00 ` Jack Steiner
2004-05-13 14:00   ` Jack Steiner
2004-05-13 16:01 ` Dave Hansen
2004-05-13 16:01   ` Dave Hansen
2004-05-13 16:33 ` Jack Steiner [this message]
2004-05-13 16:33   ` Jack Steiner
2004-05-14  1:14 ` Keiichiro Tokunaga
2004-05-14  1:14   ` Keiichiro Tokunaga
2004-05-14 13:49 ` Russ Anderson
2004-05-14 13:49   ` Russ Anderson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20040513163325.GB30758@sgi.com \
    --to=steiner@sgi.com \
    --cc=linux-ia64@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.