From: Eric Dumazet <dada1@cosmosbay.com>
To: Andi Kleen <ak@suse.de>
Cc: linux-kernel@vger.kernel.org
Subject: [NUMA , x86_64] Why memnode_shift is chosen with the lowest possible value ?
Date: Thu, 29 Sep 2005 15:40:38 +0200 [thread overview]
Message-ID: <433BEED6.6000008@cosmosbay.com> (raw)
In-Reply-To: <p73k6h0jjh3.fsf@verdi.suse.de>
Hi Andi
I have a dual Opteron machine, with 8GB of ram on each node.
With latest kernels I have high CPU profiles in mm/slab.c
kfree() is NUMA aware, so far so good, but the price seems heavy.
I noticed in 2.6.14-rc2 syslog :
Node 0 MemBase 0000000000000000 Limit 00000001ffffffff
Node 1 MemBase 0000000200000000 Limit 00000003ffffffff
Using 23 for the hash shift. Max adder is 3ffffffff
instead of previous (2.6.13) :
Node 0 MemBase 0000000000000000 Limit 00000001ffffffff
Node 1 MemBase 0000000200000000 Limit 00000003ffffffff
Using 27 for the hash shift. Max adder is 3ffffffff
After some code review, I see NODEMAPSIZE raised from 0xff to 0xfff
phys_to_nid() is now reading one byte out of 2048 bytes with
(memnode_shift=23, units of 8MB).
But shouldnt we try to use the highest possible value for memnode_shift ?
Using memnode_shift=33 would access only 2 bytes from this memnodemap[],
touching fewer cache lines (well , one cache line). kfree() and friends would
be slightly faster, at least cache friendly.
Another question is :
Could we add in pda (struct x8664_pda) the node of the cpu ?
We currently do :
#define numa_node_id() (cpu_to_node(raw_smp_processor_id()))
Instead of reading the processor_id from pda, then access cpu_to_node[], we
could directly get this information from pda.
#if defined(CONFIG_NUMA)
static inline __attribute_pure__ int numa_node_id() { return read_pda(node);}
#else
#define numa_node_id() 0
#endif
Thank you
Eric
next prev parent reply other threads:[~2005-09-29 13:40 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-09-28 20:25 [PATCH 0/3] Demand faulting for huge pages Adam Litke
2005-09-28 20:25 ` Adam Litke
2005-09-28 20:31 ` [PATCH 1/3 htlb-get_user_pages] " Adam Litke
2005-09-28 20:31 ` Adam Litke
2005-09-28 20:32 ` [PATCH 2/3 htlb-fault] " Adam Litke
2005-09-28 20:32 ` Adam Litke
2005-09-29 6:09 ` Andrew Morton
2005-09-29 6:09 ` Andrew Morton
2005-09-29 6:10 ` Andrew Morton
2005-09-29 6:10 ` Andrew Morton
2005-09-28 20:33 ` [PATCH 3/3 htlb-acct] " Adam Litke
2005-09-28 20:33 ` Adam Litke
2005-09-29 6:20 ` Andrew Morton
2005-09-29 6:20 ` Andrew Morton
2005-09-29 9:45 ` Andi Kleen
2005-09-29 9:45 ` Andi Kleen
2005-09-29 13:40 ` Eric Dumazet [this message]
2005-09-29 13:43 ` [NUMA , x86_64] Why memnode_shift is chosen with the lowest possible value ? Andi Kleen
2005-09-29 16:59 ` Eric Dumazet
2005-09-30 9:09 ` Eric Dumazet
2005-10-04 17:13 ` Andi Kleen
2005-10-04 21:12 ` Eric Dumazet
2005-09-29 13:32 ` [PATCH 0/3] Demand faulting for huge pages Hugh Dickins
2005-09-29 13:32 ` Hugh Dickins
2005-10-06 15:22 ` Adam Litke
2005-10-06 15:22 ` Adam Litke
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=433BEED6.6000008@cosmosbay.com \
--to=dada1@cosmosbay.com \
--cc=ak@suse.de \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.