* [patch 0/2] x86, UV: fixups for configurations with a large number of nodes.
@ 2009-10-15 22:39 Robin Holt
2009-10-15 22:40 ` [patch 1/2] x86, UV: Fix information in __uv_hub_info structure Robin Holt
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Robin Holt @ 2009-10-15 22:39 UTC (permalink / raw)
To: mingo, tglx; +Cc: linux-mm, linux-kernel, Jack Steiner, Cliff Whickman
We need the __uv_hub_info structure to contain the correct values for
n_val, gpa_mask, and lowmem_remap_*. The first patch in the series
accomplishes this. Could this be included in the stable tree as well.
Without this patch, booting a large configuration hits a problem where
the upper bits of the gnode affect the pnode and the bau will not operate.
The second patch cleans up the broadcast assist unit code a small bit.
Thanks,
Robin Holt
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 7+ messages in thread* [patch 1/2] x86, UV: Fix information in __uv_hub_info structure. 2009-10-15 22:39 [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Robin Holt @ 2009-10-15 22:40 ` Robin Holt 2009-10-15 22:40 ` [patch 2/2] x86, UV: Modify bau to use uv_gpa_to_pnode() Robin Holt 2009-10-16 6:34 ` [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Ingo Molnar 2 siblings, 0 replies; 7+ messages in thread From: Robin Holt @ 2009-10-15 22:40 UTC (permalink / raw) To: mingo, tglx; +Cc: linux-mm, linux-kernel, Jack Steiner, Cliff Whickman, stable [-- Attachment #1: uv_hub_info_fix --] [-- Type: text/plain, Size: 3123 bytes --] A few parts of the uv_hub_info structure are initialized incorrectly. - n_val is being loaded with m_val. - gpa_mask is initialized with a bytes instead of an unsigned long. - Handle the case where none of the alias registers are used. Lastly I converted the bau over to using the uv_hub_info->m_val which is the correct value. To: Ingo Molnar <mingo@elte.hu> To: tglx@linutronix.de Signed-off-by: Robin Holt <holt@sgi.com> Signed-off-by: Jack Steiner <steiner@sgi.com> Cc: stable@kernel.org Cc: linux-mm@kvack.org Cc: linux-kernel@vger.kernel.org --- arch/x86/kernel/apic/x2apic_uv_x.c | 8 ++++---- arch/x86/kernel/tlb_uv.c | 4 ++-- 2 files changed, 6 insertions(+), 6 deletions(-) Index: linux/arch/x86/kernel/apic/x2apic_uv_x.c =================================================================== --- linux.orig/arch/x86/kernel/apic/x2apic_uv_x.c 2009-10-15 17:02:33.000000000 -0500 +++ linux/arch/x86/kernel/apic/x2apic_uv_x.c 2009-10-15 17:24:13.000000000 -0500 @@ -352,14 +352,14 @@ static __init void get_lowmem_redirect(u for (i = 0; i < ARRAY_SIZE(redir_addrs); i++) { alias.v = uv_read_local_mmr(redir_addrs[i].alias); - if (alias.s.base == 0) { + if (alias.s.enable && alias.s.base == 0) { *size = (1UL << alias.s.m_alias); redirect.v = uv_read_local_mmr(redir_addrs[i].redirect); *base = (unsigned long)redirect.s.dest_base << DEST_SHIFT; return; } } - BUG(); + *base = *size = 0; } enum map_type {map_wb, map_uc}; @@ -619,12 +619,12 @@ void __init uv_system_init(void) uv_cpu_hub_info(cpu)->lowmem_remap_base = lowmem_redir_base; uv_cpu_hub_info(cpu)->lowmem_remap_top = lowmem_redir_size; uv_cpu_hub_info(cpu)->m_val = m_val; - uv_cpu_hub_info(cpu)->n_val = m_val; + uv_cpu_hub_info(cpu)->n_val = n_val; uv_cpu_hub_info(cpu)->numa_blade_id = blade; uv_cpu_hub_info(cpu)->blade_processor_id = lcpu; uv_cpu_hub_info(cpu)->pnode = pnode; uv_cpu_hub_info(cpu)->pnode_mask = pnode_mask; - uv_cpu_hub_info(cpu)->gpa_mask = (1 << (m_val + n_val)) - 1; + uv_cpu_hub_info(cpu)->gpa_mask = (1UL << (m_val + n_val)) - 1; uv_cpu_hub_info(cpu)->gnode_upper = gnode_upper; uv_cpu_hub_info(cpu)->gnode_extra = gnode_extra; uv_cpu_hub_info(cpu)->global_mmr_base = mmr_base; Index: linux/arch/x86/kernel/tlb_uv.c =================================================================== --- linux.orig/arch/x86/kernel/tlb_uv.c 2009-10-15 17:02:33.000000000 -0500 +++ linux/arch/x86/kernel/tlb_uv.c 2009-10-15 17:29:32.000000000 -0500 @@ -843,8 +843,8 @@ static int __init uv_bau_init(void) GFP_KERNEL, cpu_to_node(cur_cpu)); uv_bau_retry_limit = 1; - uv_nshift = uv_hub_info->n_val; - uv_mmask = (1UL << uv_hub_info->n_val) - 1; + uv_nshift = uv_hub_info->m_val; + uv_mmask = (1UL << uv_hub_info->m_val) - 1; nblades = uv_num_possible_blades(); uv_bau_table_bases = (struct bau_control **) -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a> ^ permalink raw reply [flat|nested] 7+ messages in thread
* [patch 2/2] x86, UV: Modify bau to use uv_gpa_to_pnode(). 2009-10-15 22:39 [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Robin Holt 2009-10-15 22:40 ` [patch 1/2] x86, UV: Fix information in __uv_hub_info structure Robin Holt @ 2009-10-15 22:40 ` Robin Holt 2009-10-16 6:34 ` [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Ingo Molnar 2 siblings, 0 replies; 7+ messages in thread From: Robin Holt @ 2009-10-15 22:40 UTC (permalink / raw) To: mingo, tglx; +Cc: linux-mm, linux-kernel, Jack Steiner, Cliff Whickman [-- Attachment #1: bau_use_gpa_to_pnode --] [-- Type: text/plain, Size: 3059 bytes --] Create an inline function to extract the pnode from a global physical address and then convert the broadcast assist unit to use the newly created uv_gpa_to_pnode function. To: Ingo Molnar <mingo@elte.hu> To: tglx@linutronix.de Signed-off-by: Robin Holt <holt@sgi.com> Acked-by: Cliff Whickman <cpw@sgi.com> Cc: linux-mm@kvack.org Cc: linux-kernel@vger.kernel.org --- arch/x86/include/asm/uv/uv_hub.h | 8 +++++++- arch/x86/kernel/tlb_uv.c | 7 ++----- 2 files changed, 9 insertions(+), 6 deletions(-) Index: linux/arch/x86/include/asm/uv/uv_hub.h =================================================================== --- linux.orig/arch/x86/include/asm/uv/uv_hub.h 2009-10-15 17:26:48.000000000 -0500 +++ linux/arch/x86/include/asm/uv/uv_hub.h 2009-10-15 17:28:46.000000000 -0500 @@ -114,7 +114,7 @@ /* * The largest possible NASID of a C or M brick (+ 2) */ -#define UV_MAX_NASID_VALUE (UV_MAX_NUMALINK_NODES * 2) +#define UV_MAX_NASID_VALUE (UV_MAX_NUMALINK_BLADES * 2) struct uv_scir_s { struct timer_list timer; @@ -230,6 +230,12 @@ static inline unsigned long uv_gpa(void return uv_soc_phys_ram_to_gpa(__pa(v)); } +/* gpa -> pnode */ +static inline int uv_gpa_to_pnode(unsigned long gpa) +{ + return gpa >> uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1); +} + /* pnode, offset --> socket virtual */ static inline void *uv_pnode_offset_to_vaddr(int pnode, unsigned long offset) { Index: linux/arch/x86/kernel/tlb_uv.c =================================================================== --- linux.orig/arch/x86/kernel/tlb_uv.c 2009-10-15 17:26:48.000000000 -0500 +++ linux/arch/x86/kernel/tlb_uv.c 2009-10-15 17:28:46.000000000 -0500 @@ -23,8 +23,6 @@ static struct bau_control **uv_bau_table_bases __read_mostly; static int uv_bau_retry_limit __read_mostly; -/* position of pnode (which is nasid>>1): */ -static int uv_nshift __read_mostly; /* base pnode in this partition */ static int uv_partition_base_pnode __read_mostly; @@ -723,7 +721,7 @@ uv_activation_descriptor_init(int node, BUG_ON(!adp); pa = uv_gpa(adp); /* need the real nasid*/ - n = pa >> uv_nshift; + n = uv_gpa_to_pnode(pa); m = pa & uv_mmask; uv_write_global_mmr64(pnode, UVH_LB_BAU_SB_DESCRIPTOR_BASE, @@ -778,7 +776,7 @@ uv_payload_queue_init(int node, int pnod * need the pnode of where the memory was really allocated */ pa = uv_gpa(pqp); - pn = pa >> uv_nshift; + pn = uv_gpa_to_pnode(pa); uv_write_global_mmr64(pnode, UVH_LB_BAU_INTD_PAYLOAD_QUEUE_FIRST, ((unsigned long)pn << UV_PAYLOADQ_PNODE_SHIFT) | @@ -843,7 +841,6 @@ static int __init uv_bau_init(void) GFP_KERNEL, cpu_to_node(cur_cpu)); uv_bau_retry_limit = 1; - uv_nshift = uv_hub_info->m_val; uv_mmask = (1UL << uv_hub_info->m_val) - 1; nblades = uv_num_possible_blades(); -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [patch 0/2] x86, UV: fixups for configurations with a large number of nodes. 2009-10-15 22:39 [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Robin Holt 2009-10-15 22:40 ` [patch 1/2] x86, UV: Fix information in __uv_hub_info structure Robin Holt 2009-10-15 22:40 ` [patch 2/2] x86, UV: Modify bau to use uv_gpa_to_pnode() Robin Holt @ 2009-10-16 6:34 ` Ingo Molnar 2009-10-16 11:29 ` Robin Holt 2 siblings, 1 reply; 7+ messages in thread From: Ingo Molnar @ 2009-10-16 6:34 UTC (permalink / raw) To: Robin Holt; +Cc: tglx, linux-mm, linux-kernel, Jack Steiner, Cliff Whickman * Robin Holt <holt@sgi.com> wrote: > We need the __uv_hub_info structure to contain the correct values for > n_val, gpa_mask, and lowmem_remap_*. The first patch in the series > accomplishes this. Could this be included in the stable tree as well. > Without this patch, booting a large configuration hits a problem where > the upper bits of the gnode affect the pnode and the bau will not > operate. i've applied this one. > The second patch cleans up the broadcast assist unit code a small bit. Seems to be more than just a 'cleanup'. It changes: uv_nshift = uv_hub_info->m_val; to (in essence): uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1) which is not the same. Furthermore, the new inline is: + return gpa >> uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1); note that >> has higher priority than bitwise & - is that intended? I think the intention was: + return gpa >> (uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1)); in any case please do that cleaner by adding a separate mask variable. Ingo -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [patch 0/2] x86, UV: fixups for configurations with a large number of nodes. 2009-10-16 6:34 ` [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Ingo Molnar @ 2009-10-16 11:29 ` Robin Holt 2009-10-16 12:53 ` Ingo Molnar 0 siblings, 1 reply; 7+ messages in thread From: Robin Holt @ 2009-10-16 11:29 UTC (permalink / raw) To: Ingo Molnar Cc: Robin Holt, tglx, linux-mm, linux-kernel, Jack Steiner, Cliff Whickman On Fri, Oct 16, 2009 at 08:34:05AM +0200, Ingo Molnar wrote: > > * Robin Holt <holt@sgi.com> wrote: > > > We need the __uv_hub_info structure to contain the correct values for > > n_val, gpa_mask, and lowmem_remap_*. The first patch in the series > > accomplishes this. Could this be included in the stable tree as well. > > Without this patch, booting a large configuration hits a problem where > > the upper bits of the gnode affect the pnode and the bau will not > > operate. > > i've applied this one. Thank you for applying this one. > > The second patch cleans up the broadcast assist unit code a small bit. > > Seems to be more than just a 'cleanup'. It changes: I am going to rearrange a bit: > + return gpa >> uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1); > > note that >> has higher priority than bitwise & - is that intended? I > think the intention was: > > + return gpa >> (uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1)); The intention was (gpa >> m_val) & (n_mask); I love the clarity of making it an explicitly stated mask. Much more readable. > > uv_nshift = uv_hub_info->m_val; > > to (in essence): > > uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1) > > which is not the same. Furthermore, the new inline is: You have an excellent point there. That was a bug as well. That may explain a few of our currently unexplained bau hangs. The value is supposed to be a pnode instead of the current gnode. Robin --- Create an inline function to extract the pnode from a global physical address and then convert the broadcast assist unit to use the newly created uv_gpa_to_pnode function. To: Ingo Molnar <mingo@elte.hu> To: tglx@linutronix.de Signed-off-by: Robin Holt <holt@sgi.com> Acked-by: Cliff Whickman <cpw@sgi.com> Cc: linux-mm@kvack.org Cc: linux-kernel@vger.kernel.org --- arch/x86/include/asm/uv/uv_hub.h | 16 +++++++++++++++- arch/x86/kernel/tlb_uv.c | 7 ++----- 2 files changed, 17 insertions(+), 6 deletions(-) Index: linux/arch/x86/include/asm/uv/uv_hub.h =================================================================== --- linux.orig/arch/x86/include/asm/uv/uv_hub.h 2009-10-16 06:02:23.000000000 -0500 +++ linux/arch/x86/include/asm/uv/uv_hub.h 2009-10-16 06:07:52.000000000 -0500 @@ -114,7 +114,7 @@ /* * The largest possible NASID of a C or M brick (+ 2) */ -#define UV_MAX_NASID_VALUE (UV_MAX_NUMALINK_NODES * 2) +#define UV_MAX_NASID_VALUE (UV_MAX_NUMALINK_BLADES * 2) struct uv_scir_s { struct timer_list timer; @@ -230,6 +230,20 @@ static inline unsigned long uv_gpa(void return uv_soc_phys_ram_to_gpa(__pa(v)); } +/* gnode -> pnode */ +static inline unsigned long uv_gpa_to_gnode(unsigned long gpa) +{ + return gpa >> uv_hub_info->m_val; +} + +/* gpa -> pnode */ +static inline int uv_gpa_to_pnode(unsigned long gpa) +{ + unsigned long n_mask = (1UL << uv_hub_info->n_val) - 1; + + return uv_gpa_to_gnode(gpa) & n_mask; +} + /* pnode, offset --> socket virtual */ static inline void *uv_pnode_offset_to_vaddr(int pnode, unsigned long offset) { Index: linux/arch/x86/kernel/tlb_uv.c =================================================================== --- linux.orig/arch/x86/kernel/tlb_uv.c 2009-10-16 06:02:27.000000000 -0500 +++ linux/arch/x86/kernel/tlb_uv.c 2009-10-16 06:02:28.000000000 -0500 @@ -23,8 +23,6 @@ static struct bau_control **uv_bau_table_bases __read_mostly; static int uv_bau_retry_limit __read_mostly; -/* position of pnode (which is nasid>>1): */ -static int uv_nshift __read_mostly; /* base pnode in this partition */ static int uv_partition_base_pnode __read_mostly; @@ -723,7 +721,7 @@ uv_activation_descriptor_init(int node, BUG_ON(!adp); pa = uv_gpa(adp); /* need the real nasid*/ - n = pa >> uv_nshift; + n = uv_gpa_to_pnode(pa); m = pa & uv_mmask; uv_write_global_mmr64(pnode, UVH_LB_BAU_SB_DESCRIPTOR_BASE, @@ -778,7 +776,7 @@ uv_payload_queue_init(int node, int pnod * need the pnode of where the memory was really allocated */ pa = uv_gpa(pqp); - pn = pa >> uv_nshift; + pn = uv_gpa_to_pnode(pa); uv_write_global_mmr64(pnode, UVH_LB_BAU_INTD_PAYLOAD_QUEUE_FIRST, ((unsigned long)pn << UV_PAYLOADQ_PNODE_SHIFT) | @@ -843,7 +841,6 @@ static int __init uv_bau_init(void) GFP_KERNEL, cpu_to_node(cur_cpu)); uv_bau_retry_limit = 1; - uv_nshift = uv_hub_info->m_val; uv_mmask = (1UL << uv_hub_info->m_val) - 1; nblades = uv_num_possible_blades(); -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [patch 0/2] x86, UV: fixups for configurations with a large number of nodes. 2009-10-16 11:29 ` Robin Holt @ 2009-10-16 12:53 ` Ingo Molnar 2009-10-16 14:55 ` Robin Holt 0 siblings, 1 reply; 7+ messages in thread From: Ingo Molnar @ 2009-10-16 12:53 UTC (permalink / raw) To: Robin Holt; +Cc: tglx, linux-mm, linux-kernel, Jack Steiner, Cliff Whickman * Robin Holt <holt@sgi.com> wrote: > > uv_nshift = uv_hub_info->m_val; > > > > to (in essence): > > > > uv_hub_info->m_val & ((1UL << uv_hub_info->n_val) - 1) > > > > which is not the same. Furthermore, the new inline is: > > You have an excellent point there. That was a bug as well. That may > explain a few of our currently unexplained bau hangs. The value is > supposed to be a pnode instead of the current gnode. So ... is the commit log message i've put into the commit below correct, or is it still only a cleanup patch? You really need to put that kind of info into your changelogs - it helps maintainers put it into the right kernel release. Ingo ------------> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [patch 0/2] x86, UV: fixups for configurations with a large number of nodes. 2009-10-16 12:53 ` Ingo Molnar @ 2009-10-16 14:55 ` Robin Holt 0 siblings, 0 replies; 7+ messages in thread From: Robin Holt @ 2009-10-16 14:55 UTC (permalink / raw) To: Ingo Molnar Cc: Robin Holt, tglx, linux-mm, linux-kernel, Jack Steiner, Cliff Whickman On Fri, Oct 16, 2009 at 02:53:13PM +0200, Ingo Molnar wrote: ... > So ... is the commit log message i've put into the commit below correct, > or is it still only a cleanup patch? You really need to put that kind of > info into your changelogs - it helps maintainers put it into the right > kernel release. > > Ingo ... > The open-coded code was wrong as well - it might explain a > few of our unexplained bau hangs. Terrific. Thank you for fixing up my commit message. I will try to be more complete next time. Robin -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a> ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2009-10-16 14:55 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-10-15 22:39 [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Robin Holt 2009-10-15 22:40 ` [patch 1/2] x86, UV: Fix information in __uv_hub_info structure Robin Holt 2009-10-15 22:40 ` [patch 2/2] x86, UV: Modify bau to use uv_gpa_to_pnode() Robin Holt 2009-10-16 6:34 ` [patch 0/2] x86, UV: fixups for configurations with a large number of nodes Ingo Molnar 2009-10-16 11:29 ` Robin Holt 2009-10-16 12:53 ` Ingo Molnar 2009-10-16 14:55 ` Robin Holt
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).