* [RFC][PATCH] remove section mappinng @ 2008-01-25 16:05 Badari Pulavarty 2008-01-26 1:24 ` Paul Mackerras 0 siblings, 1 reply; 4+ messages in thread From: Badari Pulavarty @ 2008-01-25 16:05 UTC (permalink / raw) To: linuxppc-dev; +Cc: anton, Paul Mackerras, linux-mm Hi Paul/Ben/Anton, As part of making memory remove working on ppc64, I need the code to remove htab mappings for the section of the memory. Here is the code I cooked up, it seems to be working fine. But I have concerns where I need your help. In order to invalidate htab entries, we need to find the "slot". But I can only find the hpte group. Is it okay to invalidate the first entry in the group ? Do I need to invalidate the entire group ? Please help, as I would like to push this for 2.6.25/2.6.26. Thanks, Badari For memory remove, we need to remove htab mapping. Signed-off-by: Badari Pulavarty <pbadari@us.ibm.com> --- arch/powerpc/mm/hash_utils_64.c | 32 ++++++++++++++++++++++++++++++++ include/asm-powerpc/sparsemem.h | 1 + 2 files changed, 33 insertions(+) Index: linux-2.6.24-rc8/arch/powerpc/mm/hash_utils_64.c =================================================================== --- linux-2.6.24-rc8.orig/arch/powerpc/mm/hash_utils_64.c 2008-01-17 09:47:37.000000000 -0800 +++ linux-2.6.24-rc8/arch/powerpc/mm/hash_utils_64.c 2008-01-25 07:57:48.000000000 -0800 @@ -191,6 +191,33 @@ int htab_bolt_mapping(unsigned long vsta return ret < 0 ? ret : 0; } +static void htab_remove_mapping(unsigned long vstart, unsigned long vend, + int psize, int ssize) +{ + unsigned long vaddr; + unsigned int step, shift; + + shift = mmu_psize_defs[psize].shift; + step = 1 << shift; + + for (vaddr = vstart; vaddr < vend; vaddr += step) { + unsigned long hash, hpteg; + unsigned long vsid = get_kernel_vsid(vaddr, ssize); + unsigned long va = hpt_va(vaddr, vsid, ssize); + + hash = hpt_hash(va, shift, ssize); + hpteg = ((hash & htab_hash_mask) * HPTES_PER_GROUP); + + /* + * HELP - how do I find the slot ? Is it okay to + * invalidates only first entry ? Do I need to + * remove entire group instead ? + */ + BUG_ON(!ppc_md.hpte_invalidate); + ppc_md.hpte_invalidate(hpteg, va, psize, ssize, 0); + } +} + static int __init htab_dt_scan_seg_sizes(unsigned long node, const char *uname, int depth, void *data) @@ -436,6 +463,11 @@ void create_section_mapping(unsigned lon _PAGE_ACCESSED | _PAGE_DIRTY | _PAGE_COHERENT | PP_RWXX, mmu_linear_psize, mmu_kernel_ssize)); } + +void remove_section_mapping(unsigned long start, unsigned long end) +{ + htab_remove_mapping(start, end, mmu_linear_psize, mmu_kernel_ssize); +} #endif /* CONFIG_MEMORY_HOTPLUG */ static inline void make_bl(unsigned int *insn_addr, void *func) Index: linux-2.6.24-rc8/include/asm-powerpc/sparsemem.h =================================================================== --- linux-2.6.24-rc8.orig/include/asm-powerpc/sparsemem.h 2008-01-15 20:22:48.000000000 -0800 +++ linux-2.6.24-rc8/include/asm-powerpc/sparsemem.h 2008-01-24 16:20:17.000000000 -0800 @@ -20,6 +20,7 @@ #ifdef CONFIG_MEMORY_HOTPLUG extern void create_section_mapping(unsigned long start, unsigned long end); +extern void remove_section_mapping(unsigned long start, unsigned long end); #ifdef CONFIG_NUMA extern int hot_add_scn_to_nid(unsigned long scn_addr); #else ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [RFC][PATCH] remove section mappinng 2008-01-25 16:05 [RFC][PATCH] remove section mappinng Badari Pulavarty @ 2008-01-26 1:24 ` Paul Mackerras 2008-01-28 22:19 ` Badari Pulavarty 0 siblings, 1 reply; 4+ messages in thread From: Paul Mackerras @ 2008-01-26 1:24 UTC (permalink / raw) To: Badari Pulavarty; +Cc: linux-mm, linuxppc-dev, anton Badari Pulavarty writes: > Here is the code I cooked up, it seems to be working fine. > But I have concerns where I need your help. > > In order to invalidate htab entries, we need to find the "slot". > But I can only find the hpte group. Is it okay to invalidate the > first entry in the group ? Do I need to invalidate the entire group ? You do need to find the correct slot. (I suppose you could invalidate the entire group, but that would be pretty gross.) Note that in the CONFIG_DEBUG_PAGEALLOC case we use 4k pages and keep a map of the slot numbers in linear_map_hash_slots[]. But in that case I assume that the generic code would have already unmapped all the pages of the LMB that you're trying to hot-unplug. In the non-DEBUG_PAGEALLOC case on a System p machine, the hash table will be big enough that the linear mapping entries should always be in slot 0. So just invalidating slot 0 would probably work in practice, but it seems pretty fragile. We might want to use your new htab_remove_mapping() function on a bare-metal system with a smaller hash table in future, for instance. Have a look at pSeries_lpar_hpte_updateboltedpp. It calls pSeries_lpar_hpte_find to find the slot for a bolted HPTE. You could do something similar. In fact maybe the best approach is to do a pSeries_lpar_hpte_remove_bolted() and not try to solve the more general problem. Paul. ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [RFC][PATCH] remove section mappinng 2008-01-26 1:24 ` Paul Mackerras @ 2008-01-28 22:19 ` Badari Pulavarty 2008-02-07 0:09 ` Paul Mackerras 0 siblings, 1 reply; 4+ messages in thread From: Badari Pulavarty @ 2008-01-28 22:19 UTC (permalink / raw) To: Paul Mackerras; +Cc: linux-mm, linuxppc-dev, anton On Sat, 2008-01-26 at 12:24 +1100, Paul Mackerras wrote: > Badari Pulavarty writes: > > > Here is the code I cooked up, it seems to be working fine. > > But I have concerns where I need your help. > > > > In order to invalidate htab entries, we need to find the "slot". > > But I can only find the hpte group. Is it okay to invalidate the > > first entry in the group ? Do I need to invalidate the entire group ? > > You do need to find the correct slot. (I suppose you could invalidate > the entire group, but that would be pretty gross.) > > Note that in the CONFIG_DEBUG_PAGEALLOC case we use 4k pages and keep > a map of the slot numbers in linear_map_hash_slots[]. But in that > case I assume that the generic code would have already unmapped all > the pages of the LMB that you're trying to hot-unplug. > > In the non-DEBUG_PAGEALLOC case on a System p machine, the hash table > will be big enough that the linear mapping entries should always be in > slot 0. So just invalidating slot 0 would probably work in practice, > but it seems pretty fragile. We might want to use your new > htab_remove_mapping() function on a bare-metal system with a smaller > hash table in future, for instance. > > Have a look at pSeries_lpar_hpte_updateboltedpp. It calls > pSeries_lpar_hpte_find to find the slot for a bolted HPTE. You could > do something similar. In fact maybe the best approach is to do a > pSeries_lpar_hpte_remove_bolted() and not try to solve the more > general problem. Paul, Thank you for your input and suggestions. Does this look reasonable to you ? Thanks, Badari For memory remove, we need to clean up htab mappings for the section of the memory we are removing. This patch implements support for removing htab bolted mappings for ppc64 lpar. Other sub-archs, may need to implement similar functionality for the hotplug memory remove to work. Signed-off-by: Badari Pulavarty <pbadari@us.ibm.com> --- arch/powerpc/mm/hash_utils_64.c | 23 +++++++++++++++++++++++ arch/powerpc/platforms/pseries/lpar.c | 15 +++++++++++++++ include/asm-powerpc/machdep.h | 2 ++ include/asm-powerpc/sparsemem.h | 1 + 5 files changed, 44 insertions(+), 1 deletion(-) Index: linux-2.6.24-rc8/arch/powerpc/mm/hash_utils_64.c =================================================================== --- linux-2.6.24-rc8.orig/arch/powerpc/mm/hash_utils_64.c 2008-01-25 08:04:32.000000000 -0800 +++ linux-2.6.24-rc8/arch/powerpc/mm/hash_utils_64.c 2008-01-28 11:45:40.000000000 -0800 @@ -191,6 +191,24 @@ int htab_bolt_mapping(unsigned long vsta return ret < 0 ? ret : 0; } +static void htab_remove_mapping(unsigned long vstart, unsigned long vend, + int psize, int ssize) +{ + unsigned long vaddr; + unsigned int step, shift; + + shift = mmu_psize_defs[psize].shift; + step = 1 << shift; + + if (!ppc_md.hpte_removebolted) { + printk("Sub-arch doesn't implement hpte_removebolted\n"); + return; + } + + for (vaddr = vstart; vaddr < vend; vaddr += step) + ppc_md.hpte_removebolted(vaddr, psize, ssize); +} + static int __init htab_dt_scan_seg_sizes(unsigned long node, const char *uname, int depth, void *data) @@ -436,6 +454,11 @@ void create_section_mapping(unsigned lon _PAGE_ACCESSED | _PAGE_DIRTY | _PAGE_COHERENT | PP_RWXX, mmu_linear_psize, mmu_kernel_ssize)); } + +void remove_section_mapping(unsigned long start, unsigned long end) +{ + htab_remove_mapping(start, end, mmu_linear_psize, mmu_kernel_ssize); +} #endif /* CONFIG_MEMORY_HOTPLUG */ static inline void make_bl(unsigned int *insn_addr, void *func) Index: linux-2.6.24-rc8/include/asm-powerpc/sparsemem.h =================================================================== --- linux-2.6.24-rc8.orig/include/asm-powerpc/sparsemem.h 2008-01-15 20:22:48.000000000 -0800 +++ linux-2.6.24-rc8/include/asm-powerpc/sparsemem.h 2008-01-25 08:18:11.000000000 -0800 @@ -20,6 +20,7 @@ #ifdef CONFIG_MEMORY_HOTPLUG extern void create_section_mapping(unsigned long start, unsigned long end); +extern void remove_section_mapping(unsigned long start, unsigned long end); #ifdef CONFIG_NUMA extern int hot_add_scn_to_nid(unsigned long scn_addr); #else Index: linux-2.6.24-rc8/arch/powerpc/platforms/pseries/lpar.c =================================================================== --- linux-2.6.24-rc8.orig/arch/powerpc/platforms/pseries/lpar.c 2008-01-15 20:22:48.000000000 -0800 +++ linux-2.6.24-rc8/arch/powerpc/platforms/pseries/lpar.c 2008-01-28 14:10:58.000000000 -0800 @@ -520,6 +520,20 @@ static void pSeries_lpar_hpte_invalidate BUG_ON(lpar_rc != H_SUCCESS); } +static void pSeries_lpar_hpte_removebolted(unsigned long ea, + int psize, int ssize) +{ + unsigned long slot, vsid, va; + + vsid = get_kernel_vsid(ea, ssize); + va = hpt_va(ea, vsid, ssize); + + slot = pSeries_lpar_hpte_find(va, psize, ssize); + BUG_ON(slot == -1); + + pSeries_lpar_hpte_invalidate(slot, va, psize, ssize, 0); +} + /* Flag bits for H_BULK_REMOVE */ #define HBR_REQUEST 0x4000000000000000UL #define HBR_RESPONSE 0x8000000000000000UL @@ -597,6 +611,7 @@ void __init hpte_init_lpar(void) ppc_md.hpte_updateboltedpp = pSeries_lpar_hpte_updateboltedpp; ppc_md.hpte_insert = pSeries_lpar_hpte_insert; ppc_md.hpte_remove = pSeries_lpar_hpte_remove; + ppc_md.hpte_removebolted = pSeries_lpar_hpte_removebolted; ppc_md.flush_hash_range = pSeries_lpar_flush_hash_range; ppc_md.hpte_clear_all = pSeries_lpar_hptab_clear; } Index: linux-2.6.24-rc8/include/asm-powerpc/machdep.h =================================================================== --- linux-2.6.24-rc8.orig/include/asm-powerpc/machdep.h 2008-01-25 08:04:41.000000000 -0800 +++ linux-2.6.24-rc8/include/asm-powerpc/machdep.h 2008-01-28 11:45:17.000000000 -0800 @@ -68,6 +68,8 @@ struct machdep_calls { unsigned long vflags, int psize, int ssize); long (*hpte_remove)(unsigned long hpte_group); + void (*hpte_removebolted)(unsigned long ea, + int psize, int ssize); void (*flush_hash_range)(unsigned long number, int local); /* special for kexec, to be called in real mode, linar mapping is ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [RFC][PATCH] remove section mappinng 2008-01-28 22:19 ` Badari Pulavarty @ 2008-02-07 0:09 ` Paul Mackerras 0 siblings, 0 replies; 4+ messages in thread From: Paul Mackerras @ 2008-02-07 0:09 UTC (permalink / raw) To: Badari Pulavarty; +Cc: linux-mm, linuxppc-dev, anton Badari Pulavarty writes: > Thank you for your input and suggestions. Does this look reasonable > to you ? > > Thanks, > Badari > > For memory remove, we need to clean up htab mappings for the > section of the memory we are removing. > > This patch implements support for removing htab bolted mappings > for ppc64 lpar. Other sub-archs, may need to implement similar > functionality for the hotplug memory remove to work. Looks OK to me. Paul. ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2008-02-07 0:09 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2008-01-25 16:05 [RFC][PATCH] remove section mappinng Badari Pulavarty 2008-01-26 1:24 ` Paul Mackerras 2008-01-28 22:19 ` Badari Pulavarty 2008-02-07 0:09 ` Paul Mackerras
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).