linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init
@ 2024-10-08 13:27 Ritesh Harjani (IBM)
  2024-10-08 13:27 ` [RFC 2/2] fadump: Make fadump reserve_dump_area_start CMA aligned in case of holes Ritesh Harjani (IBM)
  2024-10-08 13:50 ` [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init David Hildenbrand
  0 siblings, 2 replies; 4+ messages in thread
From: Ritesh Harjani (IBM) @ 2024-10-08 13:27 UTC (permalink / raw)
  To: linuxppc-dev
  Cc: linux-mm, Sourabh Jain, Hari Bathini, Zi Yan, David Hildenbrand,
	Kirill A . Shutemov, Mahesh J Salgaonkar, Michael Ellerman,
	Madhavan Srinivasan, Aneesh Kumar K . V, Donet Tom, LKML,
	Ritesh Harjani (IBM), Sachin P Bappalige

During early init CMA_MIN_ALIGNMENT_BYTES can be PAGE_SIZE,
since pageblock_order is still zero and it gets initialized
later during paging_init() e.g.
paging_init() -> free_area_init() -> set_pageblock_order().

One such use case is -
early_setup() -> early_init_devtree() -> fadump_reserve_mem()

This causes CMA memory alignment check to be bypassed in
cma_init_reserved_mem(). Then later cma_activate_area() can hit
a VM_BUG_ON_PAGE(pfn & ((1 << order) - 1)) if the reserved memory
area was not pageblock_order aligned.

Instead of fixing it locally for fadump case on PowerPC, I believe
this should be fixed for CMA_MIN_ALIGNMENT_BYTES.

<stack trace>
==============
page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x10010
flags: 0x13ffff800000000(node=1|zone=0|lastcpupid=0x7ffff) CMA
raw: 013ffff800000000 5deadbeef0000100 5deadbeef0000122 0000000000000000
raw: 0000000000000000 0000000000000000 00000000ffffffff 0000000000000000
page dumped because: VM_BUG_ON_PAGE(pfn & ((1 << order) - 1))
------------[ cut here ]------------
kernel BUG at mm/page_alloc.c:778!

Call Trace:
__free_one_page+0x57c/0x7b0 (unreliable)
free_pcppages_bulk+0x1a8/0x2c8
free_unref_page_commit+0x3d4/0x4e4
free_unref_page+0x458/0x6d0
init_cma_reserved_pageblock+0x114/0x198
cma_init_reserved_areas+0x270/0x3e0
do_one_initcall+0x80/0x2f8
kernel_init_freeable+0x33c/0x530
kernel_init+0x34/0x26c
ret_from_kernel_user_thread+0x14/0x1c

Reported-by: Sachin P Bappalige <sachinpb@linux.ibm.com>
Signed-off-by: Ritesh Harjani (IBM) <ritesh.list@gmail.com>
---
 include/linux/cma.h | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/include/linux/cma.h b/include/linux/cma.h
index 9db877506ea8..20abc6561bcd 100644
--- a/include/linux/cma.h
+++ b/include/linux/cma.h
@@ -5,6 +5,7 @@
 #include <linux/init.h>
 #include <linux/types.h>
 #include <linux/numa.h>
+#include <linux/minmax.h>

 #ifdef CONFIG_CMA_AREAS
 #define MAX_CMA_AREAS	CONFIG_CMA_AREAS
@@ -17,7 +18,8 @@
  * -- can deal with only some pageblocks of a higher-order page being
  *  MIGRATE_CMA, we can use pageblock_nr_pages.
  */
-#define CMA_MIN_ALIGNMENT_PAGES pageblock_nr_pages
+#define CMA_MIN_ALIGNMENT_PAGES \
+	(1ULL << min_not_zero(MAX_PAGE_ORDER, pageblock_order))
 #define CMA_MIN_ALIGNMENT_BYTES (PAGE_SIZE * CMA_MIN_ALIGNMENT_PAGES)

 struct cma;
--
2.46.0



^ permalink raw reply related	[flat|nested] 4+ messages in thread

* [RFC 2/2] fadump: Make fadump reserve_dump_area_start CMA aligned in case of holes
  2024-10-08 13:27 [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init Ritesh Harjani (IBM)
@ 2024-10-08 13:27 ` Ritesh Harjani (IBM)
  2024-10-08 13:50 ` [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init David Hildenbrand
  1 sibling, 0 replies; 4+ messages in thread
From: Ritesh Harjani (IBM) @ 2024-10-08 13:27 UTC (permalink / raw)
  To: linuxppc-dev
  Cc: linux-mm, Sourabh Jain, Hari Bathini, Zi Yan, David Hildenbrand,
	Kirill A . Shutemov, Mahesh J Salgaonkar, Michael Ellerman,
	Madhavan Srinivasan, Aneesh Kumar K . V, Donet Tom, LKML,
	Ritesh Harjani (IBM)

Consider cma alignment into account while calculating base address for
fadump memory allocation. Physical memory ranges can have holes and
fadump_locate_reserve_mem() tries to find a suitable base address.
If CMA is enabled and fadump nocma is false then we need to consider
CMA_MIN_ALIGNMENT_BYTES for reserve_dump_area_start.

For e.g. in case of below memory layout, the most suitable base address
is 0x00000501000000 for crashkernel=4097M which is 16M (order 8) aligned
as expected by CMA_MIN_ALIGNMENT_BYTES on PPC64 during early boot
(when pageblock_order is still not initialized)

~ # cat /proc/iomem
00000000-1fffffff : System RAM
100000000-1ffffffff : System RAM
300000000-3ffffffff : System RAM
500200000-9001fffff : System RAM

~ # dmesg |grep -Ei "fadump|cma"
fadump: Reserved 4112MB of memory at 0x00000501000000 (System RAM: 25088MB)
fadump: Initialized 0x101000000 bytes cma area at 20496MB from 0x1010002a8 bytes of memory reserved for firmware-assisted dump
Kernel command line: root=/dev/vda1 console=ttyS0 nokaslr slub_max_order=0 norandmaps noreboot crashkernel=4097M fadump=on disable_radix=no debug_pagealloc=off
Memory: 21246656K/25690112K available (31872K kernel code, 4544K rwdata, 17280K rodata, 9216K init, 2212K bss, 218432K reserved, 4210688K cma-reserved)

Reported-by: Sourabh Jain <sourabhjain@linux.ibm.com>
Signed-off-by: Ritesh Harjani (IBM) <ritesh.list@gmail.com>
---
 arch/powerpc/kernel/fadump.c | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/arch/powerpc/kernel/fadump.c b/arch/powerpc/kernel/fadump.c
index a612e7513a4f..15ea9c80bc03 100644
--- a/arch/powerpc/kernel/fadump.c
+++ b/arch/powerpc/kernel/fadump.c
@@ -512,6 +512,10 @@ static u64 __init fadump_locate_reserve_mem(u64 base, u64 size)
 	phys_addr_t mstart, mend;
 	int idx = 0;
 	u64 i, ret = 0;
+	unsigned long align = PAGE_SIZE;
+
+	if (IS_ENABLED(CONFIG_CMA) && !fw_dump.nocma)
+		align = CMA_MIN_ALIGNMENT_BYTES;

 	mrngs = reserved_mrange_info.mem_ranges;
 	for_each_free_mem_range(i, NUMA_NO_NODE, MEMBLOCK_NONE,
@@ -520,7 +524,7 @@ static u64 __init fadump_locate_reserve_mem(u64 base, u64 size)
 			 i, mstart, mend, base);

 		if (mstart > base)
-			base = PAGE_ALIGN(mstart);
+			base = ALIGN(mstart, align);

 		while ((mend > base) && ((mend - base) >= size)) {
 			if (!overlaps_reserved_ranges(base, base+size, &idx)) {
@@ -529,7 +533,7 @@ static u64 __init fadump_locate_reserve_mem(u64 base, u64 size)
 			}

 			base = mrngs[idx].base + mrngs[idx].size;
-			base = PAGE_ALIGN(base);
+			base = ALIGN(base, align);
 		}
 	}

--
2.46.0



^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init
  2024-10-08 13:27 [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init Ritesh Harjani (IBM)
  2024-10-08 13:27 ` [RFC 2/2] fadump: Make fadump reserve_dump_area_start CMA aligned in case of holes Ritesh Harjani (IBM)
@ 2024-10-08 13:50 ` David Hildenbrand
  2024-10-10  3:19   ` Ritesh Harjani
  1 sibling, 1 reply; 4+ messages in thread
From: David Hildenbrand @ 2024-10-08 13:50 UTC (permalink / raw)
  To: Ritesh Harjani (IBM), linuxppc-dev
  Cc: linux-mm, Sourabh Jain, Hari Bathini, Zi Yan, Kirill A . Shutemov,
	Mahesh J Salgaonkar, Michael Ellerman, Madhavan Srinivasan,
	Aneesh Kumar K . V, Donet Tom, LKML, Sachin P Bappalige

On 08.10.24 15:27, Ritesh Harjani (IBM) wrote:
> During early init CMA_MIN_ALIGNMENT_BYTES can be PAGE_SIZE,
> since pageblock_order is still zero and it gets initialized
> later during paging_init() e.g.
> paging_init() -> free_area_init() -> set_pageblock_order().
> 
> One such use case is -
> early_setup() -> early_init_devtree() -> fadump_reserve_mem()
> 
> This causes CMA memory alignment check to be bypassed in
> cma_init_reserved_mem(). Then later cma_activate_area() can hit
> a VM_BUG_ON_PAGE(pfn & ((1 << order) - 1)) if the reserved memory
> area was not pageblock_order aligned.
> 
> Instead of fixing it locally for fadump case on PowerPC, I believe
> this should be fixed for CMA_MIN_ALIGNMENT_BYTES.

I think we should add a way to catch the usage of 
CMA_MIN_ALIGNMENT_BYTES before it actually has meaning (before 
pageblock_order was set) and fix the PowerPC usage by reshuffling the 
code accordingly.

-- 
Cheers,

David / dhildenb



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init
  2024-10-08 13:50 ` [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init David Hildenbrand
@ 2024-10-10  3:19   ` Ritesh Harjani
  0 siblings, 0 replies; 4+ messages in thread
From: Ritesh Harjani @ 2024-10-10  3:19 UTC (permalink / raw)
  To: David Hildenbrand, linuxppc-dev
  Cc: linux-mm, Sourabh Jain, Hari Bathini, Zi Yan, Kirill A . Shutemov,
	Mahesh J Salgaonkar, Michael Ellerman, Madhavan Srinivasan,
	Aneesh Kumar K . V, Donet Tom, LKML, Sachin P Bappalige

David Hildenbrand <david@redhat.com> writes:

> On 08.10.24 15:27, Ritesh Harjani (IBM) wrote:
>> During early init CMA_MIN_ALIGNMENT_BYTES can be PAGE_SIZE,
>> since pageblock_order is still zero and it gets initialized
>> later during paging_init() e.g.
>> paging_init() -> free_area_init() -> set_pageblock_order().
>> 
>> One such use case is -
>> early_setup() -> early_init_devtree() -> fadump_reserve_mem()
>> 
>> This causes CMA memory alignment check to be bypassed in
>> cma_init_reserved_mem(). Then later cma_activate_area() can hit
>> a VM_BUG_ON_PAGE(pfn & ((1 << order) - 1)) if the reserved memory
>> area was not pageblock_order aligned.
>> 
>> Instead of fixing it locally for fadump case on PowerPC, I believe
>> this should be fixed for CMA_MIN_ALIGNMENT_BYTES.
>
> I think we should add a way to catch the usage of 
> CMA_MIN_ALIGNMENT_BYTES before it actually has meaning (before 
> pageblock_order was set)

Maybe by enforcing that the pageblock_order should not be zero where we
do the alignment check then?

i.e. in cma_init_reserved_mem() 

diff --git a/mm/cma.c b/mm/cma.c
index 3e9724716bad..36d753e7a0bf 100644
--- a/mm/cma.c
+++ b/mm/cma.c
@@ -182,6 +182,15 @@ int __init cma_init_reserved_mem(phys_addr_t base, phys_addr_t size,
        if (!size || !memblock_is_region_reserved(base, size))
                return -EINVAL;

+       /*
+        * CMA uses CMA_MIN_ALIGNMENT_BYTES as alignment requirement which
+        * needs pageblock_order to be initialized. Let's enforce it.
+        */
+       if (!pageblock_order) {
+               pr_err("pageblock_order not yet initialized. Called during early boot?\n");
+               return -EINVAL;
+       }
+
        /* ensure minimal alignment required by mm core */
        if (!IS_ALIGNED(base | size, CMA_MIN_ALIGNMENT_BYTES))
                return -EINVAL;


> and fix the PowerPC usage by reshuffling the 
> code accordingly.

Ok. I will submit a v2 with the above patch incldued.

Thanks for the review!
-ritesh


^ permalink raw reply related	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-10-10  3:22 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-10-08 13:27 [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init Ritesh Harjani (IBM)
2024-10-08 13:27 ` [RFC 2/2] fadump: Make fadump reserve_dump_area_start CMA aligned in case of holes Ritesh Harjani (IBM)
2024-10-08 13:50 ` [RFC 1/2] cma: Fix CMA_MIN_ALIGNMENT_BYTES during early_init David Hildenbrand
2024-10-10  3:19   ` Ritesh Harjani

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).