public inbox for linux-mm@kvack.org
 help / color / mirror / Atom feed
From: Muchun Song <muchun.song@linux.dev>
To: "David Hildenbrand (Arm)" <david@kernel.org>
Cc: Muchun Song <songmuchun@bytedance.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	Oscar Salvador <osalvador@suse.de>,
	Michael Ellerman <mpe@ellerman.id.au>,
	Madhavan Srinivasan <maddy@linux.ibm.com>,
	Lorenzo Stoakes <ljs@kernel.org>,
	"Liam R . Howlett" <Liam.Howlett@oracle.com>,
	Vlastimil Babka <vbabka@kernel.org>,
	Mike Rapoport <rppt@kernel.org>,
	Suren Baghdasaryan <surenb@google.com>,
	Michal Hocko <mhocko@suse.com>,
	Nicholas Piggin <npiggin@gmail.com>,
	Christophe Leroy <chleroy@kernel.org>,
	aneesh.kumar@linux.ibm.com, joao.m.martins@oracle.com,
	linux-mm@kvack.org, linuxppc-dev@lists.ozlabs.org,
	linux-kernel@vger.kernel.org
Subject: Re: [PATCH v4 4/5] mm/mm_init: Fix pageblock migratetype for ZONE_DEVICE compound pages
Date: Thu, 23 Apr 2026 11:11:39 +0800	[thread overview]
Message-ID: <FE96D828-7C9F-452B-8EEE-97CA77EF8E25@linux.dev> (raw)
In-Reply-To: <95d59eee-a4fe-4d11-b335-07f3b3ef2e66@kernel.org>



> On Apr 23, 2026, at 03:03, David Hildenbrand (Arm) <david@kernel.org> wrote:
> 
> On 4/22/26 10:14, Muchun Song wrote:
>> The memmap_init_zone_device() function only initializes the migratetype
>> of the first pageblock of a compound page. If the compound page size
>> exceeds pageblock_nr_pages (e.g., 1GB hugepages with 2MB pageblocks),
>> subsequent pageblocks in the compound page remain uninitialized.
>> 
>> Move the migratetype initialization out of __init_zone_device_page()
>> and into a separate pageblock_migratetype_init_range() function. This
>> iterates over the entire PFN range of the memory, ensuring that all
>> pageblocks are correctly initialized.
>> 
>> Fixes: c4386bd8ee3a ("mm/memremap: add ZONE_DEVICE support for compound pages")
>> Signed-off-by: Muchun Song <songmuchun@bytedance.com>
>> Reviewed-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
>> Reviewed-by: Oscar Salvador <osalvador@suse.de>
>> ---
>> mm/mm_init.c | 45 ++++++++++++++++++++++++++++++---------------
>> 1 file changed, 30 insertions(+), 15 deletions(-)
>> 
>> diff --git a/mm/mm_init.c b/mm/mm_init.c
>> index f9f8e1af921c..9d0fe79a94de 100644
>> --- a/mm/mm_init.c
>> +++ b/mm/mm_init.c
>> @@ -674,6 +674,21 @@ static inline void fixup_hashdist(void)
>> static inline void fixup_hashdist(void) {}
>> #endif /* CONFIG_NUMA */
>> 
>> +#ifdef CONFIG_ZONE_DEVICE
>> +static __meminit void pageblock_migratetype_init_range(unsigned long pfn,
>> +        						unsigned long nr_pages,
>> +        						int migratetype)
> 
> Two-tab indent says hi :)

No problem.

> 
>> +{
>> + 	unsigned long end = pfn + nr_pages;
> 
> Can be const.

Ah, yes.

> 
>> +
>> + 	for (pfn = pageblock_align(pfn); pfn < end; pfn += pageblock_nr_pages) {
>> + 		init_pageblock_migratetype(pfn_to_page(pfn), migratetype, false);
>> + 		if (IS_ALIGNED(pfn, PAGES_PER_SECTION))
>> + 			cond_resched();
>> + 	}
>> +}
>> +#endif
>> +
>> /*
>>  * Initialize a reserved page unconditionally, finding its zone first.
>>  */
>> @@ -1011,21 +1026,6 @@ static void __ref __init_zone_device_page(struct page *page, unsigned long pfn,
>> 	page_folio(page)->pgmap = pgmap;
>> 	page->zone_device_data = NULL;
>> 
>> - 	/*
>> - 	 * Mark the block movable so that blocks are reserved for
>> - 	 * movable at startup. This will force kernel allocations
>> - 	 * to reserve their blocks rather than leaking throughout
>> - 	 * the address space during boot when many long-lived
>> - 	 * kernel allocations are made.
>> - 	 *
>> - 	 * Please note that MEMINIT_HOTPLUG path doesn't clear memmap
>> - 	 * because this is done early in section_activate()
>> - 	 */
>> - 	if (pageblock_aligned(pfn)) {
>> - 		init_pageblock_migratetype(page, MIGRATE_MOVABLE, false);
>> - 		cond_resched();
>> - 	}
>> -
>> /*
>>  * ZONE_DEVICE pages other than MEMORY_TYPE_GENERIC are released
>>  * directly to the driver page allocator which will set the page count
>> @@ -1122,6 +1122,9 @@ void __ref memmap_init_zone_device(struct zone *zone,
>> 
>> 	__init_zone_device_page(page, pfn, zone_idx, nid, pgmap);
>> 
>> + 	if (IS_ALIGNED(pfn, PAGES_PER_SECTION))
>> + 		cond_resched();
>> +
>> 	if (pfns_per_compound == 1)
>> 		continue;
>> 
>> @@ -1129,6 +1132,18 @@ void __ref memmap_init_zone_device(struct zone *zone,
>>      compound_nr_pages(altmap, pgmap));
>> }
>> 
>> + /*
>> +  * Mark the block movable so that blocks are reserved for
>> +  * movable at startup. This will force kernel allocations
>> +  * to reserve their blocks rather than leaking throughout
>> +  * the address space during boot when many long-lived
>> +  * kernel allocations are made.
>> +  *
>> +  * Please note that MEMINIT_HOTPLUG path doesn't clear memmap
>> +  * because this is done early in section_activate()
>> +  */
> 
> I am deeply confused about the "memmap" comment here.
> 
> There is no MEMINIT_HOTPLUG special-casing. Nobody "clears memmap".

Absolutely right. This comment was originally copied from the regular
memory initialization path long ago (commit 966cf44f637e), and it has
become completely outdated and misleading over the years after several
MM hotplug refactorings.

> 
> 
> That comment was introduced in
> 
> commit 966cf44f637e6aeea7e3d01ba004bf8b5beac78f
> Author: Alexander Duyck <alexander.h.duyck@linux.intel.com>
> Date:   Fri Oct 26 15:07:52 2018 -0700
> 
>    mm: defer ZONE_DEVICE page initialization to the point where we init pgmap
> 
> Where we added memmap_init_zone_device().
> 
> 
> 
> Is that commit still valid, and this it actually belong here, above the
> migratetype setting?

To be honest, I just blindly moved this comment along with the migratetype
initialization code out of __init_zone_device_page() without deeply analyzing
its historical context and current validity.

After looking into it, I think this comment is an obsolete relic from commit
966cf44f637e and no longer makes sense here.

I will completely remove this confusing comment in v5. Thanks for catching this!

Thanks,
Muchun

> 
> 
>> + 	pageblock_migratetype_init_range(start_pfn, nr_pages, MIGRATE_MOVABLE);
>> +
>> 	pr_debug("%s initialised %lu pages in %ums\n", __func__,
>> 		nr_pages, jiffies_to_msecs(jiffies - start));
>> }
> 
> 
> -- 
> Cheers,
> 
> David




  reply	other threads:[~2026-04-23  3:12 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-22  8:14 [PATCH v4 0/5] mm: Fix vmemmap optimization accounting and initialization Muchun Song
2026-04-22  8:14 ` [PATCH v4 1/5] mm/sparse-vmemmap: Fix vmemmap accounting underflow Muchun Song
2026-04-22 18:47   ` David Hildenbrand (Arm)
2026-04-22  8:14 ` [PATCH v4 2/5] mm/sparse-vmemmap: Pass @pgmap argument to memory deactivation paths Muchun Song
2026-04-22 18:50   ` David Hildenbrand (Arm)
2026-04-23  2:14     ` Muchun Song
2026-04-22  8:14 ` [PATCH v4 3/5] mm/sparse-vmemmap: Fix DAX vmemmap accounting with optimization Muchun Song
2026-04-22 18:53   ` David Hildenbrand (Arm)
2026-04-23  2:17     ` Muchun Song
2026-04-22  8:14 ` [PATCH v4 4/5] mm/mm_init: Fix pageblock migratetype for ZONE_DEVICE compound pages Muchun Song
2026-04-22 19:03   ` David Hildenbrand (Arm)
2026-04-23  3:11     ` Muchun Song [this message]
2026-04-22  8:14 ` [PATCH v4 5/5] mm/mm_init: Fix uninitialized struct pages for ZONE_DEVICE Muchun Song
2026-04-22 19:12   ` David Hildenbrand (Arm)

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=FE96D828-7C9F-452B-8EEE-97CA77EF8E25@linux.dev \
    --to=muchun.song@linux.dev \
    --cc=Liam.Howlett@oracle.com \
    --cc=akpm@linux-foundation.org \
    --cc=aneesh.kumar@linux.ibm.com \
    --cc=chleroy@kernel.org \
    --cc=david@kernel.org \
    --cc=joao.m.martins@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=linuxppc-dev@lists.ozlabs.org \
    --cc=ljs@kernel.org \
    --cc=maddy@linux.ibm.com \
    --cc=mhocko@suse.com \
    --cc=mpe@ellerman.id.au \
    --cc=npiggin@gmail.com \
    --cc=osalvador@suse.de \
    --cc=rppt@kernel.org \
    --cc=songmuchun@bytedance.com \
    --cc=surenb@google.com \
    --cc=vbabka@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox