Re: [PATCH v6 0/3] mm: Free contiguous order-0 pages efficiently

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

From: Ryan Roberts <ryan.roberts@arm.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	Muhammad Usama Anjum <usama.anjum@arm.com>,
	David Hildenbrand <david@kernel.org>,
	Lorenzo Stoakes <ljs@kernel.org>,
	"Liam R . Howlett" <Liam.Howlett@oracle.com>,
	Vlastimil Babka <vbabka@kernel.org>,
	Mike Rapoport <rppt@kernel.org>,
	Suren Baghdasaryan <surenb@google.com>,
	Michal Hocko <mhocko@suse.com>,
	Brendan Jackman <jackmanb@google.com>, Zi Yan <ziy@nvidia.com>,
	Uladzislau Rezki <urezki@gmail.com>,
	Nick Terrell <terrelln@fb.com>, David Sterba <dsterba@suse.com>,
	Vishal Moola <vishal.moola@gmail.com>,
	linux-mm@kvack.org, linux-kernel@vger.kernel.org,
	bpf@vger.kernel.org, david.hildenbrand@arm.com
Subject: Re: [PATCH v6 0/3] mm: Free contiguous order-0 pages efficiently
Date: Thu, 30 Apr 2026 13:32:57 +0100	[thread overview]
Message-ID: <74bddfeb-0530-4cca-af17-a7920f17bb5b@arm.com> (raw)
In-Reply-To: <20260429135228.GA1987@cmpxchg.org>

On 29/04/2026 14:52, Johannes Weiner wrote:
> On Wed, Apr 29, 2026 at 01:31:10PM +0100, Ryan Roberts wrote:
>> On 29/04/2026 13:04, Andrew Morton wrote:
>>> On Wed, 29 Apr 2026 06:33:26 -0400 Johannes Weiner <hannes@cmpxchg.org> wrote:
>>>
>>>> On Wed, Apr 01, 2026 at 11:16:18AM +0100, Muhammad Usama Anjum wrote:
>>>>> Hi All,
>>>>>
>>>>> A recent change to vmalloc caused some performance benchmark regressions (see
>>>>> [1]). I'm attempting to fix that (and at the same time significantly improve
>>>>> beyond the baseline) by freeing a contiguous set of order-0 pages as a batch.
>>>>
>>>> I think we should revert the original patch.
>>>>
>>>> The premise is that we can save some allocator calls by requesting
>>>> higher orders and splitting them up into singles. This is a frivolous
>>>> and short-sighted use of a very coveted and expensive resource.
>>
>> I'm not sure it's that simple. First off, vmalloc has preferred to allocate high
>> order pages for quite a while, it's just that the patch you're referring to
>> makes it try even harder. So reverting the patch doesn't completely revert the
>> behaviour, it just reduces it.
>>
>> Performance benefits because those high order pages are mapped appropriately in
>> the page table - i.e. 1G PUD, 2M PMD, (or 64K CONTPTE on arm64). So it's not
>> solely about the number of cycles spent in the allocator; the HW is used more
>> efficiently. vmalloc only splits to order-0 for the benefit of the caller,
>> because there are some places that assume they can access each returned struct page.
> 
> Sure, TLB benefits can offset the cost.
> 
> PTE mapped higher orders on systems without contpte (still many) are the problem.

Perhaps vmalloc should only request the high orders when it knows it can map
them beneficially then? (which I guess was the case prior to the patch in
question?).

> 
>> And all the order-0 pages of the original high order page are freed at the same
>> time, so it's not like we are destroying the contiguous resource; it remains
>> intact for the next user (well, ignoring that some will be freed to the pcpu
>> list - this series solves that wrinkle). I've heard it argued that this approach
>> is actually _better_ for conserving contiguous blocks because it's keeping the
>> lifetime of all the constituent pages bound together and reducing fragmentation.
> 
> You're still consuming contiguity and increasing competition over
> it. That needs to pay off in a closed system, not just in one small
> part of it.
> 
> I'm a bit skeptical of that beneficial effect. Sure, if there aren't
> any small fragments and most everybody is doing larger allocations,
> then yes, this could make sense. Although in that case, even calling
> the buddy allocator repeatedly from vmalloc would give you physically
> adjacent pages due to the way splitting works (although I'm not sure
> right now if you'd get the right exact PFN order for contpte).
> 
> But as long as there is a mix of allocation sizes with mixed
> lifetimes, consuming contiguity that you don't need has a high cost
> over vacuuming up holes and fragments. Because now you're competing
> with somebody who has no choice but to *painstakingly move live pages
> around to coalesce the holes*.
> 
> That's the whole reason for the __rmqueue_smallest()-first policy in
> the page allocator. It's fine for somebody to challenge this. But it
> feels pretty strange to make a unilateral decision in vmalloc that
> works around and inverts established allocator policy, with very
> little data to boot.
> 
>> I've never seen any data though...
> 
> Yes. Considering the possible externalities of this patch, IMO we
> should have much more data on big picture behavior, under varying
> pressure situations and workloads etc.
> 
> The reason for my email was that we see this hurting in experiments
> with new code. The vmalloc higher-orders cause a sharp increase in
> compaction activity, subsequent lock contention in zsmalloc migration
> callbacks etc. I wasn't just making this up.

Fair enough.

For the avoidence of doubt, are you also suggesting that this series should not
go in or just that a06157804399 ("mm/vmalloc: request large order pages from
buddy allocator") should be reverted? If the latter, then perhaps it's best to
move the discussion to that thread to catch Vishal and Willy's attention?

Thanks,
Ryan

     prev parent reply	other threads:[~2026-04-30 12:33 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-01 10:16 [PATCH v6 0/3] mm: Free contiguous order-0 pages efficiently Muhammad Usama Anjum
2026-04-01 10:16 ` [PATCH v6 1/3] mm/page_alloc: Optimize free_contig_range() Muhammad Usama Anjum
2026-04-01 10:16 ` [PATCH v6 2/3] vmalloc: Optimize vfree with free_pages_bulk() Muhammad Usama Anjum
2026-04-01 10:19   ` David Hildenbrand (Arm)
2026-04-01 15:13   ` Uladzislau Rezki
2026-04-01 10:16 ` [PATCH v6 3/3] mm/page_alloc: Optimize __free_contig_frozen_range() Muhammad Usama Anjum
2026-04-22 13:42 ` [PATCH v6 0/3] mm: Free contiguous order-0 pages efficiently Ryan Roberts
2026-04-22 15:40   ` David Hildenbrand (Arm)
2026-04-29 10:33 ` Johannes Weiner
2026-04-29 12:04   ` Andrew Morton
2026-04-29 12:31     ` Ryan Roberts
2026-04-29 13:04       ` David Hildenbrand (Arm)
2026-04-30 12:09         ` Johannes Weiner
2026-04-29 13:52       ` Johannes Weiner
2026-04-30 12:32         ` Ryan Roberts [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=74bddfeb-0530-4cca-af17-a7920f17bb5b@arm.com \
    --to=ryan.roberts@arm.com \
    --cc=Liam.Howlett@oracle.com \
    --cc=akpm@linux-foundation.org \
    --cc=bpf@vger.kernel.org \
    --cc=david.hildenbrand@arm.com \
    --cc=david@kernel.org \
    --cc=dsterba@suse.com \
    --cc=hannes@cmpxchg.org \
    --cc=jackmanb@google.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=ljs@kernel.org \
    --cc=mhocko@suse.com \
    --cc=rppt@kernel.org \
    --cc=surenb@google.com \
    --cc=terrelln@fb.com \
    --cc=urezki@gmail.com \
    --cc=usama.anjum@arm.com \
    --cc=vbabka@kernel.org \
    --cc=vishal.moola@gmail.com \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox