Linux-mm Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: Usama Arif <usama.arif@linux.dev>
To: Lance Yang <lance.yang@linux.dev>
Cc: david@kernel.org, ying.huang@linux.alibaba.com,
	baoquan.he@linux.dev, willy@infradead.org, youngjun.park@lge.com,
	hannes@cmpxchg.org, riel@surriel.com, ljs@kernel.org,
	shakeel.butt@linux.dev, alex@ghiti.fr, kas@kernel.org,
	baohua@kernel.org, dev.jain@arm.com,
	baolin.wang@linux.alibaba.com, npache@redhat.com,
	linux-mm@kvack.org, akpm@linux-foundation.org,
	liam@infradead.org, ryan.roberts@arm.com, chrisl@kernel.org,
	vbabka@kernel.org, linux-kernel@vger.kernel.org,
	nphamcs@gmail.com, shikemeng@huaweicloud.com,
	kernel-team@meta.com, kasong@tencent.com, ziy@nvidia.com
Subject: Re: [v2 00/16] mm: PMD-level swap entries for anonymous THPs
Date: Sat, 13 Jun 2026 20:18:34 +0100	[thread overview]
Message-ID: <526fdbc0-1944-4328-9ff6-7922d021828d@linux.dev> (raw)
In-Reply-To: <20260613042232.93691-1-lance.yang@linux.dev>



On 13/06/2026 05:22, Lance Yang wrote:
> 
> On Wed, Jun 10, 2026 at 03:44:32PM +0100, Usama Arif wrote:
>>
>>
>> On 10/06/2026 14:48, David Hildenbrand (Arm) wrote:
>>> On 6/10/26 15:01, Lance Yang wrote:
>>>>
>>>>
>>>> On 2026/6/10 20:24, David Hildenbrand (Arm) wrote:
>>>>> On 6/9/26 16:29, Usama Arif wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>> Hello!
>>>>>>
>>>>>> Just following up if there were any reviews/comments on this series!
>>>>>>
>>>>>> I know its a large series but was just checking if there was any
>>>>>> feedback?
>>>>>
>>>>> It shall be reviewed. We just finished the mTHP khugepaged review to get it into
>>>>> 7.2, so we've all been rather busy.
>>>>
>>>> Right, mTHP khugepaged was a rough one. Glad we got it over the line,
>>>> but yeah, there's just been a lot of THP work lately. pretty nonstop ...
>>>>
>>
>> Yeah its definitely a lot. I have set a target of leaving review comments on
>> atleast 2 patches from mm per day myself, but even that can sometimes be
>> difficult! I will try and help out more in reviews.
> 
> Awesome!
> 
>>>>> (I mean, just take a look at the THP-related flood of patches we are fighting
>>>>> with on a daily basis, it's not funny anymore)
>>>>>
>>>>> This is clearly going to be 7.3 material, so there is plenty of time given that
>>>>> the merge window is about to open soon.
>>>>
>>>> Usama, I'll try to make this one a priority too. Looks interesting :P
>>
>> Thanks Lance!
>>
>>>
>>> I have two other bigger series to review, but I should soon get to this as well.
>>>
>>
>> No worries at all! Thanks for the reviews! and yeah definitely 7.3.
>>
>> I will send this out again when 7.3-rc1 opens (rebased), so that the reviews wont be on
>> outdated code which could cause some confusion.
> 
> After skimming through the whole series, probably PMD swap entries need
> one bigger rethink ...
> 
> Emm ... same tricky bit keeps showing up ...
> 
> One PMD swap entry is easy to handle while the swapcache still has one
> PMD-sized folio behind it. Once taht folio got split and reclaimed, the
> 512 swap slots need per-page handling :)
> 
> Maybe worth first pinning down the rule here.
> 
> Is a PMD swap entry supposed to mean "there is, or soon will be, one PMD-
> sized folio behnid it", or is just a compact page-table encoding for
> 512 swap slot?
> 
> Without that rule being very clear, every caller has to guess how much
> it can assume, and it is easy to miss one ...
> 
> So I stopped staring at the details for now, because the same issue keeps
> popping up wearing a slightly different hat :)
> 
> Anyway, no clever answer from me here, not a swap expect :( Just pointing
> out the pattern I keep runing into.
> 

Thanks for the amazing reviews!

For the next revision I’m going to treat a PMD swap entry as just a compact
page-table encoding for 512 ordinary swap slots. It does not mean that the
swapcache still has, or will soon have, one PMD-sized folio behind it.

With that rule, whole-PMD handling is only valid when either:

1. the swapcache still has one PMD-sized folio for the range, or
2. the whole PMD swap range has no cached folios, so the caller can try a
   PMD-sized swapin and still fall back if that is not possible.

If any slot in the range has per-page cache state, the PMD entry has to be
split and the existing PTE paths need to handle the individual slots.

I an reworking the next revision around that. I added a shared helper to
classify the swapcache behind a PMD swap entry as empty, PMD-sized, or
split, then used it in the places where this assumption mattered:
mincore, UFFDIO_MOVE, swapoff, MADV_WILLNEED, and the PMD swap fault path.
UFFDIO_MOVE now checks the whole 512-slot range before moving a PMD swap
entry without a cached folio, and falls back to PTE handling if per-page
cached folios exist.

Thanks!
Usama


  reply	other threads:[~2026-06-13 19:18 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20260602142537.198755-1-usama.arif@linux.dev>
2026-06-09 14:29 ` [v2 00/16] mm: PMD-level swap entries for anonymous THPs Usama Arif
2026-06-10 12:24   ` David Hildenbrand (Arm)
2026-06-10 13:01     ` Lance Yang
2026-06-10 13:48       ` David Hildenbrand (Arm)
2026-06-10 14:44         ` Usama Arif
2026-06-13  4:22           ` Lance Yang
2026-06-13 19:18             ` Usama Arif [this message]
2026-06-13 19:27   ` Zi Yan
2026-06-13 19:34     ` Usama Arif
2026-06-13 19:48       ` Usama Arif
2026-06-14  1:48       ` Lance Yang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=526fdbc0-1944-4328-9ff6-7922d021828d@linux.dev \
    --to=usama.arif@linux.dev \
    --cc=akpm@linux-foundation.org \
    --cc=alex@ghiti.fr \
    --cc=baohua@kernel.org \
    --cc=baolin.wang@linux.alibaba.com \
    --cc=baoquan.he@linux.dev \
    --cc=chrisl@kernel.org \
    --cc=david@kernel.org \
    --cc=dev.jain@arm.com \
    --cc=hannes@cmpxchg.org \
    --cc=kas@kernel.org \
    --cc=kasong@tencent.com \
    --cc=kernel-team@meta.com \
    --cc=lance.yang@linux.dev \
    --cc=liam@infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=ljs@kernel.org \
    --cc=npache@redhat.com \
    --cc=nphamcs@gmail.com \
    --cc=riel@surriel.com \
    --cc=ryan.roberts@arm.com \
    --cc=shakeel.butt@linux.dev \
    --cc=shikemeng@huaweicloud.com \
    --cc=vbabka@kernel.org \
    --cc=willy@infradead.org \
    --cc=ying.huang@linux.alibaba.com \
    --cc=youngjun.park@lge.com \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox