The Linux Kernel Mailing List
 help / color / mirror / Atom feed
From: Dev Jain <dev.jain@arm.com>
To: "David Hildenbrand (Arm)" <david@kernel.org>,
	akpm@linux-foundation.org, ljs@kernel.org, hughd@google.com,
	chrisl@kernel.org, kasong@tencent.com
Cc: riel@surriel.com, liam@infradead.org, vbabka@kernel.org,
	harry@kernel.org, jannh@google.com, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org, qi.zheng@linux.dev,
	shakeel.butt@linux.dev, baohua@kernel.org,
	axelrasmussen@google.com, yuanchu@google.com, weixugc@google.com,
	rppt@kernel.org, surenb@google.com, mhocko@suse.com,
	baolin.wang@linux.alibaba.com, shikemeng@huaweicloud.com,
	nphamcs@gmail.com, bhe@redhat.com, youngjun.park@lge.com,
	pfalcato@suse.de, ryan.roberts@arm.com,
	anshuman.khandual@arm.com
Subject: Re: [PATCH v3 4/9] mm/memory: Batch set uffd-wp markers during zapping
Date: Tue, 12 May 2026 11:29:00 +0530	[thread overview]
Message-ID: <45ad2925-9d01-427a-9ac0-650c4b2d07e4@arm.com> (raw)
In-Reply-To: <4b36ddb9-c11a-4320-85d4-3e059e67384f@kernel.org>



On 11/05/26 1:07 pm, David Hildenbrand (Arm) wrote:
> On 5/6/26 11:44, Dev Jain wrote:
>> In preparation for the next patch, enable batch setting of uffd-wp ptes.
>>
>> The code paths passing nr > 1 to zap_install_uffd_wp_if_needed() produce
>> that nr through either folio_pte_batch or swap_pte_batch, guaranteeing that
>> all ptes are the same w.r.t belonging to the same type of VMA (anonymous
>> or non-anonymous, wp-armed or non-wp-armed), and all being marked with
>> uffd-wp or all being not marked.
>>
>> Note that we will have to use set_pte_at() in a loop instead of set_ptes()
>> since the latter cannot handle present->non-present conversion for
>> nr_pages > 1.
>>
>> Convert documentation of install_uffd_wp_ptes_if_needed to kerneldoc
>> format.
> 
> You should mention why the uffd_supports_wp_marker()+vma_is_anonymous() calls in
> zap_install_uffd_wp_if_needed can be dropped.

Okay.

> 
>>
>> No functional change is intended.
>>
>> Signed-off-by: Dev Jain <dev.jain@arm.com>
>> ---
>>  include/linux/mm_inline.h | 34 +++++++++++++++++++++-------------
>>  mm/memory.c               | 20 +-------------------
>>  mm/rmap.c                 |  2 +-
>>  3 files changed, 23 insertions(+), 33 deletions(-)
>>
>> diff --git a/include/linux/mm_inline.h b/include/linux/mm_inline.h
>> index a171070e15f05..6f7ecede2fb45 100644
>> --- a/include/linux/mm_inline.h
>> +++ b/include/linux/mm_inline.h
>> @@ -566,9 +566,17 @@ static inline pte_marker copy_pte_marker(
>>  	return dstm;
>>  }
>>  
>> -/*
>> - * If this pte is wr-protected by uffd-wp in any form, arm the special pte to
>> - * replace a none pte.  NOTE!  This should only be called when *pte is already
>> +/**
>> + * install_uffd_wp_ptes_if_needed - install uffd-wp marker on PTEs that map
>> + *				    consecutive pages of the same large folio.
>> + * @vma: The VMA the pages are mapped into.
>> + * @addr: Address the first page of this batch is mapped at.
>> + * @ptep: Page table pointer for the first entry of this batch.
>> + * @pteval: old value of the entry pointed to by ptep.
>> + * @nr_ptes: Number of entries to clear (batch size).
>> + *
>> + * If the ptes were wr-protected by uffd-wp in any form, arm special ptes to
>> + * replace none ptes.  NOTE!  This should only be called when *pte is already
>>   * cleared so we will never accidentally replace something valuable.  Meanwhile
>>   * none pte also means we are not demoting the pte so tlb flushed is not needed.
>>   * E.g., when pte cleared the caller should have taken care of the tlb flush.
>> @@ -576,11 +584,11 @@ static inline pte_marker copy_pte_marker(
>>   * Must be called with pgtable lock held so that no thread will see the none
>>   * pte, and if they see it, they'll fault and serialize at the pgtable lock.
>>   *
>> - * Returns true if an uffd-wp pte was installed, false otherwise.
>> + * Returns true if uffd-wp ptes were installed, false otherwise.
>>   */
>>  static inline bool
>> -pte_install_uffd_wp_if_needed(struct vm_area_struct *vma, unsigned long addr,
>> -			      pte_t *pte, pte_t pteval)
>> +install_uffd_wp_ptes_if_needed(struct vm_area_struct *vma, unsigned long addr,
>> +			      pte_t *ptep, pte_t pteval, unsigned long nr_ptes)
> 
> If we conditionally do something, what about while at it shorten it to:
> 
> 	cont_install_uffd_wp_ptes()

Why "cont_"? Did you mean to say "cond" ("conditionally").

> 
> Also, less churn in this patch if you don't change pte->ptep
> 
> (because if you do, you should then also do pteval->pte :) )
> 
> So I'd just leave that as is in this patch.

So I'll change pte->ptep and pteval->pte because I hate pte pointers
called pte : )
> 
>>  {
>>  	bool arm_uffd_pte = false;
>>  
>> @@ -588,7 +596,7 @@ pte_install_uffd_wp_if_needed(struct vm_area_struct *vma, unsigned long addr,
>>  		return false;
>>  
>>  	/* The current status of the pte should be "cleared" before calling */
>> -	WARN_ON_ONCE(!pte_none(ptep_get(pte)));
>> +	WARN_ON_ONCE(!pte_none(ptep_get(ptep)));
>>  
>>  	/*
>>  	 * NOTE: userfaultfd_wp_unpopulated() doesn't need this whole
>> @@ -610,13 +618,13 @@ pte_install_uffd_wp_if_needed(struct vm_area_struct *vma, unsigned long addr,
>>  	if (unlikely(pte_swp_uffd_wp_any(pteval)))
>>  		arm_uffd_pte = true;
>>  
>> -	if (unlikely(arm_uffd_pte)) {
>> -		set_pte_at(vma->vm_mm, addr, pte,
>> -			   make_pte_marker(PTE_MARKER_UFFD_WP));
>> -		return true;
>> -	}
>> +	if (likely(!arm_uffd_pte))
>> +		return false;
>>  
>> -	return false;
>> +	for (int i = 0; i < nr_ptes; ++i, ++ptep, addr += PAGE_SIZE)
>> +		set_pte_at(vma->vm_mm, addr, ptep, make_pte_marker(PTE_MARKER_UFFD_WP));
>> +
>> +	return true;
>>  }
> 
> I wonder whether this growing function is really appropriate to be in the header
> file? Can we just move that whole thing into mm/memory.c?
> 
> It's called on every try_to_unmap_one() invocation, but I doubt we care about a
> function call here.

Makes sense.

> 
>>  
>>  static inline bool vma_has_recency(const struct vm_area_struct *vma)
>> diff --git a/mm/memory.c b/mm/memory.c
>> index 0c9d9c2cbf0e0..f14311c4d2001 100644
>> --- a/mm/memory.c
>> +++ b/mm/memory.c
>> @@ -1610,29 +1610,11 @@ zap_install_uffd_wp_if_needed(struct vm_area_struct *vma,
>>  			      unsigned long addr, pte_t *pte, int nr,
>>  			      struct zap_details *details, pte_t pteval)
>>  {
>> -	bool was_installed = false;
>> -
>> -	if (!uffd_supports_wp_marker())
>> -		return false;
>> -
>> -	/* Zap on anonymous always means dropping everything */
>> -	if (vma_is_anonymous(vma))
>> -		return false;
>> -
>>  	if (zap_drop_markers(details))
>>  		return false;
>>  
>> -	for (;;) {
>> -		/* the PFN in the PTE is irrelevant. */
>> -		if (pte_install_uffd_wp_if_needed(vma, addr, pte, pteval))
>> -			was_installed = true;
>> -		if (--nr == 0)
>> -			break;
>> -		pte++;
>> -		addr += PAGE_SIZE;
>> -	}
>> +	return install_uffd_wp_ptes_if_needed(vma, addr, pte, pteval, nr);
>>  
>> -	return was_installed;
>>  }
>>  
>>  static __always_inline void zap_present_folio_ptes(struct mmu_gather *tlb,
>> diff --git a/mm/rmap.c b/mm/rmap.c
>> index bd4e3639e26ed..b17dce752a1ea 100644
>> --- a/mm/rmap.c
>> +++ b/mm/rmap.c
>> @@ -2266,7 +2266,7 @@ static bool try_to_unmap_one(struct folio *folio, struct vm_area_struct *vma,
>>  		 * we may want to replace a none pte with a marker pte if
>>  		 * it's file-backed, so we don't lose the tracking info.
>>  		 */
>> -		pte_install_uffd_wp_if_needed(vma, address, pvmw.pte, pteval);
>> +		install_uffd_wp_ptes_if_needed(vma, address, pvmw.pte, pteval, 1);
>>  
>>  		/* Update high watermark before we lower rss */
>>  		update_hiwater_rss(mm);
> 
> 


  reply	other threads:[~2026-05-12  5:59 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-06  9:44 [PATCH v3 0/9] Optimize anonymous large folio unmapping Dev Jain
2026-05-06  9:44 ` [PATCH v3 1/9] mm/rmap: initialize nr_pages to 1 at loop start in try_to_unmap_one Dev Jain
2026-05-11  6:48   ` David Hildenbrand (Arm)
2026-05-11  8:18     ` Dev Jain
2026-05-11  8:32       ` David Hildenbrand (Arm)
2026-05-12  8:14         ` Dev Jain
2026-05-12  8:17           ` David Hildenbrand (Arm)
2026-05-12 10:49             ` Dev Jain
2026-05-12 11:01               ` David Hildenbrand (Arm)
2026-05-12 11:16                 ` Dev Jain
2026-05-06  9:44 ` [PATCH v3 2/9] mm/rmap: refactor hugetlb pte clearing " Dev Jain
2026-05-11  7:10   ` David Hildenbrand (Arm)
2026-05-11  8:53     ` Dev Jain
2026-05-11  8:59       ` David Hildenbrand (Arm)
2026-05-11 22:20     ` Barry Song
2026-05-12  5:16       ` Dev Jain
2026-05-06  9:44 ` [PATCH v3 3/9] mm/rmap: refactor some code around lazyfree folio unmapping Dev Jain
2026-05-11  7:28   ` David Hildenbrand (Arm)
2026-05-12  5:19     ` Dev Jain
2026-05-06  9:44 ` [PATCH v3 4/9] mm/memory: Batch set uffd-wp markers during zapping Dev Jain
2026-05-11  7:37   ` David Hildenbrand (Arm)
2026-05-12  5:59     ` Dev Jain [this message]
2026-05-12  6:04       ` David Hildenbrand (Arm)
2026-05-06  9:45 ` [PATCH v3 5/9] mm/rmap: batch unmap folios belonging to uffd-wp VMAs Dev Jain
2026-05-11  7:41   ` David Hildenbrand (Arm)
2026-05-06  9:45 ` [PATCH v3 6/9] mm/swapfile: Add batched version of folio_dup_swap Dev Jain
2026-05-11  7:45   ` David Hildenbrand (Arm)
2026-05-12  6:07     ` Dev Jain
2026-05-12  6:36       ` David Hildenbrand (Arm)
2026-05-06  9:45 ` [PATCH v3 7/9] mm/swapfile: Add batched version of folio_put_swap Dev Jain
2026-05-11  8:07   ` David Hildenbrand (Arm)
2026-05-06  9:45 ` [PATCH v3 8/9] mm/rmap: Add batched version of folio_try_share_anon_rmap_pte Dev Jain
2026-05-11  8:13   ` David Hildenbrand (Arm)
2026-05-11  8:14     ` David Hildenbrand (Arm)
2026-05-12  8:57     ` Dev Jain
2026-05-06  9:45 ` [PATCH v3 9/9] mm/rmap: enable batch unmapping of anonymous folios Dev Jain
2026-05-11  8:16   ` David Hildenbrand (Arm)
2026-05-12  8:59     ` Dev Jain
2026-05-08 23:38 ` [PATCH v3 0/9] Optimize anonymous large folio unmapping Andrew Morton
2026-05-11  6:21   ` Dev Jain

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=45ad2925-9d01-427a-9ac0-650c4b2d07e4@arm.com \
    --to=dev.jain@arm.com \
    --cc=akpm@linux-foundation.org \
    --cc=anshuman.khandual@arm.com \
    --cc=axelrasmussen@google.com \
    --cc=baohua@kernel.org \
    --cc=baolin.wang@linux.alibaba.com \
    --cc=bhe@redhat.com \
    --cc=chrisl@kernel.org \
    --cc=david@kernel.org \
    --cc=harry@kernel.org \
    --cc=hughd@google.com \
    --cc=jannh@google.com \
    --cc=kasong@tencent.com \
    --cc=liam@infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=ljs@kernel.org \
    --cc=mhocko@suse.com \
    --cc=nphamcs@gmail.com \
    --cc=pfalcato@suse.de \
    --cc=qi.zheng@linux.dev \
    --cc=riel@surriel.com \
    --cc=rppt@kernel.org \
    --cc=ryan.roberts@arm.com \
    --cc=shakeel.butt@linux.dev \
    --cc=shikemeng@huaweicloud.com \
    --cc=surenb@google.com \
    --cc=vbabka@kernel.org \
    --cc=weixugc@google.com \
    --cc=youngjun.park@lge.com \
    --cc=yuanchu@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox