Re: [PATCH 1/2] KVM: arm64: Replace memslot_is_logging() with kvm_slot_dirty_track_enabled()

Linux-ARM-Kernel Archive on lore.kernel.org
 help / color / mirror / Atom feed

From: Leonardo Bras <leo.bras@arm.com>
To: Wei-Lin Chang <weilin.chang@arm.com>
Cc: Leonardo Bras <leo.bras@arm.com>,
	linux-arm-kernel@lists.infradead.org, kvmarm@lists.linux.dev,
	linux-kernel@vger.kernel.org, Marc Zyngier <maz@kernel.org>,
	Oliver Upton <oupton@kernel.org>, Joey Gouly <joey.gouly@arm.com>,
	Steffen Eiden <seiden@linux.ibm.com>,
	Suzuki K Poulose <suzuki.poulose@arm.com>,
	Zenghui Yu <yuzenghui@huawei.com>,
	Catalin Marinas <catalin.marinas@arm.com>,
	Will Deacon <will@kernel.org>, Gavin Shan <gshan@redhat.com>
Subject: Re: [PATCH 1/2] KVM: arm64: Replace memslot_is_logging() with kvm_slot_dirty_track_enabled()
Date: Tue,  9 Jun 2026 17:31:01 +0100	[thread overview]
Message-ID: <aig_xcTZKzux0OaS@devkitleo> (raw)
In-Reply-To: <aibmALTEbc7gzSZj@devkitleo>

On Mon, Jun 08, 2026 at 04:55:45PM +0100, Leonardo Bras wrote:
> Hi Wei Lin,
> 
> On Fri, Jun 05, 2026 at 04:32:47PM +0100, Wei-Lin Chang wrote:
> > When checking whether a memslot has dirty logging enabled, the
> > KVM_MEM_LOG_DIRTY_PAGES flag is the source of truth. Previously we were
> > using memslot_is_logging() which only tests dirty bitmap and did not
> > consider dirty ring. This was not detected because
> > KVM_CAP_DIRTY_LOG_RING_WITH_BITMAP was introduced together with KVM
> > arm64 dirty ring, and users need to enable it to ensure dirty
> > information is not lost for the case of VGIC LPI/ITS table changes.
> > 
> > Fix this by using kvm_slot_dirty_track_enabled() instead which checks
> > KVM_MEM_LOG_DIRTY_PAGES.
> > 
> > Note that memslot_is_logging() also treats a memslot as not logging if
> > KVM_MEM_READONLY is set, hence a memslot with both dirty logging and
> > read only would be seen as not logging for memslot_is_logging(), but
> > logging for kvm_slot_dirty_track_enabled(). This allows a read only
> > mapping of size > PAGE_SIZE to be built when memslot_is_logging() is
> > used, leading to a better read performance compared to
> > kvm_slot_dirty_track_enabled(). However memslots that have both
> > KVM_MEM_LOG_DIRTY_PAGES and KVM_MEM_READONLY set do not really make
> > sense as dirty logging is essentially nop for a read only memslot, so
> > this shouldn't affect real workloads much.
> 
> 
> It worries me a bit that we are ignoring the KVM_MEM_READONLY flag... 
> I have not yet gone through the whole s2_mmu code but IIUC we can have 
> scenarios on which a memslot can be read-only and have dirty-logging 
> enabled. 


> If a memslot is not faulted yet, IIUC it is marked as read-only 
> (so it can be mapped on write fault), and we can have dirty-logging 
> enabled for it as well (as the VMM has no idea). 
> 

Ignore above bit, I confused memslot with block/page entry.

Looking a bit more, my viewpoint is that:
- Due to dirty_ring, checking memslot.dirty_bitmap should be done only to 
  detect the existence of a dirty_bitmap, not the migration process.
- This changes how detection works, in regardas to read-only blocks:
  memslot_is_logging() -> Checks dirty-bitmap + read-only memslot
  kvm_slot_dirty_track_enabled()  -> Checks only memslot flag
- As a simpler change, we could have:

~~~
-   return memslot->dirty_bitmap && !(memslot->flags & KVM_MEM_READONLY);
+   return kvm_slot_dirty_track_enabled(memslot) && !(memslot->flags & KVM_MEM_READONLY);
~~~

Both are cheking memslot->flags, so it will be probably optimized by the 
compiler as:

~~~
return memslot->flags & 3 == 1
~~~

My main worry was that in the curent patch we are changing the behavior 
on skipping read-only memslots. So going through the users, we can see:

> > 
> > Fixes: 9cb1096f8590 ("KVM: arm64: Enable ring-based dirty memory tracking")
> > Signed-off-by: Wei-Lin Chang <weilin.chang@arm.com>
> > ---
> > It took me a long investigation to acquire the context needed to
> > understand this change, however the reason for this problem not being
> > detected is an educated guess. Please let me know if this is wrong or
> > if there are other issues, thanks!
> > 
> >  arch/arm64/kvm/mmu.c | 11 +++--------
> >  1 file changed, 3 insertions(+), 8 deletions(-)
> > 
> > diff --git a/arch/arm64/kvm/mmu.c b/arch/arm64/kvm/mmu.c
> > index 4da9281312eb..06c46124d3e7 100644
> > --- a/arch/arm64/kvm/mmu.c
> > +++ b/arch/arm64/kvm/mmu.c
> > @@ -161,11 +161,6 @@ static int kvm_mmu_split_huge_pages(struct kvm *kvm, phys_addr_t addr,
> >  	return ret;
> >  }
> >  
> > -static bool memslot_is_logging(struct kvm_memory_slot *memslot)
> > -{
> > -	return memslot->dirty_bitmap && !(memslot->flags & KVM_MEM_READONLY);
> > -}
> > -
> >  /**
> >   * kvm_arch_flush_remote_tlbs() - flush all VM TLB entries for v7/8
> >   * @kvm:	pointer to kvm structure.
> > @@ -1748,7 +1743,7 @@ static short kvm_s2_resolve_vma_size(const struct kvm_s2_fault_desc *s2fd,
> >  {
> >  	short vma_shift;
> >  
> > -	if (memslot_is_logging(s2fd->memslot)) {
> > +	if (kvm_slot_dirty_track_enabled(s2fd->memslot)) {
> >  		s2vi->max_map_size = PAGE_SIZE;
> >  		vma_shift = PAGE_SHIFT;
> >  	} else {

On the case dirty_track is enabled in a read-only slot, it will resolve to 
a smaller vma_size. The fault granule will be smaller here. This could be 
bad for performance, so maybe we could add a check for read-only block 
here:

~~~
-   if (memslot_is_logging(s2fd->memslot)) {
+   if (kvm_slot_dirty_track_enabled(s2fd->memslot) &&
+       !memslot_is_readonly(s2fd->memslot) {
~~~


> > @@ -1953,7 +1948,7 @@ static int kvm_s2_fault_compute_prot(const struct kvm_s2_fault_desc *s2fd,
> >  	*prot = KVM_PGTABLE_PROT_R;
> >  
> >  	if (s2vi->map_writable && (s2vi->device ||
> > -				   !memslot_is_logging(s2fd->memslot) ||
> > +				   !kvm_slot_dirty_track_enabled(s2fd->memslot) ||
> >  				   kvm_is_write_fault(s2fd->vcpu)))
> >  		*prot |= KVM_PGTABLE_PROT_W;
> >


On the same scenario (dirty_track enabled on readonly memslot):
This one should be safe, as kvm_is_write_fault() will check if the memslot 
is readonly and return false in this case. But then, it will have to 
actually call kvm_is_write_fault(), as the previous version would not even 
call it in that scenario.

Not sure how would that impact perforformance, though.

> > @@ -2084,7 +2079,7 @@ static int user_mem_abort(const struct kvm_s2_fault_desc *s2fd)
> >  	 * and a write fault needs to collapse a block entry into a table.
> >  	 */
> >  	memcache = get_mmu_memcache(s2fd->vcpu);
> > -	if (!perm_fault || (memslot_is_logging(s2fd->memslot) &&
> > +	if (!perm_fault || (kvm_slot_dirty_track_enabled(s2fd->memslot) &&
> >  			    kvm_is_write_fault(s2fd->vcpu))) {
> >  		ret = topup_mmu_memcache(s2fd->vcpu, memcache);
> >  		if (ret)

Same thing, if memslot is tracking and is readonly, topup_*() would be 
called with the new patch, but not with the old behavior. 

All of that depends on how the VMM uses dirty_tracking: does it enable for 
all memory, or only for memory that is writable?

I could not find anything that would prevent user from enabling 
dirty_tracking on read-only memslots, so we can either ignore this 
scenario, apply those patches and let those users carry the extra overhead, 
or do an extra test to make sure it's doing the same thing as before.

Thanks!
Leo

next prev parent reply	other threads:[~2026-06-09 16:31 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-05 15:32 [PATCH 0/2] KVM: arm64: Small dirty logging fixes/cleanups Wei-Lin Chang
2026-06-05 15:32 ` [PATCH 1/2] KVM: arm64: Replace memslot_is_logging() with kvm_slot_dirty_track_enabled() Wei-Lin Chang
2026-06-08 15:55   ` Leonardo Bras
2026-06-09 16:31     ` Leonardo Bras [this message]
2026-06-10  9:48       ` Alexandru Elisei
2026-06-10 11:18         ` Wei-Lin Chang
2026-06-05 15:32 ` [PATCH 2/2] KVM: arm64: Remove superfluous aligning of gfn for dirty logging Wei-Lin Chang
2026-06-08 15:23   ` Leonardo Bras

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aig_xcTZKzux0OaS@devkitleo \
    --to=leo.bras@arm.com \
    --cc=catalin.marinas@arm.com \
    --cc=gshan@redhat.com \
    --cc=joey.gouly@arm.com \
    --cc=kvmarm@lists.linux.dev \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maz@kernel.org \
    --cc=oupton@kernel.org \
    --cc=seiden@linux.ibm.com \
    --cc=suzuki.poulose@arm.com \
    --cc=weilin.chang@arm.com \
    --cc=will@kernel.org \
    --cc=yuzenghui@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox