From: Alexandru Elisei <alexandru.elisei@arm.com>
To: Leonardo Bras <leo.bras@arm.com>
Cc: Wei-Lin Chang <weilin.chang@arm.com>,
linux-arm-kernel@lists.infradead.org, kvmarm@lists.linux.dev,
linux-kernel@vger.kernel.org, Marc Zyngier <maz@kernel.org>,
Oliver Upton <oupton@kernel.org>, Joey Gouly <joey.gouly@arm.com>,
Steffen Eiden <seiden@linux.ibm.com>,
Suzuki K Poulose <suzuki.poulose@arm.com>,
Zenghui Yu <yuzenghui@huawei.com>,
Catalin Marinas <catalin.marinas@arm.com>,
Will Deacon <will@kernel.org>, Gavin Shan <gshan@redhat.com>
Subject: Re: [PATCH 1/2] KVM: arm64: Replace memslot_is_logging() with kvm_slot_dirty_track_enabled()
Date: Wed, 10 Jun 2026 10:48:24 +0100 [thread overview]
Message-ID: <aiky6H02ArbFpwGZ@raptor> (raw)
In-Reply-To: <aig_xcTZKzux0OaS@devkitleo>
Hi Leo,
Just FYI, write faults on read-only memslots are handled as MMIO accesses in
kvm_handle_guest_abort() (gfn_to_hva_memslot_prot() sets @writable to false).
Thanks,
Alex
On Tue, Jun 09, 2026 at 05:31:01PM +0100, Leonardo Bras wrote:
> On Mon, Jun 08, 2026 at 04:55:45PM +0100, Leonardo Bras wrote:
> > Hi Wei Lin,
> >
> > On Fri, Jun 05, 2026 at 04:32:47PM +0100, Wei-Lin Chang wrote:
> > > When checking whether a memslot has dirty logging enabled, the
> > > KVM_MEM_LOG_DIRTY_PAGES flag is the source of truth. Previously we were
> > > using memslot_is_logging() which only tests dirty bitmap and did not
> > > consider dirty ring. This was not detected because
> > > KVM_CAP_DIRTY_LOG_RING_WITH_BITMAP was introduced together with KVM
> > > arm64 dirty ring, and users need to enable it to ensure dirty
> > > information is not lost for the case of VGIC LPI/ITS table changes.
> > >
> > > Fix this by using kvm_slot_dirty_track_enabled() instead which checks
> > > KVM_MEM_LOG_DIRTY_PAGES.
> > >
> > > Note that memslot_is_logging() also treats a memslot as not logging if
> > > KVM_MEM_READONLY is set, hence a memslot with both dirty logging and
> > > read only would be seen as not logging for memslot_is_logging(), but
> > > logging for kvm_slot_dirty_track_enabled(). This allows a read only
> > > mapping of size > PAGE_SIZE to be built when memslot_is_logging() is
> > > used, leading to a better read performance compared to
> > > kvm_slot_dirty_track_enabled(). However memslots that have both
> > > KVM_MEM_LOG_DIRTY_PAGES and KVM_MEM_READONLY set do not really make
> > > sense as dirty logging is essentially nop for a read only memslot, so
> > > this shouldn't affect real workloads much.
> >
> >
> > It worries me a bit that we are ignoring the KVM_MEM_READONLY flag...
> > I have not yet gone through the whole s2_mmu code but IIUC we can have
> > scenarios on which a memslot can be read-only and have dirty-logging
> > enabled.
>
>
> > If a memslot is not faulted yet, IIUC it is marked as read-only
> > (so it can be mapped on write fault), and we can have dirty-logging
> > enabled for it as well (as the VMM has no idea).
> >
>
> Ignore above bit, I confused memslot with block/page entry.
>
> Looking a bit more, my viewpoint is that:
> - Due to dirty_ring, checking memslot.dirty_bitmap should be done only to
> detect the existence of a dirty_bitmap, not the migration process.
> - This changes how detection works, in regardas to read-only blocks:
> memslot_is_logging() -> Checks dirty-bitmap + read-only memslot
> kvm_slot_dirty_track_enabled() -> Checks only memslot flag
> - As a simpler change, we could have:
>
> ~~~
> - return memslot->dirty_bitmap && !(memslot->flags & KVM_MEM_READONLY);
> + return kvm_slot_dirty_track_enabled(memslot) && !(memslot->flags & KVM_MEM_READONLY);
> ~~~
>
> Both are cheking memslot->flags, so it will be probably optimized by the
> compiler as:
>
> ~~~
> return memslot->flags & 3 == 1
> ~~~
>
> My main worry was that in the curent patch we are changing the behavior
> on skipping read-only memslots. So going through the users, we can see:
>
> > >
> > > Fixes: 9cb1096f8590 ("KVM: arm64: Enable ring-based dirty memory tracking")
> > > Signed-off-by: Wei-Lin Chang <weilin.chang@arm.com>
> > > ---
> > > It took me a long investigation to acquire the context needed to
> > > understand this change, however the reason for this problem not being
> > > detected is an educated guess. Please let me know if this is wrong or
> > > if there are other issues, thanks!
> > >
> > > arch/arm64/kvm/mmu.c | 11 +++--------
> > > 1 file changed, 3 insertions(+), 8 deletions(-)
> > >
> > > diff --git a/arch/arm64/kvm/mmu.c b/arch/arm64/kvm/mmu.c
> > > index 4da9281312eb..06c46124d3e7 100644
> > > --- a/arch/arm64/kvm/mmu.c
> > > +++ b/arch/arm64/kvm/mmu.c
> > > @@ -161,11 +161,6 @@ static int kvm_mmu_split_huge_pages(struct kvm *kvm, phys_addr_t addr,
> > > return ret;
> > > }
> > >
> > > -static bool memslot_is_logging(struct kvm_memory_slot *memslot)
> > > -{
> > > - return memslot->dirty_bitmap && !(memslot->flags & KVM_MEM_READONLY);
> > > -}
> > > -
> > > /**
> > > * kvm_arch_flush_remote_tlbs() - flush all VM TLB entries for v7/8
> > > * @kvm: pointer to kvm structure.
> > > @@ -1748,7 +1743,7 @@ static short kvm_s2_resolve_vma_size(const struct kvm_s2_fault_desc *s2fd,
> > > {
> > > short vma_shift;
> > >
> > > - if (memslot_is_logging(s2fd->memslot)) {
> > > + if (kvm_slot_dirty_track_enabled(s2fd->memslot)) {
> > > s2vi->max_map_size = PAGE_SIZE;
> > > vma_shift = PAGE_SHIFT;
> > > } else {
>
> On the case dirty_track is enabled in a read-only slot, it will resolve to
> a smaller vma_size. The fault granule will be smaller here. This could be
> bad for performance, so maybe we could add a check for read-only block
> here:
>
> ~~~
> - if (memslot_is_logging(s2fd->memslot)) {
> + if (kvm_slot_dirty_track_enabled(s2fd->memslot) &&
> + !memslot_is_readonly(s2fd->memslot) {
> ~~~
>
>
> > > @@ -1953,7 +1948,7 @@ static int kvm_s2_fault_compute_prot(const struct kvm_s2_fault_desc *s2fd,
> > > *prot = KVM_PGTABLE_PROT_R;
> > >
> > > if (s2vi->map_writable && (s2vi->device ||
> > > - !memslot_is_logging(s2fd->memslot) ||
> > > + !kvm_slot_dirty_track_enabled(s2fd->memslot) ||
> > > kvm_is_write_fault(s2fd->vcpu)))
> > > *prot |= KVM_PGTABLE_PROT_W;
> > >
>
>
> On the same scenario (dirty_track enabled on readonly memslot):
> This one should be safe, as kvm_is_write_fault() will check if the memslot
> is readonly and return false in this case. But then, it will have to
> actually call kvm_is_write_fault(), as the previous version would not even
> call it in that scenario.
>
> Not sure how would that impact perforformance, though.
>
> > > @@ -2084,7 +2079,7 @@ static int user_mem_abort(const struct kvm_s2_fault_desc *s2fd)
> > > * and a write fault needs to collapse a block entry into a table.
> > > */
> > > memcache = get_mmu_memcache(s2fd->vcpu);
> > > - if (!perm_fault || (memslot_is_logging(s2fd->memslot) &&
> > > + if (!perm_fault || (kvm_slot_dirty_track_enabled(s2fd->memslot) &&
> > > kvm_is_write_fault(s2fd->vcpu))) {
> > > ret = topup_mmu_memcache(s2fd->vcpu, memcache);
> > > if (ret)
>
> Same thing, if memslot is tracking and is readonly, topup_*() would be
> called with the new patch, but not with the old behavior.
>
> All of that depends on how the VMM uses dirty_tracking: does it enable for
> all memory, or only for memory that is writable?
>
> I could not find anything that would prevent user from enabling
> dirty_tracking on read-only memslots, so we can either ignore this
> scenario, apply those patches and let those users carry the extra overhead,
> or do an extra test to make sure it's doing the same thing as before.
>
> Thanks!
> Leo
>
next prev parent reply other threads:[~2026-06-10 9:48 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-06-05 15:32 [PATCH 0/2] KVM: arm64: Small dirty logging fixes/cleanups Wei-Lin Chang
2026-06-05 15:32 ` [PATCH 1/2] KVM: arm64: Replace memslot_is_logging() with kvm_slot_dirty_track_enabled() Wei-Lin Chang
2026-06-08 15:55 ` Leonardo Bras
2026-06-09 16:31 ` Leonardo Bras
2026-06-10 9:48 ` Alexandru Elisei [this message]
2026-06-10 11:18 ` Wei-Lin Chang
2026-06-05 15:32 ` [PATCH 2/2] KVM: arm64: Remove superfluous aligning of gfn for dirty logging Wei-Lin Chang
2026-06-08 15:23 ` Leonardo Bras
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aiky6H02ArbFpwGZ@raptor \
--to=alexandru.elisei@arm.com \
--cc=catalin.marinas@arm.com \
--cc=gshan@redhat.com \
--cc=joey.gouly@arm.com \
--cc=kvmarm@lists.linux.dev \
--cc=leo.bras@arm.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=maz@kernel.org \
--cc=oupton@kernel.org \
--cc=seiden@linux.ibm.com \
--cc=suzuki.poulose@arm.com \
--cc=weilin.chang@arm.com \
--cc=will@kernel.org \
--cc=yuzenghui@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox