* [PATCH] KVM: arm64: Take the SRCU lock for page table walks in fault injection and AT emulation
@ 2026-06-02 16:04 Hyunwoo Kim
2026-06-03 6:44 ` Oliver Upton
0 siblings, 1 reply; 3+ messages in thread
From: Hyunwoo Kim @ 2026-06-02 16:04 UTC (permalink / raw)
To: maz, oupton, joey.gouly, seiden, suzuki.poulose, yuzenghui,
catalin.marinas, will
Cc: linux-arm-kernel, kvmarm, stable, imv4bel
inject_abt64() rewalks the guest stage-1 page tables via
__kvm_find_s1_desc_level() when injecting an abort for a failed S1PTW, and
__kvm_at_s12() calls kvm_walk_nested_s2() to perform the stage-2
translation. Both walks reference kvm->memslots through kvm_read_guest(),
which reads the descriptors, and __kvm_at_swap_desc(), which updates the
access flag, so they must run while holding the kvm->srcu read lock.
__kvm_at_swap_desc() asserts srcu_read_lock_held() on entry, and the other
callers of these walks, handle_at_slow(), kvm_translate_vncr() and
kvm_handle_guest_abort(), take the lock before calling them.
inject_abt64() is reached from the SEA and size fault injection paths,
which run before kvm_handle_guest_abort() takes the lock, and
__kvm_at_s12() does not hold the lock across the stage-2 walk. Take the
kvm->srcu read lock with guard(srcu) in both places so that it is held for
the duration of the walk.
Cc: stable@vger.kernel.org
Fixes: 50f77dc87f13 ("KVM: arm64: Populate level on S1PTW SEA injection")
Fixes: be04cebf3e78 ("KVM: arm64: nv: Add emulation of AT S12E{0,1}{R,W}")
Signed-off-by: Hyunwoo Kim <imv4bel@gmail.com>
---
arch/arm64/kvm/at.c | 3 +++
arch/arm64/kvm/inject_fault.c | 2 ++
2 files changed, 5 insertions(+)
diff --git a/arch/arm64/kvm/at.c b/arch/arm64/kvm/at.c
index 9f8f0ae8e86e..eb334a1c2672 100644
--- a/arch/arm64/kvm/at.c
+++ b/arch/arm64/kvm/at.c
@@ -1569,6 +1569,9 @@ int __kvm_at_s12(struct kvm_vcpu *vcpu, u32 op, u64 vaddr)
/* Do the stage-2 translation */
ipa = (par & GENMASK_ULL(47, 12)) | (vaddr & GENMASK_ULL(11, 0));
out.esr = 0;
+
+ guard(srcu)(&vcpu->kvm->srcu);
+
ret = kvm_walk_nested_s2(vcpu, ipa, &out);
if (ret < 0)
return ret;
diff --git a/arch/arm64/kvm/inject_fault.c b/arch/arm64/kvm/inject_fault.c
index 89982bd3345f..868895ed0930 100644
--- a/arch/arm64/kvm/inject_fault.c
+++ b/arch/arm64/kvm/inject_fault.c
@@ -121,6 +121,8 @@ static void inject_abt64(struct kvm_vcpu *vcpu, bool is_iabt, unsigned long addr
if (hpfar == INVALID_GPA)
return;
+ guard(srcu)(&vcpu->kvm->srcu);
+
ret = __kvm_find_s1_desc_level(vcpu, addr, hpfar, &level);
if (ret)
return;
--
2.43.0
^ permalink raw reply related [flat|nested] 3+ messages in thread* Re: [PATCH] KVM: arm64: Take the SRCU lock for page table walks in fault injection and AT emulation 2026-06-02 16:04 [PATCH] KVM: arm64: Take the SRCU lock for page table walks in fault injection and AT emulation Hyunwoo Kim @ 2026-06-03 6:44 ` Oliver Upton 2026-06-03 11:05 ` Hyunwoo Kim 0 siblings, 1 reply; 3+ messages in thread From: Oliver Upton @ 2026-06-03 6:44 UTC (permalink / raw) To: Hyunwoo Kim Cc: maz, joey.gouly, seiden, suzuki.poulose, yuzenghui, catalin.marinas, will, linux-arm-kernel, kvmarm, stable Hi Hyunwoo, On Wed, Jun 03, 2026 at 01:04:20AM +0900, Hyunwoo Kim wrote: > inject_abt64() rewalks the guest stage-1 page tables via > __kvm_find_s1_desc_level() when injecting an abort for a failed S1PTW, and > __kvm_at_s12() calls kvm_walk_nested_s2() to perform the stage-2 > translation. Both walks reference kvm->memslots through kvm_read_guest(), > which reads the descriptors, and __kvm_at_swap_desc(), which updates the > access flag, so they must run while holding the kvm->srcu read lock. > __kvm_at_swap_desc() asserts srcu_read_lock_held() on entry, and the other > callers of these walks, handle_at_slow(), kvm_translate_vncr() and > kvm_handle_guest_abort(), take the lock before calling them. > > inject_abt64() is reached from the SEA and size fault injection paths, > which run before kvm_handle_guest_abort() takes the lock, and > __kvm_at_s12() does not hold the lock across the stage-2 walk. Take the > kvm->srcu read lock with guard(srcu) in both places so that it is held for > the duration of the walk. Just state the expectation that srcu is held rather than giving the play by play. Perhaps: walk_s1() and kvm_walk_nested_s2() expect to be called while holding kvm->srcu to guard against memslot changes. While this is generally the case, __kvm_at_s12() and __kvm_find_s1_desc_level() call into the respective walkers without taking kvm->srcu. Fix by acquiring kvm->srcu prior to the table walk in both instances. > Cc: stable@vger.kernel.org > Fixes: 50f77dc87f13 ("KVM: arm64: Populate level on S1PTW SEA injection") > Fixes: be04cebf3e78 ("KVM: arm64: nv: Add emulation of AT S12E{0,1}{R,W}") > Signed-off-by: Hyunwoo Kim <imv4bel@gmail.com> I'd prefer if we scoped the critical section to only the relevant calls to the software table walk, like below. -- Thanks, Oliver diff --git a/arch/arm64/kvm/at.c b/arch/arm64/kvm/at.c index 9f8f0ae8e86e..889c2c15d7bd 100644 --- a/arch/arm64/kvm/at.c +++ b/arch/arm64/kvm/at.c @@ -1569,7 +1569,8 @@ int __kvm_at_s12(struct kvm_vcpu *vcpu, u32 op, u64 vaddr) /* Do the stage-2 translation */ ipa = (par & GENMASK_ULL(47, 12)) | (vaddr & GENMASK_ULL(11, 0)); out.esr = 0; - ret = kvm_walk_nested_s2(vcpu, ipa, &out); + scoped_guard(srcu, &vcpu->kvm->srcu) + ret = kvm_walk_nested_s2(vcpu, ipa, &out); if (ret < 0) return ret; @@ -1665,7 +1666,8 @@ int __kvm_find_s1_desc_level(struct kvm_vcpu *vcpu, u64 va, u64 ipa, int *level) } /* Walk the guest's PT, looking for a match along the way */ - ret = walk_s1(vcpu, &wi, &wr, va); + scoped_guard(srcu, &vcpu->kvm->srcu) + ret = walk_s1(vcpu, &wi, &wr, va); switch (ret) { case -EINTR: /* We interrupted the walk on a match, return the level */ ^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] KVM: arm64: Take the SRCU lock for page table walks in fault injection and AT emulation 2026-06-03 6:44 ` Oliver Upton @ 2026-06-03 11:05 ` Hyunwoo Kim 0 siblings, 0 replies; 3+ messages in thread From: Hyunwoo Kim @ 2026-06-03 11:05 UTC (permalink / raw) To: Oliver Upton Cc: maz, joey.gouly, seiden, suzuki.poulose, yuzenghui, catalin.marinas, will, linux-arm-kernel, kvmarm, stable, imv4bel On Tue, Jun 02, 2026 at 11:44:58PM -0700, Oliver Upton wrote: > Hi Hyunwoo, > > On Wed, Jun 03, 2026 at 01:04:20AM +0900, Hyunwoo Kim wrote: > > inject_abt64() rewalks the guest stage-1 page tables via > > __kvm_find_s1_desc_level() when injecting an abort for a failed S1PTW, and > > __kvm_at_s12() calls kvm_walk_nested_s2() to perform the stage-2 > > translation. Both walks reference kvm->memslots through kvm_read_guest(), > > which reads the descriptors, and __kvm_at_swap_desc(), which updates the > > access flag, so they must run while holding the kvm->srcu read lock. > > __kvm_at_swap_desc() asserts srcu_read_lock_held() on entry, and the other > > callers of these walks, handle_at_slow(), kvm_translate_vncr() and > > kvm_handle_guest_abort(), take the lock before calling them. > > > > inject_abt64() is reached from the SEA and size fault injection paths, > > which run before kvm_handle_guest_abort() takes the lock, and > > __kvm_at_s12() does not hold the lock across the stage-2 walk. Take the > > kvm->srcu read lock with guard(srcu) in both places so that it is held for > > the duration of the walk. > > Just state the expectation that srcu is held rather than giving the > play by play. Perhaps: > > walk_s1() and kvm_walk_nested_s2() expect to be called while holding > kvm->srcu to guard against memslot changes. While this is generally > the case, __kvm_at_s12() and __kvm_find_s1_desc_level() call into the > respective walkers without taking kvm->srcu. > > Fix by acquiring kvm->srcu prior to the table walk in both instances. > > > Cc: stable@vger.kernel.org > > Fixes: 50f77dc87f13 ("KVM: arm64: Populate level on S1PTW SEA injection") > > Fixes: be04cebf3e78 ("KVM: arm64: nv: Add emulation of AT S12E{0,1}{R,W}") > > Signed-off-by: Hyunwoo Kim <imv4bel@gmail.com> > > I'd prefer if we scoped the critical section to only the relevant calls > to the software table walk, like below. Thanks for the review. I agree this direction is the better patch. I'll do some more testing and then submit a v2. > > -- > Thanks, > Oliver > > diff --git a/arch/arm64/kvm/at.c b/arch/arm64/kvm/at.c > index 9f8f0ae8e86e..889c2c15d7bd 100644 > --- a/arch/arm64/kvm/at.c > +++ b/arch/arm64/kvm/at.c > @@ -1569,7 +1569,8 @@ int __kvm_at_s12(struct kvm_vcpu *vcpu, u32 op, u64 vaddr) > /* Do the stage-2 translation */ > ipa = (par & GENMASK_ULL(47, 12)) | (vaddr & GENMASK_ULL(11, 0)); > out.esr = 0; > - ret = kvm_walk_nested_s2(vcpu, ipa, &out); > + scoped_guard(srcu, &vcpu->kvm->srcu) > + ret = kvm_walk_nested_s2(vcpu, ipa, &out); > if (ret < 0) > return ret; > > @@ -1665,7 +1666,8 @@ int __kvm_find_s1_desc_level(struct kvm_vcpu *vcpu, u64 va, u64 ipa, int *level) > } > > /* Walk the guest's PT, looking for a match along the way */ > - ret = walk_s1(vcpu, &wi, &wr, va); > + scoped_guard(srcu, &vcpu->kvm->srcu) > + ret = walk_s1(vcpu, &wi, &wr, va); > switch (ret) { > case -EINTR: > /* We interrupted the walk on a match, return the level */ Best regards, Hyunwoo Kim ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2026-06-03 11:06 UTC | newest] Thread overview: 3+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2026-06-02 16:04 [PATCH] KVM: arm64: Take the SRCU lock for page table walks in fault injection and AT emulation Hyunwoo Kim 2026-06-03 6:44 ` Oliver Upton 2026-06-03 11:05 ` Hyunwoo Kim
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox