* [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb @ 2026-06-08 8:11 Marc Zyngier 2026-06-08 16:34 ` Oliver Upton 2026-06-08 20:55 ` Yosry Ahmed 0 siblings, 2 replies; 5+ messages in thread From: Marc Zyngier @ 2026-06-08 8:11 UTC (permalink / raw) To: kvmarm, kvm, linux-arm-kernel Cc: Steffen Eiden, Joey Gouly, Suzuki K Poulose, Oliver Upton, Zenghui Yu Sashiko reports that there is a race between initialising vncr_tlb and making use of it, as we don't hold the mmu_lock at this point. Additionally, it identifies a memory leak, should userspace repeatedly invokes the KVM_RUN ioctl after a failure of kvm_arch_vcpu_run_pid_change(), as we assign vncr_tlb blindly on first run, irrespective of prior allocations. Slap the two bugs in one go by taking the kvm->mmu_lock on assigning vncr_tlb, preventing the race for good, and by checking that vncr_tlb is indeed NULL prior to allocation. Reported-by: Sashiko <sashiko-bot@kernel.org> Signed-off-by: Marc Zyngier <maz@kernel.org> Link: https://lore.kernel.org/r/20260607180815.85FBC1F00893@smtp.kernel.org --- arch/arm64/kvm/nested.c | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/arch/arm64/kvm/nested.c b/arch/arm64/kvm/nested.c index 690b8e8564166..d11e36b3cfcc2 100644 --- a/arch/arm64/kvm/nested.c +++ b/arch/arm64/kvm/nested.c @@ -1253,8 +1253,14 @@ int kvm_vcpu_allocate_vncr_tlb(struct kvm_vcpu *vcpu) if (!kvm_has_feat(vcpu->kvm, ID_AA64MMFR4_EL1, NV_frac, NV2_ONLY)) return 0; - vcpu->arch.vncr_tlb = kzalloc_obj(*vcpu->arch.vncr_tlb, - GFP_KERNEL_ACCOUNT); + if (!vcpu->arch.vncr_tlb) { + struct vncr_tlb *vt = kzalloc_obj(*vcpu->arch.vncr_tlb, + GFP_KERNEL_ACCOUNT); + + scoped_guard(write_lock, &vcpu->kvm->mmu_lock) + vcpu->arch.vncr_tlb = vt; + } + if (!vcpu->arch.vncr_tlb) return -ENOMEM; -- 2.47.3 ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb 2026-06-08 8:11 [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb Marc Zyngier @ 2026-06-08 16:34 ` Oliver Upton 2026-06-08 20:55 ` Yosry Ahmed 1 sibling, 0 replies; 5+ messages in thread From: Oliver Upton @ 2026-06-08 16:34 UTC (permalink / raw) To: Marc Zyngier Cc: kvmarm, kvm, linux-arm-kernel, Steffen Eiden, Joey Gouly, Suzuki K Poulose, Zenghui Yu On Mon, Jun 08, 2026 at 09:11:08AM +0100, Marc Zyngier wrote: > Sashiko reports that there is a race between initialising vncr_tlb > and making use of it, as we don't hold the mmu_lock at this point. > > Additionally, it identifies a memory leak, should userspace repeatedly > invokes the KVM_RUN ioctl after a failure of kvm_arch_vcpu_run_pid_change(), > as we assign vncr_tlb blindly on first run, irrespective of prior > allocations. > > Slap the two bugs in one go by taking the kvm->mmu_lock on assigning > vncr_tlb, preventing the race for good, and by checking that vncr_tlb > is indeed NULL prior to allocation. > > Reported-by: Sashiko <sashiko-bot@kernel.org> > Signed-off-by: Marc Zyngier <maz@kernel.org> > Link: https://lore.kernel.org/r/20260607180815.85FBC1F00893@smtp.kernel.org Reviewed-by: Oliver Upton <oupton@kernel.org> Thanks, Oliver > --- > arch/arm64/kvm/nested.c | 10 ++++++++-- > 1 file changed, 8 insertions(+), 2 deletions(-) > > diff --git a/arch/arm64/kvm/nested.c b/arch/arm64/kvm/nested.c > index 690b8e8564166..d11e36b3cfcc2 100644 > --- a/arch/arm64/kvm/nested.c > +++ b/arch/arm64/kvm/nested.c > @@ -1253,8 +1253,14 @@ int kvm_vcpu_allocate_vncr_tlb(struct kvm_vcpu *vcpu) > if (!kvm_has_feat(vcpu->kvm, ID_AA64MMFR4_EL1, NV_frac, NV2_ONLY)) > return 0; > > - vcpu->arch.vncr_tlb = kzalloc_obj(*vcpu->arch.vncr_tlb, > - GFP_KERNEL_ACCOUNT); > + if (!vcpu->arch.vncr_tlb) { > + struct vncr_tlb *vt = kzalloc_obj(*vcpu->arch.vncr_tlb, > + GFP_KERNEL_ACCOUNT); > + > + scoped_guard(write_lock, &vcpu->kvm->mmu_lock) > + vcpu->arch.vncr_tlb = vt; > + } > + > if (!vcpu->arch.vncr_tlb) > return -ENOMEM; > > -- > 2.47.3 > ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb 2026-06-08 8:11 [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb Marc Zyngier 2026-06-08 16:34 ` Oliver Upton @ 2026-06-08 20:55 ` Yosry Ahmed 2026-06-09 7:25 ` Marc Zyngier 1 sibling, 1 reply; 5+ messages in thread From: Yosry Ahmed @ 2026-06-08 20:55 UTC (permalink / raw) To: Marc Zyngier Cc: kvmarm, kvm, linux-arm-kernel, Steffen Eiden, Joey Gouly, Suzuki K Poulose, Oliver Upton, Zenghui Yu On Mon, Jun 08, 2026 at 09:11:08AM +0100, Marc Zyngier wrote: > Sashiko reports that there is a race between initialising vncr_tlb > and making use of it, as we don't hold the mmu_lock at this point. > > Additionally, it identifies a memory leak, should userspace repeatedly > invokes the KVM_RUN ioctl after a failure of kvm_arch_vcpu_run_pid_change(), > as we assign vncr_tlb blindly on first run, irrespective of prior > allocations. > > Slap the two bugs in one go by taking the kvm->mmu_lock on assigning > vncr_tlb, preventing the race for good, and by checking that vncr_tlb > is indeed NULL prior to allocation. > > Reported-by: Sashiko <sashiko-bot@kernel.org> > Signed-off-by: Marc Zyngier <maz@kernel.org> > Link: https://lore.kernel.org/r/20260607180815.85FBC1F00893@smtp.kernel.org > --- > arch/arm64/kvm/nested.c | 10 ++++++++-- > 1 file changed, 8 insertions(+), 2 deletions(-) > > diff --git a/arch/arm64/kvm/nested.c b/arch/arm64/kvm/nested.c > index 690b8e8564166..d11e36b3cfcc2 100644 > --- a/arch/arm64/kvm/nested.c > +++ b/arch/arm64/kvm/nested.c > @@ -1253,8 +1253,14 @@ int kvm_vcpu_allocate_vncr_tlb(struct kvm_vcpu *vcpu) > if (!kvm_has_feat(vcpu->kvm, ID_AA64MMFR4_EL1, NV_frac, NV2_ONLY)) > return 0; > > - vcpu->arch.vncr_tlb = kzalloc_obj(*vcpu->arch.vncr_tlb, > - GFP_KERNEL_ACCOUNT); > + if (!vcpu->arch.vncr_tlb) { > + struct vncr_tlb *vt = kzalloc_obj(*vcpu->arch.vncr_tlb, > + GFP_KERNEL_ACCOUNT); > + > + scoped_guard(write_lock, &vcpu->kvm->mmu_lock) > + vcpu->arch.vncr_tlb = vt; > + } (I am not familiar with this code at all, so apologies in advance if I am making an idiot out of myself here) IIUC, the point of holding the lock here is *not* to protect against concurrent initialization, as in this case the NULL check needs to be done under the lock. Rather, the goal is to prevent re-ordering of zeroing from kzalloc and the assignment to vcpu->arch.vncr_tlb, by depending on the barriers provided by the lock. The lock is held by the readers so holding it here conviently means we do not need to add any barriers to the readers. Is my understanding correct? If yes, I think the code looks confusing, at least to a layman like myself. It initially seems like the lock protects against concurrent initializations, but then the NULL check is not done again under the lock. The goal of the lock is not clear without the original report. Mayeb it's clearer to explicitly use barriers if the goal is preventing reordering? ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb 2026-06-08 20:55 ` Yosry Ahmed @ 2026-06-09 7:25 ` Marc Zyngier 2026-06-09 17:57 ` Yosry Ahmed 0 siblings, 1 reply; 5+ messages in thread From: Marc Zyngier @ 2026-06-09 7:25 UTC (permalink / raw) To: Yosry Ahmed Cc: kvmarm, kvm, linux-arm-kernel, Steffen Eiden, Joey Gouly, Suzuki K Poulose, Oliver Upton, Zenghui Yu On Mon, 08 Jun 2026 21:55:25 +0100, Yosry Ahmed <yosry@kernel.org> wrote: > > On Mon, Jun 08, 2026 at 09:11:08AM +0100, Marc Zyngier wrote: > > Sashiko reports that there is a race between initialising vncr_tlb > > and making use of it, as we don't hold the mmu_lock at this point. > > > > Additionally, it identifies a memory leak, should userspace repeatedly > > invokes the KVM_RUN ioctl after a failure of kvm_arch_vcpu_run_pid_change(), > > as we assign vncr_tlb blindly on first run, irrespective of prior > > allocations. > > > > Slap the two bugs in one go by taking the kvm->mmu_lock on assigning > > vncr_tlb, preventing the race for good, and by checking that vncr_tlb > > is indeed NULL prior to allocation. > > > > Reported-by: Sashiko <sashiko-bot@kernel.org> > > Signed-off-by: Marc Zyngier <maz@kernel.org> > > Link: https://lore.kernel.org/r/20260607180815.85FBC1F00893@smtp.kernel.org > > --- > > arch/arm64/kvm/nested.c | 10 ++++++++-- > > 1 file changed, 8 insertions(+), 2 deletions(-) > > > > diff --git a/arch/arm64/kvm/nested.c b/arch/arm64/kvm/nested.c > > index 690b8e8564166..d11e36b3cfcc2 100644 > > --- a/arch/arm64/kvm/nested.c > > +++ b/arch/arm64/kvm/nested.c > > @@ -1253,8 +1253,14 @@ int kvm_vcpu_allocate_vncr_tlb(struct kvm_vcpu *vcpu) > > if (!kvm_has_feat(vcpu->kvm, ID_AA64MMFR4_EL1, NV_frac, NV2_ONLY)) > > return 0; > > > > - vcpu->arch.vncr_tlb = kzalloc_obj(*vcpu->arch.vncr_tlb, > > - GFP_KERNEL_ACCOUNT); > > + if (!vcpu->arch.vncr_tlb) { > > + struct vncr_tlb *vt = kzalloc_obj(*vcpu->arch.vncr_tlb, > > + GFP_KERNEL_ACCOUNT); > > + > > + scoped_guard(write_lock, &vcpu->kvm->mmu_lock) > > + vcpu->arch.vncr_tlb = vt; > > + } > > (I am not familiar with this code at all, so apologies in advance if I > am making an idiot out of myself here) > > IIUC, the point of holding the lock here is *not* to protect against > concurrent initialization, as in this case the NULL check needs to be > done under the lock. > > Rather, the goal is to prevent re-ordering of zeroing from kzalloc and > the assignment to vcpu->arch.vncr_tlb, by depending on the barriers > provided by the lock. The lock is held by the readers so holding it here > conviently means we do not need to add any barriers to the readers. > > Is my understanding correct? It is correct. > > If yes, I think the code looks confusing, at least to a layman like > myself. It initially seems like the lock protects against concurrent > initializations, but then the NULL check is not done again under the > lock. The goal of the lock is not clear without the original report. > > Mayeb it's clearer to explicitly use barriers if the goal is preventing > reordering? This would require both the initialisation of vncr_tlb to use a store release, *and* all the other call sites to use a load acquire. I really don't think it is worth the churn, nor the (very small) burden on the readers. Thanks, M. -- Without deviation from the norm, progress is not possible. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb 2026-06-09 7:25 ` Marc Zyngier @ 2026-06-09 17:57 ` Yosry Ahmed 0 siblings, 0 replies; 5+ messages in thread From: Yosry Ahmed @ 2026-06-09 17:57 UTC (permalink / raw) To: Marc Zyngier Cc: kvmarm, kvm, linux-arm-kernel, Steffen Eiden, Joey Gouly, Suzuki K Poulose, Oliver Upton, Zenghui Yu > > If yes, I think the code looks confusing, at least to a layman like > > myself. It initially seems like the lock protects against concurrent > > initializations, but then the NULL check is not done again under the > > lock. The goal of the lock is not clear without the original report. > > > > Mayeb it's clearer to explicitly use barriers if the goal is preventing > > reordering? > > This would require both the initialisation of vncr_tlb to use a store > release, *and* all the other call sites to use a load acquire. > > I really don't think it is worth the churn, nor the (very small) > burden on the readers. That's fair. I was mainly just pointing out my initial confusion and that others may share it. Avoiding the churn on the readers' side is understandable. Maybe a comment here would help explain why the lock needs to be held? ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2026-06-09 17:57 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2026-06-08 8:11 [PATCH] KVM: arm64: Hold kvm->mmu_lock while initialising vcpu->arch.vncr_tlb Marc Zyngier 2026-06-08 16:34 ` Oliver Upton 2026-06-08 20:55 ` Yosry Ahmed 2026-06-09 7:25 ` Marc Zyngier 2026-06-09 17:57 ` Yosry Ahmed
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox