Re: [PATCH 05/22] KVM: arm64: nv: Pass an access descriptor for stage-2 walks

All of lore.kernel.org
 help / color / mirror / Atom feed

From: sashiko-bot@kernel.org
To: "Oliver Upton" <oupton@kernel.org>
Cc: "Marc Zyngier" <maz@kernel.org>, kvmarm@lists.linux.dev
Subject: Re: [PATCH 05/22] KVM: arm64: nv: Pass an access descriptor for stage-2 walks
Date: Tue, 23 Jun 2026 19:06:07 +0000	[thread overview]
Message-ID: <20260623190607.7106B1F000E9@smtp.kernel.org> (raw)
In-Reply-To: <20260623184201.1518871-6-oupton@kernel.org>

Thank you for your contribution! Sashiko AI review found 5 potential issue(s) to consider:

Pre-existing issues:
- [Critical] __kvm_at_s12 invokes a nested stage-2 page table walk without holding the KVM SRCU read lock, causing a Use-After-Free.
- [Critical] Missing SRCU read lock in Synchronous External Abort (SEA) handling causes a Use-After-Free on memslots during __kvm_find_s1_desc_level.
- [High] kvm_s2_handle_perm_fault incorrectly checks execute permissions instead of read/write permissions for a stage-1 PTW triggered by an instruction fetch.
- [High] __kvm_at_s12 corrupts the intermediate physical address (IPA) for guests using 16KB or 64KB translation granules.
- [High] __kvm_at_s12 silently truncates 52-bit intermediate physical addresses when reading the stage-1 translation result from PAR_EL1.
--

commit 2325baae11b7348a33c6e1cec8c8a1a565d36558
Author: Oliver Upton <oupton@kernel.org>
Subject: KVM: arm64: nv: Pass an access descriptor for stage-2 walks

This commit modifies the stage-2 page table walk API to take an access
descriptor instead of just the IPA. This provides sufficient context to the
PTW to determine the correct behavior for access-dependent features like
FEAT_HAFDBS.

> diff --git a/arch/arm64/kvm/at.c b/arch/arm64/kvm/at.c
> index 7a84495a2e6d4..083014e9d86a3 100644
> --- a/arch/arm64/kvm/at.c
> +++ b/arch/arm64/kvm/at.c
> @@ -1642,9 +1655,11 @@ int __kvm_at_s12(struct kvm_vcpu *vcpu, u32 op, u64 vaddr)
>  		return 0;
>  
>  	/* Do the stage-2 translation */
> -	ipa = (par & GENMASK_ULL(47, 12)) | (vaddr & GENMASK_ULL(11, 0));
> +	access.type = WALK_ACCESS_AT;
> +	access.ia = (par & GENMASK_ULL(47, 12)) | (vaddr & GENMASK_ULL(11, 0));

[Severity: High]
This is a pre-existing issue, but does this bitwise AND operation silently
truncate 52-bit intermediate physical addresses?

If the CPU and guest support 52-bit physical addresses (FEAT_LPA or
FEAT_LPA2), the physical address occupies bits up to 51. The
GENMASK_ULL(47, 12) truncates bits [51:48], feeding a 48-bit address into
kvm_walk_nested_s2() which breaks nested guests utilizing memory addresses
beyond the 48-bit boundary.

[Severity: High]
This isn't a bug introduced by this patch, but does this assume a 4KB granule
and corrupt the IPA for guests using 16KB or 64KB translation granules?

For 16KB and 64KB translation granules, the page offset spans 14 bits and
16 bits respectively. The hardware returns a PAR_EL1 where bits [15:12] are
RES0 for a 64KB granule. Because the offset is extracted exclusively from
vaddr & GENMASK_ULL(11, 0), bits [15:12] of the reconstructed IPA will be
zeroed out, causing the stage-2 walk to use the wrong address.

>  	out.esr = 0;
> -	ret = kvm_walk_nested_s2(vcpu, ipa, &out);
> +
> +	ret = kvm_walk_nested_s2(vcpu, &access, &out);

[Severity: Critical]
This is a pre-existing issue, but does this invoke a nested stage-2 page
table walk without holding the KVM SRCU read lock?

kvm_walk_nested_s2() reads from guest memory using kvm_read_guest(), which
dereferences the RCU-protected memslots array. If host userspace unmaps or
modifies the VM's memslots concurrently while the nested hypervisor executes
an AT S12 instruction, can this result in a use-after-free condition leading
to host kernel memory corruption?

[ ... ]
> diff --git a/arch/arm64/kvm/mmu.c b/arch/arm64/kvm/mmu.c
> index f35c4ce954731..88998195274b9 100644
> --- a/arch/arm64/kvm/mmu.c
> +++ b/arch/arm64/kvm/mmu.c
> @@ -2313,9 +2313,23 @@ int kvm_handle_guest_abort(struct kvm_vcpu *vcpu)
>  	 */
>  	if (kvm_is_nested_s2_mmu(vcpu->kvm,vcpu->arch.hw_mmu) &&
>  	    vcpu->arch.hw_mmu->nested_stage2_enabled) {
> +		struct kvm_walk_access access = {
> +			.ia	= fault_ipa,
> +		};
>  		u32 esr;
>  
> -		ret = kvm_walk_nested_s2(vcpu, fault_ipa, &nested_trans);
> +		if (kvm_vcpu_abt_iss1tw(vcpu))
> +			access.type = WALK_ACCESS_S1PTW;
> +		else if (is_iabt)
> +			access.type = WALK_ACCESS_IFETCH;
> +		else if (kvm_vcpu_dabt_is_cm(vcpu))
> +			access.type = WALK_ACCESS_CMO;
> +		else
> +			access.type = WALK_ACCESS_LDST;
> +
> +		access.write = kvm_is_write_fault(vcpu);
> +
> +		ret = kvm_walk_nested_s2(vcpu, &access, &nested_trans);
>  		if (ret == -EAGAIN) {
>  			ret = 1;
>  			goto out_unlock;

[Severity: High]
This isn't a bug introduced by this patch, but does kvm_s2_handle_perm_fault()
incorrectly check execute permissions instead of read/write permissions for a
stage-1 PTW triggered by an instruction fetch?

When an instruction fetch triggers a stage-1 page table walk (S1PTW) that
causes a stage-2 permission fault, the resulting exception is reported as an
instruction abort (IABT) with the S1PTW bit set. A page table walk is a data
read, so the hardware checks stage-2 read/write permissions.

However, kvm_s2_handle_perm_fault() evaluates this as an instruction fetch:

arch/arm64/kvm/nested.c:kvm_s2_handle_perm_fault() {
    ...
	if (kvm_vcpu_trap_is_iabt(vcpu)) {
		if (vcpu_mode_priv(vcpu))
			forward_fault = !trans->px;
		else
			forward_fault = !trans->ux;
	} else {
    ...
}

Because stage-2 page tables mapping the stage-1 page tables are typically
marked Execute-Never (XN), will KVM spuriously inject a stage-2 permission
fault into the nested hypervisor?

[Severity: Critical]
This is a pre-existing issue, but does handling a Synchronous External Abort
(SEA) without the SRCU read lock cause a use-after-free during
__kvm_find_s1_desc_level()?

When KVM handles an SEA fault, kvm_handle_guest_abort() delegates to
kvm_inject_sea() before acquiring the SRCU read lock:

arch/arm64/kvm/mmu.c:kvm_handle_guest_abort() {
    ...
	if (esr_fsc_is_translation_fault(esr)) {
		/* Beyond sanitised PARange (which is the IPA limit) */
		if (fault_ipa >= BIT_ULL(get_kvm_ipa_limit())) {
			kvm_inject_size_fault(vcpu);
			return 1;
		}

		/* Falls between the IPA range and the PARange? */
		if (fault_ipa >= BIT_ULL(VTCR_EL2_IPA(vcpu->arch.hw_mmu->vtcr))) {
			fault_ipa |= FAR_TO_FIPA_OFFSET(kvm_vcpu_get_hfar(vcpu));

			return kvm_inject_sea(vcpu, is_iabt, fault_ipa);
		}
	}
    ...
}

If the SEA occurred during a stage-1 page table walk (S1PTW), it cascades
to inject_abt64(), which calls __kvm_find_s1_desc_level():

arch/arm64/kvm/inject_fault.c:inject_abt64() {
    ...
		ret = __kvm_find_s1_desc_level(vcpu, addr, hpfar, &level);
    ...
}

This ultimately calls walk_s1() which relies on kvm_read_guest() to read the
descriptors. Because the SRCU read lock is not held, can the memslot lookup
race with concurrent host userspace memslot modifications, causing a
use-after-free?

-- 
Sashiko AI review · https://sashiko.dev/#/patchset/20260623184201.1518871-1-oupton@kernel.org?part=5

next prev parent reply	other threads:[~2026-06-23 19:06 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-23 18:41 [PATCH 00/22] KVM: arm64: nv: Implement FEAT_HAFDBS, FEAT_HAFT Oliver Upton
2026-06-23 18:41 ` [PATCH 01/22] KVM: arm64: nv: Introduce struct for stage-2 walk step Oliver Upton
2026-06-23 18:41 ` [PATCH 02/22] KVM: arm64: nv: Consolidate computation of stage-2 permissions Oliver Upton
2026-06-23 18:57   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 03/22] KVM: arm64: nv: Get rid of kvm_s2_trans*() accessors Oliver Upton
2026-06-23 18:41 ` [PATCH 04/22] KVM: arm64: nv: Only shadow writable-dirty guest descs as writable Oliver Upton
2026-06-23 18:58   ` sashiko-bot
2026-06-23 20:05     ` Oliver Upton
2026-06-23 18:41 ` [PATCH 05/22] KVM: arm64: nv: Pass an access descriptor for stage-2 walks Oliver Upton
2026-06-23 19:06   ` sashiko-bot [this message]
2026-06-23 18:41 ` [PATCH 06/22] KVM: arm64: nv: Use a helper for stage-2 descriptor updates Oliver Upton
2026-06-23 18:41 ` [PATCH 07/22] KVM: arm64: nv: Set dirty state at stage-2 Oliver Upton
2026-06-23 19:03   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 08/22] KVM: arm64: nv: Treat DBM as writable " Oliver Upton
2026-06-23 18:55   ` sashiko-bot
2026-06-23 20:08     ` Oliver Upton
2026-06-23 18:41 ` [PATCH 09/22] KVM: arm64: Compute S1 permissions as part of s1_walk() Oliver Upton
2026-06-23 18:41 ` [PATCH 10/22] KVM: arm64: Plumb through access descriptor for stage-1 Oliver Upton
2026-06-23 18:41 ` [PATCH 11/22] KVM: arm64: Use a struct for stage-1 walk context Oliver Upton
2026-06-23 18:41 ` [PATCH 12/22] KVM: arm64: Create helper for stage-1 descriptor updates Oliver Upton
2026-06-23 18:55   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 13/22] KVM: arm64: Set dirty state at stage-1 Oliver Upton
2026-06-23 18:54   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 14/22] KVM: arm64: Grant write permission when DBM is set at S1 Oliver Upton
2026-06-23 18:57   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 15/22] KVM: arm64: Don't update descriptors for "non-arch" access Oliver Upton
2026-06-23 18:41 ` [PATCH 16/22] KVM: arm64: nv: Expose FEAT_HAFDBS Oliver Upton
2026-06-23 19:01   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 17/22] KVM: arm64: Set Access flag on table descriptors at stage-1 Oliver Upton
2026-06-23 20:56   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 18/22] KVM: arm64: nv: Set access flag on table descriptors at stage-2 Oliver Upton
2026-06-23 19:05   ` sashiko-bot
2026-06-23 20:14     ` Oliver Upton
2026-06-23 18:41 ` [PATCH 19/22] KVM: arm64: nv: Expose FEAT_HAFT Oliver Upton
2026-06-23 19:05   ` sashiko-bot
2026-06-23 18:41 ` [PATCH 20/22] KVM: arm64: selftests: Only test AF behavior for emulated AT insns Oliver Upton
2026-06-23 18:42 ` [PATCH 21/22] KVM: arm64: selftests: Test AT emulation for FEAT_HAFT Oliver Upton
2026-06-23 19:05   ` sashiko-bot
2026-06-23 20:17     ` Oliver Upton
2026-06-23 18:42 ` [PATCH 22/22] HACK: KVM: arm64: nv: Set the dirty state for CMOs that fetch for write Oliver Upton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260623190607.7106B1F000E9@smtp.kernel.org \
    --to=sashiko-bot@kernel.org \
    --cc=kvmarm@lists.linux.dev \
    --cc=maz@kernel.org \
    --cc=oupton@kernel.org \
    --cc=sashiko-reviews@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.