From: Mostafa Saleh <smostafa@google.com>
To: Marc Zyngier <maz@kernel.org>
Cc: "Aneesh Kumar K.V" <aneesh.kumar@kernel.org>,
kvmarm@lists.linux.dev, Will Deacon <will@kernel.org>,
Quentin Perret <qperret@google.com>
Subject: Re: pkvm boot failures
Date: Tue, 10 Jun 2025 09:06:22 +0000 [thread overview]
Message-ID: <aEf1jhjFwbaWQgPP@google.com> (raw)
In-Reply-To: <87ldq0f3rx.wl-maz@kernel.org>
Hi Marc,
On Tue, Jun 10, 2025 at 08:34:58AM +0100, Marc Zyngier wrote:
> Hi Mostafa,
>
> Thanks for looking into this.
>
> On Mon, 09 Jun 2025 18:25:15 +0100,
> Mostafa Saleh <smostafa@google.com> wrote:
> >
> > On Mon, Jun 09, 2025 at 06:53:40PM +0530, Aneesh Kumar K.V wrote:
> > >
> > > I am hitting the below failure with v6.15 (I tried other kernel versions
> > > with similar results). I disabled CONFIG_PROTECTED_NVHE_STACKTRACE
> > > because with CONFIG_NVHE_EL2_DEBUG, the stack was pointing at
> > > hyp_assert_lock_held() .
> > >
> > > [ 0.664457] kvm [1]: nVHE hyp panic at: [<ffff80008050b10c>] __kvm_nvhe_handle_trap+0x34/0x10c!
> > > [ 0.664538] kvm [1]: Cannot dump pKVM nVHE stacktrace: !CONFIG_PROTECTED_NVHE_STACKTRACE
> > > [ 0.664566] kvm [1]: Hyp Offset: 0xffff000007c00000
> > > [ 0.664631] Kernel panic - not syncing: HYP panic:
> > > [ 0.664631] PS:614023c9 PC:000080007890b10c ESR:0000000096000007
> > > [ 0.664631] FAR:0000800078c252f0 HPFAR:0000000000000000 PAR:0000000000000000
> > > [ 0.664631] VCPU:0000000000000000
> > > [ 0.664938] CPU: 0 UID: 0 PID: 1 Comm: swapper/0 Not tainted 6.15.0-rc1 #594 NONE
> > > [ 0.665068] Hardware name: FVP Base RevC (DT)
> > > [ 0.665140] Call trace:
> > > [ 0.665196] show_stack+0x18/0x24 (C)
> > > [ 0.665346] dump_stack_lvl+0x3c/0x80
> > > [ 0.665468] dump_stack+0x18/0x24
> > > [ 0.665588] panic+0x124/0x2d8
> > > [ 0.665699] nvhe_hyp_panic_handler+0x108/0x180
> > > [ 0.665825] do_pkvm_init+0xb0/0x124
> > > [ 0.665957] do_pkvm_init+0xb0/0x124
> > > [ 0.666089] kvm_hyp_init_protection+0x5c/0x6c
> > > [ 0.666226] init_hyp_mode+0x760/0x790
> > > [ 0.666362] kvm_arm_init+0xac/0x23c
> > > [ 0.666492] do_one_initcall+0xa0/0x1f0
> > > [ 0.666617] do_initcall_level+0x8c/0xac
> > > [ 0.666753] do_initcalls+0x54/0x94
> > > [ 0.666885] do_basic_setup+0x18/0x24
> > > [ 0.667019] kernel_init_freeable+0xc0/0x10c
> > > [ 0.667157] kernel_init+0x20/0x118
> > > [ 0.667271] ret_from_fork+0x10/0x20
> > > [ 0.667400] SMP: stopping secondary CPUs
> > > [ 0.667475] Kernel Offset: disabled
> > > [ 0.667534] CPU features: 0x0000,00000140,064dc298,cb7a552f
> > > [ 0.667619] Memory Limit: none
> > > [ 0.667681] ---[ end Kernel panic - not syncing: HYP panic:
> > > [ 0.667681] PS:614023c9 PC:000080007890b10c ESR:0000000096000007
> > > [ 0.667681] FAR:0000800078c252f0 HPFAR:0000000000000000 PAR:0000000000000000
> > > [ 0.667681] VCPU:0000000000000000 ]
> > >
> > > I was able to locate a .config that make the pkvm work, But i am not
> > > able to identify which config dependency is making the difference. I am
> > > attaching below the working and non working kernel configs. I am using
> > > FVP to test this.
> > >
> >
> > I had a look at this and tracked the issue to "CONFIG_JUMP_LABEL=n"
> > It seems that it panics at
> > if (static_branch_unlikely(&kvm_protected_mode_initialized))
> > Where "kvm_protected_mode_initialized" is mapped in the initial PGD for the
> > hypervisor, but not mapped in the hypervisor created one.
> > As the variable is defined outside the hypervisor namespace, it doesn’t exist
> > in the hyp bss section.
> > And in case of "CONFIG_JUMP_LABEL=n" it won't be patched in this case, causing
> > next access to read the variable and panicking.
>
> It really begs the question: why do we even support JUMP_LABEL=n? It
> really feels like a backward configuration, and I'd be very glad to
> either mark it as "always on", or make KVM depend on it.
>
> > I guess moving this key to hyp would cause problems with kernel access after
> > de-privilege as cases from kvm_share_hyp(), So I can only think of having a
> > different key for the hypervisor as
> >
> > diff --git a/arch/arm64/kvm/hyp/nvhe/hyp-main.c b/arch/arm64/kvm/hyp/nvhe/hyp-main.c
> > index 8e8848de4d47..8945b335bcea 100644
> > --- a/arch/arm64/kvm/hyp/nvhe/hyp-main.c
> > +++ b/arch/arm64/kvm/hyp/nvhe/hyp-main.c
> > @@ -21,6 +21,7 @@
> > #include <nvhe/trap_handler.h>
> >
> > DEFINE_PER_CPU(struct kvm_nvhe_init_params, kvm_init_params);
> > +DEFINE_STATIC_KEY_FALSE(kvm_protected_mode_initialized_hyp);
> >
> > void __kvm_hyp_host_forward_smc(struct kvm_cpu_context *host_ctxt);
> >
> > @@ -626,7 +627,7 @@ static void handle_host_hcall(struct kvm_cpu_context *host_ctxt)
> > * basis. This is all fine, however, since __pkvm_prot_finalize
> > * returns -EPERM after the first call for a given CPU.
> > */
> > - if (static_branch_unlikely(&kvm_protected_mode_initialized))
> > + if (static_branch_unlikely(&kvm_protected_mode_initialized_hyp))
> > hcall_min = __KVM_HOST_SMCCC_FUNC___pkvm_prot_finalize;
> >
> > id &= ~ARM_SMCCC_CALL_HINTS;
> > diff --git a/arch/arm64/kvm/pkvm.c b/arch/arm64/kvm/pkvm.c
> > index fcd70bfe44fb..af0854e98902 100644
> > --- a/arch/arm64/kvm/pkvm.c
> > +++ b/arch/arm64/kvm/pkvm.c
> > @@ -17,6 +17,7 @@
> > #include "hyp_constants.h"
> >
> > DEFINE_STATIC_KEY_FALSE(kvm_protected_mode_initialized);
> > +DECLARE_STATIC_KEY_FALSE(kvm_nvhe_sym(kvm_protected_mode_initialized_hyp));
> >
> > static struct memblock_region *hyp_memory = kvm_nvhe_sym(hyp_memory);
> > static unsigned int *hyp_memblock_nr_ptr = &kvm_nvhe_sym(hyp_memblock_nr);
> > @@ -229,6 +230,7 @@ static int __init pkvm_drop_host_privileges(void)
> > * once the host stage 2 is installed.
> > */
> > static_branch_enable(&kvm_protected_mode_initialized);
> > + static_branch_enable(&kvm_nvhe_sym(kvm_protected_mode_initialized_hyp));
> > on_each_cpu(_kvm_host_prot_finalize, &ret, 1);
> > return ret;
> > }
> >
>
> I don't really enjoy this duplication, and unless we have a good
> reason not too, I'd rather have something like:
>
> diff --git a/arch/arm64/kvm/Kconfig b/arch/arm64/kvm/Kconfig
> index 713248f240e0..66d232e7c894 100644
> --- a/arch/arm64/kvm/Kconfig
> +++ b/arch/arm64/kvm/Kconfig
> @@ -37,6 +37,7 @@ menuconfig KVM
> select HAVE_KVM_VCPU_RUN_PID_CHANGE
> select SCHED_INFO
> select GUEST_PERF_EVENTS if PERF_EVENTS
> + select JUMP_LABEL
> help
> Support hosting virtualized guest machines.
>
> It should be OK now that all the supported compilers have asm goto
> support.
Yes, that looks better, I am not sure why that is not automatically
selected. AFAIK, it should be ok atleast for arm64.
There are some hidden problems though, if the condition is false, that
would panic after de-privilege, but as the key is toggled at
pkvm_drop_host_privileges() before the privilege drop, it should be fine
for now. (but not if the code is reworked)
Also, ofcourse static_branch_disable() won’t be supported.
I’d say, we can add a comment explaining that this key can't be read from
the hypervisor from memory, in case someone reworked this in the future.
Thanks,
Mostafa
>
> Thanks,
>
> M.
>
> --
> Jazz isn't dead. It just smells funny.
prev parent reply other threads:[~2025-06-10 9:06 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-09 13:23 pkvm boot failures Aneesh Kumar K.V
2025-06-09 16:24 ` Marc Zyngier
2025-06-09 17:25 ` Mostafa Saleh
2025-06-10 6:33 ` Aneesh Kumar K.V
2025-06-10 9:03 ` Mostafa Saleh
2025-06-10 7:34 ` Marc Zyngier
2025-06-10 9:06 ` Mostafa Saleh [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aEf1jhjFwbaWQgPP@google.com \
--to=smostafa@google.com \
--cc=aneesh.kumar@kernel.org \
--cc=kvmarm@lists.linux.dev \
--cc=maz@kernel.org \
--cc=qperret@google.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.