From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
To: Sean Christopherson <seanjc@google.com>
Cc: kvm@vger.kernel.org, pbonzini@redhat.com, tglx@linutronix.de,
leobras@redhat.com, linux-kernel@vger.kernel.org,
mingo@redhat.com, bp@alien8.de, dave.hansen@linux.intel.com,
x86@kernel.org
Subject: Re: [PATCH] KVM: x86: Always enable legacy fp/sse
Date: Wed, 17 Aug 2022 12:03:22 +0100 [thread overview]
Message-ID: <YvzK+slWoAvm0/Wn@work-vm> (raw)
In-Reply-To: <YvwODUu/rdzjzDjk@google.com>
* Sean Christopherson (seanjc@google.com) wrote:
> On Tue, Aug 16, 2022, Dr. David Alan Gilbert (git) wrote:
> > From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
> >
> > A live migration under qemu is currently failing when the source
> > host is ~Nehalem era (pre-xsave) and the destination is much newer,
> > (configured with a guest CPU type of Nehalem).
> > QEMU always calls kvm_put_xsave, even on this combination because
> > KVM_CAP_CHECK_EXTENSION_VM always returns true for KVM_CAP_XSAVE.
> >
> > When QEMU calls kvm_put_xsave it's rejected by
> > fpu_copy_uabi_to_guest_fpstate->
> > copy_uabi_to_xstate->
> > validate_user_xstate_header
> >
> > when the validate checks the loaded xfeatures against
> > user_xfeatures, which it finds to be 0.
> >
> > I think our initialisation of user_xfeatures is being
> > too strict here, and we should always allow the base FP/SSE.
> >
> > Fixes: ad856280ddea ("x86/kvm/fpu: Limit guest user_xfeatures to supported bits of XCR0")
> > bz: https://bugzilla.redhat.com/show_bug.cgi?id=2079311
> >
> > Signed-off-by: Dr. David Alan Gilbert <dgilbert@redhat.com>
> > ---
> > arch/x86/kvm/cpuid.c | 3 ++-
> > 1 file changed, 2 insertions(+), 1 deletion(-)
> >
> > diff --git a/arch/x86/kvm/cpuid.c b/arch/x86/kvm/cpuid.c
> > index de6d44e07e34..3b2319cecfd1 100644
> > --- a/arch/x86/kvm/cpuid.c
> > +++ b/arch/x86/kvm/cpuid.c
> > @@ -298,7 +298,8 @@ static void kvm_vcpu_after_set_cpuid(struct kvm_vcpu *vcpu)
> > guest_supported_xcr0 =
> > cpuid_get_supported_xcr0(vcpu->arch.cpuid_entries, vcpu->arch.cpuid_nent);
> >
> > - vcpu->arch.guest_fpu.fpstate->user_xfeatures = guest_supported_xcr0;
> > + vcpu->arch.guest_fpu.fpstate->user_xfeatures = guest_supported_xcr0 |
> > + XFEATURE_MASK_FPSSE;
Hi Sean,
Thanks for the reply,
> I don't think this is correct. This will allow the guest to set the SSE bit
> even when XSAVE isn't supported due to kvm_guest_supported_xcr0() returning
> user_xfeatures.
>
> static inline u64 kvm_guest_supported_xcr0(struct kvm_vcpu *vcpu)
> {
> return vcpu->arch.guest_fpu.fpstate->user_xfeatures;
> }
>
> I believe the right place to fix this is in validate_user_xstate_header(). It's
> reachable if and only if XSAVE is supported in the host, and when XSAVE is _not_
> supported, the kernel unconditionally allows FP+SSE. So it follows that the kernel
> should also allow FP+SSE when using XSAVE too. That would also align the logic
> with fpu_copy_guest_fpstate_to_uabi(), which fordces the FPSSE flags. Ditto for
> the non-KVM save_xstate_epilog().
OK, yes, I'd followed the check that failed down to this test; although
by itself this test works until Leo's patch came along later; so I
wasn't sure where to fix it.
> Aha! And fpu__init_system_xstate() ensure the host supports FP+SSE when XSAVE
> is enabled (knew their had to be a sanity check somewhere).
>
> ---
> arch/x86/kernel/fpu/xstate.c | 9 +++++++--
> 1 file changed, 7 insertions(+), 2 deletions(-)
>
> diff --git a/arch/x86/kernel/fpu/xstate.c b/arch/x86/kernel/fpu/xstate.c
> index c8340156bfd2..83b9a9653d47 100644
> --- a/arch/x86/kernel/fpu/xstate.c
> +++ b/arch/x86/kernel/fpu/xstate.c
> @@ -399,8 +399,13 @@ int xfeature_size(int xfeature_nr)
> static int validate_user_xstate_header(const struct xstate_header *hdr,
> struct fpstate *fpstate)
> {
> - /* No unknown or supervisor features may be set */
> - if (hdr->xfeatures & ~fpstate->user_xfeatures)
> + /*
> + * No unknown or supervisor features may be set. Userspace is always
> + * allowed to restore FP+SSE state (XSAVE/XRSTOR are used by the kernel
> + * if and only if FP+SSE are supported in xstate).
> + */
> + if (hdr->xfeatures & ~fpstate->user_xfeatures &
> + ~(XFEATURE_MASK_FP | XFEATURE_MASK_SSE))
> return -EINVAL;
>
> /* Userspace must use the uncompacted format */
That passes the small smoke test for me; will you repost that then?
Thanks,
Dave
> base-commit: de3d415edca23831c5d1f24f10c74a715af7efdb
> --
>
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
next prev parent reply other threads:[~2022-08-17 11:03 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-08-16 17:59 [PATCH] KVM: x86: Always enable legacy fp/sse Dr. David Alan Gilbert (git)
2022-08-16 21:37 ` Sean Christopherson
2022-08-17 3:29 ` Leonardo Brás
2022-08-17 8:45 ` Paolo Bonzini
2022-08-17 11:03 ` Dr. David Alan Gilbert [this message]
2022-08-17 16:11 ` Sean Christopherson
2022-08-17 16:14 ` Dr. David Alan Gilbert
2022-08-23 0:15 ` Sean Christopherson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YvzK+slWoAvm0/Wn@work-vm \
--to=dgilbert@redhat.com \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=kvm@vger.kernel.org \
--cc=leobras@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=pbonzini@redhat.com \
--cc=seanjc@google.com \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox