Kernel KVM virtualization development
 help / color / mirror / Atom feed
From: Xiaoyao Li <xiaoyao.li@intel.com>
To: Binbin Wu <binbin.wu@linux.intel.com>, kvm@vger.kernel.org
Cc: pbonzini@redhat.com, seanjc@google.com,
	rick.p.edgecombe@intel.com, chao.gao@intel.com,
	kai.huang@intel.com
Subject: Re: [RFC PATCH 00/27] KVM: x86: Add a paranoid mode for CPUID verification
Date: Fri, 15 May 2026 16:08:33 +0800	[thread overview]
Message-ID: <50ccbe42-4b74-4c2a-b530-a367f7285de6@intel.com> (raw)
In-Reply-To: <20260417073610.3246316-1-binbin.wu@linux.intel.com>

On 4/17/2026 3:35 PM, Binbin Wu wrote:
> Hi,
> 
> This RFC series is to allow public capture of feedback from TDX
> developers before we have too much internal conversations on it and to
> initiate code review of Sashiko. It is not yet intended for review by
> KVM maintainers. Sean and Paolo, please feel free to ignore this version.
> 
> Originally, we had issues on TDX when a new hardware feature, which is a
> host state clobbering feature, is supported by new TDX modules/platforms.
> A host state clobbering feature requires KVM to save and restore the
> feature's related MSR(s) on host/guest transitions; otherwise, if the
> feature is used by TDs, the host state will be corrupted, leading to
> unexpected behavior on the host.
> 
> Currently KVM hardcodes a deny list for unsupported host clobbering
> features for TDX, i.e. HLE, RTM and WAITPKG. However, KVM can't keep a
> list of bits that it may not know about (e.g. the upcoming FRED support
> in TDX).
> 
> We had been working internally to propose a TDX specific solution to
> solve the host state clobbering feature issue. But during a PUCK meeting,
> Sean mentioned that KVM had a more permissive CPUID configuration
> interface than desired and there were problems due to it in the past for
> normal VMs as well. 

It will be better if some detailed example of problems on normal VMs can 
be provided.

> Sean suggested that KVM should introduce a more
> paranoid mode to check CPUID from userspace for VMs in general, as well
> as an opt-in interface for userspace. And TDX should use the
> infrastructure to enforce paranoid mode non-optionally.
 >
> This RFC patch series adds a paranoid CPUID verification mode for KVM on> x86, where KVM must be explicitly aware of every CPUID feature exposed
> to the guest. When the CPUID paranoid mode is opted-in by userspace or
> enforced, KVM will reject any unknown or unsupported feature from
> userspace. And it starts to enforce paranoid CPUID verification for TDX.

Regarding the opt-in interface (for normal VMs), I want to know what the 
benefit it brings for normal VMs when it's opted-in.

If it can make the host more robust or prevent potential attack from 
malicious userspace + guest, then it should certainly be forced on 
instead of userspace to opt-in.

If no good benefit, I doubt any userspace will opt it in. E.g, I can see 
one benefit without the paranoid mode: Userspace can expose the new 
simple x86 Instruction to guest before KVM supports it by adding one 
line of F(xxx) in kvm_initialize_cpu_caps()

> This patch series touches a lot of lines and involves many subtle CPUID
> details. We may not expect reviews on these CPUID leaf specific details
> yet, but feedback is welcome on the framework to build the CPUID overlays
> and how paranoid CPUID verification is implemented.
> 
> The changes are only tested on Intel platforms. Compile-tested only for
> SVM.

Regarding test, can you elaborate more? e.g., did you test the case for 
normal VMs? and what's the configuration of the normal VMs? e.g., the 
"-cpu xxx" parameter if you use QEMU. And if so, can you provide the 
QEMU branch that you used? (so that we can know how much change in QEMU 
is required to enable the paranoid mode)

> The series is organized in following parts:
> ===========================================
> - Patch 1 ~ 2:  Cleanup patches.
> 
> - Patch 3 ~ 11: Construct CPUID overlays
>    This part extends kvm_cpu_caps[] into a 2D array indexed by an "overlay"
>    dimension (CPUID_OL_DEFAULT, CPUID_OL_SVM, CPUID_OL_TDX), allowing
>    each overlay to maintain its own set of supported CPUID features.
>    Having separate overlays for VMX and TDX helps handle cases where
>    KVM's support for certain features differs on Intel-compatible
>    platforms, e.g., HLE, RTM and WAITPKG are not supported for TDX in
>    KVM. There will be new host state clobbering features like this in
>    the future.
>    Having separate overlays for VMX and SVM helps handle cases where a
>    common feature has support on one vendor but not the other. Setting
>    the support in common code requires additional handling in vendor
>    specific code, e.g., SVM code needs to clear IBT, BUS_LOCK_DETECT
>    and MSR_IMM.
>    More overlays could be added in the future if needed.
> 
>    KVM_GET_SUPPORTED_CPUID and KVM_GET_EMULATED_CPUID are also promoted
>    to VM-scoped IOCTLs so that userspace can query per-VM-type CPUID
>    capabilities. CPUID overlays are a KVM internal concept; the overlay is
>    decided by VM type and/or platform vendor.
> 
> - Patch 12 ~ 19: Build allowed CPUID values for different overlays
>    This part builds a comprehensive table of allowed CPUID values covering
>    the basic, extended, Centaur, and KVM paravirt CPUID ranges.
>    For each CPUID output register, the validation follows one of three
>    rules:
>    1. Ignored: the register is added to the ignored set and KVM skips
>       validation of the userspace-provided value.
>    2. Mask/value check: a new KVM-only CPUID leaf enum is defined with a
>       corresponding reverse_cpuid[] entry, and an allowed mask or fixed
>       value is initialized per-overlay.
>    3. Zero check: for reserved registers or registers where no bits are
>       supported, userspace input is checked against zero.
> 
> - Patch 20 ~ 25: Implement paranoid CPUID verification
>    This part adds CPUID paranoid verification to reject userspace CPUID
>    configurations that set unsupported or unknown bits when paranoid mode
>    is enabled for a VM.
>    Also, it adds the opt-in interface KVM_CAP_X86_CPUID_PARANOID for
>    userspace and unconditionally enforces CPUID paranoid mode for TDs.
> 
> - Patch 26 ~ 27: Remove the hardcoded filter for TDX.
>    This part removes the hardcoded deny list for unsupported host
>    clobbering features for TDX, and relies on the allowed mask for the TDX
>    overlay to filter and check generically.
> 
> Opens:
> ======
> - CPUID overlays VS. open-code checks for specific features in vendor
>    specific callbacks.
>    Open-code checks for specific features in vendor callback will have
>    less code changes, however, it tightly couples normal VM feature
>    enablement with TDX. If a new host-state-clobbering feature is added
>    for normal VMs, the developer has to remember to update the TDX filter
>    list(s). Or when a common x86 feature is added for only VMX/SVM, the
>    developer has to remember to clear the bit for the other vendor.
>    Relying solely on mailing list reviews to catch these omissions may be
>    more error-prone than using an overlay approach.

I prefer the approach in this series. Require explicitly enabling for 
each overlay will force to provide the justification when enable it.

> - This patch series uses a 2D array in common KVM code to accommodate KVM
>    CPUID capabilities for different overlays. This avoids adding init ops
>    and runtime ops to call into vendor modules for a few reasons:
>    1. kvm_ops_update() is called after ops->hardware_setup(), inside which
>       the KVM CPU capabilities are built, runtime x86 ops can not be
>       called. Need some workaround to allow it.
>    2. These inputs to build the KVM CPU capabilities for overlays are from
>       the common KVM code or via the common KVM code helpers, which make
>       the callbacks in vendor module just duplication of similar tedious
>       code.
>    But conceptually, putting vendor-specific overlay data in the related
>    vendor module is cleaner.
> 
> - This patch combines vCPU capability initialization and paranoid CPUID
>    verification. It refactors the vCPU capability initialization to iterate
>    over userspace CPUID entries rather than reverse_cpuid[], combining the
>    paranoid check with capability setup. The purpose is to avoid iterating
>    over CPUID entries twice for vCPU capability initialization and paranoid
>    check separately. However, this can change the code for vCPU capability
>    initialization a bit even when paranoid mode is disabled. It could be
>    separated if we want to minimize the change for the non-paranoid mode.

I don't think iterate twice matters. It's not hot path anyway. And 
strictly speaking, it iterates two different ranges:

- for paranoid check, it needs to iterate on cpuid_entries[]
- for capability setup, it needs to iterate on reverse_cpuid[]

What's more, I think putting the paranoid check in kvm_check_cpuid() 
fits more naturally.

> - This patch series checks a CPUID register if part of the 32-bit range
>    is reserved. I am not sure this is necessary for all cases. It could be
>    simplified if we believe these reserved bits won’t cause problems
>    according to the property of the CPUID register, so that they can be
>    treated as ignored registers.

I'm not clear on it. Do you mean this series checks the reserved bits 
must be 0?

  parent reply	other threads:[~2026-05-15  8:08 UTC|newest]

Thread overview: 41+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-17  7:35 [RFC PATCH 00/27] KVM: x86: Add a paranoid mode for CPUID verification Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 01/27] KVM: x86: Fix emulated CPUID features being applied to wrong sub-leaf Binbin Wu
2026-05-15  9:03   ` Xiaoyao Li
2026-04-17  7:35 ` [RFC PATCH 02/27] KVM: x86: Reorder the features for CPUID 7 Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 03/27] KVM: x86: Add definitions for CPUID overlays Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 04/27] KVM: x86: Extend F() and its variants " Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 05/27] KVM: x86: Extend kvm_cpu_cap_{set/clear}() to configure overlays Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 06/27] KVM: x86: Populate TDX CPUID overlay with supported feature bits Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 07/27] KVM: x86: Support KVM_GET_{SUPPORTED,EMULATED}_CPUID as VM scope ioctls Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 08/27] KVM: x86: Thread @kvm to KVM CPU capability helpers Binbin Wu
2026-04-21  6:18   ` Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 09/27] KVM: x86: Use overlays of KVM CPU capabilities Binbin Wu
2026-04-21  5:31   ` Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 10/27] KVM: x86: Use vendor-specific overlay flags instead of F_CPUID_DEFAULT Binbin Wu
2026-04-21  6:43   ` Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 11/27] KVM: SVM: Drop unnecessary clears of unsupported common x86 features Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 12/27] KVM: x86: Split KVM CPU cap leafs into two parts Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 13/27] KVM: x86: Add a helper to initialize CPUID multi-bit fields Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 14/27] KVM: x86: Add a helper to init multiple feature bits based on raw CPUID Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 15/27] KVM: x86: Add infrastructure to track CPUID entries ignored in paranoid mode Binbin Wu
2026-04-17  7:35 ` [RFC PATCH 16/27] KVM: x86: Init allowed masks for basic CPUID range " Binbin Wu
2026-04-21  6:51   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 17/27] KVM: x86: Init allowed masks for extended " Binbin Wu
2026-04-21  7:55   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 18/27] KVM: x86: Handle Centaur CPUID leafs " Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 19/27] KVM: x86: Track KVM PV CPUID features for " Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 20/27] KVM: x86: Add per-VM flag to track CPUID " Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 21/27] KVM: x86: Make kvm_vcpu_after_set_cpuid() return an error code Binbin Wu
2026-04-22  8:22   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 22/27] KVM: x86: Verify userspace CPUID inputs in paranoid mode Binbin Wu
2026-04-22  8:59   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 23/27] KVM: x86: Account for runtime CPUID features " Binbin Wu
2026-04-23  2:41   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 24/27] KVM: x86: Skip paranoid CPUID check for KVM PV leafs when base is relocated Binbin Wu
2026-04-23  3:02   ` Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 25/27] KVM: x86: Add new KVM_CAP_X86_CPUID_PARANOID Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 26/27] KVM: x86: Add a helper to query the allowed CPUID mask Binbin Wu
2026-04-17  7:36 ` [RFC PATCH 27/27] KVM: TDX: Replace hardcoded CPUID filtering with the allowed mask Binbin Wu
2026-04-23  3:25   ` Binbin Wu
2026-05-15  8:08 ` Xiaoyao Li [this message]
2026-05-15 15:45   ` [RFC PATCH 00/27] KVM: x86: Add a paranoid mode for CPUID verification Edgecombe, Rick P

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50ccbe42-4b74-4c2a-b530-a367f7285de6@intel.com \
    --to=xiaoyao.li@intel.com \
    --cc=binbin.wu@linux.intel.com \
    --cc=chao.gao@intel.com \
    --cc=kai.huang@intel.com \
    --cc=kvm@vger.kernel.org \
    --cc=pbonzini@redhat.com \
    --cc=rick.p.edgecombe@intel.com \
    --cc=seanjc@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox