From: Nikolay Borisov <nik.borisov@suse.com>
To: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Dave Hansen <dave.hansen@linux.intel.com>,
Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>, Borislav Petkov <bp@alien8.de>,
x86@kernel.org, "H. Peter Anvin" <hpa@zytor.com>
Cc: linux-coco@lists.linux.dev, linux-kernel@vger.kernel.org,
stable@vger.kernel.org
Subject: Re: [PATCHv5 3/4] x86/tdx: Dynamically disable SEPT violations from causing #VEs
Date: Wed, 3 Jul 2024 14:39:09 +0300 [thread overview]
Message-ID: <05d0b24a-2e21-48c0-85b7-a9dd935ac449@suse.com> (raw)
In-Reply-To: <20240624114149.377492-4-kirill.shutemov@linux.intel.com>
On 24.06.24 г. 14:41 ч., Kirill A. Shutemov wrote:
> Memory access #VE's are hard for Linux to handle in contexts like the
> entry code or NMIs. But other OSes need them for functionality.
> There's a static (pre-guest-boot) way for a VMM to choose one or the
> other. But VMMs don't always know which OS they are booting, so they
> choose to deliver those #VE's so the "other" OSes will work. That,
> unfortunately has left us in the lurch and exposed to these
> hard-to-handle #VEs.
>
> The TDX module has introduced a new feature. Even if the static
> configuration is "send nasty #VE's", the kernel can dynamically request
> that they be disabled.
>
> Check if the feature is available and disable SEPT #VE if possible.
>
> If the TD allowed to disable/enable SEPT #VEs, the ATTR_SEPT_VE_DISABLE
> attribute is no longer reliable. It reflects the initial state of the
> control for the TD, but it will not be updated if someone (e.g. bootloader)
> changes it before the kernel starts. Kernel must check TDCS_TD_CTLS bit to
> determine if SEPT #VEs are enabled or disabled.
>
> Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
> Fixes: 373e715e31bf ("x86/tdx: Panic on bad configs that #VE on "private" memory access")
> Cc: stable@vger.kernel.org
> ---
> arch/x86/coco/tdx/tdx.c | 76 ++++++++++++++++++++++++-------
> arch/x86/include/asm/shared/tdx.h | 10 +++-
> 2 files changed, 69 insertions(+), 17 deletions(-)
>
> diff --git a/arch/x86/coco/tdx/tdx.c b/arch/x86/coco/tdx/tdx.c
> index 08ce488b54d0..ba3103877b21 100644
> --- a/arch/x86/coco/tdx/tdx.c
> +++ b/arch/x86/coco/tdx/tdx.c
> @@ -78,7 +78,7 @@ static inline void tdcall(u64 fn, struct tdx_module_args *args)
> }
>
> /* Read TD-scoped metadata */
> -static inline u64 __maybe_unused tdg_vm_rd(u64 field, u64 *value)
> +static inline u64 tdg_vm_rd(u64 field, u64 *value)
> {
> struct tdx_module_args args = {
> .rdx = field,
> @@ -193,6 +193,62 @@ static void __noreturn tdx_panic(const char *msg)
> __tdx_hypercall(&args);
> }
>
> +/*
> + * The kernel cannot handle #VEs when accessing normal kernel memory. Ensure
> + * that no #VE will be delivered for accesses to TD-private memory.
> + *
> + * TDX 1.0 does not allow the guest to disable SEPT #VE on its own. The VMM
> + * controls if the guest will receive such #VE with TD attribute
> + * ATTR_SEPT_VE_DISABLE.
> + *
> + * Newer TDX module allows the guest to control if it wants to receive SEPT
> + * violation #VEs.
> + *
> + * Check if the feature is available and disable SEPT #VE if possible.
> + *
> + * If the TD allowed to disable/enable SEPT #VEs, the ATTR_SEPT_VE_DISABLE
> + * attribute is no longer reliable. It reflects the initial state of the
> + * control for the TD, but it will not be updated if someone (e.g. bootloader)
> + * changes it before the kernel starts. Kernel must check TDCS_TD_CTLS bit to
> + * determine if SEPT #VEs are enabled or disabled.
> + */
> +static void disable_sept_ve(u64 td_attr)
> +{
> + const char *msg = "TD misconfiguration: SEPT #VE has to be disabled";
> + bool debug = td_attr & ATTR_DEBUG;
> + u64 config, controls;
> +
> + /* Is this TD allowed to disable SEPT #VE */
> + tdg_vm_rd(TDCS_CONFIG_FLAGS, &config);
> + if (!(config & TDCS_CONFIG_FLEXIBLE_PENDING_VE)) {
> + /* No SEPT #VE controls for the guest: check the attribute */
> + if (td_attr & ATTR_SEPT_VE_DISABLE)
> + return;
> +
> + /* Relax SEPT_VE_DISABLE check for debug TD for backtraces */
> + if (debug)
> + pr_warn("%s\n", msg);
> + else
> + tdx_panic(msg);
> + return;
> + }
> +
> + /* Check if SEPT #VE has been disabled before us */
> + tdg_vm_rd(TDCS_TD_CTLS, &controls);
> + if (controls & TD_CTLS_PENDING_VE_DISABLE)
> + return;
> +
> + /* Keep #VEs enabled for splats in debugging environments */
> + if (debug)
> + return;
> +
> + /* Disable SEPT #VEs */
> + tdg_vm_wr(TDCS_TD_CTLS, TD_CTLS_PENDING_VE_DISABLE,
> + TD_CTLS_PENDING_VE_DISABLE);
> +
> + return;
> +}
> +
> static void tdx_setup(u64 *cc_mask)
> {
> struct tdx_module_args args = {};
> @@ -218,24 +274,12 @@ static void tdx_setup(u64 *cc_mask)
> gpa_width = args.rcx & GENMASK(5, 0);
> *cc_mask = BIT_ULL(gpa_width - 1);
>
> + td_attr = args.rdx;
> +
> /* Kernel does not use NOTIFY_ENABLES and does not need random #VEs */
> tdg_vm_wr(TDCS_NOTIFY_ENABLES, 0, -1ULL);
>
> - /*
> - * The kernel can not handle #VE's when accessing normal kernel
> - * memory. Ensure that no #VE will be delivered for accesses to
> - * TD-private memory. Only VMM-shared memory (MMIO) will #VE.
> - */
> - td_attr = args.rdx;
> - if (!(td_attr & ATTR_SEPT_VE_DISABLE)) {
> - const char *msg = "TD misconfiguration: SEPT_VE_DISABLE attribute must be set.";
> -
> - /* Relax SEPT_VE_DISABLE check for debug TD. */
> - if (td_attr & ATTR_DEBUG)
> - pr_warn("%s\n", msg);
> - else
> - tdx_panic(msg);
> - }
> + disable_sept_ve(td_attr);
> }
>
> /*
> diff --git a/arch/x86/include/asm/shared/tdx.h b/arch/x86/include/asm/shared/tdx.h
> index 7e12cfa28bec..fecb2a6e864b 100644
> --- a/arch/x86/include/asm/shared/tdx.h
> +++ b/arch/x86/include/asm/shared/tdx.h
> @@ -19,9 +19,17 @@
> #define TDG_VM_RD 7
> #define TDG_VM_WR 8
>
> -/* TDCS fields. To be used by TDG.VM.WR and TDG.VM.RD module calls */
> +/* TDX TD-Scope Metadata. To be used by TDG.VM.WR and TDG.VM.RD */
> +#define TDCS_CONFIG_FLAGS 0x1110000300000016
0x9110000300000016
> +#define TDCS_TD_CTLS 0x1110000300000017
0x9110000300000017
> #define TDCS_NOTIFY_ENABLES 0x9100000000000010
>
> +/* TDCS_CONFIG_FLAGS bits */
> +#define TDCS_CONFIG_FLEXIBLE_PENDING_VE BIT_ULL(1)
> +
> +/* TDCS_TD_CTLS bits */
> +#define TD_CTLS_PENDING_VE_DISABLE BIT_ULL(0)
> +
> /* TDX hypercall Leaf IDs */
> #define TDVMCALL_MAP_GPA 0x10001
> #define TDVMCALL_GET_QUOTE 0x10002
next prev parent reply other threads:[~2024-07-03 11:39 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-24 11:41 [PATCHv5 0/4] x86/tdx: Adjust TD settings on boot Kirill A. Shutemov
2024-06-24 11:41 ` [PATCHv5 1/4] x86/tdx: Introduce wrappers to read and write TD metadata Kirill A. Shutemov
2024-06-24 11:41 ` [PATCHv5 2/4] x86/tdx: Rename tdx_parse_tdinfo() to tdx_setup() Kirill A. Shutemov
2024-06-24 11:41 ` [PATCHv5 3/4] x86/tdx: Dynamically disable SEPT violations from causing #VEs Kirill A. Shutemov
2024-07-03 11:39 ` Nikolay Borisov [this message]
2024-07-03 13:04 ` Kirill A. Shutemov
2024-07-03 14:49 ` Dave Hansen
2024-07-04 13:03 ` Kirill A. Shutemov
2024-07-03 12:22 ` Nikolay Borisov
2024-07-03 13:05 ` Kirill A. Shutemov
2024-06-24 11:41 ` [PATCHv5 4/4] x86/tdx: Enable CPU topology enumeration Kirill A. Shutemov
2024-07-02 16:17 ` Xiaoyao Li
2024-07-03 12:33 ` Kirill A. Shutemov
2024-07-04 10:17 ` [PATCHv5 0/4] x86/tdx: Adjust TD settings on boot Nikolay Borisov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=05d0b24a-2e21-48c0-85b7-a9dd935ac449@suse.com \
--to=nik.borisov@suse.com \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=hpa@zytor.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-coco@lists.linux.dev \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=stable@vger.kernel.org \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).