Kernel KVM virtualization development
 help / color / mirror / Atom feed
From: Sean Christopherson <seanjc@google.com>
To: Ackerley Tng <ackerleytng@google.com>
Cc: Michael Roth <michael.roth@amd.com>,
	sashiko-reviews@lists.linux.dev,
	 Ackerley Tng via B4 Relay
	<devnull+ackerleytng.google.com@kernel.org>,
	kvm@vger.kernel.org
Subject: Re: [PATCH v7 21/42] KVM: TDX: Make source page optional for KVM_TDX_INIT_MEM_REGION
Date: Tue, 9 Jun 2026 17:11:28 -0700	[thread overview]
Message-ID: <aiirsJFic_naddv7@google.com> (raw)
In-Reply-To: <CAEvNRgFkfqirL6hZDk5eugeaf4u7J_=a0xj6RdE=QnG5T6N29A@mail.gmail.com>

On Fri, Jun 05, 2026, Ackerley Tng wrote:
> Michael Roth <michael.roth@amd.com> writes:
> 
> >
> > [...snip...]
> >
> >> > > When KVM_TDX_INIT_MEM_REGION is called with a NULL source_addr,
> >> > > __kvm_gmem_populate() attempts an in-place conversion. If the target
> >> > > guest_memfd folio is unpopulated, __kvm_gmem_get_pfn() allocates a new folio
> >> > > using GFP_HIGHUSER, which lacks __GFP_ZERO.
> >>
> >> For in-place conversion, if src is NULL, it's generally because src *is*
> >> the target and it makes no sense to copy the page to itself because
> >> userspace already initialized it, which means userspace should have already
> >> trigged the uptodate flag to be set via kvm_gmem_fault_user_mapping().
> >>
> >> So maybe to address the malicious case that Sashiko seems to be worried
> >> about (where VMM purposely doesn't touch/init the memory to try to leak data
> >> into the guest), we just need to error out if: !src && !folio_uptodate(folio)?
> >
> > Of course immediately after posting this I think of exceptional cases, like
> > SEV-SNP can set up pre-zero'd private guest pages via this path, where
> > userspace wouldn't necessarily have done any touching/init of the page
> > in advance.
> >
> 
> Are you referring to KVM_SEV_SNP_PAGE_TYPE_ZERO?
> 
> > BUT.... in that case it would be okay, because the kernel knows
> > everything would get zero'd.
> >
> > So maybe the post-populate callbacks need to be the ones that would need
> > to implement these checks, or some API to that gmem can do it on their
> > behalf...
> >
> > So maybe your proposed patch is the more straightforward fix for now.
> >
> 
> Is KVM_SEV_SNP_PAGE_TYPE_ZERO expected to be used for a large number
> pages? If not, I think we can defer this to a future optimization, like
> in snp_launch_update(), if KVM_SEV_SNP_PAGE_TYPE_ZERO, pass a flag
> through to __kvm_gmem_get_pfn() to skip zeroing?
> 
> For all other page types, __kvm_gmem_get_pfn() MUST zero to avoid the
> issue Sashiko pointed out, right?

Or we do as Mike suggested, and outright reject the populate() call.  But I think
for ABI purposes, zeroing memory is the right approach, otherwise we'll end up
with a discrepancy between the userfault path and populate().

As I mentioned in the cover letter, initial guest image will typically be a tiny
subset of guest memory, so I don't have any concerns with unnecessarily zeroing
a few pages (memory that is ultimately overwritten by userspace).

> Currently, kvm_gmem_fault_user_mapping() will zero, and then userspace
> presumably writes the entire page. Can't avoid zeroing to avoid leaking
> uninitialized memory.
> 
> To avoid that zeroing, I guess there's the future guest_memfd write()
> syscall too, userspace can write() the entire page - write in the kernel
> can avoid zeroing since it knows which parts of the page was written and
> set uptodate. Then, kvm_gmem_fault_user_mapping() won't zero since the
> page is uptodate, and populate will not zero since it was also uptodate.



  reply	other threads:[~2026-06-10  0:11 UTC|newest]

Thread overview: 88+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-23  0:17 [PATCH v7 00/42] guest_memfd: In-place conversion support Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 01/42] KVM: guest_memfd: Introduce per-gmem attributes, use to guard user mappings Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 02/42] KVM: Rename KVM_GENERIC_MEMORY_ATTRIBUTES to KVM_VM_MEMORY_ATTRIBUTES Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 03/42] KVM: Enumerate support for PRIVATE memory iff kvm_arch_has_private_mem is defined Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 04/42] KVM: Stub in ability to disable per-VM memory attribute tracking Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 05/42] KVM: guest_memfd: Wire up kvm_get_memory_attributes() to per-gmem attributes Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 06/42] KVM: guest_memfd: Update kvm_gmem_populate() to use gmem attributes Ackerley Tng via B4 Relay
2026-05-23  0:59   ` sashiko-bot
2026-05-23  0:17 ` [PATCH v7 07/42] KVM: guest_memfd: Only prepare folios for private pages Ackerley Tng via B4 Relay
2026-05-23  0:52   ` sashiko-bot
2026-05-27 21:22     ` Ackerley Tng
2026-06-02 20:41     ` Ackerley Tng
2026-06-02  8:55   ` Suzuki K Poulose
2026-06-02  9:10     ` Suzuki K Poulose
2026-06-02 22:41       ` Ackerley Tng
2026-06-03  8:58         ` Suzuki K Poulose
2026-06-03 13:51           ` Michael Roth
2026-06-02 20:46     ` Ackerley Tng
2026-06-03 13:54       ` Michael Roth
2026-05-23  0:17 ` [PATCH v7 08/42] KVM: Move kvm_supported_mem_attributes() to kvm_host.h Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 09/42] KVM: guest_memfd: Add base support for KVM_SET_MEMORY_ATTRIBUTES2 Ackerley Tng via B4 Relay
2026-05-23  1:01   ` sashiko-bot
2026-05-27 21:27     ` Ackerley Tng
2026-06-01 23:14   ` Michael Roth
2026-05-23  0:17 ` [PATCH v7 10/42] KVM: guest_memfd: Ensure pages are not in use before conversion Ackerley Tng via B4 Relay
2026-05-23  0:55   ` sashiko-bot
2026-06-08  8:55   ` Vlastimil Babka (SUSE)
2026-05-23  0:17 ` [PATCH v7 11/42] KVM: guest_memfd: Call arch invalidate hooks on conversion Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 12/42] KVM: guest_memfd: Return early if range already has requested attributes Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 13/42] KVM: guest_memfd: Advertise KVM_SET_MEMORY_ATTRIBUTES2 ioctl Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 14/42] KVM: guest_memfd: Handle lru_add fbatch refcounts during conversion safety check Ackerley Tng via B4 Relay
2026-06-08  8:45   ` Vlastimil Babka (SUSE)
2026-05-23  0:17 ` [PATCH v7 15/42] KVM: guest_memfd: Use actual size for invalidation in kvm_gmem_release() Ackerley Tng via B4 Relay
2026-05-23  0:17 ` [PATCH v7 16/42] KVM: guest_memfd: Determine invalidation filter from memory attributes Ackerley Tng via B4 Relay
2026-05-23  1:06   ` sashiko-bot
     [not found]     ` <CAEvNRgH21BoKT1mOQzgmKHKpDi4xbwtbMuenGv5U1ZUSENrJmg@mail.gmail.com>
2026-05-29 20:44       ` Sean Christopherson
2026-05-23  0:17 ` [PATCH v7 17/42] KVM: Move KVM_VM_MEMORY_ATTRIBUTES config definition to x86 Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 18/42] KVM: Let userspace disable per-VM mem attributes, enable per-gmem attributes Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 19/42] KVM: guest_memfd: Enable INIT_SHARED on guest_memfd for x86 Coco VMs Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 20/42] KVM: SEV: Make 'uaddr' parameter optional for KVM_SEV_SNP_LAUNCH_UPDATE Ackerley Tng via B4 Relay
2026-05-23  0:55   ` sashiko-bot
2026-05-27 23:31     ` Ackerley Tng
2026-06-04 15:29   ` Suzuki K Poulose
2026-06-04 19:05     ` Ackerley Tng
2026-06-05  8:54       ` Suzuki K Poulose
2026-06-04 20:11     ` Michael Roth
2026-06-05  9:06       ` Suzuki K Poulose
2026-05-23  0:18 ` [PATCH v7 21/42] KVM: TDX: Make source page optional for KVM_TDX_INIT_MEM_REGION Ackerley Tng via B4 Relay
2026-05-23  1:07   ` sashiko-bot
2026-06-03 21:22     ` Ackerley Tng
2026-06-03 23:45       ` Michael Roth
2026-06-03 23:55         ` Michael Roth
2026-06-05 18:40           ` Ackerley Tng
2026-06-10  0:11             ` Sean Christopherson [this message]
2026-06-10  0:44               ` Michael Roth
2026-05-23  0:18 ` [PATCH v7 22/42] KVM: selftests: Create gmem fd before "regular" fd when adding memslot Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 23/42] KVM: selftests: Rename guest_memfd{,_offset} to gmem_{fd,offset} Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 24/42] KVM: selftests: Add support for mmap() on guest_memfd in core library Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 25/42] KVM: selftests: Add selftests global for guest memory attributes capability Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 26/42] KVM: selftests: Add helpers for calling ioctls on guest_memfd Ackerley Tng via B4 Relay
2026-05-23  0:42   ` sashiko-bot
2026-06-03 16:33     ` Ackerley Tng
2026-05-23  0:18 ` [PATCH v7 27/42] KVM: selftests: Test basic single-page conversion flow Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 28/42] KVM: selftests: Test conversion flow when INIT_SHARED Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 29/42] KVM: selftests: Test conversion precision in guest_memfd Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 30/42] KVM: selftests: Test conversion before allocation Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 31/42] KVM: selftests: Convert with allocated folios in different layouts Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 32/42] KVM: selftests: Test that truncation does not change shared/private status Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 33/42] KVM: selftests: Test that shared/private status is consistent across processes Ackerley Tng via B4 Relay
2026-05-23  1:11   ` sashiko-bot
2026-05-23  0:18 ` [PATCH v7 34/42] KVM: selftests: Test conversion with elevated page refcount Ackerley Tng via B4 Relay
2026-06-02 21:26   ` Askar Safin
2026-05-23  0:18 ` [PATCH v7 35/42] KVM: selftests: Reset shared memory after hole-punching Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 36/42] KVM: selftests: Provide function to look up guest_memfd details from gpa Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 37/42] KVM: selftests: Provide common function to set memory attributes Ackerley Tng via B4 Relay
2026-05-23  1:35   ` sashiko-bot
2026-06-03 19:01     ` Ackerley Tng
2026-05-23  0:18 ` [PATCH v7 38/42] KVM: selftests: Check fd/flags provided to mmap() when setting up memslot Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 39/42] KVM: selftests: Make TEST_EXPECT_SIGBUS thread-safe Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 40/42] KVM: selftests: Update private_mem_conversions_test to mmap() guest_memfd Ackerley Tng via B4 Relay
2026-05-23  0:18 ` [PATCH v7 41/42] KVM: selftests: Add script to exercise private_mem_conversions_test Ackerley Tng via B4 Relay
2026-05-23  1:15   ` sashiko-bot
2026-05-23  0:18 ` [PATCH v7 42/42] KVM: selftests: Update private memory exits test to work with per-gmem attributes Ackerley Tng via B4 Relay
2026-06-03 21:27 ` [PATCH v7 00/42] guest_memfd: In-place conversion support Ackerley Tng
2026-06-04 20:20   ` Sean Christopherson
2026-06-04 21:14     ` Ackerley Tng
2026-06-05 18:27       ` Sean Christopherson
2026-06-05 13:41 ` [POC] KVM: selftests: Verify conversion works with TDX Ackerley Tng

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aiirsJFic_naddv7@google.com \
    --to=seanjc@google.com \
    --cc=ackerleytng@google.com \
    --cc=devnull+ackerleytng.google.com@kernel.org \
    --cc=kvm@vger.kernel.org \
    --cc=michael.roth@amd.com \
    --cc=sashiko-reviews@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox