From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 07B21C433FE for ; Thu, 25 Nov 2021 13:48:08 +0000 (UTC) Received: from localhost ([::1]:52700 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1mqF6d-0007a5-LX for qemu-devel@archiver.kernel.org; Thu, 25 Nov 2021 08:48:07 -0500 Received: from eggs.gnu.org ([209.51.188.92]:43118) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mqF1f-0002Df-D4 for qemu-devel@nongnu.org; Thu, 25 Nov 2021 08:42:59 -0500 Received: from us-smtp-delivery-124.mimecast.com ([170.10.129.124]:59466) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mqF1a-00043j-61 for qemu-devel@nongnu.org; Thu, 25 Nov 2021 08:42:56 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1637847765; h=from:from:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=Wq+RfoJ1m0DNik4Bdgd23Ykpvf4lLkIJzokFlZBFVZw=; b=inOIXD9+fZwYgHxPuBbQdpgnypXZhlbZ3jvD1w02JTWu6eC7qaNPPXHyWiCBL7QIIl3Uf5 GqJ7f30OSaI3feoXKxxa3K+jxDtR/Ck78ID4NxHYyQbV7mm/3vB5A0cT3qQH48NxciXWOo KEq3gCG52+cPFh8qA/rMA0BpSoWTJhM= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-483-BCA-hA5KOXGB6Wsb0d5a7Q-1; Thu, 25 Nov 2021 08:42:42 -0500 X-MC-Unique: BCA-hA5KOXGB6Wsb0d5a7Q-1 Received: from smtp.corp.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.14]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 7CA6A64083; Thu, 25 Nov 2021 13:42:41 +0000 (UTC) Received: from redhat.com (unknown [10.33.36.141]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 769615D9CA; Thu, 25 Nov 2021 13:42:29 +0000 (UTC) Date: Thu, 25 Nov 2021 13:42:26 +0000 From: Daniel =?utf-8?B?UC4gQmVycmFuZ8Op?= To: Dov Murik Subject: Re: SEV guest attestation Message-ID: References: <20211125071428.dpnavgxd3w4bzktr@mhamilton> <53dc2a76-a6bb-dc8d-1a47-faf7577e59e8@linux.ibm.com> MIME-Version: 1.0 In-Reply-To: <53dc2a76-a6bb-dc8d-1a47-faf7577e59e8@linux.ibm.com> User-Agent: Mutt/2.1.3 (2021-09-10) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.14 Authentication-Results: relay.mimecast.com; auth=pass smtp.auth=CUSA124A263 smtp.mailfrom=berrange@redhat.com X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=170.10.129.124; envelope-from=berrange@redhat.com; helo=us-smtp-delivery-124.mimecast.com X-Spam_score_int: -34 X-Spam_score: -3.5 X-Spam_bar: --- X-Spam_report: (-3.5 / 5.0 requ) BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.7, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H4=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Daniel =?utf-8?B?UC4gQmVycmFuZ8Op?= Cc: Sergio Lopez , afrosi@redhat.com, James Bottomley , "Dr. David Alan Gilbert" , qemu-devel@nongnu.org, Hubertus Franke , Tyler Fanelli , Tobin Feldman-Fitzthum , Jim Cadden , dinechin@redhat.com, John Ferlan Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" On Thu, Nov 25, 2021 at 02:44:51PM +0200, Dov Murik wrote: > [+cc jejb, tobin, jim, hubertus] > > > On 25/11/2021 9:14, Sergio Lopez wrote: > > On Wed, Nov 24, 2021 at 06:29:07PM +0000, Dr. David Alan Gilbert wrote: > >> * Daniel P. Berrangé (berrange@redhat.com) wrote: > >>> On Wed, Nov 24, 2021 at 11:34:16AM -0500, Tyler Fanelli wrote: > >>>> Hi, > >>>> > >>>> We recently discussed a way for remote SEV guest attestation through QEMU. > >>>> My initial approach was to get data needed for attestation through different > >>>> QMP commands (all of which are already available, so no changes required > >>>> there), deriving hashes and certificate data; and collecting all of this > >>>> into a new QMP struct (SevLaunchStart, which would include the VM's policy, > >>>> secret, and GPA) which would need to be upstreamed into QEMU. Once this is > >>>> provided, QEMU would then need to have support for attestation before a VM > >>>> is started. Upon speaking to Dave about this proposal, he mentioned that > >>>> this may not be the best approach, as some situations would render the > >>>> attestation unavailable, such as the instance where a VM is running in a > >>>> cloud, and a guest owner would like to perform attestation via QMP (a likely > >>>> scenario), yet a cloud provider cannot simply let anyone pass arbitrary QMP > >>>> commands, as this could be an issue. > >>> > >>> As a general point, QMP is a low level QEMU implementation detail, > >>> which is generally expected to be consumed exclusively on the host > >>> by a privileged mgmt layer, which will in turn expose its own higher > >>> level APIs to users or other apps. I would not expect to see QMP > >>> exposed to anything outside of the privileged host layer. > >>> > >>> We also use the QAPI protocol for QEMU guest agent commmunication, > >>> however, that is a distinct service from QMP on the host. It shares > >>> most infra with QMP but has a completely diffent command set. On the > >>> host it is not consumed inside QEMU, but instead consumed by a > >>> mgmt app like libvirt. > >>> > >>>> So I ask, does anyone involved in QEMU's SEV implementation have any input > >>>> on a quality way to perform guest attestation? If so, I'd be interested. > >>> > >>> I think what's missing is some clearer illustrations of how this > >>> feature is expected to be consumed in some real world application > >>> and the use cases we're trying to solve. > >>> > >>> I'd like to understand how it should fit in with common libvirt > >>> applications across the different virtualization management > >>> scenarios - eg virsh (command line), virt-manger (local desktop > >>> GUI), cockpit (single host web mgmt), OpenStack (cloud mgmt), etc. > >>> And of course any non-traditional virt use cases that might be > >>> relevant such as Kata. > >> > >> That's still not that clear; I know Alice and Sergio have some ideas > >> (cc'd). > >> There's also some standardisation efforts (e.g. https://www.potaroo.net/ietf/html/ids-wg-rats.html > >> and https://www.ietf.org/archive/id/draft-ietf-rats-architecture-00.html > >> ) - that I can't claim to fully understand. > >> However, there are some themes that are emerging: > >> > >> a) One use is to only allow a VM to access some private data once we > >> prove it's the VM we expect running in a secure/confidential system > >> b) (a) normally involves requesting some proof from the VM and then > >> providing it some confidential data/a key if it's OK > >> c) RATs splits the problem up: > >> https://www.ietf.org/archive/id/draft-ietf-rats-architecture-00.html#name-architectural-overview > >> I don't fully understand the split yet, but in principal there are > >> at least a few different things: > >> > >> d) The comms layer > >> e) Something that validates the attestation message (i.e. the > >> signatures are valid, the hashes all add up etc) > >> f) Something that knows what hashes to expect (i.e. oh that's a RHEL > >> 8.4 kernel, or that's a valid kernel command line) > >> g) Something that holds some secrets that can be handed out if e & f > >> are happy. > >> > >> There have also been proposals (e.g. Intel HTTPA) for an attestable > >> connection after a VM is running; that's probably quite different from > >> (g) but still involves (e) & (f). > >> > >> In the simpler setups d,e,f,g probably live in one place; but it's not > >> clear where they live - for example one scenario says that your cloud > >> management layer holds some of them, another says you don't trust your > >> cloud management layer and you keep them separate. > >> > >> So I think all we're actually interested in at the moment, is (d) and > >> (e) and the way for (g) to get the secret back to the guest. > >> > >> Unfortunately the comms and the contents of them varies heavily with > >> technology; in some you're talking to the qemu/hypervisor (SEV/SEV-ES) > >> while in some you're talking to the guest after boot (SEV-SNP/TDX maybe > >> SEV-ES in some cases). > > SEV-ES has pre-launch measurement and secret injection, just like SEV > (except that the measurement includes the initial states of all vcpus, > that is, their VMSAs. BTW that means that in order to calculate the > measurement the Attestation Server must know exactly how many vcpus are > in the VM). Does that work with CPU hotplug ? ie cold boot with -smp 4,maxcpus=8 and some time later try to enable the extra 4 cpus at runtime ? > >> So my expectation at the moment is libvirt needs to provide a transport > >> layer for the comms, to enable an external validator to retrieve the > >> measurements from the guest/hypervisor and provide data back if > >> necessary. Once this shakes out a bit, we might want libvirt to be > >> able to invoke the validator; however I expect (f) and (g) to be much > >> more complex things that don't feel like they belong in libvirt. > > > > We experimented with the attestation flow quite a bit while working on > > SEV-ES support for libkrun-tee. One important aspect we noticed quite > > early, is that there's more data that's needed to be exchange of top > > of the attestation itself. > > > > For instance, even before you start the VM, the management layer in > > charge of coordinating the confidential VM launch needs to obtain the > > Virtualization TEE capabilities of the Host (SEV-ES vs. SEV-SNP > > vs. TDX) and the platform version, to know which features are > > available and whether that host is a candidate for running the VM at > > all. > > > > With that information, the mgmt layer can build a guest policy (this > > is SEV's terminology, but I guess we'll have something similar in > > TDX) and feed it to component launching the VMM (libvirt, in this > > case). > > > > For SEV-SNP, this is pretty much the end of the story, because the > > attestation exchange is driven by an agent inside the guest. Well, > > there's also the need to have in the VM a well-known vNIC bridged to a > > network that's routed to the Attestation Server, that everyone seems > > to consider a given, but to me, from a CSP perspective, looks like > > quite a headache. In fact, I'd go as far as to suggest this > > communication should happen through an alternative channel, such as > > vsock, having a proxy on the Host, but I guess that depends on the CSP > > infrastructure. > > If we have an alternative channel (vsock?) and a proxy on the host, > maybe we can share parts of the solution between SEV and SNP. > > > > For SEV/SEV-ES, as the attestation happens at the VMM level, there's > > still the need to have some interactions with it. As Tyler pointed > > out, we basically need to retrieve the measurement and, if valid, > > inject the secret. If the measurement isn't valid, the VM must be shut > > down immediately. > > > > In libkrun-tee, this operation is driven by the VMM in libkrun, which > > contacts the Attestation Server with the measurement and receives the > > secret in exchange. I guess for QEMU/libvirt we expect this to be > > driven by the upper management layer through a delegated component in > > the Host, such as NOVA. In this case, NOVA would need to: > > > > - Based on the upper management layer info and the Host properties, > > generate a guest policy and use it while generating the compute > > instance XML. > > > > - Ask libvirt to launch the VM. > > Launch the VM with -S (suspended; so it doesn't actually begin running > guest instructions). > > > > > > - Wait for the VM to be in SEV_STATE_LAUNCH_SECRET state *. > > > > - Retrieve the measurement *. > > Note that libvirt holds the QMP socket to QEMU. So whoever fetches the > measurement needs either (a) to ask libvirt to it; or (b) to connect to > another QMP listening socket for getting the measurement and injecting > the secret. Libvirt would not be particularly happy with allowing (b) because it enables the 3rd parties to change the VM state behind libvirt's back in ways that can ultimately confuse its understanding of the state of the VM. If there's some task that needs interaction with a QEMU managed by libvirt, we need to expose suitable APIs in libvirt (if they don't already exist). Regards, Daniel -- |: https://berrange.com -o- https://www.flickr.com/photos/dberrange :| |: https://libvirt.org -o- https://fstop138.berrange.com :| |: https://entangle-photo.org -o- https://www.instagram.com/dberrange :|