From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:47061) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1elcQb-0003E6-Vc for qemu-devel@nongnu.org; Tue, 13 Feb 2018 10:23:31 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1elcQa-0008SA-V9 for qemu-devel@nongnu.org; Tue, 13 Feb 2018 10:23:29 -0500 Date: Tue, 13 Feb 2018 16:23:21 +0100 From: Kevin Wolf Message-ID: <20180213152321.GK5083@localhost.localdomain> References: <20180107122336.29333-1-richiejp@f-m.fm> <5cf19623-72ac-fb8b-2054-a60d42419ec6@redhat.com> <20180111130427.GG8326@redhat.com> <20180213105024.GC5083@localhost.localdomain> <20180213143001.GA2354@rkaganb.sw.ru> <20180213144310.GH5083@localhost.localdomain> <20180213145844.GP573@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline In-Reply-To: <20180213145844.GP573@redhat.com> Content-Transfer-Encoding: quoted-printable Subject: Re: [Qemu-devel] [Qemu-block] [PATCH 1/2] Add save-snapshot, load-snapshot and delete-snapshot to QAPI List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Daniel =?iso-8859-1?Q?P=2E_Berrang=E9?= Cc: Roman Kagan , Richard Palethorpe , Qemu-block , quintela@redhat.com, qemu-devel@nongnu.org, armbru@redhat.com, Max Reitz , rpalethorpe@suse.com, dgilbert@redhat.com, Denis Plotnikov , Denis Lunev Am 13.02.2018 um 15:58 hat Daniel P. Berrang=E9 geschrieben: > On Tue, Feb 13, 2018 at 03:43:10PM +0100, Kevin Wolf wrote: > > Am 13.02.2018 um 15:30 hat Roman Kagan geschrieben: > > > On Tue, Feb 13, 2018 at 11:50:24AM +0100, Kevin Wolf wrote: > > > > Am 11.01.2018 um 14:04 hat Daniel P. Berrange geschrieben: > > > > > Then you could just use the regular migrate QMP commands for lo= ading > > > > > and saving snapshots. > > > >=20 > > > > Yes, you could. I think for a proper implementation you would wan= t to do > > > > better, though. Live migration provides just a stream, but that's= not > > > > really well suited for snapshots. When a RAM page is dirtied, you= just > > > > want to overwrite the old version of it in a snapshot [...] > > >=20 > > > This means the point in time where the guest state is snapshotted i= s not > > > when the command is issued, but any unpredictable amount of time la= ter. > > >=20 > > > I'm not sure this is what a user expects. > >=20 > > I don't think it's necessarily a big problem as long as you set the > > expectations right, but good point anyway. > >=20 > > > A better approach for the save part appears to be to stop the vcpus= , > > > dump the device state, resume the vcpus, and save the memory conten= ts > > > in the background, prioritizing the old copies of the pages that > > > change. > >=20 > > So basically you would let the guest fault whenever it writes to a pa= ge > > that is not saved yet, and then save it first before you make the pag= e > > writable again? Essentially blockdev-backup, except for RAM. >=20 > The page fault servicing will be delayed by however long it takes to > write the page to underling storage, which could be considerable with > non-SSD. So guest performance could be significantly impacted on slow > storage with high dirtying rate. On the flip side it gurantees a live > snapshot would complete in finite time which is good. You can just use a bounce buffer for writing out the old page. Then the VM is only stopped for the duration of a malloc() + memcpy(). Kevin