From: Stefan Hajnoczi <stefanha@redhat.com>
To: Kevin Wolf <kwolf@redhat.com>
Cc: Max Reitz <mreitz@redhat.com>, He Junyan <junyan.he@intel.com>,
Pankaj Gupta <pagupta@redhat.com>,
qemu-devel@nongnu.org, qemu block <qemu-block@nongnu.org>
Subject: Re: [Qemu-devel] [Qemu-block] Some question about savem/qcow2 incremental snapshot
Date: Thu, 31 May 2018 11:48:38 +0100 [thread overview]
Message-ID: <20180531104838.GC27838@stefanha-x1.localdomain> (raw)
In-Reply-To: <20180530160719.GD4311@localhost.localdomain>
[-- Attachment #1: Type: text/plain, Size: 3933 bytes --]
On Wed, May 30, 2018 at 06:07:19PM +0200, Kevin Wolf wrote:
> Am 30.05.2018 um 16:44 hat Stefan Hajnoczi geschrieben:
> > On Mon, May 14, 2018 at 02:48:47PM +0100, Stefan Hajnoczi wrote:
> > > On Fri, May 11, 2018 at 07:25:31PM +0200, Kevin Wolf wrote:
> > > > Am 10.05.2018 um 10:26 hat Stefan Hajnoczi geschrieben:
> > > > > On Wed, May 09, 2018 at 07:54:31PM +0200, Max Reitz wrote:
> > > > > > On 2018-05-09 12:16, Stefan Hajnoczi wrote:
> > > > > > > On Tue, May 08, 2018 at 05:03:09PM +0200, Kevin Wolf wrote:
> > > > > > >> Am 08.05.2018 um 16:41 hat Eric Blake geschrieben:
> > > > > > >>> On 12/25/2017 01:33 AM, He Junyan wrote:
> > > > > > >> I think it makes sense to invest some effort into such interfaces, but
> > > > > > >> be prepared for a long journey.
> > > > > > >
> > > > > > > I like the suggestion but it needs to be followed up with a concrete
> > > > > > > design that is feasible and fair for Junyan and others to implement.
> > > > > > > Otherwise the "long journey" is really just a way of rejecting this
> > > > > > > feature.
> >
> > The discussion on NVDIMM via the block layer has runs its course. It
> > would be a big project and I don't think it's fair to ask Junyan to
> > implement it.
> >
> > My understanding is this patch series doesn't modify the qcow2 on-disk
> > file format. Rather, it just uses existing qcow2 mechanisms and extends
> > live migration to identify the NVDIMM state state region to share the
> > clusters.
> >
> > Since this feature does not involve qcow2 format changes and is just an
> > optimization (dirty blocks still need to be allocated), it can be
> > removed from QEMU in the future if a better alternative becomes
> > available.
> >
> > Junyan: Can you rebase the series and send a new revision?
> >
> > Kevin and Max: Does this sound alright?
>
> Do patches exist? I've never seen any, so I thought this was just the
> early design stage.
Sorry for the confusion, the earlier patch series was here:
https://lists.nongnu.org/archive/html/qemu-devel/2018-03/msg04530.html
> I suspect that while it wouldn't change the qcow2 on-disk format in a
> way that the qcow2 spec would have to be change, it does need to change
> the VMState format that is stored as a blob within the qcow2 file.
> At least, you need to store which other snapshot it is based upon so
> that you can actually resume a VM from the incremental state.
>
> Once you modify the VMState format/the migration stream, removing it
> from QEMU again later means that you can't load your old snapshots any
> more. Doing that, even with the two-release deprecation period, would be
> quite nasty.
>
> But you're right, depending on how the feature is implemented, it might
> not be a thing that affects qcow2 much, but one that the migration
> maintainers need to have a look at. I kind of suspect that it would
> actually touch both parts to a degree that it would need approval from
> both sides.
VMState wire format changes are minimal. The only issue is that the
previous snapshot's nvdimm vmstate can start at an arbitrary offset in
the qcow2 cluster. We can find a solution to the misalignment problem
(I think Junyan's patch series adds padding).
The approach references existing clusters in the previous snapshot's
vmstate area and only allocates new clusters for dirty NVDIMM regions.
In the non-qcow2 case we fall back to writing the entire NVDIMM
contents.
So instead of:
write(qcow2_bs, all_vmstate_data); /* duplicates nvdimm contents :( */
do:
write(bs, vmstate_data_upto_nvdimm);
if (is_qcow2(bs)) {
snapshot_clone_vmstate_range(bs, previous_snapshot,
offset_to_nvdimm_vmstate);
overwrite_nvdimm_dirty_blocks(bs, nvdimm);
} else {
write(bs, nvdimm_vmstate_data);
}
write(bs, vmstate_data_after_nvdimm);
Stefan
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 455 bytes --]
next prev parent reply other threads:[~2018-05-31 10:48 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-25 7:33 [Qemu-devel] Some question about savem/qcow2 incremental snapshot He Junyan
2018-05-08 14:41 ` Eric Blake
2018-05-08 15:03 ` [Qemu-devel] [Qemu-block] " Kevin Wolf
2018-05-09 10:16 ` Stefan Hajnoczi
2018-05-09 17:54 ` Max Reitz
2018-05-10 8:26 ` Stefan Hajnoczi
2018-05-11 17:25 ` Kevin Wolf
2018-05-14 13:48 ` Stefan Hajnoczi
2018-05-28 7:01 ` He, Junyan
2018-05-30 14:44 ` Stefan Hajnoczi
2018-05-30 16:07 ` Kevin Wolf
2018-05-31 10:48 ` Stefan Hajnoczi [this message]
2018-06-08 5:02 ` He, Junyan
2018-06-08 7:59 ` Pankaj Gupta
2018-06-08 15:58 ` Junyan He
2018-06-08 16:38 ` Pankaj Gupta
2018-06-08 16:49 ` Pankaj Gupta
2018-06-08 13:29 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180531104838.GC27838@stefanha-x1.localdomain \
--to=stefanha@redhat.com \
--cc=junyan.he@intel.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=pagupta@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).