From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:37914)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <anthony@codemonkey.ws>) id 1VZdRS-0007M4-RL
	for qemu-devel@nongnu.org; Fri, 25 Oct 2013 05:12:32 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <anthony@codemonkey.ws>) id 1VZdRM-0008OT-Qp
	for qemu-devel@nongnu.org; Fri, 25 Oct 2013 05:12:26 -0400
Received: from mail-wg0-f52.google.com ([74.125.82.52]:52334)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <anthony@codemonkey.ws>) id 1VZdRM-0008OF-H9
	for qemu-devel@nongnu.org; Fri, 25 Oct 2013 05:12:20 -0400
Received: by mail-wg0-f52.google.com with SMTP id f12so3509093wgh.31
	for <qemu-devel@nongnu.org>; Fri, 25 Oct 2013 02:12:19 -0700 (PDT)
MIME-Version: 1.0
In-Reply-To: <526A1DF8.2040406@redhat.com>
References: <1382412341-1173-1-git-send-email-lilei@linux.vnet.ibm.com>
	<52692C10.3080604@redhat.com> <526A0870.3020401@linux.vnet.ibm.com>
	<526A1DF8.2040406@redhat.com>
Date: Fri, 25 Oct 2013 10:12:18 +0100
Message-ID: <CA+aC4ktjiRXDY2F89fLc4MhTXphZfTwCyC7ActdFVbgOv_cgJQ@mail.gmail.com>
From: Anthony Liguori <anthony@codemonkey.ws>
Content-Type: multipart/alternative; boundary=001a11c25cecb39a6004e98d25f6
Subject: Re: [Qemu-devel] [PATCH 0/17 v2] Localhost migration with side
 channel for ram
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Paolo Bonzini <pbonzini@redhat.com>
Cc: Andrea Arcangeli <aarcange@redhat.com>, Lei Li <lilei@linux.vnet.ibm.com>, quintela@redhat.com, mdroth@linux.vnet.ibm.com, mrhines@linux.vnet.ibm.com, qemu-devel <qemu-devel@nongnu.org>, Anthony Liguori <aliguori@amazon.com>, lagarcia@br.ibm.com, rcj@linux.vnet.ibm.com

--001a11c25cecb39a6004e98d25f6
Content-Type: text/plain; charset=ISO-8859-1

On Oct 25, 2013 8:30 AM, "Paolo Bonzini" <pbonzini@redhat.com> wrote:
>
> Il 25/10/2013 06:58, Lei Li ha scritto:
> > Right now just has inaccurate numbers without the new vmsplice, which
> > based on
> > the result from info migrate, as the guest ram size increases, although
the
> > 'total time' is number of times less compared with the current live
> > migration, but the 'downtime' performs badly.
>
> Of course.
> >
> > For a 1GB ram guest,
> >
> > total time: 702 milliseconds
> > downtime: 692 milliseconds
> >
> > And when the ram size of guest increasesexponentially, those numbers are
> > proportional to it.
> >
> > I will make a list of the performance with the new vmsplice later, I am
> > sure it'd be much better than this at least.
>
> Yes, please.  Is the memory usage is still 2x without vmsplice?
>
> I think you have a nice proof of concept, but on the other hand this
> probably needs to be coupled with some kind of postcopy live migration,
> that is:
>
> * the source starts sending data
>
> * but the destination starts running immediately
>
> * if the machine needs a page that is missing, the destination asks the
> source to send it
>
> * as soon as it arrives, the destination can restart
>
> Using postcopy is problematic for reliability: if the destination fails,
> the virtual machine is lost because the source doesn't have the latest
> content of memory.  However, this is a much, much smaller problem for
> live QEMU upgrade where the network cannot fail.
>
> If you do this, you can achieve pretty much instantaneous live upgrade,
> well within your original 200 ms goals.

This is actually a very nice justification for post copy.

Regards,

Anthony Liguori

But the flipping code with
> vmsplice should be needed anyway to avoid doubling memory usage, and
> it's looking pretty good in this version already!  I'm relieved that the
> RDMA code was designed right!
>
> Paolo
>
>

--001a11c25cecb39a6004e98d25f6
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr"><br>
On Oct 25, 2013 8:30 AM, &quot;Paolo Bonzini&quot; &lt;<a href=3D"mailto:pb=
onzini@redhat.com">pbonzini@redhat.com</a>&gt; wrote:<br>
&gt;<br>
&gt; Il 25/10/2013 06:58, Lei Li ha scritto:<br>
&gt; &gt; Right now just has inaccurate numbers without the new vmsplice, w=
hich<br>
&gt; &gt; based on<br>
&gt; &gt; the result from info migrate, as the guest ram size increases, al=
though the<br>
&gt; &gt; &#39;total time&#39; is number of times less compared with the cu=
rrent live<br>
&gt; &gt; migration, but the &#39;downtime&#39; performs badly.<br>
&gt;<br>
&gt; Of course.<br>
&gt; &gt;<br>
&gt; &gt; For a 1GB ram guest,<br>
&gt; &gt;<br>
&gt; &gt; total time: 702 milliseconds<br>
&gt; &gt; downtime: 692 milliseconds<br>
&gt; &gt;<br>
&gt; &gt; And when the ram size of guest increasesexponentially, those numb=
ers are<br>
&gt; &gt; proportional to it.<br>
&gt; &gt;<br>
&gt; &gt; I will make a list of the performance with the new vmsplice later=
, I am<br>
&gt; &gt; sure it&#39;d be much better than this at least.<br>
&gt;<br>
&gt; Yes, please. =A0Is the memory usage is still 2x without vmsplice?<br>
&gt;<br>
&gt; I think you have a nice proof of concept, but on the other hand this<b=
r>
&gt; probably needs to be coupled with some kind of postcopy live migration=
,<br>
&gt; that is:<br>
&gt;<br>
&gt; * the source starts sending data<br>
&gt;<br>
&gt; * but the destination starts running immediately<br>
&gt;<br>
&gt; * if the machine needs a page that is missing, the destination asks th=
e<br>
&gt; source to send it<br>
&gt;<br>
&gt; * as soon as it arrives, the destination can restart<br>
&gt;<br>
&gt; Using postcopy is problematic for reliability: if the destination fail=
s,<br>
&gt; the virtual machine is lost because the source doesn&#39;t have the la=
test<br>
&gt; content of memory. =A0However, this is a much, much smaller problem fo=
r<br>
&gt; live QEMU upgrade where the network cannot fail.<br>
&gt;<br>
&gt; If you do this, you can achieve pretty much instantaneous live upgrade=
,<br>
&gt; well within your original 200 ms goals. =A0</p>
<p dir=3D"ltr">This is actually a very nice justification for post copy. </=
p>
<p dir=3D"ltr">Regards,</p>
<p dir=3D"ltr">Anthony Liguori</p>
<p dir=3D"ltr">But the flipping code with<br>
&gt; vmsplice should be needed anyway to avoid doubling memory usage, and<b=
r>
&gt; it&#39;s looking pretty good in this version already! =A0I&#39;m relie=
ved that the<br>
&gt; RDMA code was designed right!<br>
&gt;<br>
&gt; Paolo<br>
&gt;<br>
&gt;<br>
</p>

--001a11c25cecb39a6004e98d25f6--