From: Dor Laor <dlaor@redhat.com>
To: Anthony Liguori <anthony@codemonkey.ws>
Cc: kvm@vger.kernel.org, satoshi.itoh@aist.go.jp,
t.hirofuchi@aist.go.jp, qemu-devel@nongnu.org,
Orit Wasserman <owasserm@redhat.com>, Avi Kivity <avi@redhat.com>,
Isaku Yamahata <yamahata@valinux.co.jp>
Subject: Re: [RFC] postcopy livemigration proposal
Date: Mon, 08 Aug 2011 22:47:09 +0300 [thread overview]
Message-ID: <4E403D3D.80605@redhat.com> (raw)
In-Reply-To: <4E4007EA.4070104@codemonkey.ws>
On 08/08/2011 06:59 PM, Anthony Liguori wrote:
> On 08/08/2011 10:36 AM, Avi Kivity wrote:
>> On 08/08/2011 06:29 PM, Anthony Liguori wrote:
>>>
>>>>>> - Efficient, reduce needed traffic no need to re-send pages.
>>>>>
>>>>> It's not quite that simple. Post-copy needs to introduce a protocol
>>>>> capable of requesting pages.
>>>>
>>>> Just another subsection.. (kidding), still it shouldn't be too
>>>> complicated, just an offset+pagesize and return page_content/error
>>>
>>> What I meant by this is that there is potentially a lot of round trip
>>> overhead. Pre-copy migration works well with reasonable high latency
>>> network connections because the downtime is capped only by the maximum
>>> latency sending from one point to another.
>>>
>>> But with something like this, the total downtime is
>>> 2*max_latency*nb_pagefaults. That's potentially pretty high.
>>
>> Let's be generous and assume that the latency is dominated by page copy
>> time. So the total downtime is equal to the first live migration pass,
>> ~20 sec for 2GB on 1GbE. It's distributed over potentially even more
>> time, though. If the guest does a lot of I/O, it may not be noticeable
>> (esp. if we don't copy over pages read from disk). If the guest is
>> cpu/memory bound, it'll probably suck badly.
>>
>>>
>>> So it may be desirable to try to reduce nb_pagefaults by prefaulting
>>> in pages, etc. Suffice to say, this ends up getting complicated and
>>> may end up burning network traffic too.
It is complicated but can help (like pre faulting working set size
pages). Beyond that async page fault will help a bit.
Lastly, if a guest has several apps, those that are memory intensive
might suffer but light weight apps will function nicely.
It provides extra flexibility over the current protocol (that still has
value for some of the loads).
>>
>> Yeah, and prefaulting in the background adds latency to synchronous
>> requests.
>>
>> This really needs excellent networking resources to work well.
>
> Yup, it's very similar to other technologies using RDMA (single system
> image, lock step execution, etc.).
>
> Regards,
>
> Anthony Liguori
>
>>
>
>
WARNING: multiple messages have this Message-ID (diff)
From: Dor Laor <dlaor@redhat.com>
To: Anthony Liguori <anthony@codemonkey.ws>
Cc: kvm@vger.kernel.org, satoshi.itoh@aist.go.jp,
t.hirofuchi@aist.go.jp, qemu-devel@nongnu.org,
Orit Wasserman <owasserm@redhat.com>, Avi Kivity <avi@redhat.com>,
Isaku Yamahata <yamahata@valinux.co.jp>
Subject: Re: [Qemu-devel] [RFC] postcopy livemigration proposal
Date: Mon, 08 Aug 2011 22:47:09 +0300 [thread overview]
Message-ID: <4E403D3D.80605@redhat.com> (raw)
In-Reply-To: <4E4007EA.4070104@codemonkey.ws>
On 08/08/2011 06:59 PM, Anthony Liguori wrote:
> On 08/08/2011 10:36 AM, Avi Kivity wrote:
>> On 08/08/2011 06:29 PM, Anthony Liguori wrote:
>>>
>>>>>> - Efficient, reduce needed traffic no need to re-send pages.
>>>>>
>>>>> It's not quite that simple. Post-copy needs to introduce a protocol
>>>>> capable of requesting pages.
>>>>
>>>> Just another subsection.. (kidding), still it shouldn't be too
>>>> complicated, just an offset+pagesize and return page_content/error
>>>
>>> What I meant by this is that there is potentially a lot of round trip
>>> overhead. Pre-copy migration works well with reasonable high latency
>>> network connections because the downtime is capped only by the maximum
>>> latency sending from one point to another.
>>>
>>> But with something like this, the total downtime is
>>> 2*max_latency*nb_pagefaults. That's potentially pretty high.
>>
>> Let's be generous and assume that the latency is dominated by page copy
>> time. So the total downtime is equal to the first live migration pass,
>> ~20 sec for 2GB on 1GbE. It's distributed over potentially even more
>> time, though. If the guest does a lot of I/O, it may not be noticeable
>> (esp. if we don't copy over pages read from disk). If the guest is
>> cpu/memory bound, it'll probably suck badly.
>>
>>>
>>> So it may be desirable to try to reduce nb_pagefaults by prefaulting
>>> in pages, etc. Suffice to say, this ends up getting complicated and
>>> may end up burning network traffic too.
It is complicated but can help (like pre faulting working set size
pages). Beyond that async page fault will help a bit.
Lastly, if a guest has several apps, those that are memory intensive
might suffer but light weight apps will function nicely.
It provides extra flexibility over the current protocol (that still has
value for some of the loads).
>>
>> Yeah, and prefaulting in the background adds latency to synchronous
>> requests.
>>
>> This really needs excellent networking resources to work well.
>
> Yup, it's very similar to other technologies using RDMA (single system
> image, lock step execution, etc.).
>
> Regards,
>
> Anthony Liguori
>
>>
>
>
next prev parent reply other threads:[~2011-08-08 19:47 UTC|newest]
Thread overview: 54+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-08-08 3:24 [RFC] postcopy livemigration proposal Isaku Yamahata
2011-08-08 3:24 ` [Qemu-devel] " Isaku Yamahata
2011-08-08 9:20 ` Dor Laor
2011-08-08 9:20 ` [Qemu-devel] " Dor Laor
2011-08-08 9:40 ` Yaniv Kaul
2011-08-08 9:40 ` [Qemu-devel] " Yaniv Kaul
2011-08-08 21:42 ` Anthony Liguori
2011-08-08 21:42 ` Anthony Liguori
2011-08-08 10:59 ` Nadav Har'El
2011-08-08 10:59 ` [Qemu-devel] " Nadav Har'El
2011-08-08 11:47 ` Dor Laor
2011-08-08 11:47 ` [Qemu-devel] " Dor Laor
2011-08-08 16:52 ` Cleber Rosa
2011-08-08 15:52 ` Anthony Liguori
2011-08-08 12:32 ` Anthony Liguori
2011-08-08 12:32 ` [Qemu-devel] " Anthony Liguori
2011-08-08 15:11 ` Dor Laor
2011-08-08 15:11 ` Dor Laor
2011-08-08 15:29 ` Anthony Liguori
2011-08-08 15:29 ` Anthony Liguori
2011-08-08 15:36 ` Avi Kivity
2011-08-08 15:36 ` [Qemu-devel] " Avi Kivity
2011-08-08 15:59 ` Anthony Liguori
2011-08-08 15:59 ` Anthony Liguori
2011-08-08 19:47 ` Dor Laor [this message]
2011-08-08 19:47 ` Dor Laor
2011-08-09 2:07 ` Isaku Yamahata
2011-08-09 2:07 ` Isaku Yamahata
2011-08-08 9:38 ` Stefan Hajnoczi
2011-08-08 9:38 ` Stefan Hajnoczi
2011-08-08 9:43 ` Isaku Yamahata
2011-08-08 9:43 ` Isaku Yamahata
2011-08-08 12:38 ` Avi Kivity
2011-08-08 12:38 ` [Qemu-devel] " Avi Kivity
2011-08-09 2:33 ` Isaku Yamahata
2011-08-09 2:33 ` [Qemu-devel] " Isaku Yamahata
2011-08-10 13:55 ` Avi Kivity
2011-08-10 13:55 ` [Qemu-devel] " Avi Kivity
2011-08-11 2:19 ` Isaku Yamahata
2011-08-11 2:19 ` [Qemu-devel] " Isaku Yamahata
2011-08-11 16:55 ` Andrea Arcangeli
2011-08-11 16:55 ` [Qemu-devel] " Andrea Arcangeli
2011-08-12 11:07 ` [PATCH][RFC] post copy chardevice (was Re: [RFC] postcopy livemigration proposal) Isaku Yamahata
2011-08-12 11:07 ` [Qemu-devel] " Isaku Yamahata
2011-08-12 11:09 ` Isaku Yamahata
2011-08-12 11:09 ` [Qemu-devel] " Isaku Yamahata
2011-08-12 21:26 ` Blue Swirl
2011-08-12 21:26 ` Blue Swirl
2011-08-15 19:29 ` Avi Kivity
2011-08-15 19:29 ` [Qemu-devel] " Avi Kivity
2011-08-16 1:42 ` Isaku Yamahata
2011-08-16 1:42 ` [Qemu-devel] " Isaku Yamahata
2011-08-16 13:40 ` Avi Kivity
2011-08-16 13:40 ` [Qemu-devel] " Avi Kivity
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E403D3D.80605@redhat.com \
--to=dlaor@redhat.com \
--cc=anthony@codemonkey.ws \
--cc=avi@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=owasserm@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=satoshi.itoh@aist.go.jp \
--cc=t.hirofuchi@aist.go.jp \
--cc=yamahata@valinux.co.jp \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.