* [Qemu-devel] about post copy recovery
@ 2017-01-10 9:39 Li, Liang Z
2017-01-12 11:58 ` Dr. David Alan Gilbert
0 siblings, 1 reply; 3+ messages in thread
From: Li, Liang Z @ 2017-01-10 9:39 UTC (permalink / raw)
To: Dr. David Alan Gilbert (git); +Cc: qemu-devel@nongnu.org
Hi David,
I remembered some guys wanted to solve the issue of post copy recovery when network broken down, do you know latest status?
Thanks!
Liang
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Qemu-devel] about post copy recovery
2017-01-10 9:39 [Qemu-devel] about post copy recovery Li, Liang Z
@ 2017-01-12 11:58 ` Dr. David Alan Gilbert
2017-01-16 5:19 ` Li, Liang Z
0 siblings, 1 reply; 3+ messages in thread
From: Dr. David Alan Gilbert @ 2017-01-12 11:58 UTC (permalink / raw)
To: Li, Liang Z; +Cc: qemu-devel@nongnu.org
* Li, Liang Z (liang.z.li@intel.com) wrote:
>
> Hi David,
>
> I remembered some guys wanted to solve the issue of post copy recovery when network broken down, do you know latest status?
Hi Liang,
Yes, Haris looked at it as part of GSoC, the latest
version is what was posted:
https://lists.nongnu.org/archive/html/qemu-devel/2016-08/msg03468.html
I've not done any work on it since then; there are a couple of
hard problems to be solved. The simpler is making sure that
we always correctly detect a migration error due to networking
(rather than some other non-recoverable error); there's lots of
migration code that doesn't check for a file error straight away
and only hits the error code later on when it's too late to recover.
The harder problem is that we often end up with the case where
the main thread is blocked trying to access postcopied-RAM,
e.g. an emulated network driver tries to write an incoming
packet to guest RAM but finds the guest RAM hasn't arrived
yet.
With the main thread blocked it's very difficult to recover -
we can't issue any commands to trigger the recovery and even
if we could we'll have to be very careful about what things
those commands need the main thread to do.
Dave
>
> Thanks!
> Liang
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Qemu-devel] about post copy recovery
2017-01-12 11:58 ` Dr. David Alan Gilbert
@ 2017-01-16 5:19 ` Li, Liang Z
0 siblings, 0 replies; 3+ messages in thread
From: Li, Liang Z @ 2017-01-16 5:19 UTC (permalink / raw)
To: Dr. David Alan Gilbert; +Cc: qemu-devel@nongnu.org
> * Li, Liang Z (liang.z.li@intel.com) wrote:
> >
> > Hi David,
> >
> > I remembered some guys wanted to solve the issue of post copy recovery
> when network broken down, do you know latest status?
>
> Hi Liang,
> Yes, Haris looked at it as part of GSoC, the latest version is what was posted:
>
> https://lists.nongnu.org/archive/html/qemu-devel/2016-08/msg03468.html
>
> I've not done any work on it since then; there are a couple of hard problems
> to be solved. The simpler is making sure that we always correctly detect a
> migration error due to networking (rather than some other non-recoverable
> error); there's lots of migration code that doesn't check for a file error
> straight away and only hits the error code later on when it's too late to
> recover.
>
> The harder problem is that we often end up with the case where the main
> thread is blocked trying to access postcopied-RAM, e.g. an emulated
> network driver tries to write an incoming packet to guest RAM but finds the
> guest RAM hasn't arrived yet.
> With the main thread blocked it's very difficult to recover - we can't issue any
> commands to trigger the recovery and even if we could we'll have to be very
> careful about what things those commands need the main thread to do.
>
> Dave
Thanks for your information!
I will take a look first, maybe get back to you later.
Liang
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2017-01-16 5:19 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-01-10 9:39 [Qemu-devel] about post copy recovery Li, Liang Z
2017-01-12 11:58 ` Dr. David Alan Gilbert
2017-01-16 5:19 ` Li, Liang Z
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).