qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: David Gibson <david@gibson.dropbear.id.au>
To: Igor Mammedov <imammedo@redhat.com>
Cc: "Dr. David Alan Gilbert" <dgilbert@redhat.com>,
	quintela@redhat.com, qemu-ppc@nongnu.org, qemu-devel@nongnu.org,
	jdenemar@redhat.com
Subject: Re: [Qemu-devel] unplug_request and migration
Date: Fri, 9 Jun 2017 20:03:36 +1000	[thread overview]
Message-ID: <20170609100336.GJ26521@umbus.fritz.box> (raw)
In-Reply-To: <20170609110748.00c8e419@nial.brq.redhat.com>

[-- Attachment #1: Type: text/plain, Size: 3149 bytes --]

On Fri, Jun 09, 2017 at 11:09:10AM +0200, Igor Mammedov wrote:
> On Fri, 9 Jun 2017 00:41:06 +1000
> David Gibson <david@gibson.dropbear.id.au> wrote:
> 
> > Hi Dave & Juan,
> > 
> > I'm hoping one of you can answer this.
> > 
> > I'm currently grappling with (amongst other things) a pseries machine
> > racing a hot unplug operation with a migrate.  There's various issues
> > with what interim state we need, and which bits of it need to be
> > migrated that I'm still investigating.  But, there's a more general
> > question that I'm guessing must have already been addressed for x86.
> > 
> > For any "soft" unplug device - i.e. using ->unplug_request, rather
> > than ->unplug, giving a device_del command will just ask the guest
> > nicely to release the device, with the completion of the unplug
> > happening only if and when the guest indicates it's ready for the
> > device to go away.  AFAICT, the device_del command will return as soon
> > as the request is made, but if the guest is busy, the completion of
> > the hot unplug could take arbitrarily long.
> > 
> > So, what happens if there's a migration in between the unplug_request
> > and the guest completing the unplug?  How does libvirt (or whatever)
> > know whether to include the device on the destination machine command
> > line?
> > 
> 
> looking at qdev_unplug():
>     if (!migration_is_idle()) { 
>         error_setg(errp, "device_del not allowed while migrating");
>         return;
>     }
> 
> so unplug request should fail if migration is in progress , it won't reach guest
> and mgmt side will have to repeat request on migration completion.
> 
> But it's still possible to issue unplug request first and then start
> migration,

Right, that's the case I'm interested in, not the other way around.

> that's where race between DEVICE_DELETED and migration start (starting DST with
> being unplugged device) occurs.
> 
> it could be possible:
>  1: on unplug_request() set global flag that there is pending unplug and forbid
>     migration until completion. But there is no guarantee that unplug will
>     be completed nor a way to notice that it's failed/rejected by guest.
>     I'm not sure how that could be solved.
>  2: set per device pending_unplug flag and delay unplug event from guest
>     until migration is completed if migration is in progress when unplug
>     callback is called.
>     mgmt will treat the case as usual migration, i.e. start dst with being
>     unplugged device, and device will be removed on dst side on migration
>     completion.
>     (it should be generic solution as x86 is also affected), as place where
>     to put this common logic I'd suggest hotplug_handler_unplug()

So.. it seems like the short version is that racing migration and
unplug is broken already.

Which is unfortunate, but at least means I don't need to worry about
it particularly for Power.

-- 
David Gibson			| I'll have my music baroque, and my code
david AT gibson.dropbear.id.au	| minimalist, thank you.  NOT _the_ _other_
				| _way_ _around_!
http://www.ozlabs.org/~dgibson

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 819 bytes --]

  reply	other threads:[~2017-06-09 10:57 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-06-08 14:41 [Qemu-devel] unplug_request and migration David Gibson
2017-06-08 15:10 ` Dr. David Alan Gilbert
2017-06-08 15:44   ` Dr. David Alan Gilbert
2017-06-08 16:07 ` Juan Quintela
2017-06-09  9:09 ` Igor Mammedov
2017-06-09 10:03   ` David Gibson [this message]
2017-06-09 12:18     ` Juan Quintela

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20170609100336.GJ26521@umbus.fritz.box \
    --to=david@gibson.dropbear.id.au \
    --cc=dgilbert@redhat.com \
    --cc=imammedo@redhat.com \
    --cc=jdenemar@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-ppc@nongnu.org \
    --cc=quintela@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).