qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: David Gibson <david@gibson.dropbear.id.au>
To: Markus Armbruster <armbru@redhat.com>
Cc: Juan Quintela <quintela@redhat.com>,
	"Dr. David Alan Gilbert" <dgilbert@redhat.com>,
	qemu-devel@nongnu.org
Subject: Re: -only-migrate and the two different uses of migration blockers
Date: Wed, 21 Jul 2021 16:32:28 +1000	[thread overview]
Message-ID: <YPe/fIBuvGgfiyy3@yekko> (raw)
In-Reply-To: <87o8axh7rr.fsf@dusky.pond.sub.org>

[-- Attachment #1: Type: text/plain, Size: 2661 bytes --]

On Tue, Jul 20, 2021 at 07:30:16AM +0200, Markus Armbruster wrote:
> "Dr. David Alan Gilbert" <dgilbert@redhat.com> writes:
> 
> > * Markus Armbruster (armbru@redhat.com) wrote:
> >> We appear to use migration blockers in two ways:
> >> 
> >> (1) Prevent migration for an indefinite time, typically due to use of
> >> some feature that isn't compatible with migration.
> >> 
> >> (2) Delay migration for a short time.
> >> 
> >> Option -only-migrate is designed for (1).  It interferes with (2).
> >> 
> >> Example for (1): device "x-pci-proxy-dev" doesn't support migration.  It
> >> adds a migration blocker on realize, and deletes it on unrealize.  With
> >> -only-migrate, device realize fails.  Works as designed.
> >> 
> >> Example for (2): spapr_mce_req_event() makes an effort to prevent
> >> migration degrate the reporting of FWNMIs.  It adds a migration blocker
> >> when it receives one, and deletes it when it's done handling it.  This
> >> is a best effort; if migration is already in progress by the time FWNMI
> >> is received, we simply carry on, and that's okay.  However, option
> >> -only-migrate sabotages the best effort entirely.
> >
> > That's interesting; it's the first time I've heard of anyone using it as
> > 'best effort'.  I've always regarded blockers as blocking.
> 
> Me too, until I found this one.

Right, it may well have been the first usage this way, this fwnmi
stuff isn't super old.

> >> While this isn't exactly terrible, it may be a weakness in our thinking
> >> and our infrastructure.  I'm bringing it up so the people in charge are
> >> aware :)
> >
> > Thanks.
> >
> > It almost feels like they need a way to temporarily hold off
> > 'completion' of migratio - i.e. the phase where we stop the CPU and
> > write the device data;  mind you you'd also probably want it to stop
> > cold-migrates/snapshots?
> 
> Yes, a proper way to delay 'completion' for a bit would be clearer, and
> wouldn't let -only-migrate interfere.

Right.  If that becomes a thing, we should use it here.  Note that
this one use case probably isn't a very strong argument for it,
though.  The only problem here is slightly less that optimal error
reporting in a rare edge case (hardware fault occurs by chance at the
same time as a migration).


.... and, also, I half-suspect that the whole fwnmi feature exists
more to tick IBM RAS check boxes than because anyone will actually use
it.

-- 
David Gibson			| I'll have my music baroque, and my code
david AT gibson.dropbear.id.au	| minimalist, thank you.  NOT _the_ _other_
				| _way_ _around_!
http://www.ozlabs.org/~dgibson

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

  reply	other threads:[~2021-07-21  6:44 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-07-15 13:32 spapr_events: Sure we may ignore migrate_add_blocker() failure? Markus Armbruster
2021-07-19  2:31 ` David Gibson
2021-07-19  7:18   ` Markus Armbruster
2021-07-19  7:20     ` David Gibson
2021-07-19 10:41       ` Markus Armbruster
2021-07-19 11:00         ` -only-migrate and the two different uses of migration blockers (was: spapr_events: Sure we may ignore migrate_add_blocker() failure?) Markus Armbruster
2021-07-19 12:42           ` Dr. David Alan Gilbert
2021-07-20  5:30             ` -only-migrate and the two different uses of migration blockers Markus Armbruster
2021-07-21  6:32               ` David Gibson [this message]
2021-07-22 18:00                 ` Dr. David Alan Gilbert
2021-07-25  6:25                   ` David Gibson
2021-11-02 14:32                 ` Juan Quintela
2021-11-02 14:30           ` Juan Quintela
2021-07-21  6:26         ` spapr_events: Sure we may ignore migrate_add_blocker() failure? David Gibson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YPe/fIBuvGgfiyy3@yekko \
    --to=david@gibson.dropbear.id.au \
    --cc=armbru@redhat.com \
    --cc=dgilbert@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=quintela@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).