From: Alex Williamson <alex.williamson@redhat.com>
To: Shenming Lu <lushenming@huawei.com>
Cc: Neo Jia <cjia@nvidia.com>, Marc Zyngier <maz@kernel.org>,
Cornelia Huck <cohuck@redhat.com>,
"Dr. David Alan Gilbert" <dgilbert@redhat.com>,
qemu-devel@nongnu.org, Eric Auger <eric.auger@redhat.com>,
Kirti Wankhede <kwankhede@nvidia.com>,
qemu-arm@nongnu.org, yuzenghui@huawei.com,
wanghaibin.wang@huawei.com
Subject: Re: [PATCH RFC] vfio: Move the saving of the config space to the right place in VFIO migration
Date: Mon, 30 Nov 2020 10:03:37 -0700 [thread overview]
Message-ID: <20201130100337.4afe8eb4@w520.home> (raw)
In-Reply-To: <ed6c0920-8a26-fafe-01a6-3021c5a92adb@huawei.com>
On Thu, 26 Nov 2020 14:56:17 +0800
Shenming Lu <lushenming@huawei.com> wrote:
> Hi,
>
> After reading everyone's opinions, we have a rough idea for this issue.
>
> One key point is whether it is necessary to setup the config space before
> the device can accept further migration data. I think it is decided by
> the vendor driver, so we can simply ask the vendor driver about it in
> .save_setup, which could avoid a lot of unnecessary copies and settings.
> Once we have known the need, we can iterate the config space (before)
> along with the device migration data in .save_live_iterate and
> .save_live_complete_precopy, and if not needed, we can only migrate the
> config space in .save_state.
>
> Another key point is that the interrupt enabling should be after the
> restoring of the interrupt controller (might not only interrupts).
> My solution is to add a subflag at the beginning of the config data
> (right after VFIO_MIG_FLAG_DEV_CONFIG_STATE) to indicate the triggered
> actions on the dst (such as whether to enable interrupts).
>
> Below is it's workflow.
>
> On the save path:
> In vfio_save_setup():
> Ask the vendor driver if it needs the config space setup before it
> can accept further migration data.
How does "ask the vendor driver" actually work?
> |
> In vfio_save_iterate() (pre-copy):
> If *needed*, save the config space which would be setup on the dst
> before the migration data, but send with a subflag to instruct not
> to (such as) enable interrupts.
If not for triggering things like MSI/X configuration, isn't config
space almost entirely virtual? What visibility does the vendor driver
have to the VM machine dependencies regarding device interrupt versus
interrupt controller migration?
> |
> In vfio_save_complete_precopy() (stop-and-copy, iterable process):
> The same as that in vfio_save_iterate().
> |
> In .save_state (stop-and-copy, non-iterable process):
> If *needed*, only send a subflag to instruct to enable interrupts.
> If *not needed*, save the config space and setup everything on the dst.
Again, how does the vendor driver have visibility to know when the VM
machine can enable interrupts?
>
> Besides the above idea, we might be able to choose to let the vendor driver do
> more: qemu just sends and writes the config data (before) along with the device
> migration data every time, and it's up to the vendor driver to filter out/buffer
> the received data or reorder the settings...
There is no vendor driver in QEMU though, so are you suggesting that
QEMU follows a standard protocol and the vendor driver chooses when to
enable specific features? For instance, QEMU would call SET_IRQS and
the driver would return success, but defer that setup if necessary?
That seems quite troubling as we then have ioctls that behave
differently depending on the device state and we have no error path to
userspace should that setup fail later. The vendor driver does have
its own data stream for migration, so the vendor driver could tell the
destination version of itself what type of interrupt to use, which
might be sufficient if we were to ignore the latency if QEMU were to
defer interrupt setup until stop-and-copy.
Is the question of when to setup device interrupts versus the interrupt
controller state largely a machine issue within QEMU? If so, shouldn't
it be at QEMU's determination when to act on the config space
information on the target? IOW, if a vendor driver has a dependency on
interrupt configuration, they need to include it in their own pre-copy
data stream and decouple that dependency from userspace interrupt
configuration via the SET_IRQS ioctl. Is that possible? Thanks,
Alex
next prev parent reply other threads:[~2020-11-30 17:06 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-11-14 9:17 [PATCH RFC] vfio: Move the saving of the config space to the right place in VFIO migration Shenming Lu
2020-11-19 8:43 ` Kirti Wankhede
2020-11-19 17:41 ` Alex Williamson
2020-11-20 14:05 ` Shenming Lu
2020-11-20 22:01 ` Alex Williamson
2020-11-23 3:14 ` Shenming Lu
2020-11-23 19:33 ` Neo Jia
2020-11-23 21:46 ` Alex Williamson
2020-11-26 6:56 ` Shenming Lu
2020-11-30 17:03 ` Alex Williamson [this message]
2020-12-01 6:37 ` Shenming Lu
2020-12-01 22:21 ` Alex Williamson
2020-12-03 8:34 ` Shenming Lu
2020-12-02 10:55 ` Dr. David Alan Gilbert
2020-12-04 10:45 ` Shenming Lu
2020-11-24 11:01 ` Shenming Lu
2020-11-24 11:02 ` Dr. David Alan Gilbert
2020-11-23 13:52 ` Cornelia Huck
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20201130100337.4afe8eb4@w520.home \
--to=alex.williamson@redhat.com \
--cc=cjia@nvidia.com \
--cc=cohuck@redhat.com \
--cc=dgilbert@redhat.com \
--cc=eric.auger@redhat.com \
--cc=kwankhede@nvidia.com \
--cc=lushenming@huawei.com \
--cc=maz@kernel.org \
--cc=qemu-arm@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=wanghaibin.wang@huawei.com \
--cc=yuzenghui@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).