From: Peter Xu <peterx@redhat.com>
To: "Dr. David Alan Gilbert (git)" <dgilbert@redhat.com>
Cc: qemu-devel@nongnu.org, maxime.coquelin@redhat.com,
a.perevalov@samsung.com, mst@redhat.com,
marcandre.lureau@redhat.com, quintela@redhat.com,
lvivier@redhat.com, aarcange@redhat.com, felipe@nutanix.com
Subject: Re: [Qemu-devel] [RFC v2 16/32] vhost+postcopy: Send address back to qemu
Date: Tue, 29 Aug 2017 16:30:03 +0800 [thread overview]
Message-ID: <20170829083003.GD2610@pxdev.xzpeter.org> (raw)
In-Reply-To: <20170824192730.8440-17-dgilbert@redhat.com>
On Thu, Aug 24, 2017 at 08:27:14PM +0100, Dr. David Alan Gilbert (git) wrote:
> From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
>
> We need a better way, but at the moment we need the address of the
> mappings sent back to qemu so it can interpret the messages on the
> userfaultfd it reads.
>
> Note: We don't ask for the default 'ack' reply since we've got our own.
>
> Signed-off-by: Dr. David Alan Gilbert <dgilbert@redhat.com>
> ---
> contrib/libvhost-user/libvhost-user.c | 15 ++++++++-
> docs/interop/vhost-user.txt | 6 ++++
> hw/virtio/trace-events | 1 +
> hw/virtio/vhost-user.c | 57 ++++++++++++++++++++++++++++++++++-
> 4 files changed, 77 insertions(+), 2 deletions(-)
>
> diff --git a/contrib/libvhost-user/libvhost-user.c b/contrib/libvhost-user/libvhost-user.c
> index e6ab059a03..5ec54f7d60 100644
> --- a/contrib/libvhost-user/libvhost-user.c
> +++ b/contrib/libvhost-user/libvhost-user.c
> @@ -477,13 +477,26 @@ vu_set_mem_table_exec(VuDev *dev, VhostUserMsg *vmsg)
> DPRINT("%s: region %d: Registered userfault for %llx + %llx\n",
> __func__, i, reg_struct.range.start, reg_struct.range.len);
> /* TODO: Stash 'zero' support flags somewhere */
> - /* TODO: Get address back to QEMU */
>
> + /* TODO: We need to find a way for the qemu not to see the virtual
> + * addresses of the clients, so as to keep better separation.
> + */
> + /* Return the address to QEMU so that it can translate the ufd
> + * fault addresses back.
> + */
> + msg_region->userspace_addr = (uintptr_t)(mmap_addr +
> + dev_region->mmap_offset);
> }
>
> close(vmsg->fds[i]);
> }
>
> + if (dev->postcopy_listening) {
> + /* Need to return the addresses - send the updated message back */
> + vmsg->fd_num = 0;
> + return true;
> + }
> +
> return false;
> }
>
> diff --git a/docs/interop/vhost-user.txt b/docs/interop/vhost-user.txt
> index 73c3dd74db..b2a548c94d 100644
> --- a/docs/interop/vhost-user.txt
> +++ b/docs/interop/vhost-user.txt
> @@ -413,12 +413,18 @@ Master message types
> Id: 5
> Equivalent ioctl: VHOST_SET_MEM_TABLE
> Master payload: memory regions description
> + Slave payload: (postcopy only) memory regions description
>
> Sets the memory map regions on the slave so it can translate the vring
> addresses. In the ancillary data there is an array of file descriptors
> for each memory mapped region. The size and ordering of the fds matches
> the number and ordering of memory regions.
>
> + When postcopy-listening has been received, SET_MEM_TABLE replies with
> + the bases of the memory mapped regions to the master. It must have mmap'd
> + the regions and enabled userfaultfd on them. Note NEED_REPLY_MASK
> + is not set in this case.
> +
> * VHOST_USER_SET_LOG_BASE
>
> Id: 6
> diff --git a/hw/virtio/trace-events b/hw/virtio/trace-events
> index f736c7c84f..63fd4a79cf 100644
> --- a/hw/virtio/trace-events
> +++ b/hw/virtio/trace-events
> @@ -2,6 +2,7 @@
>
> # hw/virtio/vhost-user.c
> vhost_user_postcopy_listen(void) ""
> +vhost_user_set_mem_table_postcopy(uint64_t client_addr, uint64_t qhva, int reply_i, int region_i) "client:0x%"PRIx64" for hva: 0x%"PRIx64" reply %d region %d"
>
> # hw/virtio/virtio.c
> virtqueue_alloc_element(void *elem, size_t sz, unsigned in_num, unsigned out_num) "elem %p size %zd in_num %u out_num %u"
> diff --git a/hw/virtio/vhost-user.c b/hw/virtio/vhost-user.c
> index 9178271ab2..2e4eb0864a 100644
> --- a/hw/virtio/vhost-user.c
> +++ b/hw/virtio/vhost-user.c
> @@ -19,6 +19,7 @@
> #include "qemu/sockets.h"
> #include "migration/migration.h"
> #include "migration/postcopy-ram.h"
> +#include "trace.h"
>
> #include <sys/ioctl.h>
> #include <sys/socket.h>
> @@ -133,6 +134,7 @@ struct vhost_user {
> int slave_fd;
> NotifierWithReturn postcopy_notifier;
> struct PostCopyFD postcopy_fd;
> + uint64_t postcopy_client_bases[VHOST_MEMORY_MAX_NREGIONS];
> };
>
> static bool ioeventfd_enabled(void)
> @@ -300,11 +302,13 @@ static int vhost_user_set_log_base(struct vhost_dev *dev, uint64_t base,
> static int vhost_user_set_mem_table(struct vhost_dev *dev,
> struct vhost_memory *mem)
> {
> + struct vhost_user *u = dev->opaque;
> int fds[VHOST_MEMORY_MAX_NREGIONS];
> int i, fd;
> size_t fd_num = 0;
> bool reply_supported = virtio_has_feature(dev->protocol_features,
> - VHOST_USER_PROTOCOL_F_REPLY_ACK);
> + VHOST_USER_PROTOCOL_F_REPLY_ACK) &&
> + !u->postcopy_fd.handler;
(indent)
>
> VhostUserMsg msg = {
> .request = VHOST_USER_SET_MEM_TABLE,
> @@ -350,6 +354,57 @@ static int vhost_user_set_mem_table(struct vhost_dev *dev,
> return -1;
> }
>
> + if (u->postcopy_fd.handler) {
It seems that after this handler is set, we never clean it up. Do we
need to unset it somewhere? (maybe vhost_user_postcopy_end?)
> + VhostUserMsg msg_reply;
> + int region_i, reply_i;
> + if (vhost_user_read(dev, &msg_reply) < 0) {
> + return -1;
> + }
> +
> + if (msg_reply.request != VHOST_USER_SET_MEM_TABLE) {
> + error_report("%s: Received unexpected msg type."
> + "Expected %d received %d", __func__,
> + VHOST_USER_SET_MEM_TABLE, msg_reply.request);
> + return -1;
> + }
> + /* We're using the same structure, just reusing one of the
> + * fields, so it should be the same size.
> + */
> + if (msg_reply.size != msg.size) {
> + error_report("%s: Unexpected size for postcopy reply "
> + "%d vs %d", __func__, msg_reply.size, msg.size);
> + return -1;
> + }
> +
> + memset(u->postcopy_client_bases, 0,
> + sizeof(uint64_t) * VHOST_MEMORY_MAX_NREGIONS);
> +
> + /* They're in the same order as the regions that were sent
> + * but some of the regions were skipped (above) if they
> + * didn't have fd's
> + */
> + for (reply_i = 0, region_i = 0;
> + region_i < dev->mem->nregions;
> + region_i++) {
> + if (reply_i < fd_num &&
> + msg_reply.payload.memory.regions[region_i].guest_phys_addr ==
^^^^^^^^
should this be reply_i?
(And maybe we can use pointers for the regions for better readability?)
> + dev->mem->regions[region_i].guest_phys_addr) {
> + u->postcopy_client_bases[region_i] =
> + msg_reply.payload.memory.regions[reply_i].userspace_addr;
> + trace_vhost_user_set_mem_table_postcopy(
> + msg_reply.payload.memory.regions[reply_i].userspace_addr,
> + msg.payload.memory.regions[reply_i].userspace_addr,
> + reply_i, region_i);
> + reply_i++;
> + }
> + }
> + if (reply_i != fd_num) {
> + error_report("%s: postcopy reply not fully consumed "
> + "%d vs %zd",
> + __func__, reply_i, fd_num);
> + return -1;
> + }
> + }
> if (reply_supported) {
> return process_message_reply(dev, &msg);
> }
> --
> 2.13.5
>
--
Peter Xu
next prev parent reply other threads:[~2017-08-29 8:30 UTC|newest]
Thread overview: 94+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <CGME20170824192750epcas5p484df9724ca7c0a259a4dd85425a69e1d@epcas5p4.samsung.com>
2017-08-24 19:26 ` [Qemu-devel] [RFC v2 00/32] postcopy+vhost-user/shared ram Dr. David Alan Gilbert (git)
2017-08-24 19:26 ` [Qemu-devel] [RFC v2 01/32] vhu: vu_queue_started Dr. David Alan Gilbert (git)
2017-08-24 23:10 ` Marc-André Lureau
2017-08-25 14:58 ` Dr. David Alan Gilbert
2017-08-30 13:02 ` Michael S. Tsirkin
2017-08-30 13:13 ` Marc-André Lureau
2017-09-05 12:58 ` Dr. David Alan Gilbert
2017-09-05 13:01 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 02/32] vhub: Only process received packets on started queues Dr. David Alan Gilbert (git)
2017-08-30 9:59 ` Marc-André Lureau
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 03/32] migrate: Update ram_block_discard_range for shared Dr. David Alan Gilbert (git)
2017-08-29 5:30 ` Peter Xu
2017-09-18 12:18 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 04/32] qemu_ram_block_host_offset Dr. David Alan Gilbert (git)
2017-08-25 12:11 ` Philippe Mathieu-Daudé
2017-08-25 15:28 ` Dr. David Alan Gilbert
2017-08-29 5:36 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 05/32] migration/ram: ramblock_recv_bitmap_test_byte_offset Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 06/32] postcopy: use UFFDIO_ZEROPAGE only when available Dr. David Alan Gilbert (git)
2017-08-30 9:57 ` Marc-André Lureau
2017-09-07 10:55 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 07/32] postcopy: Add notifier chain Dr. David Alan Gilbert (git)
2017-08-29 6:02 ` Peter Xu
2017-09-11 17:00 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 08/32] postcopy: Add vhost-user flag for postcopy and check it Dr. David Alan Gilbert (git)
2017-08-29 6:22 ` Peter Xu
2017-09-13 14:34 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 09/32] vhost-user: Add 'VHOST_USER_POSTCOPY_ADVISE' message Dr. David Alan Gilbert (git)
2017-08-30 10:07 ` Marc-André Lureau
2017-09-07 11:04 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 10/32] vhub: Support sending fds back to qemu Dr. David Alan Gilbert (git)
2017-08-30 10:22 ` Marc-André Lureau
2017-09-07 11:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 11/32] vhub: Open userfaultfd Dr. David Alan Gilbert (git)
2017-08-29 6:40 ` Peter Xu
2017-09-15 17:33 ` Dr. David Alan Gilbert
2017-08-30 10:30 ` Marc-André Lureau
2017-09-07 16:36 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 12/32] postcopy: Allow registering of fd handler Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 13/32] vhost+postcopy: Register shared ufd with postcopy Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 14/32] vhost+postcopy: Transmit 'listen' to client Dr. David Alan Gilbert (git)
2017-08-30 10:37 ` Marc-André Lureau
2017-09-07 12:10 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 15/32] vhost+postcopy: Register new regions with the ufd Dr. David Alan Gilbert (git)
2017-08-30 10:42 ` Marc-André Lureau
2017-09-08 14:50 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 16/32] vhost+postcopy: Send address back to qemu Dr. David Alan Gilbert (git)
2017-08-29 8:30 ` Peter Xu [this message]
2017-09-12 17:15 ` Dr. David Alan Gilbert
2017-09-13 4:29 ` Peter Xu
2017-09-13 12:15 ` Dr. David Alan Gilbert
2017-09-15 8:57 ` Peter Xu
2017-09-15 15:32 ` Dr. David Alan Gilbert
2017-09-18 9:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 17/32] vhost+postcopy: Stash RAMBlock and offset Dr. David Alan Gilbert (git)
2017-08-30 5:51 ` Peter Xu
2017-09-13 15:59 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 18/32] vhost+postcopy: Send requests to source for shared pages Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 19/32] vhost+postcopy: Resolve client address Dr. David Alan Gilbert (git)
2017-08-30 5:28 ` Peter Xu
2017-09-11 11:58 ` Dr. David Alan Gilbert
2017-09-13 5:18 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 20/32] postcopy: wake shared Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 21/32] postcopy: postcopy_notify_shared_wake Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 22/32] vhost+postcopy: Add vhost waker Dr. David Alan Gilbert (git)
2017-08-30 5:55 ` Peter Xu
2017-09-13 13:09 ` Dr. David Alan Gilbert
2017-09-18 3:57 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 23/32] vhost+postcopy: Call wakeups Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 24/32] vub+postcopy: madvises Dr. David Alan Gilbert (git)
2017-08-30 10:48 ` Marc-André Lureau
2017-09-07 12:30 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 25/32] vhost+postcopy: Lock around set_mem_table Dr. David Alan Gilbert (git)
2017-08-30 6:50 ` Peter Xu
2017-09-25 17:56 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 26/32] vhost: Add VHOST_USER_POSTCOPY_END message Dr. David Alan Gilbert (git)
2017-08-30 6:55 ` Peter Xu
2017-09-11 11:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 27/32] vhost+postcopy: Wire up POSTCOPY_END notify Dr. David Alan Gilbert (git)
2017-08-30 6:57 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 28/32] postcopy: Allow shared memory Dr. David Alan Gilbert (git)
2017-08-30 10:39 ` Marc-André Lureau
2017-09-07 12:15 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 29/32] vhost-user: Claim support for postcopy Dr. David Alan Gilbert (git)
2017-08-30 10:50 ` Marc-André Lureau
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 30/32] vhost: Merge neighbouring hugepage regions where appropriate Dr. David Alan Gilbert (git)
2017-09-14 9:18 ` Igor Mammedov
2017-09-25 11:19 ` Dr. David Alan Gilbert
2017-10-02 13:49 ` Igor Mammedov
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 31/32] vhost: Don't break merged regions on small remove/non-adds Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 32/32] postcopy shared docs Dr. David Alan Gilbert (git)
2017-09-01 13:34 ` [Qemu-devel] [RFC v2 00/32] postcopy+vhost-user/shared ram Alexey Perevalov
2017-09-01 13:42 ` Maxime Coquelin
2017-10-16 8:32 ` Alexey Perevalov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170829083003.GD2610@pxdev.xzpeter.org \
--to=peterx@redhat.com \
--cc=a.perevalov@samsung.com \
--cc=aarcange@redhat.com \
--cc=dgilbert@redhat.com \
--cc=felipe@nutanix.com \
--cc=lvivier@redhat.com \
--cc=marcandre.lureau@redhat.com \
--cc=maxime.coquelin@redhat.com \
--cc=mst@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=quintela@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).