From: Stefan Hajnoczi <stefanha@redhat.com>
To: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
Anthony Liguori <aliguori@us.ibm.com>,
Stefan Hajnoczi <stefanha@gmail.com>,
qemu-devel <qemu-devel@nongnu.org>,
Blue Swirl <blauwirbel@gmail.com>, Khoa Huynh <khoa@us.ibm.com>,
Paolo Bonzini <pbonzini@redhat.com>, Asias He <asias@redhat.com>
Subject: Re: [Qemu-devel] [PATCH v6 03/12] dataplane: add host memory mapping code
Date: Mon, 17 Dec 2012 10:09:12 +0100 [thread overview]
Message-ID: <20121217090912.GA14300@stefanha-thinkpad.redhat.com> (raw)
In-Reply-To: <20121216161114.GD15790@redhat.com>
On Sun, Dec 16, 2012 at 06:11:14PM +0200, Michael S. Tsirkin wrote:
> On Fri, Dec 14, 2012 at 12:45:16PM +0100, Stefan Hajnoczi wrote:
> > On Wed, Dec 12, 2012 at 4:49 PM, Michael S. Tsirkin <mst@redhat.com> wrote:
> > > On Wed, Dec 12, 2012 at 04:34:21PM +0100, Stefan Hajnoczi wrote:
> > >> On Tue, Dec 11, 2012 at 08:09:56PM +0200, Michael S. Tsirkin wrote:
> > >> > On Tue, Dec 11, 2012 at 10:32:28AM -0600, Anthony Liguori wrote:
> > >> > > "Michael S. Tsirkin" <mst@redhat.com> writes:
> > >> > >
> > >> > > > On Tue, Dec 11, 2012 at 04:27:49PM +0100, Stefan Hajnoczi wrote:
> > >> > > >> On Tue, Dec 11, 2012 at 3:13 PM, Michael S. Tsirkin <mst@redhat.com> wrote:
> > >> > > >> > On Mon, Dec 10, 2012 at 02:09:36PM +0100, Stefan Hajnoczi wrote:
> > >> > > >> >> The data plane thread needs to map guest physical addresses to host
> > >> > > >> >> pointers. Normally this is done with cpu_physical_memory_map() but the
> > >> > > >> >> function assumes the global mutex is held. The data plane thread does
> > >> > > >> >> not touch the global mutex and therefore needs a thread-safe memory
> > >> > > >> >> mapping mechanism.
> > >> > > >> >>
> > >> > > >> >> Hostmem registers a MemoryListener similar to how vhost collects and
> > >> > > >> >> pushes memory region information into the kernel. There is a
> > >> > > >> >> fine-grained lock on the regions list which is held during lookup and
> > >> > > >> >> when installing a new regions list.
> > >> > > >> >
> > >> > > >> > Can we export and reuse the vhost code for this?
> > >> > > >> > I think you will find this advantageous when you add migration
> > >> > > >> > support down the line.
> > >> > > >> > And if you find it necessary to use MemoryListener e.g. for performance
> > >> > > >> > reasons, then vhost will likely benefit too.
> > >> > > >>
> > >> > > >> It's technically possible and not hard to do but it prevents
> > >> > > >> integrating deeper with core QEMU as the memory API becomes
> > >> > > >> thread-safe.
> > >> > > >>
> > >> > > >> There are two ways to implement dirty logging:
> > >> > > >> 1. The vhost log approach which syncs dirty information periodically.
> > >> > > >> 2. A cheap thread-safe way to mark dirty outside the global mutex,
> > >> > > >> i.e. a thread-safe memory_region_set_dirty().
> > >> > > >
> > >> > > > You don't normally want to dirty the whole region,
> > >> > > > you want to do this to individual pages.
> > >> > > >
> > >> > > >> If we can get thread-safe guest memory load/store in QEMU then #2 is
> > >> > > >> included. We can switch to using hw/virtio.c instead of
> > >> > > >> hw/dataplane/vring.c, we get dirty logging for free, we can drop
> > >> > > >> hostmem.c completely, etc.
> > >> > > >>
> > >> > > >> Stefan
> > >> > > >
> > >> > > > So why not reuse existing code? If you drop it later it won't
> > >> > > > matter what you used ...
> > >> > >
> > >> > > Let's not lose sight of the forest for the trees here...
> > >> > >
> > >> > > This whole series is not reusing existing code. That's really the whole
> > >> > > point.
> > >> > >
> > >> > > The point is to take the code (duplication and all) and then do all of
> > >> > > the refactoring to use common code in the tree itself.
> > >> > >
> > >> > > If we want to put this in a hw/staging/ directory, that's fine by me
> > >> > > too.
> > >> > >
> > >> > > Regards,
> > >> > >
> > >> > > Anthony Liguori
> > >> >
> > >> > Yes I agree. I think lack of handling for cross regin descriptors
> > >> > bothers me a bit more.
> > >>
> > >> The two things you've mentioned both aren't handled by hw/virtio.c:
> > >>
> > >> 1. Issue: Indirect descriptors have no alignment restrictions and can
> > >> cross regions.
> > >>
> > >> hw/virtio.c uses vring_desc_flags() and other accessor functions,
> > >> which do lduw_phys() - there is no memory region boundary checking
> > >> here.
> > >
> > > Since addresses are aligned this one is fine I think.
> > >
> > >> 2. Issue: Virtio buffers can cross memory region boundaries.
> > >>
> > >> hw/virtio.c maps buffers 1:1 using virtqueue_map_sg() and exits if
> > >> mapping fails. It does not split buffers if they cross a memory
> > >> region.
> > >>
> > >> These are definitely ugly corner cases but hw/virtio.c is proof that
> > >> we're not hitting them in practice.
> > >>
> > >> Stefan
> > >
> > > Yes, this one seems ugly. Maybe add a TODO?
> > >
> > > OK let's assume we want to put it in staging/
> > > I worry about the virtio-blk changes being isolated.
> > > Can you put ifdef CONFIG_VIRTIO_BLK_DATA_PLANE around
> > > them all to avoid dependency on that header completely
> > > if configured out?
> >
> > Okay, I'll move the #ifdefs. I like the stubs in the header file
> > because it reduces the amount of #ifdefs, but this is easy to change.
> >
> > Stefan
>
> Okay.
> Another option (if you prefer stubs) is to add a stub for access to
> s->dataplane field, and surround just the field with ifdefs.
> As it is, this code:
> if (s->dataplane) {
> return;
> }
> can't be compiled out since compiler is not smart enough to
> figure out dataplane is never set.
It's okay, I have already implemented your previous suggestion in the v7
patches that I sent out on Friday and I'm okay with it.
Stefan
next prev parent reply other threads:[~2012-12-17 9:17 UTC|newest]
Thread overview: 32+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-12-10 13:09 [Qemu-devel] [PATCH v6 00/12] virtio: virtio-blk data plane Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 01/12] raw-posix: add raw_get_aio_fd() for virtio-blk-data-plane Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 02/12] configure: add CONFIG_VIRTIO_BLK_DATA_PLANE Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 03/12] dataplane: add host memory mapping code Stefan Hajnoczi
2012-12-11 14:13 ` Michael S. Tsirkin
2012-12-11 15:27 ` Stefan Hajnoczi
2012-12-11 15:42 ` Michael S. Tsirkin
2012-12-11 16:32 ` Anthony Liguori
2012-12-11 18:09 ` Michael S. Tsirkin
2012-12-12 15:34 ` Stefan Hajnoczi
2012-12-12 15:49 ` Michael S. Tsirkin
2012-12-14 11:45 ` Stefan Hajnoczi
2012-12-16 16:11 ` Michael S. Tsirkin
2012-12-17 9:09 ` Stefan Hajnoczi [this message]
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 04/12] dataplane: add virtqueue vring code Stefan Hajnoczi
2012-12-11 14:18 ` Michael S. Tsirkin
2012-12-12 15:55 ` Stefan Hajnoczi
2012-12-12 16:32 ` Michael S. Tsirkin
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 05/12] dataplane: add event loop Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 06/12] dataplane: add Linux AIO request queue Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 07/12] iov: add iov_discard_front/back() to remove data Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 08/12] test-iov: add iov_discard_front/back() testcases Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 09/12] iov: add qemu_iovec_concat_iov() Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 10/12] virtio-blk: restore VirtIOBlkConf->config_wce flag Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 11/12] dataplane: add virtio-blk data plane code Stefan Hajnoczi
2012-12-10 13:09 ` [Qemu-devel] [PATCH v6 12/12] virtio-blk: add x-data-plane=on|off performance feature Stefan Hajnoczi
2012-12-16 16:08 ` Michael S. Tsirkin
2012-12-18 14:57 ` Stefan Hajnoczi
2012-12-18 15:22 ` Michael S. Tsirkin
2012-12-20 4:04 ` Rusty Russell
2012-12-10 13:13 ` [Qemu-devel] [PATCH v6 00/12] virtio: virtio-blk data plane Stefan Hajnoczi
2012-12-11 8:53 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20121217090912.GA14300@stefanha-thinkpad.redhat.com \
--to=stefanha@redhat.com \
--cc=aliguori@us.ibm.com \
--cc=asias@redhat.com \
--cc=blauwirbel@gmail.com \
--cc=khoa@us.ibm.com \
--cc=kwolf@redhat.com \
--cc=mst@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).