From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
To: Peter Xu <peterx@redhat.com>
Cc: qemu-devel@nongnu.org, maxime.coquelin@redhat.com,
a.perevalov@samsung.com, mst@redhat.com,
marcandre.lureau@redhat.com, quintela@redhat.com,
lvivier@redhat.com, aarcange@redhat.com, felipe@nutanix.com
Subject: Re: [Qemu-devel] [RFC v2 03/32] migrate: Update ram_block_discard_range for shared
Date: Mon, 18 Sep 2017 13:18:10 +0100 [thread overview]
Message-ID: <20170918121809.GH2581@work-vm> (raw)
In-Reply-To: <20170829053011.GS14174@pxdev.xzpeter.org>
* Peter Xu (peterx@redhat.com) wrote:
> On Thu, Aug 24, 2017 at 08:27:01PM +0100, Dr. David Alan Gilbert (git) wrote:
> > From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
> >
> > The choice of call to discard a block is getting more complicated
> > for other cases. We use fallocate PUNCH_HOLE in any file cases;
> > it works for both hugepage and for tmpfs.
> > We use the DONTNEED for non-hugepage cases either where they're
> > anonymous or where they're private.
> >
> > Care should be taken when trying other backing files.
> >
> > Signed-off-by: Dr. David Alan Gilbert <dgilbert@redhat.com>
> > ---
> > exec.c | 35 ++++++++++++++++++++++++-----------
> > trace-events | 3 +++
> > 2 files changed, 27 insertions(+), 11 deletions(-)
> >
> > diff --git a/exec.c b/exec.c
> > index d20c34ca83..67df2909ce 100644
> > --- a/exec.c
> > +++ b/exec.c
> > @@ -3573,6 +3573,7 @@ int ram_block_discard_range(RAMBlock *rb, uint64_t start, size_t length)
> > }
> >
> > if ((start + length) <= rb->used_length) {
> > + bool need_madvise, need_fallocate;
> > uint8_t *host_endaddr = host_startaddr + length;
> > if ((uintptr_t)host_endaddr & (rb->page_size - 1)) {
> > error_report("ram_block_discard_range: Unaligned end address: %p",
> > @@ -3582,23 +3583,35 @@ int ram_block_discard_range(RAMBlock *rb, uint64_t start, size_t length)
> >
> > errno = ENOTSUP; /* If we are missing MADVISE etc */
> >
> > - if (rb->page_size == qemu_host_page_size) {
> > -#if defined(CONFIG_MADVISE)
> > - /* Note: We need the madvise MADV_DONTNEED behaviour of definitely
> > - * freeing the page.
> > - */
> > - ret = madvise(host_startaddr, length, MADV_DONTNEED);
> > -#endif
> > - } else {
> > - /* Huge page case - unfortunately it can't do DONTNEED, but
> > - * it can do the equivalent by FALLOC_FL_PUNCH_HOLE in the
> > - * huge page file.
> > + /* The logic here is messy;
> > + * madvise DONTNEED fails for hugepages
> > + * fallocate works on hugepages and shmem
> > + */
> > + need_madvise = (rb->page_size == qemu_host_page_size);
> > + need_fallocate = rb->fd != -1;
> > + if (need_fallocate) {
> > + /* For a file, this causes the area of the file to be zero'd
> > + * if read, and for hugetlbfs also causes it to be unmapped
> > + * so a userfault will trigger.
> > */
> > #ifdef CONFIG_FALLOCATE_PUNCH_HOLE
> > ret = fallocate(rb->fd, FALLOC_FL_PUNCH_HOLE | FALLOC_FL_KEEP_SIZE,
> > start, length);
> > #endif
> > }
> > + /* i.e. need madvise but skip it if the fallocate failed */
> > + if (need_madvise && (!need_fallocate || (ret == 0))) {
>
> I'll slightly prefer:
>
> trace_ram_block_discard_range();
>
> if (need_fallocate) {
> ret = fallocate();
> if (ret) {
> error_report();
> goto err;
> }
> }
>
> if (need_madvise) {
> ret = madvise();
> if (ret) {
> error_report();
> goto err;
> }
> }
OK, I've reworked it more like that.
(It's a little more complex because of the ifdef's)
Dave
> But it is personal preference. For either way:
>
> Reviewed-by: Peter Xu <peterx@redhat.com>
>
> > + /* For normal RAM this causes it to be unmapped,
> > + * for shared memory it causes the local mapping to disappear
> > + * and to fall back on the file contents (which we just
> > + * fallocate'd away).
> > + */
> > +#if defined(CONFIG_MADVISE)
> > + ret = madvise(host_startaddr, length, MADV_DONTNEED);
> > +#endif
> > + }
> > + trace_ram_block_discard_range(rb->idstr, host_startaddr,
> > + need_madvise, need_fallocate, ret);
> > if (ret) {
> > ret = -errno;
> > error_report("ram_block_discard_range: Failed to discard range "
> > diff --git a/trace-events b/trace-events
> > index 1f50f56d9d..213ee34f89 100644
> > --- a/trace-events
> > +++ b/trace-events
> > @@ -55,6 +55,9 @@ dma_complete(void *dbs, int ret, void *cb) "dbs=%p ret=%d cb=%p"
> > dma_blk_cb(void *dbs, int ret) "dbs=%p ret=%d"
> > dma_map_wait(void *dbs) "dbs=%p"
> >
> > +# exec.c
> > +ram_block_discard_range(const char *rbname, void *hva, bool need_madvise, bool need_fallocate, int ret) "%s@%p: madvise: %d fallocate: %d ret: %d"
> > +
> > # memory.c
> > memory_region_ops_read(int cpu_index, void *mr, uint64_t addr, uint64_t value, unsigned size) "cpu %d mr %p addr 0x%"PRIx64" value 0x%"PRIx64" size %u"
> > memory_region_ops_write(int cpu_index, void *mr, uint64_t addr, uint64_t value, unsigned size) "cpu %d mr %p addr 0x%"PRIx64" value 0x%"PRIx64" size %u"
> > --
> > 2.13.5
> >
>
> --
> Peter Xu
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
next prev parent reply other threads:[~2017-09-18 12:18 UTC|newest]
Thread overview: 94+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <CGME20170824192750epcas5p484df9724ca7c0a259a4dd85425a69e1d@epcas5p4.samsung.com>
2017-08-24 19:26 ` [Qemu-devel] [RFC v2 00/32] postcopy+vhost-user/shared ram Dr. David Alan Gilbert (git)
2017-08-24 19:26 ` [Qemu-devel] [RFC v2 01/32] vhu: vu_queue_started Dr. David Alan Gilbert (git)
2017-08-24 23:10 ` Marc-André Lureau
2017-08-25 14:58 ` Dr. David Alan Gilbert
2017-08-30 13:02 ` Michael S. Tsirkin
2017-08-30 13:13 ` Marc-André Lureau
2017-09-05 12:58 ` Dr. David Alan Gilbert
2017-09-05 13:01 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 02/32] vhub: Only process received packets on started queues Dr. David Alan Gilbert (git)
2017-08-30 9:59 ` Marc-André Lureau
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 03/32] migrate: Update ram_block_discard_range for shared Dr. David Alan Gilbert (git)
2017-08-29 5:30 ` Peter Xu
2017-09-18 12:18 ` Dr. David Alan Gilbert [this message]
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 04/32] qemu_ram_block_host_offset Dr. David Alan Gilbert (git)
2017-08-25 12:11 ` Philippe Mathieu-Daudé
2017-08-25 15:28 ` Dr. David Alan Gilbert
2017-08-29 5:36 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 05/32] migration/ram: ramblock_recv_bitmap_test_byte_offset Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 06/32] postcopy: use UFFDIO_ZEROPAGE only when available Dr. David Alan Gilbert (git)
2017-08-30 9:57 ` Marc-André Lureau
2017-09-07 10:55 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 07/32] postcopy: Add notifier chain Dr. David Alan Gilbert (git)
2017-08-29 6:02 ` Peter Xu
2017-09-11 17:00 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 08/32] postcopy: Add vhost-user flag for postcopy and check it Dr. David Alan Gilbert (git)
2017-08-29 6:22 ` Peter Xu
2017-09-13 14:34 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 09/32] vhost-user: Add 'VHOST_USER_POSTCOPY_ADVISE' message Dr. David Alan Gilbert (git)
2017-08-30 10:07 ` Marc-André Lureau
2017-09-07 11:04 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 10/32] vhub: Support sending fds back to qemu Dr. David Alan Gilbert (git)
2017-08-30 10:22 ` Marc-André Lureau
2017-09-07 11:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 11/32] vhub: Open userfaultfd Dr. David Alan Gilbert (git)
2017-08-29 6:40 ` Peter Xu
2017-09-15 17:33 ` Dr. David Alan Gilbert
2017-08-30 10:30 ` Marc-André Lureau
2017-09-07 16:36 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 12/32] postcopy: Allow registering of fd handler Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 13/32] vhost+postcopy: Register shared ufd with postcopy Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 14/32] vhost+postcopy: Transmit 'listen' to client Dr. David Alan Gilbert (git)
2017-08-30 10:37 ` Marc-André Lureau
2017-09-07 12:10 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 15/32] vhost+postcopy: Register new regions with the ufd Dr. David Alan Gilbert (git)
2017-08-30 10:42 ` Marc-André Lureau
2017-09-08 14:50 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 16/32] vhost+postcopy: Send address back to qemu Dr. David Alan Gilbert (git)
2017-08-29 8:30 ` Peter Xu
2017-09-12 17:15 ` Dr. David Alan Gilbert
2017-09-13 4:29 ` Peter Xu
2017-09-13 12:15 ` Dr. David Alan Gilbert
2017-09-15 8:57 ` Peter Xu
2017-09-15 15:32 ` Dr. David Alan Gilbert
2017-09-18 9:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 17/32] vhost+postcopy: Stash RAMBlock and offset Dr. David Alan Gilbert (git)
2017-08-30 5:51 ` Peter Xu
2017-09-13 15:59 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 18/32] vhost+postcopy: Send requests to source for shared pages Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 19/32] vhost+postcopy: Resolve client address Dr. David Alan Gilbert (git)
2017-08-30 5:28 ` Peter Xu
2017-09-11 11:58 ` Dr. David Alan Gilbert
2017-09-13 5:18 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 20/32] postcopy: wake shared Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 21/32] postcopy: postcopy_notify_shared_wake Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 22/32] vhost+postcopy: Add vhost waker Dr. David Alan Gilbert (git)
2017-08-30 5:55 ` Peter Xu
2017-09-13 13:09 ` Dr. David Alan Gilbert
2017-09-18 3:57 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 23/32] vhost+postcopy: Call wakeups Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 24/32] vub+postcopy: madvises Dr. David Alan Gilbert (git)
2017-08-30 10:48 ` Marc-André Lureau
2017-09-07 12:30 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 25/32] vhost+postcopy: Lock around set_mem_table Dr. David Alan Gilbert (git)
2017-08-30 6:50 ` Peter Xu
2017-09-25 17:56 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 26/32] vhost: Add VHOST_USER_POSTCOPY_END message Dr. David Alan Gilbert (git)
2017-08-30 6:55 ` Peter Xu
2017-09-11 11:31 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 27/32] vhost+postcopy: Wire up POSTCOPY_END notify Dr. David Alan Gilbert (git)
2017-08-30 6:57 ` Peter Xu
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 28/32] postcopy: Allow shared memory Dr. David Alan Gilbert (git)
2017-08-30 10:39 ` Marc-André Lureau
2017-09-07 12:15 ` Dr. David Alan Gilbert
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 29/32] vhost-user: Claim support for postcopy Dr. David Alan Gilbert (git)
2017-08-30 10:50 ` Marc-André Lureau
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 30/32] vhost: Merge neighbouring hugepage regions where appropriate Dr. David Alan Gilbert (git)
2017-09-14 9:18 ` Igor Mammedov
2017-09-25 11:19 ` Dr. David Alan Gilbert
2017-10-02 13:49 ` Igor Mammedov
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 31/32] vhost: Don't break merged regions on small remove/non-adds Dr. David Alan Gilbert (git)
2017-08-24 19:27 ` [Qemu-devel] [RFC v2 32/32] postcopy shared docs Dr. David Alan Gilbert (git)
2017-09-01 13:34 ` [Qemu-devel] [RFC v2 00/32] postcopy+vhost-user/shared ram Alexey Perevalov
2017-09-01 13:42 ` Maxime Coquelin
2017-10-16 8:32 ` Alexey Perevalov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170918121809.GH2581@work-vm \
--to=dgilbert@redhat.com \
--cc=a.perevalov@samsung.com \
--cc=aarcange@redhat.com \
--cc=felipe@nutanix.com \
--cc=lvivier@redhat.com \
--cc=marcandre.lureau@redhat.com \
--cc=maxime.coquelin@redhat.com \
--cc=mst@redhat.com \
--cc=peterx@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=quintela@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.