From: David Gibson <david@gibson.dropbear.id.au>
To: "Michael S. Tsirkin" <mst@redhat.com>
Cc: qemu-devel@nongnu.org, qemu-ppc@nongnu.org,
David Hildenbrand <david@redhat.com>
Subject: Re: [Qemu-devel] [PATCH 1/5] virtio-balloon: Remove unnecessary MADV_WILLNEED on deflate
Date: Wed, 6 Mar 2019 10:35:12 +1100 [thread overview]
Message-ID: <20190305233512.GB19715@umbus.fritz.box> (raw)
In-Reply-To: <20190305094050-mutt-send-email-mst@kernel.org>
[-- Attachment #1: Type: text/plain, Size: 5314 bytes --]
On Tue, Mar 05, 2019 at 09:41:34AM -0500, Michael S. Tsirkin wrote:
> On Tue, Mar 05, 2019 at 04:03:00PM +1100, David Gibson wrote:
> > On Mon, Mar 04, 2019 at 09:29:24PM -0500, Michael S. Tsirkin wrote:
> > > On Tue, Mar 05, 2019 at 11:52:08AM +1100, David Gibson wrote:
> > > > On Thu, Feb 28, 2019 at 08:36:58AM -0500, Michael S. Tsirkin wrote:
> > > > > On Thu, Feb 14, 2019 at 03:39:12PM +1100, David Gibson wrote:
> > > > > > When the balloon is inflated, we discard memory place in it using madvise()
> > > > > > with MADV_DONTNEED. And when we deflate it we use MADV_WILLNEED, which
> > > > > > sounds like it makes sense but is actually unnecessary.
> > > > > >
> > > > > > The misleadingly named MADV_DONTNEED just discards the memory in question,
> > > > > > it doesn't set any persistent state on it in-kernel; all that's necessary
> > > > > > to bring the memory back is to touch it. MADV_WILLNEED in contrast
> > > > > > specifically says that the memory will be used soon and faults it in.
> > > > > >
> > > > > > This patch simplify's the balloon operation by dropping the madvise()
> > > > > > on deflate. This might have an impact on performance - it will move a
> > > > > > delay at deflate time until that memory is actually touched, which
> > > > > > might be more latency sensitive. However:
> > > > > >
> > > > > > * Memory that's being given back to the guest by deflating the
> > > > > > balloon *might* be used soon, but it equally could just sit around
> > > > > > in the guest's pools until needed (or even be faulted out again if
> > > > > > the host is under memory pressure).
> > > > > >
> > > > > > * Usually, the timescale over which you'll be adjusting the balloon
> > > > > > is long enough that a few extra faults after deflation aren't
> > > > > > going to make a difference.
> > > > > >
> > > > > > Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
> > > > > > Reviewed-by: David Hildenbrand <david@redhat.com>
> > > > > > Reviewed-by: Michael S. Tsirkin <mst@redhat.com>
> > > > >
> > > > > I'm having second thoughts about this. It might affect performance but
> > > > > probably won't but we have no idea. Might cause latency jitter after
> > > > > deflate where it previously didn't happen. This kind of patch should
> > > > > really be accompanied by benchmarking results, not philosophy.
> > > >
> > > > I guess I see your point, much as it's annoying to spend time
> > > > benchmarking a device that's basically broken by design.
> > >
> > > Because of 4K page thing?
> >
> > For one thing. I believe David H has bunch of other reasons.
> >
> > > It's an annoying bug for sure. There were
> > > patches to add a feature bit to just switch to plan s/g format, but they
> > > were abandoned. You are welcome to revive them though.
> > > Additionally or alternatively, we can easily add a field specifying
> > > page size.
> >
> > We could, but I'm pretty disinclined to work on this when virtio-mem
> > is a better solution in nearly every way.
>
> Then one way would be to just let balloon be. Make it behave same as
> always and don't make changes to it :)
I'd love to, but it is in real world use, so I think we do need to fix
serious bugs in it - at least if they can be fixed on one side,
without needing to roll out both qemu and guest changes (which adding
page size negotiation would require).
>
> > > > That said.. I don't really know how I'd go about benchmarking it. Any
> > > > guesses at a suitable workload which would be most likely to show a
> > > > performance degradation here?
> > >
> > > Here's one idea - I tried to come up with a worst case scenario here.
> > > Basically based on idea by Alex Duyck. All credits are his, all bugs are
> > > mine:
> >
> > Ok. I'll try to find time to implement this and test it.
> >
> > > Setup:
> > > Memory-15837 MB
> > > Guest Memory Size-5 GB
> > > Swap-Disabled
> > > Test Program-Simple program which allocates 4GB memory via malloc, touches it via memset and exits.
> > > Use case-Number of guests that can be launched completely including the successful execution of the test program.
> > > Procedure:
> > > Setup:
> > > A first guest is launched and once its console is up,
> > > test allocation program is executed with 4 GB memory request (Due to
> > > this the guest occupies almost 4-5 GB of memory in the host)
> > > Afterwards balloon is inflated by 4Gbyte in the guest.
> > > We continue launching the guests until a guest gets
> > > killed due to low memory condition in the host.
> > >
> > >
> > > Now repeatedly, in each guest in turn, balloon is deflated and
> > > test allocation program is executed with 4 GB memory request (Due to
> > > this the guest occupies almost 4-5 GB of memory in the host)
> > > After program finishes balloon is inflated by 4GB again.
> > >
> > > Then we switch to another guest.
> > >
> > > Time how many cycles of this we can do.
> > >
> > >
> > > Hope this helps.
> > >
> > >
> > >
> >
>
>
--
David Gibson | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2019-03-05 23:51 UTC|newest]
Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-02-14 4:39 [Qemu-devel] [PATCH 0/5] Improve balloon handling of pagesizes other than 4kiB David Gibson
2019-02-14 4:39 ` [Qemu-devel] [PATCH 1/5] virtio-balloon: Remove unnecessary MADV_WILLNEED on deflate David Gibson
2019-02-28 13:36 ` Michael S. Tsirkin
2019-03-05 0:52 ` David Gibson
2019-03-05 2:29 ` Michael S. Tsirkin
2019-03-05 5:03 ` David Gibson
2019-03-05 14:41 ` Michael S. Tsirkin
2019-03-05 23:35 ` David Gibson [this message]
2019-03-06 0:14 ` Michael S. Tsirkin
2019-03-06 0:58 ` David Gibson
2019-02-14 4:39 ` [Qemu-devel] [PATCH 2/5] virtio-balloon: Corrections to address verification David Gibson
2019-02-22 9:08 ` [Qemu-devel] [Qemu-ppc] " Greg Kurz
2019-02-24 23:37 ` David Gibson
2019-02-25 9:26 ` Greg Kurz
2019-02-26 23:20 ` David Gibson
2019-02-28 9:09 ` Greg Kurz
2019-02-14 4:39 ` [Qemu-devel] [PATCH 3/5] virtio-balloon: Rework ballon_page() interface David Gibson
2019-02-14 4:39 ` [Qemu-devel] [PATCH 4/5] virtio-balloon: Use ram_block_discard_range() instead of raw madvise() David Gibson
2019-02-14 4:39 ` [Qemu-devel] [PATCH 5/5] virtio-balloon: Safely handle BALLOON_PAGE_SIZE < host page size David Gibson
2019-03-05 16:06 ` [Qemu-devel] [PULL 23/26] " Peter Maydell
2019-03-05 23:33 ` David Gibson
2019-02-28 13:39 ` [Qemu-devel] [PATCH 0/5] Improve balloon handling of pagesizes other than 4kiB Michael S. Tsirkin
2019-03-05 0:53 ` David Gibson
2019-03-05 2:13 ` Michael S. Tsirkin
2019-03-05 4:55 ` David Gibson
-- strict thread matches above, loose matches on Subject: below --
2019-02-22 2:40 [Qemu-devel] [PULL 00/26] pci, pc, virtio: fixes, cleanups, tests Michael S. Tsirkin
2019-02-22 15:47 ` Peter Maydell
2019-02-22 15:53 ` Michael S. Tsirkin
2019-02-22 16:34 ` Peter Maydell
2019-02-24 0:34 ` Michael S. Tsirkin
2019-02-24 10:21 ` Peter Maydell
2019-02-24 16:41 ` Michael S. Tsirkin
2019-02-25 16:23 ` Philippe Mathieu-Daudé
2019-02-25 17:27 ` Peter Maydell
2019-02-24 22:49 ` David Gibson
2019-02-25 15:19 ` [Qemu-devel] [PULL v2 resend " Michael S. Tsirkin
2019-03-04 10:55 ` Paolo Bonzini
2019-03-04 13:38 ` Peter Maydell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190305233512.GB19715@umbus.fritz.box \
--to=david@gibson.dropbear.id.au \
--cc=david@redhat.com \
--cc=mst@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=qemu-ppc@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).