From: Avi Kivity <avi@redhat.com>
To: vatsa@linux.vnet.ibm.com
Cc: Anthony Liguori <aliguori@linux.vnet.ibm.com>,
qemu-devel@nongnu.org, kvm@vger.kernel.org,
Chris Wright <chrisw@sous-sol.org>
Subject: Re: [PATCH] qemu-kvm: response to SIGUSR1 to start/stop a VCPU (v2)
Date: Wed, 01 Dec 2010 14:56:44 +0200 [thread overview]
Message-ID: <4CF6460C.5070604@redhat.com> (raw)
In-Reply-To: <20101201123742.GA3780@linux.vnet.ibm.com>
On 12/01/2010 02:37 PM, Srivatsa Vaddagiri wrote:
> On Wed, Nov 24, 2010 at 04:23:15PM +0200, Avi Kivity wrote:
> > >>I'm more concerned about lock holder preemption, and interaction
> > >>of this mechanism with any kernel solution for LHP.
> > >
> > >Can you suggest some scenarios and I'll create some test cases?
> > >I'm trying figure out the best way to evaluate this.
> >
> > Booting 64-vcpu Windows on a 64-cpu host with PLE but without
> > directed yield takes longer than forever because PLE detects
> > contention within the guest, which under our current PLE
> > implementation (usleep(100)) converts guest contention into delays.
>
> Is there any way of optimizing PLE at runtime in such special case? For ex:
> either turn off PLE feature or gradually increase (spin-)timeout when PLE should
> kick in ..
It's not a special case at all. Both host contention and guest
contention are perfectly normal, and can occur simultaneously.
> > (a directed yield implementation would find that all vcpus are
> > runnable, yielding optimal results under this test case).
>
> I would think a plain yield() (rather than usleep/directed yield) would suffice
> here (yield would realize that there is nobody else to yield to and continue
> running the same vcpu thread).
Currently yield() is a no-op on Linux.
> As regards to any concern of leaking cpu
> bandwidth because of a plain yield, I think it can be fixed by a more
> simpler modification to yield that allows a thread to reclaim whatever timeslice
> it gave up previously [1].
If some other thread used that timeslice, don't we have an accounting
problem?
> Regarding directed yield, do we have any reliable mechanism to find target of
> directed yield in this (unmodified/non-paravirtualized guest) case? IOW how do
> we determine the vcpu thread to which cycles need to be yielded upon contention?
My idea was to yield to a random starved vcpu of the same guest. There
are several cases to consider:
- we hit the right vcpu; lock is released, party.
- we hit some vcpu that is doing unrelated work. yielding thread
doesn't make progress, but we're not wasting cpu time.
- we hit another waiter for the same lock. it will also PLE exit and
trigger a directed yield. this increases the cost of directed yield by
a factor of count_of_runnable_but_not_running_vcpus, which could be
large, but not disasterously so (i.e. don't run a 64-vcpu guest on a
uniprocessor host with this)
> > So if you were to test something similar running with a 20% vcpu
> > cap, I'm sure you'd run into similar issues. It may show with fewer
> > vcpus (I've only tested 64).
> >
> > >Are you assuming the existence of a directed yield and the
> > >specific concern is what happens when a directed yield happens
> > >after a PLE and the target of the yield has been capped?
> >
> > Yes. My concern is that we will see the same kind of problems
> > directed yield was designed to fix, but without allowing directed
> > yield to fix them. Directed yield was designed to fix lock holder
> > preemption under contention,
>
> For modified guests, something like [2] seems to be the best approach to fix
> lock-holder preemption (LHP) problem, which does not require any sort of
> directed yield support. Essentially upon contention, a vcpu registers its lock
> of interest and goes to sleep (via hypercall) waiting for lock-owner to wake it
> up (again via another hypercall).
Right.
> For unmodified guests, IMHO a plain yield (or slightly enhanced yield [1])
> should fix the LHP problem.
A plain yield (ignoring no-opiness on Linux) will penalize the running
guest wrt other guests. We need to maintain fairness.
> Fyi, Xen folks also seem to be avoiding a directed yield for some of the same
> reasons [3].
I think that fails for unmodified guests, where you don't know when the
lock is released and so you don't have a wake_up notification. You lost
a large timeslice and you can't gain it back, whereas with pv the wakeup
means you only lose as much time as the lock was held.
> Given this line of thinking, hard-limiting guests (either in user-space or
> kernel-space, latter being what I prefer) should not have adverse interactions
> with LHP-related solutions.
If you hard-limit a vcpu that holds a lock, any waiting vcpus are also
halted. With directed yield you can let the lock holder make some
progress at the expense of another vcpu. A regular yield() will simply
stall the waiter.
--
error compiling committee.c: too many arguments to function
next prev parent reply other threads:[~2010-12-01 12:57 UTC|newest]
Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-11-23 16:49 [PATCH] qemu-kvm: response to SIGUSR1 to start/stop a VCPU (v2) Anthony Liguori
2010-11-23 19:35 ` [Qemu-devel] " Blue Swirl
2010-11-23 21:46 ` Anthony Liguori
2010-11-23 23:43 ` Paolo Bonzini
2010-11-24 1:15 ` Anthony Liguori
2010-11-24 2:08 ` Paolo Bonzini
2010-11-24 8:18 ` Avi Kivity
2010-11-24 13:58 ` Anthony Liguori
2010-11-24 14:23 ` Avi Kivity
2010-12-01 12:37 ` Srivatsa Vaddagiri
2010-12-01 12:56 ` Avi Kivity [this message]
2010-12-01 16:12 ` Srivatsa Vaddagiri
2010-12-01 16:25 ` Peter Zijlstra
2010-12-01 17:17 ` Chris Wright
2010-12-01 17:22 ` Peter Zijlstra
2010-12-01 17:26 ` Rik van Riel
2010-12-01 19:07 ` Peter Zijlstra
2010-12-01 19:24 ` Rik van Riel
2010-12-01 19:35 ` Peter Zijlstra
2010-12-01 19:42 ` Rik van Riel
2010-12-01 19:47 ` Peter Zijlstra
2010-12-02 9:07 ` Avi Kivity
2010-12-01 17:46 ` Chris Wright
2010-12-01 17:29 ` Srivatsa Vaddagiri
2010-12-01 17:45 ` Peter Zijlstra
2010-12-01 18:00 ` Srivatsa Vaddagiri
2010-12-01 19:09 ` Peter Zijlstra
2010-12-02 9:17 ` Avi Kivity
2010-12-02 11:47 ` Srivatsa Vaddagiri
2010-12-02 12:22 ` Srivatsa Vaddagiri
2010-12-02 12:41 ` Avi Kivity
2010-12-02 13:13 ` Srivatsa Vaddagiri
2010-12-02 13:49 ` Avi Kivity
2010-12-02 15:27 ` Srivatsa Vaddagiri
2010-12-02 15:28 ` Srivatsa Vaddagiri
2010-12-02 15:33 ` Avi Kivity
2010-12-02 15:44 ` Srivatsa Vaddagiri
2010-12-02 12:19 ` Srivatsa Vaddagiri
2010-12-02 12:42 ` Avi Kivity
2010-12-02 9:14 ` Avi Kivity
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4CF6460C.5070604@redhat.com \
--to=avi@redhat.com \
--cc=aliguori@linux.vnet.ibm.com \
--cc=chrisw@sous-sol.org \
--cc=kvm@vger.kernel.org \
--cc=qemu-devel@nongnu.org \
--cc=vatsa@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.