From: Avi Kivity <avi@redhat.com>
To: vatsa@linux.vnet.ibm.com
Cc: Anthony Liguori <aliguori@linux.vnet.ibm.com>,
qemu-devel@nongnu.org, kvm@vger.kernel.org,
Chris Wright <chrisw@sous-sol.org>
Subject: Re: [PATCH] qemu-kvm: response to SIGUSR1 to start/stop a VCPU (v2)
Date: Wed, 01 Dec 2010 14:56:44 +0200 [thread overview]
Message-ID: <4CF6460C.5070604@redhat.com> (raw)
In-Reply-To: <20101201123742.GA3780@linux.vnet.ibm.com>
On 12/01/2010 02:37 PM, Srivatsa Vaddagiri wrote:
> On Wed, Nov 24, 2010 at 04:23:15PM +0200, Avi Kivity wrote:
> > >>I'm more concerned about lock holder preemption, and interaction
> > >>of this mechanism with any kernel solution for LHP.
> > >
> > >Can you suggest some scenarios and I'll create some test cases?
> > >I'm trying figure out the best way to evaluate this.
> >
> > Booting 64-vcpu Windows on a 64-cpu host with PLE but without
> > directed yield takes longer than forever because PLE detects
> > contention within the guest, which under our current PLE
> > implementation (usleep(100)) converts guest contention into delays.
>
> Is there any way of optimizing PLE at runtime in such special case? For ex:
> either turn off PLE feature or gradually increase (spin-)timeout when PLE should
> kick in ..
It's not a special case at all. Both host contention and guest
contention are perfectly normal, and can occur simultaneously.
> > (a directed yield implementation would find that all vcpus are
> > runnable, yielding optimal results under this test case).
>
> I would think a plain yield() (rather than usleep/directed yield) would suffice
> here (yield would realize that there is nobody else to yield to and continue
> running the same vcpu thread).
Currently yield() is a no-op on Linux.
> As regards to any concern of leaking cpu
> bandwidth because of a plain yield, I think it can be fixed by a more
> simpler modification to yield that allows a thread to reclaim whatever timeslice
> it gave up previously [1].
If some other thread used that timeslice, don't we have an accounting
problem?
> Regarding directed yield, do we have any reliable mechanism to find target of
> directed yield in this (unmodified/non-paravirtualized guest) case? IOW how do
> we determine the vcpu thread to which cycles need to be yielded upon contention?
My idea was to yield to a random starved vcpu of the same guest. There
are several cases to consider:
- we hit the right vcpu; lock is released, party.
- we hit some vcpu that is doing unrelated work. yielding thread
doesn't make progress, but we're not wasting cpu time.
- we hit another waiter for the same lock. it will also PLE exit and
trigger a directed yield. this increases the cost of directed yield by
a factor of count_of_runnable_but_not_running_vcpus, which could be
large, but not disasterously so (i.e. don't run a 64-vcpu guest on a
uniprocessor host with this)
> > So if you were to test something similar running with a 20% vcpu
> > cap, I'm sure you'd run into similar issues. It may show with fewer
> > vcpus (I've only tested 64).
> >
> > >Are you assuming the existence of a directed yield and the
> > >specific concern is what happens when a directed yield happens
> > >after a PLE and the target of the yield has been capped?
> >
> > Yes. My concern is that we will see the same kind of problems
> > directed yield was designed to fix, but without allowing directed
> > yield to fix them. Directed yield was designed to fix lock holder
> > preemption under contention,
>
> For modified guests, something like [2] seems to be the best approach to fix
> lock-holder preemption (LHP) problem, which does not require any sort of
> directed yield support. Essentially upon contention, a vcpu registers its lock
> of interest and goes to sleep (via hypercall) waiting for lock-owner to wake it
> up (again via another hypercall).
Right.
> For unmodified guests, IMHO a plain yield (or slightly enhanced yield [1])
> should fix the LHP problem.
A plain yield (ignoring no-opiness on Linux) will penalize the running
guest wrt other guests. We need to maintain fairness.
> Fyi, Xen folks also seem to be avoiding a directed yield for some of the same
> reasons [3].
I think that fails for unmodified guests, where you don't know when the
lock is released and so you don't have a wake_up notification. You lost
a large timeslice and you can't gain it back, whereas with pv the wakeup
means you only lose as much time as the lock was held.
> Given this line of thinking, hard-limiting guests (either in user-space or
> kernel-space, latter being what I prefer) should not have adverse interactions
> with LHP-related solutions.
If you hard-limit a vcpu that holds a lock, any waiting vcpus are also
halted. With directed yield you can let the lock holder make some
progress at the expense of another vcpu. A regular yield() will simply
stall the waiter.
--
error compiling committee.c: too many arguments to function
next prev parent reply other threads:[~2010-12-01 12:57 UTC|newest]
Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-11-23 16:49 [PATCH] qemu-kvm: response to SIGUSR1 to start/stop a VCPU (v2) Anthony Liguori
2010-11-23 19:35 ` [Qemu-devel] " Blue Swirl
2010-11-23 21:46 ` Anthony Liguori
2010-11-23 23:43 ` Paolo Bonzini
2010-11-24 1:15 ` Anthony Liguori
2010-11-24 2:08 ` Paolo Bonzini
2010-11-24 8:18 ` Avi Kivity
2010-11-24 13:58 ` Anthony Liguori
2010-11-24 14:23 ` Avi Kivity
2010-12-01 12:37 ` Srivatsa Vaddagiri
2010-12-01 12:56 ` Avi Kivity [this message]
2010-12-01 16:12 ` Srivatsa Vaddagiri
2010-12-01 16:25 ` Peter Zijlstra
2010-12-01 17:17 ` Chris Wright
2010-12-01 17:22 ` Peter Zijlstra
2010-12-01 17:26 ` Rik van Riel
2010-12-01 19:07 ` Peter Zijlstra
2010-12-01 19:24 ` Rik van Riel
2010-12-01 19:35 ` Peter Zijlstra
2010-12-01 19:42 ` Rik van Riel
2010-12-01 19:47 ` Peter Zijlstra
2010-12-02 9:07 ` Avi Kivity
2010-12-01 17:46 ` Chris Wright
2010-12-01 17:29 ` Srivatsa Vaddagiri
2010-12-01 17:45 ` Peter Zijlstra
2010-12-01 18:00 ` Srivatsa Vaddagiri
2010-12-01 19:09 ` Peter Zijlstra
2010-12-02 9:17 ` Avi Kivity
2010-12-02 11:47 ` Srivatsa Vaddagiri
2010-12-02 12:22 ` Srivatsa Vaddagiri
2010-12-02 12:41 ` Avi Kivity
2010-12-02 13:13 ` Srivatsa Vaddagiri
2010-12-02 13:49 ` Avi Kivity
2010-12-02 15:27 ` Srivatsa Vaddagiri
2010-12-02 15:28 ` Srivatsa Vaddagiri
2010-12-02 15:33 ` Avi Kivity
2010-12-02 15:44 ` Srivatsa Vaddagiri
2010-12-02 12:19 ` Srivatsa Vaddagiri
2010-12-02 12:42 ` Avi Kivity
2010-12-02 9:14 ` Avi Kivity
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4CF6460C.5070604@redhat.com \
--to=avi@redhat.com \
--cc=aliguori@linux.vnet.ibm.com \
--cc=chrisw@sous-sol.org \
--cc=kvm@vger.kernel.org \
--cc=qemu-devel@nongnu.org \
--cc=vatsa@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox