From: Mike Galbraith <efault@gmx.de>
To: Avi Kivity <avi@redhat.com>
Cc: Rik van Riel <riel@redhat.com>,
kvm@vger.kernel.org, linux-kernel@vger.kernel.org,
Srivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Chris Wright <chrisw@sous-sol.org>
Subject: Re: [RFC -v2 PATCH 2/3] sched: add yield_to function
Date: Mon, 20 Dec 2010 11:33:13 +0100 [thread overview]
Message-ID: <1292841193.11946.36.camel@marge.simson.net> (raw)
In-Reply-To: <4D0F25E8.80305@redhat.com>
On Mon, 2010-12-20 at 11:46 +0200, Avi Kivity wrote:
> On 12/20/2010 11:30 AM, Mike Galbraith wrote:
> > > >
> > > > Because preempting a perfect stranger is not courteous, all tasks have
> > > > to play nice.
> > >
> > > I don't want to preempt anybody, simply make the task run before me.
> >
> > I thought you wanted to get the target to the cpu asap? You just can't
> > have he runs before me cross cpu.
>
> You're right, of course. I'm fine with running in parallel. I'm fine
> with him running before or instead of me. I'm not fine with running
> while the other guy is waiting.
Goody, maybe we're headed down a productive path then.
> > > Further, this is a kernel internal API, so no need for these types of
> > > restrictions. If we expose it to userspace, sure.
> >
> > Doesn't matter whether it's kernel or not afaikt. If virtualization has
> > to coexist peacefully with other loads, it can't just say "my hints are
> > the only ones that count", and thus shred other loads throughput.
>
> What does that have to do with being in the same group or not? I want
> to maintain fairness (needed for pure virt workloads, one guest cannot
> dominate another), but I don't see how being in the same thread group is
> relevant.
My thought is that you can shred your own throughput, but not some other
concurrent load. I'll have to let that thought stew a bit though.
> Again, I don't want more than one entitlement. I want to move part of
> my entitlement to another task.
Folks can keep trying that, but IMO it's too broken to live.
> > > > > > use cfs_rq->next to pass the scheduler a HINT of what you would LIKE to
> > > > > > happen.
> > > > >
> > > > > Hint is fine, so long as the scheduler seriously considers it.
> > > >
> > > > It will take the hint if the target the target hasn't had too much cpu.
> > >
> > > Since I'm running and the target isn't, it's clear the scheduler thinks
> > > the target had more cpu than I did [73]. That's why I want to donate
> > > cpu time.
> >
> > That's not necessarily true, in fact, it's very often false. Last/next
> > buddy will allow a task to run ahead of leftmost so we don't always
> > blindly select leftmost and shred cache.
>
> Ok.
>
> > > >
> > > > What would you suggest? There is no global execution timeline, so if
> > > > you want to definitely run after this task, you're stuck with moving to
> > > > his timezone or moving him to yours. Well, you could sleep a while, but
> > > > we know how productive sleeping is.
> > >
> > > I don't know. The whole idea of donating runtime was predicated on CFS
> > > being completely fair. Now I find that (a) it isn't (b) donating
> > > runtimes between tasks on different cpus is problematic.
> >
> > True and true. However, would you _want_ the scheduler to hold runnable
> > tasks hostage, and thus let CPU go to waste in the name of perfect
> > fairness? Perfect is the enemy of good applies to that idea imho.
>
> Sorry, I don't see how it follows.
Let's just forget theoretical views, and concentrate on a forward path.
> > > Moving tasks between cpus is expensive and sometimes prohibited by
> > > pinning. I'd like to avoid it if possible, but it's better than nothing.
> >
> > Expensive in many ways, so let's try to not do that.
> >
> > So why do you need this other task to run before you do, even cross cpu?
> > If he's a lock holder, getting him to the cpu will give him a chance to
> > drop, no? Isn't that what you want to get done? Drop that lock so you
> > or someone else can get something other than spinning done?
>
> Correct. I don't want the other task to run before me, I just don't
> want to run before it.
OK, so what I gather is that if you can preempt another of your own
threads to get the target to cpu, that would be a good thing whether
he's on the same cpu as yield_to() caller or not. If the target is
sharing a cpu with you, that's even better. Correct?
Would a kick/hint option be useful?
-Mike
next prev parent reply other threads:[~2010-12-20 10:33 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-12-14 3:44 [RFC -v2 PATCH 0/3] directed yield for Pause Loop Exiting Rik van Riel
2010-12-14 3:45 ` [RFC -v2 PATCH 1/3] kvm: keep track of which task is running a KVM vcpu Rik van Riel
2010-12-14 3:46 ` [RFC -v2 PATCH 2/3] sched: add yield_to function Rik van Riel
2010-12-14 6:08 ` Mike Galbraith
2010-12-14 10:24 ` Srivatsa Vaddagiri
2010-12-14 11:03 ` Mike Galbraith
2010-12-14 11:26 ` Srivatsa Vaddagiri
2010-12-14 12:47 ` Mike Galbraith
2010-12-16 19:49 ` Rik van Riel
2010-12-17 6:56 ` Mike Galbraith
2010-12-17 7:15 ` Mike Galbraith
2010-12-18 17:08 ` Avi Kivity
2010-12-18 19:13 ` Mike Galbraith
2010-12-19 6:08 ` Avi Kivity
2010-12-20 15:40 ` Rik van Riel
2010-12-20 16:04 ` Mike Galbraith
2010-12-28 5:54 ` Mike Galbraith
2010-12-28 6:08 ` Gene Heskett
2010-12-28 6:16 ` Mike Galbraith
2010-12-28 16:18 ` Gene Heskett
2010-12-28 22:34 ` Rik van Riel
2010-12-17 15:09 ` Avi Kivity
2010-12-17 19:51 ` Mike Galbraith
2010-12-18 17:02 ` Avi Kivity
2010-12-18 19:06 ` Mike Galbraith
2010-12-19 6:21 ` Avi Kivity
2010-12-19 10:05 ` Mike Galbraith
2010-12-19 9:19 ` Avi Kivity
2010-12-19 11:18 ` Mike Galbraith
2010-12-20 8:39 ` Mike Galbraith
2010-12-20 8:45 ` Avi Kivity
2010-12-20 8:55 ` Mike Galbraith
2010-12-20 9:03 ` Avi Kivity
2010-12-20 9:30 ` Mike Galbraith
2010-12-20 9:46 ` Avi Kivity
2010-12-20 10:33 ` Mike Galbraith [this message]
2010-12-20 10:39 ` Avi Kivity
2010-12-20 10:46 ` Mike Galbraith
2010-12-20 10:49 ` Avi Kivity
2010-12-20 10:50 ` Mike Galbraith
2010-12-20 11:06 ` Avi Kivity
2010-12-14 12:22 ` Peter Zijlstra
2010-12-18 14:50 ` Rik van Riel
2010-12-14 3:48 ` [RFC -v2 PATCH 3/3] kvm: use yield_to instead of sleep in kvm_vcpu_on_spin Rik van Riel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1292841193.11946.36.camel@marge.simson.net \
--to=efault@gmx.de \
--cc=a.p.zijlstra@chello.nl \
--cc=avi@redhat.com \
--cc=chrisw@sous-sol.org \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=riel@redhat.com \
--cc=vatsa@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox