From: Peter Zijlstra <peterz@infradead.org>
To: Patrick Bellasi <patrick.bellasi@arm.com>
Cc: Vincent Guittot <vincent.guittot@linaro.org>,
Ingo Molnar <mingo@kernel.org>,
linux-kernel <linux-kernel@vger.kernel.org>,
"Rafael J. Wysocki" <rjw@rjwysocki.net>,
Dietmar Eggemann <dietmar.eggemann@arm.com>,
Morten Rasmussen <Morten.Rasmussen@arm.com>,
Paul Turner <pjt@google.com>, Ben Segall <bsegall@google.com>,
Thara Gopinath <thara.gopinath@linaro.org>,
pkondeti@codeaurora.org, Quentin Perret <quentin.perret@arm.com>,
Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com>
Subject: Re: [PATCH v7 2/2] sched/fair: update scale invariance of PELT
Date: Thu, 24 Jan 2019 10:07:55 +0100 [thread overview]
Message-ID: <20190124090755.GC13536@hirez.programming.kicks-ass.net> (raw)
In-Reply-To: <20181129151316.GG23094@e110439-lin>
Sorry; trying to get back to this and re-reading the old conversations.
On Thu, Nov 29, 2018 at 03:13:16PM +0000, Patrick Bellasi wrote:
> On 29-Nov 13:53, Peter Zijlstra wrote:
> > On Wed, Nov 28, 2018 at 11:53:36AM +0000, Patrick Bellasi wrote:
> >
> > > diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> > > index ac855b2f4774..93e0cf5d8a76 100644
> > > --- a/kernel/sched/fair.c
> > > +++ b/kernel/sched/fair.c
> > > @@ -3661,6 +3661,10 @@ util_est_dequeue(struct cfs_rq *cfs_rq, struct task_struct *p, bool task_sleep)
> > > if (!task_sleep)
> > > return;
> > >
> > > + /* Skip samples which do not represent an actual utilization */
> > > + if (unlikely(task_util(p) > capacity_of(task_cpu(p))))
> > > + return;
> > > +
> > > /*
> > > * If the PELT values haven't changed since enqueue time,
> > > * skip the util_est update.
> >
> > Would you not want something like:
> >
> > min(task_util(p), capacity_of(task_cpu(p)))
> >
> > And is this the only place where we need this?
>
> Mmm... even this could be an over-estimation:
>
> I've just posted an example in my last reply to Vincent, end of:
>
> Message-ID: <20181129150020.GF23094@e110439-lin>
> https://lore.kernel.org/lkml/20181129150020.GF23094@e110439-lin/
In particular this bit:
| Seems we agree that, when there is no idle time:
| - the two 15% tasks will be overestimated
| - their utilization will reach 50% after a while
Right?
> > OTOH, if the task is always running, it will be always running
> > irrespective of where it runs.
>
> That's not what I'm concerned about. I'm concerned about small tasks
> which are running on limited capacity (e.g. due to thermal capping)
> without idle time. In this case, the new "utilization" signal could
> overestimate the real task needs.
>
> > Not storing these samples seems weird though; this is the exact
> > condition you want to record -- the task is very active, if we skip
> > these, we'll come back at a low frequency on the next wakeup.
>
> When there is not idle time, we don't know if the reported
> utilization, above the cpu capacity, is due to the task being bigger...
> or just the new utilization signal converging towards:
>
> 100% / RUNNABLE_TASKS_COUNT
So if I'm not mistaken we then have 3 cases:
1) runnable == util <= capacity
no contention, idle
2) runnable == util > capacity
no contention, no idle
3) runnable > util
contention, no idle
For 1) we can use: 'util'
For 2) we can use: 'capacity'
For 3) we can use: 'util * capacity >> 10'
(note that 2 is a special case of 3 when u=1)
This should work right?
Now, instead of doing complicated things like that, you instead figure
that when there's no idle there's also no dequeue happening and we can
simply short-cut by skipping the entire thing, forgetting everything
about 2,3.
Did I get that right?
next prev parent reply other threads:[~2019-01-24 9:08 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-11-20 10:55 [PATCH v7 0/2] sched/fair: update scale invariance of PELT Vincent Guittot
2018-11-20 10:55 ` [PATCH v7 1/2] sched/fair: move rq_of helper function Vincent Guittot
2018-11-20 10:55 ` [PATCH v7 2/2] sched/fair: update scale invariance of PELT Vincent Guittot
2018-11-28 9:54 ` Vincent Guittot
2018-11-28 10:02 ` Peter Zijlstra
2018-11-28 11:53 ` Patrick Bellasi
2018-11-28 13:33 ` Vincent Guittot
2018-11-28 13:35 ` Vincent Guittot
2018-11-28 14:40 ` Patrick Bellasi
2018-11-28 14:55 ` Vincent Guittot
2018-11-28 15:21 ` Patrick Bellasi
2018-11-28 15:42 ` Vincent Guittot
2018-11-28 16:35 ` Patrick Bellasi
2018-11-29 10:43 ` Vincent Guittot
2018-11-29 15:00 ` Patrick Bellasi
2018-11-29 16:19 ` Vincent Guittot
2019-01-10 15:30 ` Patrick Bellasi
2019-01-11 14:29 ` Vincent Guittot
2018-11-29 12:53 ` Peter Zijlstra
2018-11-29 15:13 ` Patrick Bellasi
2019-01-24 9:07 ` Peter Zijlstra [this message]
2019-01-24 14:04 ` Patrick Bellasi
2019-01-29 19:42 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190124090755.GC13536@hirez.programming.kicks-ass.net \
--to=peterz@infradead.org \
--cc=Morten.Rasmussen@arm.com \
--cc=bsegall@google.com \
--cc=dietmar.eggemann@arm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=patrick.bellasi@arm.com \
--cc=pjt@google.com \
--cc=pkondeti@codeaurora.org \
--cc=quentin.perret@arm.com \
--cc=rjw@rjwysocki.net \
--cc=srinivas.pandruvada@linux.intel.com \
--cc=thara.gopinath@linaro.org \
--cc=vincent.guittot@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox