All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mike Galbraith <efault@gmx.de>
To: Vincent Guittot <vincent.guittot@linaro.org>
Cc: linux-kernel <linux-kernel@vger.kernel.org>,
	"linaro-kernel@lists.linaro.org" <linaro-kernel@lists.linaro.org>,
	"Peter Zijlstra" <peterz@infradead.org>,
	"Ingo Molnar" <mingo@kernel.org>, "Paul Turner" <pjt@google.com>,
	"Steven Rostedt" <rostedt@goodmis.org>,
	"Frédéric Weisbecker" <fweisbec@gmail.com>
Subject: Re: [PATCH Resend v6] sched: fix wrong rq's runnable_avg update with rt tasks
Date: Fri, 19 Apr 2013 11:21:44 +0200	[thread overview]
Message-ID: <1366363304.4708.75.camel@marge.simpson.net> (raw)
In-Reply-To: <CAKfTPtBuAwyiNYk17SnW066FhvGF18n+HaiKwhKLgsii8ntcHA@mail.gmail.com>

On Fri, 2013-04-19 at 10:50 +0200, Vincent Guittot wrote: 
> On 19 April 2013 10:14, Mike Galbraith <efault@gmx.de> wrote:
> > On Fri, 2013-04-19 at 09:49 +0200, Vincent Guittot wrote:
> >> On 19 April 2013 06:30, Mike Galbraith <efault@gmx.de> wrote:
> >> > On Thu, 2013-04-18 at 18:34 +0200, Vincent Guittot wrote:
> >> >> The current update of the rq's load can be erroneous when RT tasks are
> >> >> involved
> >> >>
> >> >> The update of the load of a rq that becomes idle, is done only if the avg_idle
> >> >> is less than sysctl_sched_migration_cost. If RT tasks and short idle duration
> >> >> alternate, the runnable_avg will not be updated correctly and the time will be
> >> >> accounted as idle time when a CFS task wakes up.
> >> >>
> >> >> A new idle_enter function is called when the next task is the idle function
> >> >> so the elapsed time will be accounted as run time in the load of the rq,
> >> >> whatever the average idle time is. The function update_rq_runnable_avg is
> >> >> removed from idle_balance.
> >> >>
> >> >> When a RT task is scheduled on an idle CPU, the update of the rq's load is
> >> >> not done when the rq exit idle state because CFS's functions are not
> >> >> called. Then, the idle_balance, which is called just before entering the
> >> >> idle function, updates the rq's load and makes the assumption that the
> >> >> elapsed time since the last update, was only running time.
> >> >>
> >> >> As a consequence, the rq's load of a CPU that only runs a periodic RT task,
> >> >> is close to LOAD_AVG_MAX whatever the running duration of the RT task is.
> >> >
> >> > Why do we care what rq's load says, if the only thing running is a
> >> > periodic RT task?  I _think_ I recall that stuff being put under the
> >>
> >> cfs scheduler will use a wrong rq load the next time it wants to schedule a task
> >>
> >> > throttle specifically to not waste cycles doing that on every
> >> > microscopic idle.
> >>
> >> yes but this lead to the wrong computation of runnable_avg_sum. To be
> >> more precise, we only need to call __update_entity_runnable_avg,
> >> __update_tg_runnable_avg is not mandatory in this step.
> >
> > If it only scares fair class tasks away from the periodic rt load, that
> > seems like a benefit to me, not a liability.  If we really really need
> 
> I'm not sure that such behavior that is only based on erroneous value,
> is good one.
> 
> > perfect load numbers, fine, we have to eat some cycles, but when I look
> > at it, it looks like one of those "Perfect is the enemy of good" things.
> 
> The target is not perfect number but good enough to be usable. The
> systctl_migration_cost threshold is good for idle balancing but can
> generates wrong load value

But again, why do we care?  To be able to mix rt and fair loads and
still make pretty mixed load utilization numbers?  Paying a general case
fast path price to make strange (to me) load utilization numbers pretty
is not very attractive.  If you muck about with rt classes, you need to
have a good reason for doing that.  If you do have a good reason, you
also allocated all resources, including CPU, so don't need the kernel to
balance the load for you.  Paying any fast path price to make the kernel
balance a mixed rt/fair load just seems fundamentally wrong to me.

-Mike


  reply	other threads:[~2013-04-19  9:21 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-04-18 16:34 [PATCH Resend v6] sched: fix wrong rq's runnable_avg update with rt tasks Vincent Guittot
2013-04-19  4:30 ` Mike Galbraith
2013-04-19  7:49   ` Vincent Guittot
2013-04-19  8:14     ` Mike Galbraith
2013-04-19  8:50       ` Vincent Guittot
2013-04-19  9:21         ` Mike Galbraith [this message]
2013-04-19  9:37           ` Mike Galbraith
2013-04-19 11:11           ` Vincent Guittot
2013-04-19 11:47 ` Peter Zijlstra
2013-04-19 21:28 ` Steven Rostedt
2013-04-21 12:52 ` [tip:sched/core] sched: Fix wrong rq' s " tip-bot for Vincent Guittot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1366363304.4708.75.camel@marge.simpson.net \
    --to=efault@gmx.de \
    --cc=fweisbec@gmail.com \
    --cc=linaro-kernel@lists.linaro.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@kernel.org \
    --cc=peterz@infradead.org \
    --cc=pjt@google.com \
    --cc=rostedt@goodmis.org \
    --cc=vincent.guittot@linaro.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.