public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Peter Zijlstra <peterz@infradead.org>
To: Vincent Guittot <vincent.guittot@linaro.org>
Cc: mingo@kernel.org, linux-kernel@vger.kernel.org,
	preeti@linux.vnet.ibm.com, Morten.Rasmussen@arm.com,
	kamalesh@linux.vnet.ibm.com, riel@redhat.com, efault@gmx.de,
	nicolas.pitre@linaro.org, dietmar.eggemann@arm.com,
	linaro-kernel@lists.linaro.org
Subject: Re: [PATCH RESEND v9 08/10] sched: replace capacity_factor by usage
Date: Fri, 20 Feb 2015 12:14:04 +0100	[thread overview]
Message-ID: <20150220111404.GM5029@twins.programming.kicks-ass.net> (raw)
In-Reply-To: <1421316570-23097-9-git-send-email-vincent.guittot@linaro.org>

On Thu, Jan 15, 2015 at 11:09:28AM +0100, Vincent Guittot wrote:

> Finally, the sched_group->sched_group_capacity->capacity_orig has been removed
> because it's no more used during load balance.

Maybe do that in a separate patch to avoid cluttering this one?

> [1] https://lkml.org/lkml/2014/8/12/295

Patch references are like:
9a5d9ba6a363 ("sched/fair: Allow calculate_imbalance() to move idle cpus")

>  /*
> + * Check whether the capacity of the rq has been noticeably reduced by side
> + * activity. The imbalance_pct is used for the threshold.
> + * Return true is the capacity is reduced
>   */
>  static inline int
> +check_cpu_capacity(struct rq *rq, struct sched_domain *sd)
>  {
> +	return ((rq->cpu_capacity * sd->imbalance_pct) <
> +				(rq->cpu_capacity_orig * 100));
>  }

How about cpu_has_capacity() to be consistent with the below function?

This comment could use whitespace:

>  /*
> + * group_has_capacity returns true if the group has spare capacity that could
> + * be used by some tasks.

      We consider that a group has spare capacity if the
> + * number of task is smaller than the number of CPUs or if the usage is lower
> + * than the available capacity for CFS tasks.

      For the latter, we use a
> + * threshold to stabilize the state, to take into account the variance of the
> + * tasks' load and to return true if the available capacity in meaningful for
> + * the load balancer.

      As an example, an available capacity of 1% can appear
> + * but it doesn't make any benefit for the load balance.
>   */
> +static inline bool
> +group_has_capacity(struct lb_env *env, struct sg_lb_stats *sgs)
>  {
> +	if ((sgs->group_capacity * 100) >
> +			(sgs->group_usage * env->sd->imbalance_pct))
> +		return true;
>  
> +	if (sgs->sum_nr_running < sgs->group_weight)
> +		return true;
> +
> +	return false;
> +}

Would it not make sense to first do the nr_running test, its cheaper
than the multiplication thing.

> +/*
> + *  group_is_overloaded returns true if the group has more tasks than it can
> + *  handle.

       We consider that a group is overloaded if the number of tasks is
> + *  greater than the number of CPUs and the tasks already use all available
> + *  capacity for CFS tasks.

       For the latter, we use a threshold to stabilize
> + *  the state, to take into account the variance of tasks' load and to return
> + *  true if available capacity is no more meaningful for load balancer
> + */
> +static inline bool
> +group_is_overloaded(struct lb_env *env, struct sg_lb_stats *sgs)
> +{
> +	if (sgs->sum_nr_running <= sgs->group_weight)
> +		return false;
>  
> +	if ((sgs->group_capacity * 100) <
> +			(sgs->group_usage * env->sd->imbalance_pct))
> +		return true;
>  
> +	return false;
>  }

Maybe a note on the difference between group_is_overloaded() and
!group_has_capacity()?

As to the comment, I think it can be reduced by referring to the comment
of group_has_capacity().

>  		/*
>  		 * In case the child domain prefers tasks go to siblings
> +		 * first, lower the sg capacity so that we'll try
>  		 * and move all the excess tasks away. We lower the capacity
>  		 * of a group only if the local group has the capacity to fit
> +		 * these excess tasks.

                   The extra check prevents the case where
> +		 * you always pull from the heaviest group when it is already
> +		 * under-utilized (possible with a large weight task outweighs
> +		 * the tasks on the system).
>  		 */
>  		if (prefer_sibling && sds->local &&
> +		    group_has_capacity(env, &sds->local_stat) &&
> +		    (sgs->sum_nr_running > 1)) {
> +			sgs->group_no_capacity = 1;
> +			sgs->group_type = group_overloaded;
> +		}

Looks OK otherwise I suppose.

  reply	other threads:[~2015-02-20 11:14 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-01-15 10:09 [PATCH RESEND v9 00/10] sched: consolidation of CPU capacity and usage Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 01/10] sched: add utilization_avg_contrib Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 02/10] sched: Track group sched_entity usage contributions Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 03/10] sched: remove frequency scaling from cpu_capacity Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 04/10] sched: Make sched entity usage tracking scale-invariant Vincent Guittot
2015-02-19 16:34   ` Peter Zijlstra
2015-02-19 17:02     ` Morten Rasmussen
2015-02-19 17:05       ` Peter Zijlstra
2015-02-20  9:21         ` Morten Rasmussen
2015-01-15 10:09 ` [PATCH RESEND v9 05/10] sched: make scale_rt invariant with frequency Vincent Guittot
2015-02-19 16:52   ` Peter Zijlstra
2015-02-19 17:18     ` Morten Rasmussen
2015-02-24 10:21       ` Vincent Guittot
2015-02-24 11:33         ` Dietmar Eggemann
2015-01-15 10:09 ` [PATCH RESEND v9 06/10] sched: add per rq cpu_capacity_orig Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 07/10] sched: get CPU's usage statistic Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 08/10] sched: replace capacity_factor by usage Vincent Guittot
2015-02-20 11:14   ` Peter Zijlstra [this message]
2015-02-20 13:31     ` Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 09/10] sched: add SD_PREFER_SIBLING for SMT level Vincent Guittot
2015-01-15 10:09 ` [PATCH RESEND v9 10/10] sched: move cfs task on a CPU with higher capacity Vincent Guittot
2015-02-20 11:27   ` Peter Zijlstra
2015-02-20 13:54     ` Vincent Guittot
2015-02-20 16:26       ` Peter Zijlstra
2015-02-19 12:49 ` [PATCH RESEND v9 00/10] sched: consolidation of CPU capacity and usage Morten Rasmussen
2015-02-20 11:34   ` Peter Zijlstra
2015-02-20 11:52     ` Morten Rasmussen
2015-02-20 14:13       ` Vincent Guittot
2015-02-20 14:35         ` Morten Rasmussen
2015-02-20 14:54           ` Vincent Guittot
2015-02-23 15:45             ` Morten Rasmussen
2015-02-24 10:38               ` Vincent Guittot
2015-02-24 11:29                 ` Morten Rasmussen
2015-02-24 12:18                   ` Vincent Guittot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150220111404.GM5029@twins.programming.kicks-ass.net \
    --to=peterz@infradead.org \
    --cc=Morten.Rasmussen@arm.com \
    --cc=dietmar.eggemann@arm.com \
    --cc=efault@gmx.de \
    --cc=kamalesh@linux.vnet.ibm.com \
    --cc=linaro-kernel@lists.linaro.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@kernel.org \
    --cc=nicolas.pitre@linaro.org \
    --cc=preeti@linux.vnet.ibm.com \
    --cc=riel@redhat.com \
    --cc=vincent.guittot@linaro.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox