* [PATCH v2 1/3] sched/fair: Remove unnecessary goto in update_sd_lb_stats()
2024-02-04 4:46 [PATCH v2 0/3] sched/fair: Simplify and optimize update_sd_pick_busiest() David Vernet
@ 2024-02-04 4:46 ` David Vernet
2024-02-04 11:44 ` Vincent Guittot
2024-02-04 4:46 ` [PATCH v2 2/3] sched/fair: Do strict inequality check for busiest misfit task group David Vernet
2024-02-04 4:46 ` [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest() David Vernet
2 siblings, 1 reply; 8+ messages in thread
From: David Vernet @ 2024-02-04 4:46 UTC (permalink / raw)
To: linux-kernel
Cc: mingo, peterz, juri.lelli, vincent.guittot, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
In update_sd_lb_stats(), when we're iterating over the sched groups that
comprise a sched domain, we're skipping the call to
update_sd_pick_busiest() for the sched group that contains the local /
destination CPU. We use a goto to skip the call, but we could just as
easily check !local_group, as there's no other logic that we need to
skip with the goto. Let's remove the goto, and check for !local_group in
the if statement instead.
Reviewed-by: Valentin Schneider <vschneid@redhat.com>
Signed-off-by: David Vernet <void@manifault.com>
---
kernel/sched/fair.c | 7 +------
1 file changed, 1 insertion(+), 6 deletions(-)
diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index b803030c3a03..e7519ea434b1 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -10578,16 +10578,11 @@ static inline void update_sd_lb_stats(struct lb_env *env, struct sd_lb_stats *sd
update_sg_lb_stats(env, sds, sg, sgs, &sg_status);
- if (local_group)
- goto next_group;
-
-
- if (update_sd_pick_busiest(env, sds, sg, sgs)) {
+ if (!local_group && update_sd_pick_busiest(env, sds, sg, sgs)) {
sds->busiest = sg;
sds->busiest_stat = *sgs;
}
-next_group:
/* Now, start updating sd_lb_stats */
sds->total_load += sgs->group_load;
sds->total_capacity += sgs->group_capacity;
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread* Re: [PATCH v2 1/3] sched/fair: Remove unnecessary goto in update_sd_lb_stats()
2024-02-04 4:46 ` [PATCH v2 1/3] sched/fair: Remove unnecessary goto in update_sd_lb_stats() David Vernet
@ 2024-02-04 11:44 ` Vincent Guittot
0 siblings, 0 replies; 8+ messages in thread
From: Vincent Guittot @ 2024-02-04 11:44 UTC (permalink / raw)
To: David Vernet
Cc: linux-kernel, mingo, peterz, juri.lelli, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
On Sun, 4 Feb 2024 at 05:46, David Vernet <void@manifault.com> wrote:
>
> In update_sd_lb_stats(), when we're iterating over the sched groups that
> comprise a sched domain, we're skipping the call to
> update_sd_pick_busiest() for the sched group that contains the local /
> destination CPU. We use a goto to skip the call, but we could just as
> easily check !local_group, as there's no other logic that we need to
> skip with the goto. Let's remove the goto, and check for !local_group in
> the if statement instead.
>
> Reviewed-by: Valentin Schneider <vschneid@redhat.com>
> Signed-off-by: David Vernet <void@manifault.com>
Reviewed-by: Vincent Guittot <vincent.guittot@linaro.org>
> ---
> kernel/sched/fair.c | 7 +------
> 1 file changed, 1 insertion(+), 6 deletions(-)
>
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index b803030c3a03..e7519ea434b1 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -10578,16 +10578,11 @@ static inline void update_sd_lb_stats(struct lb_env *env, struct sd_lb_stats *sd
>
> update_sg_lb_stats(env, sds, sg, sgs, &sg_status);
>
> - if (local_group)
> - goto next_group;
> -
> -
> - if (update_sd_pick_busiest(env, sds, sg, sgs)) {
> + if (!local_group && update_sd_pick_busiest(env, sds, sg, sgs)) {
> sds->busiest = sg;
> sds->busiest_stat = *sgs;
> }
>
> -next_group:
> /* Now, start updating sd_lb_stats */
> sds->total_load += sgs->group_load;
> sds->total_capacity += sgs->group_capacity;
> --
> 2.43.0
>
^ permalink raw reply [flat|nested] 8+ messages in thread
* [PATCH v2 2/3] sched/fair: Do strict inequality check for busiest misfit task group
2024-02-04 4:46 [PATCH v2 0/3] sched/fair: Simplify and optimize update_sd_pick_busiest() David Vernet
2024-02-04 4:46 ` [PATCH v2 1/3] sched/fair: Remove unnecessary goto in update_sd_lb_stats() David Vernet
@ 2024-02-04 4:46 ` David Vernet
2024-02-04 11:45 ` Vincent Guittot
2024-02-04 4:46 ` [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest() David Vernet
2 siblings, 1 reply; 8+ messages in thread
From: David Vernet @ 2024-02-04 4:46 UTC (permalink / raw)
To: linux-kernel
Cc: mingo, peterz, juri.lelli, vincent.guittot, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
In update_sd_pick_busiest(), when comparing two sched groups that are
both of type group_misfit_task, we currently consider the new group as
busier than the current busiest group even if the new group has the
same misfit task load as the current busiest group. We can avoid some
unnecessary writes if we instead only consider the newest group to be
the busiest if it has a higher load than the current busiest. This
matches the behavior of other group types where we compare load, such as
two groups that are both overloaded.
Let's update the group_misfit_task type comparison to also only update
the busiest group in the event of strict inequality.
Signed-off-by: David Vernet <void@manifault.com>
---
kernel/sched/fair.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index e7519ea434b1..76d03106040d 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -10028,7 +10028,7 @@ static bool update_sd_pick_busiest(struct lb_env *env,
* If we have more than one misfit sg go with the biggest
* misfit.
*/
- if (sgs->group_misfit_task_load < busiest->group_misfit_task_load)
+ if (sgs->group_misfit_task_load <= busiest->group_misfit_task_load)
return false;
break;
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
* Re: [PATCH v2 2/3] sched/fair: Do strict inequality check for busiest misfit task group
2024-02-04 4:46 ` [PATCH v2 2/3] sched/fair: Do strict inequality check for busiest misfit task group David Vernet
@ 2024-02-04 11:45 ` Vincent Guittot
0 siblings, 0 replies; 8+ messages in thread
From: Vincent Guittot @ 2024-02-04 11:45 UTC (permalink / raw)
To: David Vernet
Cc: linux-kernel, mingo, peterz, juri.lelli, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
On Sun, 4 Feb 2024 at 05:46, David Vernet <void@manifault.com> wrote:
>
> In update_sd_pick_busiest(), when comparing two sched groups that are
> both of type group_misfit_task, we currently consider the new group as
> busier than the current busiest group even if the new group has the
> same misfit task load as the current busiest group. We can avoid some
> unnecessary writes if we instead only consider the newest group to be
> the busiest if it has a higher load than the current busiest. This
> matches the behavior of other group types where we compare load, such as
> two groups that are both overloaded.
>
> Let's update the group_misfit_task type comparison to also only update
> the busiest group in the event of strict inequality.
fair enough
Reviewed-by: Vincent Guittot <vincent.guittot@linaro.org>
>
> Signed-off-by: David Vernet <void@manifault.com>
> ---
> kernel/sched/fair.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index e7519ea434b1..76d03106040d 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -10028,7 +10028,7 @@ static bool update_sd_pick_busiest(struct lb_env *env,
> * If we have more than one misfit sg go with the biggest
> * misfit.
> */
> - if (sgs->group_misfit_task_load < busiest->group_misfit_task_load)
> + if (sgs->group_misfit_task_load <= busiest->group_misfit_task_load)
> return false;
> break;
>
> --
> 2.43.0
>
^ permalink raw reply [flat|nested] 8+ messages in thread
* [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest()
2024-02-04 4:46 [PATCH v2 0/3] sched/fair: Simplify and optimize update_sd_pick_busiest() David Vernet
2024-02-04 4:46 ` [PATCH v2 1/3] sched/fair: Remove unnecessary goto in update_sd_lb_stats() David Vernet
2024-02-04 4:46 ` [PATCH v2 2/3] sched/fair: Do strict inequality check for busiest misfit task group David Vernet
@ 2024-02-04 4:46 ` David Vernet
2024-02-04 11:48 ` Vincent Guittot
2 siblings, 1 reply; 8+ messages in thread
From: David Vernet @ 2024-02-04 4:46 UTC (permalink / raw)
To: linux-kernel
Cc: mingo, peterz, juri.lelli, vincent.guittot, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
When comparing the current struct sched_group with the yet-busiest
domain in update_sd_pick_busiest(), if the two groups have the same
group type, we're currently doing a bit of unnecessary work for any
group >= group_misfit_task. We're comparing the two groups, and then
returning only if false (the group in question is not the busiest).
Othewise, we break, do an extra unnecessary conditional check that's
vacuously false for any group type > group_fully_busy, and then always
return true.
Let's just return directly in the switch statement instead. This doesn't
change the size of vmlinux with llvm 17 (not surprising given that all
of this is inlined in load_balance()), but it does shrink load_balance()
by 88 bytes on x86. Given that it also improves readability, this seems
worth doing.
As a bonus, remove an unnecessary goto in update_sd_lb_stats().
Reviewed-by: Valentin Schneider <vschneid@redhat.com>
Signed-off-by: David Vernet <void@manifault.com>
---
kernel/sched/fair.c | 12 +++---------
1 file changed, 3 insertions(+), 9 deletions(-)
diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 76d03106040d..fa049f866461 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -10006,9 +10006,7 @@ static bool update_sd_pick_busiest(struct lb_env *env,
switch (sgs->group_type) {
case group_overloaded:
/* Select the overloaded group with highest avg_load. */
- if (sgs->avg_load <= busiest->avg_load)
- return false;
- break;
+ return sgs->avg_load > busiest->avg_load;
case group_imbalanced:
/*
@@ -10019,18 +10017,14 @@ static bool update_sd_pick_busiest(struct lb_env *env,
case group_asym_packing:
/* Prefer to move from lowest priority CPU's work */
- if (sched_asym_prefer(sg->asym_prefer_cpu, sds->busiest->asym_prefer_cpu))
- return false;
- break;
+ return sched_asym_prefer(sds->busiest->asym_prefer_cpu, sg->asym_prefer_cpu);
case group_misfit_task:
/*
* If we have more than one misfit sg go with the biggest
* misfit.
*/
- if (sgs->group_misfit_task_load <= busiest->group_misfit_task_load)
- return false;
- break;
+ return sgs->group_misfit_task_load > busiest->group_misfit_task_load;
case group_smt_balance:
/*
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread* Re: [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest()
2024-02-04 4:46 ` [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest() David Vernet
@ 2024-02-04 11:48 ` Vincent Guittot
2024-02-05 15:10 ` David Vernet
0 siblings, 1 reply; 8+ messages in thread
From: Vincent Guittot @ 2024-02-04 11:48 UTC (permalink / raw)
To: David Vernet
Cc: linux-kernel, mingo, peterz, juri.lelli, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
On Sun, 4 Feb 2024 at 05:46, David Vernet <void@manifault.com> wrote:
>
> When comparing the current struct sched_group with the yet-busiest
> domain in update_sd_pick_busiest(), if the two groups have the same
> group type, we're currently doing a bit of unnecessary work for any
> group >= group_misfit_task. We're comparing the two groups, and then
> returning only if false (the group in question is not the busiest).
> Othewise, we break, do an extra unnecessary conditional check that's
> vacuously false for any group type > group_fully_busy, and then always
> return true.
>
> Let's just return directly in the switch statement instead. This doesn't
> change the size of vmlinux with llvm 17 (not surprising given that all
> of this is inlined in load_balance()), but it does shrink load_balance()
> by 88 bytes on x86. Given that it also improves readability, this seems
> worth doing.
>
> As a bonus, remove an unnecessary goto in update_sd_lb_stats().
The line above is not relevant to the content of the patch.
Other than that
Reviewed-by: Vincent Guittot <vincent.guittot@linaro.org>
>
> Reviewed-by: Valentin Schneider <vschneid@redhat.com>
> Signed-off-by: David Vernet <void@manifault.com>
> ---
> kernel/sched/fair.c | 12 +++---------
> 1 file changed, 3 insertions(+), 9 deletions(-)
>
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index 76d03106040d..fa049f866461 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -10006,9 +10006,7 @@ static bool update_sd_pick_busiest(struct lb_env *env,
> switch (sgs->group_type) {
> case group_overloaded:
> /* Select the overloaded group with highest avg_load. */
> - if (sgs->avg_load <= busiest->avg_load)
> - return false;
> - break;
> + return sgs->avg_load > busiest->avg_load;
>
> case group_imbalanced:
> /*
> @@ -10019,18 +10017,14 @@ static bool update_sd_pick_busiest(struct lb_env *env,
>
> case group_asym_packing:
> /* Prefer to move from lowest priority CPU's work */
> - if (sched_asym_prefer(sg->asym_prefer_cpu, sds->busiest->asym_prefer_cpu))
> - return false;
> - break;
> + return sched_asym_prefer(sds->busiest->asym_prefer_cpu, sg->asym_prefer_cpu);
>
> case group_misfit_task:
> /*
> * If we have more than one misfit sg go with the biggest
> * misfit.
> */
> - if (sgs->group_misfit_task_load <= busiest->group_misfit_task_load)
> - return false;
> - break;
> + return sgs->group_misfit_task_load > busiest->group_misfit_task_load;
>
> case group_smt_balance:
> /*
> --
> 2.43.0
>
^ permalink raw reply [flat|nested] 8+ messages in thread* Re: [PATCH v2 3/3] sched/fair: Simplify some logic in update_sd_pick_busiest()
2024-02-04 11:48 ` Vincent Guittot
@ 2024-02-05 15:10 ` David Vernet
0 siblings, 0 replies; 8+ messages in thread
From: David Vernet @ 2024-02-05 15:10 UTC (permalink / raw)
To: Vincent Guittot
Cc: linux-kernel, mingo, peterz, juri.lelli, dietmar.eggemann,
rostedt, bsegall, mgorman, bristot, vschneid, kernel-team
[-- Attachment #1: Type: text/plain, Size: 1384 bytes --]
On Sun, Feb 04, 2024 at 12:48:11PM +0100, Vincent Guittot wrote:
> On Sun, 4 Feb 2024 at 05:46, David Vernet <void@manifault.com> wrote:
> >
> > When comparing the current struct sched_group with the yet-busiest
> > domain in update_sd_pick_busiest(), if the two groups have the same
> > group type, we're currently doing a bit of unnecessary work for any
> > group >= group_misfit_task. We're comparing the two groups, and then
> > returning only if false (the group in question is not the busiest).
> > Othewise, we break, do an extra unnecessary conditional check that's
> > vacuously false for any group type > group_fully_busy, and then always
> > return true.
> >
> > Let's just return directly in the switch statement instead. This doesn't
> > change the size of vmlinux with llvm 17 (not surprising given that all
> > of this is inlined in load_balance()), but it does shrink load_balance()
> > by 88 bytes on x86. Given that it also improves readability, this seems
> > worth doing.
> >
> > As a bonus, remove an unnecessary goto in update_sd_lb_stats().
>
> The line above is not relevant to the content of the patch.
Ah, thanks for catching that.
Should I send a v3 of the patch set? Or should I just let whomever
applies remove that line?
> Other than that
>
> Reviewed-by: Vincent Guittot <vincent.guittot@linaro.org>
Thanks,
David
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 228 bytes --]
^ permalink raw reply [flat|nested] 8+ messages in thread