public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 0/2] Two fixes for sysctl_sched_rr_timeslice
@ 2023-07-19 10:37 Cyril Hrubis
  2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
  2023-07-19 10:37 ` [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset Cyril Hrubis
  0 siblings, 2 replies; 8+ messages in thread
From: Cyril Hrubis @ 2023-07-19 10:37 UTC (permalink / raw)
  To: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall, Mel Gorman,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel
  Cc: ltp, Cyril Hrubis

- Fixes rounding error for initial value with CONFIG_HZ_300

- Fixes read from the file after reset to default (by writing val <= 0)

Cyril Hrubis (2):
  sched/rt: Fix sysctl_sched_rr_timeslice intial value
  sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset

 kernel/sched/rt.c | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

-- 
2.41.0


^ permalink raw reply	[flat|nested] 8+ messages in thread

* [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value
  2023-07-19 10:37 [PATCH 0/2] Two fixes for sysctl_sched_rr_timeslice Cyril Hrubis
@ 2023-07-19 10:37 ` Cyril Hrubis
  2023-07-19 11:21   ` [LTP] " Petr Vorel
                     ` (2 more replies)
  2023-07-19 10:37 ` [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset Cyril Hrubis
  1 sibling, 3 replies; 8+ messages in thread
From: Cyril Hrubis @ 2023-07-19 10:37 UTC (permalink / raw)
  To: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall, Mel Gorman,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel
  Cc: ltp, Cyril Hrubis, Jiri Bohac

Thre is 10% rounding error in the intial value of the
sysctl_sched_rr_timeslice with  CONFIG_HZ_300=y.

This was found with LTP test sched_rr_get_interval01:

sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90
sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90

What this test does is to compare the return value from the
sched_rr_get_interval() and the sched_rr_timeslice_ms sysctl file and
fails if they do not match.

The prolem it found is the intial sysctl file value which was computed as:

static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC / HZ) * RR_TIMESLICE;

which works fine as long as MSEC_PER_SEC is multiple of HZ, however it
introduces 10% rounding error for CONFIG_HZ_300:

(MSEC_PER_SEC / HZ) * (100 * HZ / 1000)

(1000 / 300) * (100 * 300 / 1000)

3 * 30 = 90

This can be easily fixed by reversing the order of the multiplication
and division. After this fix we get:

(MSEC_PER_SEC * (100 * HZ / 1000)) / HZ

(1000 * (100 * 300 / 1000)) / 300

(1000 * 30) / 300 = 100

Signed-off-by: Cyril Hrubis <chrubis@suse.cz>
CC: Jiri Bohac <jbohac@suse.cz>
---
 kernel/sched/rt.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c
index 00e0e5074115..185d3d749f6b 100644
--- a/kernel/sched/rt.c
+++ b/kernel/sched/rt.c
@@ -25,7 +25,7 @@ unsigned int sysctl_sched_rt_period = 1000000;
 int sysctl_sched_rt_runtime = 950000;
 
 #ifdef CONFIG_SYSCTL
-static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC / HZ) * RR_TIMESLICE;
+static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC * RR_TIMESLICE) / HZ;
 static int sched_rt_handler(struct ctl_table *table, int write, void *buffer,
 		size_t *lenp, loff_t *ppos);
 static int sched_rr_handler(struct ctl_table *table, int write, void *buffer,
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 8+ messages in thread

* [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset
  2023-07-19 10:37 [PATCH 0/2] Two fixes for sysctl_sched_rr_timeslice Cyril Hrubis
  2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
@ 2023-07-19 10:37 ` Cyril Hrubis
  2023-07-20 10:00   ` Mel Gorman
  2023-07-21 16:14   ` Petr Vorel
  1 sibling, 2 replies; 8+ messages in thread
From: Cyril Hrubis @ 2023-07-19 10:37 UTC (permalink / raw)
  To: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall, Mel Gorman,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel
  Cc: ltp, Cyril Hrubis, Jiri Bohac

The sched_rr_timeslice can be reset to default by writing value that is
<= 0. However after reading from this file we always got the last value
written, which is not useful at all.

$ echo -1 > /proc/sys/kernel/sched_rr_timeslice_ms
$ cat /proc/sys/kernel/sched_rr_timeslice_ms
-1

Fix this by setting the variable that holds the sysctl file value to the
jiffies_to_msecs(RR_TIMESLICE) in case that <= 0 value was written.

Signed-off-by: Cyril Hrubis <chrubis@suse.cz>
CC: Jiri Bohac <jbohac@suse.cz>
---
 kernel/sched/rt.c | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c
index 185d3d749f6b..0597ba0f85ff 100644
--- a/kernel/sched/rt.c
+++ b/kernel/sched/rt.c
@@ -3062,6 +3062,9 @@ static int sched_rr_handler(struct ctl_table *table, int write, void *buffer,
 		sched_rr_timeslice =
 			sysctl_sched_rr_timeslice <= 0 ? RR_TIMESLICE :
 			msecs_to_jiffies(sysctl_sched_rr_timeslice);
+
+		if (sysctl_sched_rr_timeslice <= 0)
+			sysctl_sched_rr_timeslice = jiffies_to_msecs(RR_TIMESLICE);
 	}
 	mutex_unlock(&mutex);
 
-- 
2.41.0


^ permalink raw reply related	[flat|nested] 8+ messages in thread

* Re: [LTP] [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value
  2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
@ 2023-07-19 11:21   ` Petr Vorel
  2023-07-20  9:57   ` Mel Gorman
  2023-07-21 16:16   ` [LTP] " Petr Vorel
  2 siblings, 0 replies; 8+ messages in thread
From: Petr Vorel @ 2023-07-19 11:21 UTC (permalink / raw)
  To: Cyril Hrubis
  Cc: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall, Mel Gorman,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel,
	Jiri Bohac, ltp, Shile Zhang, Shile Zhang

Hi,

[ Cc Shile Zhang ]

> Thre is 10% rounding error in the intial value of the
> sysctl_sched_rr_timeslice with  CONFIG_HZ_300=y.

> This was found with LTP test sched_rr_get_interval01:

> sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
> sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
> sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90
> sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
> sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
> sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90

> What this test does is to compare the return value from the
> sched_rr_get_interval() and the sched_rr_timeslice_ms sysctl file and
> fails if they do not match.

> The prolem it found is the intial sysctl file value which was computed as:

> static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC / HZ) * RR_TIMESLICE;

> which works fine as long as MSEC_PER_SEC is multiple of HZ, however it
> introduces 10% rounding error for CONFIG_HZ_300:

> (MSEC_PER_SEC / HZ) * (100 * HZ / 1000)

> (1000 / 300) * (100 * 300 / 1000)

> 3 * 30 = 90

> This can be easily fixed by reversing the order of the multiplication
> and division. After this fix we get:

> (MSEC_PER_SEC * (100 * HZ / 1000)) / HZ

> (1000 * (100 * 300 / 1000)) / 300

> (1000 * 30) / 300 = 100

> Signed-off-by: Cyril Hrubis <chrubis@suse.cz>
> CC: Jiri Bohac <jbohac@suse.cz>
> ---
>  kernel/sched/rt.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)

> diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c
> index 00e0e5074115..185d3d749f6b 100644
> --- a/kernel/sched/rt.c
> +++ b/kernel/sched/rt.c
> @@ -25,7 +25,7 @@ unsigned int sysctl_sched_rt_period = 1000000;
>  int sysctl_sched_rt_runtime = 950000;

>  #ifdef CONFIG_SYSCTL
> -static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC / HZ) * RR_TIMESLICE;
> +static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC * RR_TIMESLICE) / HZ;

It looks like very old bug, from v4.11-rc1. I guess this should go to all stable
and LTS kernels.

Fixes: 975e155ed873 ("sched/rt: Show the 'sched_rr_timeslice' SCHED_RR timeslice tuning knob in milliseconds")

Reviewed-by: Petr Vorel <pvorel@suse.cz>

Kind regards,
Petr

>  static int sched_rt_handler(struct ctl_table *table, int write, void *buffer,
>  		size_t *lenp, loff_t *ppos);
>  static int sched_rr_handler(struct ctl_table *table, int write, void *buffer,

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value
  2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
  2023-07-19 11:21   ` [LTP] " Petr Vorel
@ 2023-07-20  9:57   ` Mel Gorman
  2023-07-21 16:16   ` [LTP] " Petr Vorel
  2 siblings, 0 replies; 8+ messages in thread
From: Mel Gorman @ 2023-07-20  9:57 UTC (permalink / raw)
  To: Cyril Hrubis
  Cc: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel, ltp,
	Jiri Bohac

On Wed, Jul 19, 2023 at 12:37:42PM +0200, Cyril Hrubis wrote:
> Thre is 10% rounding error in the intial value of the
> sysctl_sched_rr_timeslice with  CONFIG_HZ_300=y.
> 
> This was found with LTP test sched_rr_get_interval01:
> 
> sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
> sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
> sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90
> sched_rr_get_interval01.c:57: TPASS: sched_rr_get_interval() passed
> sched_rr_get_interval01.c:64: TPASS: Time quantum 0s 99999990ns
> sched_rr_get_interval01.c:72: TFAIL: /proc/sys/kernel/sched_rr_timeslice_ms != 100 got 90
> 
> What this test does is to compare the return value from the
> sched_rr_get_interval() and the sched_rr_timeslice_ms sysctl file and
> fails if they do not match.
> 
> The prolem it found is the intial sysctl file value which was computed as:
> 
> static int sysctl_sched_rr_timeslice = (MSEC_PER_SEC / HZ) * RR_TIMESLICE;
> 
> which works fine as long as MSEC_PER_SEC is multiple of HZ, however it
> introduces 10% rounding error for CONFIG_HZ_300:
> 
> (MSEC_PER_SEC / HZ) * (100 * HZ / 1000)
> 
> (1000 / 300) * (100 * 300 / 1000)
> 
> 3 * 30 = 90
> 
> This can be easily fixed by reversing the order of the multiplication
> and division. After this fix we get:
> 
> (MSEC_PER_SEC * (100 * HZ / 1000)) / HZ
> 
> (1000 * (100 * 300 / 1000)) / 300
> 
> (1000 * 30) / 300 = 100
> 
> Signed-off-by: Cyril Hrubis <chrubis@suse.cz>
> CC: Jiri Bohac <jbohac@suse.cz>

Acked-by: Mel Gorman <mgorman@suse.de>

-- 
Mel Gorman
SUSE Labs

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset
  2023-07-19 10:37 ` [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset Cyril Hrubis
@ 2023-07-20 10:00   ` Mel Gorman
  2023-07-21 16:14   ` Petr Vorel
  1 sibling, 0 replies; 8+ messages in thread
From: Mel Gorman @ 2023-07-20 10:00 UTC (permalink / raw)
  To: Cyril Hrubis
  Cc: Ingo Molnar, Peter Zijlstra, Juri Lelli, Vincent Guittot,
	Dietmar Eggemann, Steven Rostedt, Ben Segall,
	Daniel Bristot de Oliveira, Valentin Schneider, linux-kernel, ltp,
	Jiri Bohac

On Wed, Jul 19, 2023 at 12:37:43PM +0200, Cyril Hrubis wrote:
> The sched_rr_timeslice can be reset to default by writing value that is
> <= 0. However after reading from this file we always got the last value
> written, which is not useful at all.
> 
> $ echo -1 > /proc/sys/kernel/sched_rr_timeslice_ms
> $ cat /proc/sys/kernel/sched_rr_timeslice_ms
> -1
> 
> Fix this by setting the variable that holds the sysctl file value to the
> jiffies_to_msecs(RR_TIMESLICE) in case that <= 0 value was written.
> 
> Signed-off-by: Cyril Hrubis <chrubis@suse.cz>
> CC: Jiri Bohac <jbohac@suse.cz>

Acked-by: Mel Gorman <mgorman@suse.de>

-- 
Mel Gorman
SUSE Labs

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset
  2023-07-19 10:37 ` [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset Cyril Hrubis
  2023-07-20 10:00   ` Mel Gorman
@ 2023-07-21 16:14   ` Petr Vorel
  1 sibling, 0 replies; 8+ messages in thread
From: Petr Vorel @ 2023-07-21 16:14 UTC (permalink / raw)
  To: chrubis
  Cc: bristot, bsegall, dietmar.eggemann, jbohac, juri.lelli,
	linux-kernel, ltp, mgorman, mingo, peterz, rostedt,
	vincent.guittot, vschneid, Petr Vorel

Reviewed-by: Petr Vorel <pvorel@suse.cz>
Tested-by: Petr Vorel <pvorel@suse.cz>

Kind regards,
Petr

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [LTP] [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value
  2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
  2023-07-19 11:21   ` [LTP] " Petr Vorel
  2023-07-20  9:57   ` Mel Gorman
@ 2023-07-21 16:16   ` Petr Vorel
  2 siblings, 0 replies; 8+ messages in thread
From: Petr Vorel @ 2023-07-21 16:16 UTC (permalink / raw)
  To: chrubis
  Cc: bristot, bsegall, dietmar.eggemann, jbohac, juri.lelli,
	linux-kernel, ltp, mgorman, mingo, peterz, rostedt,
	vincent.guittot, vschneid, Petr Vorel

Tested-by: Petr Vorel <pvorel@suse.cz>

Kind regards,
Petr

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2023-07-21 16:18 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-07-19 10:37 [PATCH 0/2] Two fixes for sysctl_sched_rr_timeslice Cyril Hrubis
2023-07-19 10:37 ` [PATCH 1/2] sched/rt: Fix sysctl_sched_rr_timeslice intial value Cyril Hrubis
2023-07-19 11:21   ` [LTP] " Petr Vorel
2023-07-20  9:57   ` Mel Gorman
2023-07-21 16:16   ` [LTP] " Petr Vorel
2023-07-19 10:37 ` [PATCH 2/2] sched/rt: sysctl_sched_rr_timeslice show default timeslice after reset Cyril Hrubis
2023-07-20 10:00   ` Mel Gorman
2023-07-21 16:14   ` Petr Vorel

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox