public inbox for sched-ext@lists.linux.dev
 help / color / mirror / Atom feed
From: Andrea Righi <arighi@nvidia.com>
To: Tejun Heo <tj@kernel.org>
Cc: David Vernet <void@manifault.com>,
	Changwoo Min <changwoo@igalia.com>, Shuah Khan <shuah@kernel.org>,
	Ihor Solodrai <ihor.solodrai@linux.dev>,
	Emil Tsalapatis <emil@etsalapatis.com>,
	sched-ext@lists.linux.dev, linux-kselftest@vger.kernel.org,
	linux-kernel@vger.kernel.org
Subject: Re: [PATCH sched_ext/for-6.20] selftests/sched_ext: Fix init_enable_count flakiness
Date: Mon, 2 Feb 2026 21:20:57 +0100	[thread overview]
Message-ID: <aYEHKTYHBVuQflNu@gpd4> (raw)
In-Reply-To: <03982d7b5642c0ad003668178f9e5df7@kernel.org>

On Mon, Feb 02, 2026 at 09:40:22AM -1000, Tejun Heo wrote:
> The init_enable_count test is flaky. The test forks 1024 children before
> attaching the scheduler to verify that existing tasks get ops.init_task()
> called. The children were using sleep(1) before exiting.
> 
> 7900aa699c34 ("sched_ext: Fix cgroup exit ordering by moving sched_ext_free()
> to finish_task_switch()") changed when tasks are removed from scx_tasks -
> previously when the task_struct was freed, now immediately in
> finish_task_switch() when the task dies.
> 
> Before the commit, pre-forked children would linger on scx_tasks until freed
> regardless of when they exited, so the scheduler would always see them during
> iteration. The sleep(1) was unnecessary. After the commit, children are
> removed as soon as they die. The sleep(1) masks the problem in most cases but
> the test becomes flaky depending on timing.
> 
> Fix by synchronizing properly using a pipe. All children block on read() and
> the parent signals them to exit by closing the write end after attaching the
> scheduler. The children are auto-reaped so there's no need to wait on them.
> 
> Reported-by: Ihor Solodrai <ihor.solodrai@linux.dev>
> Cc: David Vernet <void@manifault.com>
> Cc: Andrea Righi <arighi@nvidia.com>
> Cc: Changwoo Min <changwoo@igalia.com>
> Cc: Emil Tsalapatis <emil@etsalapatis.com>
> Signed-off-by: Tejun Heo <tj@kernel.org>

LGTM.

Acked-by: Andrea Righi <arighi@nvidia.com>

-Andrea

> ---
>  tools/testing/selftests/sched_ext/init_enable_count.c |   34 ++++++++++++------
>  1 file changed, 23 insertions(+), 11 deletions(-)
> 
> --- a/tools/testing/selftests/sched_ext/init_enable_count.c
> +++ b/tools/testing/selftests/sched_ext/init_enable_count.c
> @@ -4,6 +4,7 @@
>   * Copyright (c) 2023 David Vernet <dvernet@meta.com>
>   * Copyright (c) 2023 Tejun Heo <tj@kernel.org>
>   */
> +#include <signal.h>
>  #include <stdio.h>
>  #include <unistd.h>
>  #include <sched.h>
> @@ -23,6 +24,9 @@ static enum scx_test_status run_test(boo
>  	int ret, i, status;
>  	struct sched_param param = {};
>  	pid_t pids[num_pre_forks];
> +	int pipe_fds[2];
> +
> +	SCX_FAIL_IF(pipe(pipe_fds) < 0, "Failed to create pipe");
> 
>  	skel = init_enable_count__open();
>  	SCX_FAIL_IF(!skel, "Failed to open");
> @@ -38,26 +42,34 @@ static enum scx_test_status run_test(boo
>  	 * ensure (at least in practical terms) that there are more tasks that
>  	 * transition from SCHED_OTHER -> SCHED_EXT than there are tasks that
>  	 * take the fork() path either below or in other processes.
> +	 *
> +	 * All children will block on read() on the pipe until the parent closes
> +	 * the write end after attaching the scheduler, which signals all of
> +	 * them to exit simultaneously. Auto-reap so we don't have to wait on
> +	 * them.
>  	 */
> +	signal(SIGCHLD, SIG_IGN);
>  	for (i = 0; i < num_pre_forks; i++) {
> -		pids[i] = fork();
> -		SCX_FAIL_IF(pids[i] < 0, "Failed to fork child");
> -		if (pids[i] == 0) {
> -			sleep(1);
> +		pid_t pid = fork();
> +
> +		SCX_FAIL_IF(pid < 0, "Failed to fork child");
> +		if (pid == 0) {
> +			char buf;
> +
> +			close(pipe_fds[1]);
> +			read(pipe_fds[0], &buf, 1);
> +			close(pipe_fds[0]);
>  			exit(0);
>  		}
>  	}
> +	close(pipe_fds[0]);
> 
>  	link = bpf_map__attach_struct_ops(skel->maps.init_enable_count_ops);
>  	SCX_FAIL_IF(!link, "Failed to attach struct_ops");
> 
> -	for (i = 0; i < num_pre_forks; i++) {
> -		SCX_FAIL_IF(waitpid(pids[i], &status, 0) != pids[i],
> -			    "Failed to wait for pre-forked child\n");
> -
> -		SCX_FAIL_IF(status != 0, "Pre-forked child %d exited with status %d\n", i,
> -			    status);
> -	}
> +	/* Signal all pre-forked children to exit. */
> +	close(pipe_fds[1]);
> +	signal(SIGCHLD, SIG_DFL);
> 
>  	bpf_link__destroy(link);
>  	SCX_GE(skel->bss->init_task_cnt, num_pre_forks);

      parent reply	other threads:[~2026-02-02 20:21 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-02 19:40 [PATCH sched_ext/for-6.20] selftests/sched_ext: Fix init_enable_count flakiness Tejun Heo
2026-02-02 19:49 ` Tejun Heo
2026-02-02 20:20 ` Andrea Righi [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aYEHKTYHBVuQflNu@gpd4 \
    --to=arighi@nvidia.com \
    --cc=changwoo@igalia.com \
    --cc=emil@etsalapatis.com \
    --cc=ihor.solodrai@linux.dev \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-kselftest@vger.kernel.org \
    --cc=sched-ext@lists.linux.dev \
    --cc=shuah@kernel.org \
    --cc=tj@kernel.org \
    --cc=void@manifault.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox