From: Andrea Righi <arighi@nvidia.com>
To: Tejun Heo <tj@kernel.org>
Cc: sched-ext@lists.linux.dev, David Vernet <void@manifault.com>,
Changwoo Min <changwoo@igalia.com>,
Cheng-Yang Chou <yphbchou0911@gmail.com>,
Juntong Deng <juntong.deng@outlook.com>,
Ching-Chun Huang <jserv@ccns.ncku.edu.tw>,
Chia-Ping Tsai <chia7712@gmail.com>,
Emil Tsalapatis <emil@etsalapatis.com>,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH 10/10] sched_ext: Warn on task-based SCX op recursion
Date: Fri, 10 Apr 2026 19:38:42 +0200 [thread overview]
Message-ID: <adk1ojwzqRNFmGEq@gpd4> (raw)
In-Reply-To: <20260410063046.3556100-11-tj@kernel.org>
On Thu, Apr 09, 2026 at 08:30:46PM -1000, Tejun Heo wrote:
> The kf_tasks[] design assumes task-based SCX ops don't nest - if they
> did, kf_tasks[0] would get clobbered. The old scx_kf_allow() WARN_ONCE
> caught invalid nesting via kf_mask, but that machinery is gone now.
>
> Add a WARN_ON_ONCE(current->scx.kf_tasks[0]) at the top of each
> SCX_CALL_OP_TASK*() macro. Checking kf_tasks[0] alone is sufficient: all
> three variants (SCX_CALL_OP_TASK, SCX_CALL_OP_TASK_RET,
> SCX_CALL_OP_2TASKS_RET) write to kf_tasks[0], so a non-NULL value at
> entry to any of the three means re-entry from somewhere in the family.
>
> Signed-off-by: Tejun Heo <tj@kernel.org>
Acked-by: Andrea Righi <arighi@nvidia.com>
Thanks,
-Andrea
> ---
> kernel/sched/ext.c | 7 ++++++-
> 1 file changed, 6 insertions(+), 1 deletion(-)
>
> diff --git a/kernel/sched/ext.c b/kernel/sched/ext.c
> index 27091ae075a3..99760d1fbbd4 100644
> --- a/kernel/sched/ext.c
> +++ b/kernel/sched/ext.c
> @@ -502,10 +502,13 @@ do { \
> * held by try_to_wake_up() with rq tracking via scx_rq.in_select_cpu. So if
> * kf_tasks[] is set, @p's scheduler-protected fields are stable.
> *
> - * These macros only work for non-nesting ops since kf_tasks[] is not stacked.
> + * kf_tasks[] can not stack, so task-based SCX ops must not nest. The
> + * WARN_ON_ONCE() in each macro catches a re-entry of any of the three variants
> + * while a previous one is still in progress.
> */
> #define SCX_CALL_OP_TASK(sch, op, rq, task, args...) \
> do { \
> + WARN_ON_ONCE(current->scx.kf_tasks[0]); \
> current->scx.kf_tasks[0] = task; \
> SCX_CALL_OP((sch), op, rq, task, ##args); \
> current->scx.kf_tasks[0] = NULL; \
> @@ -514,6 +517,7 @@ do { \
> #define SCX_CALL_OP_TASK_RET(sch, op, rq, task, args...) \
> ({ \
> __typeof__((sch)->ops.op(task, ##args)) __ret; \
> + WARN_ON_ONCE(current->scx.kf_tasks[0]); \
> current->scx.kf_tasks[0] = task; \
> __ret = SCX_CALL_OP_RET((sch), op, rq, task, ##args); \
> current->scx.kf_tasks[0] = NULL; \
> @@ -523,6 +527,7 @@ do { \
> #define SCX_CALL_OP_2TASKS_RET(sch, op, rq, task0, task1, args...) \
> ({ \
> __typeof__((sch)->ops.op(task0, task1, ##args)) __ret; \
> + WARN_ON_ONCE(current->scx.kf_tasks[0]); \
> current->scx.kf_tasks[0] = task0; \
> current->scx.kf_tasks[1] = task1; \
> __ret = SCX_CALL_OP_RET((sch), op, rq, task0, task1, ##args); \
> --
> 2.53.0
>
next prev parent reply other threads:[~2026-04-10 17:38 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-10 6:30 [PATCHSET sched_ext/for-7.1] sched_ext: Add verifier-time kfunc context filter Tejun Heo
2026-04-10 6:30 ` [PATCH 01/10] sched_ext: Drop TRACING access to select_cpu kfuncs Tejun Heo
2026-04-10 16:04 ` Andrea Righi
2026-04-10 6:30 ` [PATCH 02/10] sched_ext: Add select_cpu kfuncs to scx_kfunc_ids_unlocked Tejun Heo
2026-04-10 16:07 ` Andrea Righi
2026-04-10 17:51 ` [PATCH v2 " Tejun Heo
2026-04-10 6:30 ` [PATCH 03/10] sched_ext: Track @p's rq lock across set_cpus_allowed_scx -> ops.set_cpumask Tejun Heo
2026-04-10 16:12 ` Andrea Righi
2026-04-10 17:51 ` [PATCH v2 " Tejun Heo
2026-04-10 6:30 ` [PATCH 04/10] sched_ext: Fix ops.cgroup_move() invocation kf_mask and rq tracking Tejun Heo
2026-04-10 16:16 ` Andrea Righi
2026-04-10 17:51 ` [PATCH v2 " Tejun Heo
2026-04-10 6:30 ` [PATCH 05/10] sched_ext: Decouple kfunc unlocked-context check from kf_mask Tejun Heo
2026-04-10 16:34 ` Andrea Righi
2026-04-10 17:51 ` [PATCH v2 " Tejun Heo
2026-04-10 6:30 ` [PATCH 06/10] sched_ext: Drop redundant rq-locked check from scx_bpf_task_cgroup() Tejun Heo
2026-04-10 16:36 ` Andrea Righi
2026-04-10 6:30 ` [PATCH 07/10] sched_ext: Add verifier-time kfunc context filter Tejun Heo
2026-04-10 16:49 ` Andrea Righi
2026-04-10 6:30 ` [PATCH 08/10] sched_ext: Remove runtime kfunc mask enforcement Tejun Heo
2026-04-10 16:50 ` Andrea Righi
2026-04-10 6:30 ` [PATCH 09/10] sched_ext: Rename scx_kf_allowed_on_arg_tasks() to scx_kf_arg_task_ok() Tejun Heo
2026-04-10 16:55 ` Andrea Righi
2026-04-10 6:30 ` [PATCH 10/10] sched_ext: Warn on task-based SCX op recursion Tejun Heo
2026-04-10 17:38 ` Andrea Righi [this message]
2026-04-10 17:45 ` [PATCHSET sched_ext/for-7.1] sched_ext: Add verifier-time kfunc context filter Andrea Righi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=adk1ojwzqRNFmGEq@gpd4 \
--to=arighi@nvidia.com \
--cc=changwoo@igalia.com \
--cc=chia7712@gmail.com \
--cc=emil@etsalapatis.com \
--cc=jserv@ccns.ncku.edu.tw \
--cc=juntong.deng@outlook.com \
--cc=linux-kernel@vger.kernel.org \
--cc=sched-ext@lists.linux.dev \
--cc=tj@kernel.org \
--cc=void@manifault.com \
--cc=yphbchou0911@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox