All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andrea Righi <arighi@nvidia.com>
To: Juri Lelli <juri.lelli@redhat.com>
Cc: Ingo Molnar <mingo@redhat.com>,
	Peter Zijlstra <peterz@infradead.org>,
	Vincent Guittot <vincent.guittot@linaro.org>,
	Dietmar Eggemann <dietmar.eggemann@arm.com>,
	Steven Rostedt <rostedt@goodmis.org>,
	Ben Segall <bsegall@google.com>, Mel Gorman <mgorman@suse.de>,
	Valentin Schneider <vschneid@redhat.com>,
	Joel Fernandes <joelagnelf@nvidia.com>, Tejun Heo <tj@kernel.org>,
	David Vernet <void@manifault.com>,
	Changwoo Min <changwoo@igalia.com>, Shuah Khan <shuah@kernel.org>,
	sched-ext@lists.linux.dev, bpf@vger.kernel.org,
	linux-kernel@vger.kernel.org,
	Luca Abeni <luca.abeni@santannapisa.it>,
	Yuri Andriaccio <yurand2000@gmail.com>
Subject: Re: [PATCH 05/16] sched/deadline: Return EBUSY if dl_bw_cpus is zero
Date: Wed, 3 Sep 2025 17:24:45 +0200	[thread overview]
Message-ID: <aLhdvY1D_RZF_ahw@gpd4> (raw)
In-Reply-To: <aLhbhv1oiwxQ2E6b@jlelli-thinkpadt14gen4.remote.csb>

On Wed, Sep 03, 2025 at 05:15:18PM +0200, Juri Lelli wrote:
> On 03/09/25 17:10, Andrea Righi wrote:
> > On Wed, Sep 03, 2025 at 04:53:59PM +0200, Juri Lelli wrote:
> > > Hi,
> > > 
> > > On 03/09/25 11:33, Andrea Righi wrote:
> > > > From: Joel Fernandes <joelagnelf@nvidia.com>
> > > > 
> > > > Hotplugged CPUs coming online do an enqueue but are not a part of any
> > > > root domain containing cpu_active() CPUs. So in this case, don't mess
> > > > with accounting and we can retry later. Without this patch, we see
> > > > crashes with sched_ext selftest's hotplug test due to divide by zero.
> > > > 
> > > > Signed-off-by: Joel Fernandes <joelagnelf@nvidia.com>
> > > > ---
> > > >  kernel/sched/deadline.c | 7 ++++++-
> > > >  1 file changed, 6 insertions(+), 1 deletion(-)
> > > > 
> > > > diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
> > > > index 3c478a1b2890d..753e50b1e86fc 100644
> > > > --- a/kernel/sched/deadline.c
> > > > +++ b/kernel/sched/deadline.c
> > > > @@ -1689,7 +1689,12 @@ int dl_server_apply_params(struct sched_dl_entity *dl_se, u64 runtime, u64 perio
> > > >  	cpus = dl_bw_cpus(cpu);
> > > >  	cap = dl_bw_capacity(cpu);
> > > >  
> > > > -	if (__dl_overflow(dl_b, cap, old_bw, new_bw))
> > > > +	/*
> > > > +	 * Hotplugged CPUs coming online do an enqueue but are not a part of any
> > > > +	 * root domain containing cpu_active() CPUs. So in this case, don't mess
> > > > +	 * with accounting and we can retry later.
> > > > +	 */
> > > > +	if (!cpus || __dl_overflow(dl_b, cap, old_bw, new_bw))
> > > >  		return -EBUSY;
> > > >  
> > > >  	if (init) {
> > > 
> > > Yuri is proposing to ignore dl-servers bandwidth contribution from
> > > admission control (as they essentially operate on the remaining
> > > bandwidth portion not available to RT/DEADLINE tasks):
> > > 
> > > https://lore.kernel.org/lkml/20250903114448.664452-1-yurand2000@gmail.com/
> > > 
> > > His patch should make this patch not required. Would you be able and
> > > willing to test this assumption?
> > 
> > I'll run some tests with Yuri's patch applied and dropping this one (and we
> > may also need to drop "[PATCH 10/16] sched/deadline: Account ext server
> > bandwidth").
> 
> Please mind that Yuri's change is still under discussion! :))
> 
> I just wanted to mention it here as it might change how we account for
> dl-servers if we decide to go that way.

That's fine, I've already done a quick test. :)

It seems to work (more or less), meaning that in case of RT/sched_ext
contention the sched_ext tasks seem to get the right amount of CPU
bandwidth (5%), but the total_bw kselftest is quite broken and it's always
reporting a bw value of 0... in any case, even if we go this way there's no
major disruption apparently.

-Andrea

  reply	other threads:[~2025-09-03 15:24 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-03  9:33 [PATCHSET v8 sched_ext/for-6.18] Add a deadline server for sched_ext tasks Andrea Righi
2025-09-03  9:33 ` [PATCH 01/16] sched_ext: Exit early on hotplug events during attach Andrea Righi
2025-09-03 19:44   ` Tejun Heo
2025-09-03 21:40     ` Andrea Righi
2025-09-03  9:33 ` [PATCH 02/16] sched/debug: Fix updating of ppos on server write ops Andrea Righi
2025-09-03  9:33 ` [PATCH 03/16] sched/debug: Stop and start server based on if it was active Andrea Righi
2025-09-03 14:43   ` Juri Lelli
2025-09-03 15:02     ` Andrea Righi
2025-09-03  9:33 ` [PATCH 04/16] sched/deadline: Clear the defer params Andrea Righi
2025-09-03 14:44   ` Juri Lelli
2025-09-03  9:33 ` [PATCH 05/16] sched/deadline: Return EBUSY if dl_bw_cpus is zero Andrea Righi
2025-09-03 14:53   ` Juri Lelli
2025-09-03 15:10     ` Andrea Righi
2025-09-03 15:15       ` Juri Lelli
2025-09-03 15:24         ` Andrea Righi [this message]
2025-09-03 20:05     ` Peter Zijlstra
2025-09-04  7:12       ` luca abeni
2025-09-04  7:17       ` Juri Lelli
2025-09-03  9:33 ` [PATCH 06/16] sched: Add a server arg to dl_server_update_idle_time() Andrea Righi
2025-09-03  9:33 ` [PATCH 07/16] sched_ext: Add a DL server for sched_ext tasks Andrea Righi
2025-09-03 19:54   ` Tejun Heo
2025-09-03 20:08     ` Peter Zijlstra
2025-09-03 20:41       ` Tejun Heo
2025-09-03 20:56         ` Peter Zijlstra
2025-09-04 20:28           ` Peter Zijlstra
2025-09-04 21:43             ` Tejun Heo
2025-09-04 22:02               ` Peter Zijlstra
2025-09-10 16:01               ` Peter Zijlstra
2025-09-03 21:33         ` Andrea Righi
2025-09-03  9:33 ` [PATCH 08/16] sched/debug: Add support to change sched_ext server params Andrea Righi
2025-09-03  9:33 ` [PATCH 09/16] sched/deadline: Add support to remove DL server's bandwidth contribution Andrea Righi
2025-09-03  9:33 ` [PATCH 10/16] sched/deadline: Account ext server bandwidth Andrea Righi
2025-09-03  9:33 ` [PATCH 11/16] sched/deadline: Allow to initialize DL server when needed Andrea Righi
2025-09-03  9:33 ` [PATCH 12/16] sched_ext: Selectively enable ext and fair DL servers Andrea Righi
2025-09-03  9:33 ` [PATCH 13/16] sched/deadline: Fix DL server crash in inactive_timer callback Andrea Righi
2025-09-03  9:33 ` [PATCH 14/16] sched/deadline: De-couple balance and pick_task Andrea Righi
2025-09-03  9:33 ` [PATCH 15/16] selftests/sched_ext: Add test for sched_ext dl_server Andrea Righi
2025-09-03  9:33 ` [PATCH 16/16] selftests/sched_ext: Add test for DL server total_bw consistency Andrea Righi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aLhdvY1D_RZF_ahw@gpd4 \
    --to=arighi@nvidia.com \
    --cc=bpf@vger.kernel.org \
    --cc=bsegall@google.com \
    --cc=changwoo@igalia.com \
    --cc=dietmar.eggemann@arm.com \
    --cc=joelagnelf@nvidia.com \
    --cc=juri.lelli@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=luca.abeni@santannapisa.it \
    --cc=mgorman@suse.de \
    --cc=mingo@redhat.com \
    --cc=peterz@infradead.org \
    --cc=rostedt@goodmis.org \
    --cc=sched-ext@lists.linux.dev \
    --cc=shuah@kernel.org \
    --cc=tj@kernel.org \
    --cc=vincent.guittot@linaro.org \
    --cc=void@manifault.com \
    --cc=vschneid@redhat.com \
    --cc=yurand2000@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.