From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id A418F1B6D18 for ; Tue, 18 Feb 2025 10:30:21 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.133.124 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739874623; cv=none; b=Spgsan2xe6tFlqFl9ToLUfA00LWi1cfnFZbSsDefqqgCxrHGBVMlH3Suw0xy9mO5UtmkyRCM6TJEn27WZzSe85JeYW4+Z2I4z/LmPNYD3itB4hSyNHpKxG3xtmbpt3JAu+4YpfA+slHCaZregqep28LuiPizCC/8v++hexBcD+I= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739874623; c=relaxed/simple; bh=1v1+yGs8+/N/RADI56i+Ihhjc+nbn3bPS4R/90SHXDw=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=D0GZR8XdyE8/ZO19Cgf6EQFaLMcUhXqEfs4HwUpwnmjg4uESrsRNNOMCrYh8UhravbHow20HZErJNbnESj1NOMs/TTUOHPcl4fZnSMEz4nwdHiVYrI/RX4Bt5fjR3r/u3R2D0p0HYjmWdwtybUN0TlTAZN5HLgActikTbN4CJSE= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=UjJLc63e; arc=none smtp.client-ip=170.10.133.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="UjJLc63e" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1739874620; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=LauAAAlEvHKS7dRj8knYYOQs4SxKuXcQYbrkVAvMEW0=; b=UjJLc63es1tjDpBjdVGRKBD563OvmCfV6jWZ8YrG+aX346JKo/e1epgxwpIK9hpg27VX5i 0s/t39LywdMdfuxd3uwJAnR4x500zLffvhdkuK+Jb4vstdA0mfmdv8bnnDjym7/1OAzlAX u9fm6x4yQc9o9saufcEk3oClAmUXIyk= Received: from mail-wm1-f70.google.com (mail-wm1-f70.google.com [209.85.128.70]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-584-Wgc8nIpdP4Of9ChsjugeiQ-1; Tue, 18 Feb 2025 05:30:19 -0500 X-MC-Unique: Wgc8nIpdP4Of9ChsjugeiQ-1 X-Mimecast-MFC-AGG-ID: Wgc8nIpdP4Of9ChsjugeiQ_1739874618 Received: by mail-wm1-f70.google.com with SMTP id 5b1f17b1804b1-4394b2c19ccso42454545e9.1 for ; Tue, 18 Feb 2025 02:30:18 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1739874618; x=1740479418; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=LauAAAlEvHKS7dRj8knYYOQs4SxKuXcQYbrkVAvMEW0=; b=lPUTnYonNShN35g/L5dS/Fch0gHkZ/Mp7kmE6DngZ4a/pT4Yiv9ZQP1Ke8Tze0YbHQ xi9OX/uv5t/d33HGXtzv/Jo4e79IMeS6ORW/urpMt51wN0lh67UECX2+BILASWc3A8pX c1vbMTMkYqMX1sOeiOPcMxlTxBv4MwroHPuJyH/ShMNvjKp2LMC2BSPPfFYBCVXt3w3O QqfZgNN/q52KHD3cUxmDVV18aFOlYwLdMiCaETp0t74AacN1WJVpVaN78zgP1+Zs6BV+ o2D5w+q/TV1JIQnh6ziFzLF0ZVs6BfSnWPXtDJZaLJ1uigJl7Kh4FN3RzCWm8+s0D5LC LXug== X-Forwarded-Encrypted: i=1; AJvYcCUO9Kblt5eC8cukiQ8RtBiBWunImXHJDM8ODbb5umS3sQ3AooMxeXHNoY9mTXxviqK7znTfKBOkcvyatA==@vger.kernel.org X-Gm-Message-State: AOJu0YwALZ7jaGIp9ZOHpCMW3ZVX6nc0PyEHtAgVXCKMERKrk2GUYQbc A0yAgy7kLjGwHnz7wrfTNsnYUKi3aVJjpWUgpLO0ZjYVEJr7tp3R2urSPsdsfQ/ey+XSOi0Pw5n +llOvL84rF8toS2JRrvsFVH7h4Ybhn8Cnhas4MdKOWsV6+1yA2/Iahke2nBoH X-Gm-Gg: ASbGncsbd3u2pFNBWtdNAEmnAk3e2ERQ6z31vS8dFcyrCB1SOmPtzJgaESm/Og78O03 38s2E9RMhHB6EtTRbhuGzg7vV3NHiTUU44LXBLVABqFzq7b+6BADOHNWRbrVhJp5JQuqeI6girI wYoijHxwKajEoH9mqMEJNDg6Wv79Npw/HkzgxiY+EMu+xTSM+fBK+Cfr9fhfFazajrbkYwdxwjD F4nUJKhn94+uzEGzUeAsD614K5X+sgfrtJd2jXmMBJPSDFF50TD6GCyoJRzaI4bwRPZF4tiMxEx kOyBzOXNgQ57oDizAfgkaB35Allt9d81MA== X-Received: by 2002:a05:600c:154c:b0:439:89d1:30dc with SMTP id 5b1f17b1804b1-43989d1328bmr59335775e9.10.1739874617864; Tue, 18 Feb 2025 02:30:17 -0800 (PST) X-Google-Smtp-Source: AGHT+IG7aSbjVJTqNJv0ShEPIOLOehhZvYvDsEaV+3YYWlbmMTitPYqsZW8oRj+7KN2zheWWGsk4LA== X-Received: by 2002:a05:600c:154c:b0:439:89d1:30dc with SMTP id 5b1f17b1804b1-43989d1328bmr59335065e9.10.1739874617351; Tue, 18 Feb 2025 02:30:17 -0800 (PST) Received: from jlelli-thinkpadt14gen4.remote.csb ([151.29.34.42]) by smtp.gmail.com with ESMTPSA id 5b1f17b1804b1-4395a04f22fsm176610635e9.5.2025.02.18.02.30.15 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 18 Feb 2025 02:30:16 -0800 (PST) Date: Tue, 18 Feb 2025 11:30:14 +0100 From: Juri Lelli To: Jon Hunter Cc: Christian Loehle , Dietmar Eggemann , Thierry Reding , Waiman Long , Tejun Heo , Johannes Weiner , Michal Koutny , Ingo Molnar , Peter Zijlstra , Vincent Guittot , Steven Rostedt , Ben Segall , Mel Gorman , Valentin Schneider , Phil Auld , Qais Yousef , Sebastian Andrzej Siewior , "Joel Fernandes (Google)" , Suleiman Souhlal , Aashish Sharma , Shin Kawamura , Vineeth Remanan Pillai , linux-kernel@vger.kernel.org, cgroups@vger.kernel.org, "linux-tegra@vger.kernel.org" Subject: Re: [PATCH v2 3/2] sched/deadline: Check bandwidth overflow earlier for hotplug Message-ID: References: <8ff19556-a656-4f11-a10c-6f9b92ec9cea@arm.com> <78f627fe-dd1e-4816-bbf3-58137fdceda6@nvidia.com> <30a8cda5-0fd0-4e47-bafe-5deefc561f0c@nvidia.com> <151884eb-ad6d-458e-a325-92cbe5b8b33f@nvidia.com> Precedence: bulk X-Mailing-List: linux-tegra@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On 18/02/25 10:58, Juri Lelli wrote: > Hi! > > On 17/02/25 17:08, Juri Lelli wrote: > > On 14/02/25 10:05, Jon Hunter wrote: > > ... > > > At this point I believe you triggered suspend. > > > > > [ 57.290150] Freezing remaining freezable tasks completed (elapsed 0.001 seconds) > > > [ 57.335619] tegra-xusb 3530000.usb: Firmware timestamp: 2020-07-06 13:39:28 UTC > > > [ 57.353364] dwc-eth-dwmac 2490000.ethernet eth0: Link is Down > > > [ 57.397022] Disabling non-boot CPUs ... > > > > Offlining CPU5. > > > > > [ 57.400904] dl_bw_manage: cpu=5 cap=3072 fair_server_bw=52428 total_bw=209712 dl_bw_cpus=4 type=DYN span=0,3-5 > > > [ 57.400949] CPU0 attaching NULL sched-domain. > > > [ 57.415298] span=1-2 > > > [ 57.417483] __dl_sub: cpus=3 tsk_bw=52428 total_bw=157284 span=0,3-5 type=DYN > > > [ 57.417487] __dl_server_detach_root: cpu=0 rd_span=0,3-5 total_bw=157284 > > > [ 57.417496] rq_attach_root: cpu=0 old_span=NULL new_span=1-2 > > > [ 57.417501] __dl_add: cpus=3 tsk_bw=52428 total_bw=157284 span=0-2 type=DEF > > > [ 57.417504] __dl_server_attach_root: cpu=0 rd_span=0-2 total_bw=157284 > > > [ 57.417507] CPU3 attaching NULL sched-domain. > > > [ 57.454804] span=0-2 > > > [ 57.456987] __dl_sub: cpus=2 tsk_bw=52428 total_bw=104856 span=3-5 type=DYN > > > [ 57.456990] __dl_server_detach_root: cpu=3 rd_span=3-5 total_bw=104856 > > > [ 57.456998] rq_attach_root: cpu=3 old_span=NULL new_span=0-2 > > > [ 57.457000] __dl_add: cpus=4 tsk_bw=52428 total_bw=209712 span=0-3 type=DEF > > > [ 57.457003] __dl_server_attach_root: cpu=3 rd_span=0-3 total_bw=209712 > > > [ 57.457006] CPU4 attaching NULL sched-domain. > > > [ 57.493964] span=0-3 > > > [ 57.496152] __dl_sub: cpus=1 tsk_bw=52428 total_bw=52428 span=4-5 type=DYN > > > [ 57.496156] __dl_server_detach_root: cpu=4 rd_span=4-5 total_bw=52428 > > > [ 57.496162] rq_attach_root: cpu=4 old_span=NULL new_span=0-3 > > > [ 57.496165] __dl_add: cpus=5 tsk_bw=52428 total_bw=262140 span=0-4 type=DEF > > > [ 57.496168] __dl_server_attach_root: cpu=4 rd_span=0-4 total_bw=262140 > > > [ 57.496171] CPU5 attaching NULL sched-domain. > > > [ 57.532952] span=0-4 > > > [ 57.535143] rq_attach_root: cpu=5 old_span= new_span=0-4 > > > [ 57.535147] __dl_add: cpus=5 tsk_bw=52428 total_bw=314568 span=0-5 type=DEF > > > > Maybe we shouldn't add the dl_server contribution of a CPU that is going > > to be offline. > > I tried to implement this idea and ended up with the following. As usual > also pushed it to the branch on github. Could you please update and > re-test? And now for the actual change --- kernel/sched/topology.c | 27 +++++++++++++++------------ 1 file changed, 15 insertions(+), 12 deletions(-) diff --git a/kernel/sched/topology.c b/kernel/sched/topology.c index 8830acb4f1b2..c6a140d8d851 100644 --- a/kernel/sched/topology.c +++ b/kernel/sched/topology.c @@ -497,12 +497,14 @@ void rq_attach_root(struct rq *rq, struct root_domain *rd) if (rq->rd) { old_rd = rq->rd; - if (rq->fair_server.dl_server) - __dl_server_detach_root(&rq->fair_server, rq); - - if (cpumask_test_cpu(rq->cpu, old_rd->online)) + if (cpumask_test_cpu(rq->cpu, old_rd->online)) { set_rq_offline(rq); + if (rq->fair_server.dl_server) + __dl_server_detach_root(&rq->fair_server, rq); + } + + cpumask_clear_cpu(rq->cpu, old_rd->span); /* @@ -529,16 +531,17 @@ void rq_attach_root(struct rq *rq, struct root_domain *rd) } cpumask_set_cpu(rq->cpu, rd->span); - if (cpumask_test_cpu(rq->cpu, cpu_active_mask)) + if (cpumask_test_cpu(rq->cpu, cpu_active_mask)) { set_rq_online(rq); - /* - * Because the rq is not a task, dl_add_task_root_domain() did not - * move the fair server bw to the rd if it already started. - * Add it now. - */ - if (rq->fair_server.dl_server) - __dl_server_attach_root(&rq->fair_server, rq); + /* + * Because the rq is not a task, dl_add_task_root_domain() did not + * move the fair server bw to the rd if it already started. + * Add it now. + */ + if (rq->fair_server.dl_server) + __dl_server_attach_root(&rq->fair_server, rq); + } rq_unlock_irqrestore(rq, &rf);