Linux cgroups development
 help / color / mirror / Atom feed
From: Jing Wu <realwujing@gmail.com>
To: Waiman Long <longman@redhat.com>,
	Frederic Weisbecker <frederic@kernel.org>
Cc: Jing Wu <realwujing@gmail.com>, Thomas Gleixner <tglx@kernel.org>,
	linux-kernel@vger.kernel.org, rcu@vger.kernel.org,
	cgroups@vger.kernel.org, Qiliang Yuan <yuanql9@chinatelecom.cn>
Subject: Re: [PATCH-next 00/23] cgroup/cpuset: Enable runtime update of nohz_full and managed_irq CPUs
Date: Thu,  2 Jul 2026 11:39:33 +0800	[thread overview]
Message-ID: <20260702033934.984512-1-realwujing@gmail.com> (raw)
In-Reply-To: <fe35dd41-7068-4cf0-9ee9-eb9c12017b42@redhat.com>

On 7/1/26 14:56, Waiman Long wrote:
> On 7/1/26 10:22 AM, Frederic Weisbecker wrote:
> > > I know RCU support changing the nocb mask for fully offline CPUs, I
> > > will need to find out if it possible to do that for partially
> > > offline CPUs.
> > No because callbacks can still be enqueued at this stage. But we could
> > manage to make it work with CPUHP_AP_IDLE_DEAD.
>
> If we can only go as high as CPUHP_AP_IDLE_DEAD, we may as well go down
> all the way to CPUHP_OFFLINE [...] we may have to break RCU out from
> HK_TYPE_KERNEL_NOISE and add a cpuset control switch [...]

A data point from the DHM side that corroborates this.

Our RCU/nocb prototype toggled the NOCB state by fully offlining each
affected CPU, one at a time:

	remove_cpu(cpu);
	rcu_nocb_cpu_offload(cpu) / rcu_nocb_cpu_deoffload(cpu);
	add_cpu(cpu);

i.e. it went all the way to CPUHP_OFFLINE, which matches Frederic's
point that the nocb mask change needs the CPU at least at
CPUHP_AP_IDLE_DEAD - callbacks are still enqueued before that.  It worked
functionally, but it also confirmed the cost Waiman describes: each
remove_cpu() pays the stop_machine price, so doing this while another
isolated partition is running latency-sensitive work is disruptive.

We are reworking that code precisely because doing it asynchronously
raced with concurrent CPU hotplug (a TOCTOU on cpu_online() and the nocb
state), so +1 that this has to be serialized against hotplug the way
Thomas outlined.

So decoupling RCU from HK_TYPE_KERNEL_NOISE and gating the "pay the
stop_machine spike" behaviour behind an explicit cpuset switch sounds
right to us.  RCU seems to be the only kernel-noise type that needs to go
that deep; tick, managed_irq and the watchdog appear to only need the CPU
to cycle through the existing online-side callbacks, not a forced
IDLE_DEAD - please correct us if that is wrong.

On Frederic's idea of asking userspace to offline the target CPUs before
toggling isolation: that cleanly removes the kernel-internal offline for
the isolate direction.  How would you see the de-isolate direction - the
admin brings the CPU back online and the housekeeping/nocb masks are
recomputed in the online path?

Waiman, since you mentioned you have not started on RCU and it carries
the deepest hotplug constraints, we are happy to take the RCU/nocb
decoupling piece and build it on top of your CPU down/up primitives,
following Frederic's CPUHP_AP_IDLE_DEAD guidance.  That keeps the
subsystem split clean and avoids duplicating your tick/irq work.

Thanks,
Jing Wu
Qiliang Yuan

  reply	other threads:[~2026-07-02  3:39 UTC|newest]

Thread overview: 52+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-21  3:03 [PATCH-next 00/23] cgroup/cpuset: Enable runtime update of nohz_full and managed_irq CPUs Waiman Long
2026-04-21  3:03 ` [PATCH 01/23] sched/isolation: Add HK_TYPE_KERNEL_NOISE_BOOT & HK_TYPE_MANAGED_IRQ_BOOT Waiman Long
2026-04-21  3:03 ` [PATCH 02/23] sched/isolation: Enhance housekeeping_update() to support updating more than one HK cpumask Waiman Long
2026-04-22  6:39   ` Chen Ridong
2026-04-21  3:03 ` [PATCH 03/23] tick/nohz: Make nohz_full parameter optional Waiman Long
2026-04-21  8:32   ` Thomas Gleixner
2026-04-21 14:14     ` Waiman Long
2026-04-24 15:57       ` Frederic Weisbecker
2026-04-21  3:03 ` [PATCH 04/23] tick/nohz: Allow runtime changes in full dynticks CPUs Waiman Long
2026-04-21  8:50   ` Thomas Gleixner
2026-04-21 14:24     ` Waiman Long
2026-05-13 13:04     ` Frederic Weisbecker
2026-04-21  3:03 ` [PATCH 05/23] tick: Pass timer tick job to an online HK CPU in tick_cpu_dying() Waiman Long
2026-04-21  8:55   ` Thomas Gleixner
2026-04-21 14:22     ` Waiman Long
2026-04-21  3:03 ` [PATCH 06/23] rcu/nocbs: Allow runtime changes in RCU NOCBS cpumask Waiman Long
2026-04-21  3:03 ` [PATCH 07/23] watchdog: Sync up with runtime change of isolated CPUs Waiman Long
2026-04-21  3:03 ` [PATCH 08/23] arm64: topology: Use RCU to protect access to HK_TYPE_TICK cpumask Waiman Long
2026-04-22  9:34   ` Chen Ridong
2026-05-13 16:19   ` Frederic Weisbecker
2026-04-21  3:03 ` [PATCH 09/23] workqueue: Use RCU to protect access of HK_TYPE_TIMER cpumask Waiman Long
2026-04-21  3:03 ` [PATCH 10/23] cpu: " Waiman Long
2026-04-21  8:57   ` Thomas Gleixner
2026-04-21 14:25     ` Waiman Long
2026-04-21  3:03 ` [PATCH 11/23] hrtimer: " Waiman Long
2026-04-21  8:59   ` Thomas Gleixner
2026-04-21  3:03 ` [PATCH 12/23] net: Use boot time housekeeping cpumask settings for now Waiman Long
2026-04-21  3:03 ` [PATCH 13/23] sched/core: Use RCU to protect access of HK_TYPE_KERNEL_NOISE cpumask Waiman Long
2026-04-21  3:03 ` [PATCH 14/23] hwmon/coretemp: Use RCU to protect access of HK_TYPE_MISC cpumask Waiman Long
2026-04-21  3:03 ` [PATCH 15/23] Drivers: hv: Use RCU to protect access of HK_TYPE_MANAGED_IRQ cpumask Waiman Long
2026-04-21  3:03 ` [PATCH 16/23] genirq/cpuhotplug: " Waiman Long
2026-04-21  9:02   ` Thomas Gleixner
2026-04-21 14:29     ` Waiman Long
2026-04-21  3:03 ` [PATCH 17/23] sched/isolation: Extend housekeeping_dereference_check() to cover changes in nohz_full or manged_irqs cpumasks Waiman Long
2026-04-21  3:03 ` [PATCH 18/23] cpu/hotplug: Add a new cpuhp_offline_cb() API Waiman Long
2026-04-21 16:17   ` Thomas Gleixner
2026-04-21 17:29     ` Waiman Long
2026-04-21 18:43       ` Thomas Gleixner
2026-04-21  3:03 ` [PATCH 19/23] cgroup/cpuset: Improve check for calling housekeeping_update() Waiman Long
2026-04-23  1:10   ` Chen Ridong
2026-04-24 18:32     ` Waiman Long
2026-04-21  3:03 ` [PATCH 20/23] cgroup/cpuset: Enable runtime update of HK_TYPE_{KERNEL_NOISE,MANAGED_IRQ} cpumasks Waiman Long
2026-04-21  3:03 ` [PATCH 21/23] cgroup/cpuset: Limit the side effect of using CPU hotplug on isolated partition Waiman Long
2026-04-21  3:03 ` [PATCH 22/23] cgroup/cpuset: Prevent offline_disabled CPUs from being used in " Waiman Long
2026-04-21  3:03 ` [PATCH 23/23] cgroup/cpuset: Documentation and kselftest updates Waiman Long
2026-06-24  6:34 ` [PATCH-next 00/23] cgroup/cpuset: Enable runtime update of nohz_full and managed_irq CPUs Jing Wu
2026-06-25  5:27   ` Waiman Long
2026-07-01 14:22     ` Frederic Weisbecker
2026-07-01 18:56       ` Waiman Long
2026-07-02  3:39         ` Jing Wu [this message]
2026-07-02 15:00       ` Thomas Gleixner
2026-07-02 23:07         ` Paul E. McKenney

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260702033934.984512-1-realwujing@gmail.com \
    --to=realwujing@gmail.com \
    --cc=cgroups@vger.kernel.org \
    --cc=frederic@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=longman@redhat.com \
    --cc=rcu@vger.kernel.org \
    --cc=tglx@kernel.org \
    --cc=yuanql9@chinatelecom.cn \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox