From: Waiman Long <longman@redhat.com>
To: sashiko@lists.linux.dev
Cc: mkoutny@suse.com, tj@kernel.org, hannes@cmpxchg.org,
linux-hwmon@vger.kernel.org
Subject: Re: [PATCH 06/23] rcu/nocbs: Allow runtime changes in RCU NOCBS cpumask
Date: Wed, 22 Apr 2026 22:05:42 -0400 [thread overview]
Message-ID: <9b164822-b4f3-4a97-b602-e49c495b54b8@redhat.com> (raw)
In-Reply-To: <20260422030856.4C8BEC2BCB0@smtp.kernel.org>
On 4/21/26 11:08 PM, sashiko-bot@kernel.org wrote:
> Sashiko AI review found 1 potential issue(s):
> - [High] Dynamic RCU NOCB update attempts to de-offload online `rcu_nocbs=` CPUs, triggering a WARN_ON_ONCE and leaving isolation state inconsistent.
> --
>
> commit a58a92fdc96e43de871fd44a78e4e6da373179d5
> Author: Waiman Long <longman@redhat.com>
>
> rcu/nocbs: Allow runtime changes in RCU NOCBS cpumask
>
> This commit enables dynamic runtime updates to the RCU NO-CB CPU offloading
> state when isolated CPUs are added or removed. It introduces a new helper,
> rcu_nocb_update_cpus(), which is called directly from housekeeping_update()
> to synchronize the offloaded CPUs with the HK_TYPE_KERNEL_NOISE cpumask.
>
>> diff --git a/kernel/rcu/tree_nocb.h b/kernel/rcu/tree_nocb.h
>> index 2d06dcb61f379..b2daba1e5cb98 100644
>> --- a/kernel/rcu/tree_nocb.h
>> +++ b/kernel/rcu/tree_nocb.h
> [ ... ]
>> +void rcu_nocb_update_cpus(struct cpumask *cpumask)
>> +{
>> + int cpu, ret;
>> +
>> + if (!rcu_state.nocb_is_setup) {
>> + pr_warn_once("Dynamic RCU NOCB cannot be enabled without nohz_full/rcu_nocbs kernel boot parameter!\n");
>> + return;
>> + }
>> +
>> + for_each_cpu_andnot(cpu, cpumask, rcu_nocb_mask) {
>> + ret = rcu_nocb_cpu_offload(cpu);
>> + if (WARN_ON_ONCE(ret))
>> + return;
>> + }
>> +
>> + for_each_cpu_andnot(cpu, rcu_nocb_mask, cpumask) {
>> + ret = rcu_nocb_cpu_deoffload(cpu);
> What happens here if a system booted with explicit rcu_nocbs= CPUs, and
> later a user dynamically updates an isolated cpuset?
>
> It looks like this loop assumes the active rcu_nocb_mask should identically
> match the new isol_mask. If the boot-time rcu_nocbs= CPUs are not included
> in the new isolation mask, this loop will attempt to de-offload them.
>
> Because the cpuset subsystem is unaware of the independent rcu_nocbs= mask,
> wouldn't those original CPUs likely remain online?
>
Yes, the code should properly handle a pre-existing "rcu_nocbs" boot
option with a different CPU list. I will update the patch to add a boot
version of rcu_nocb_mask and make sure that CPUs that are in
rcu_nocb_mask_boot will not be de-offloaded.
>> + if (WARN_ON_ONCE(ret))
>> + return;
> If rcu_nocb_cpu_deoffload() is called on an online CPU, it returns -EINVAL.
> Could this unconditionally trigger this WARN_ON_ONCE and abort the rest of
> the update loop?
It should only be called for an offline CPU. The WARN_ON_ONCE() is there
to just catch unexpected bug in the code.
>
> If the loop aborts early, since housekeeping_update() already committed the
> new mask to housekeeping.cpumasks[], does this leave the system in a
> desynchronized state where the scheduler assumes CPUs are isolated but
> RCU callback offloading is not properly updated?
Any triggered WARN_ON_ONCE() is a bug that needs to be investigated and
fixed.
Cheers,
Longman
>
>> + }
>> +}
next prev parent reply other threads:[~2026-04-23 2:05 UTC|newest]
Thread overview: 66+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-04-21 3:03 [PATCH-next 00/23] cgroup/cpuset: Enable runtime update of nohz_full and managed_irq CPUs Waiman Long
2026-04-21 3:03 ` [PATCH 01/23] sched/isolation: Add HK_TYPE_KERNEL_NOISE_BOOT & HK_TYPE_MANAGED_IRQ_BOOT Waiman Long
2026-04-21 3:03 ` [PATCH 02/23] sched/isolation: Enhance housekeeping_update() to support updating more than one HK cpumask Waiman Long
2026-04-22 3:08 ` sashiko-bot
2026-04-22 6:39 ` Chen Ridong
2026-04-21 3:03 ` [PATCH 03/23] tick/nohz: Make nohz_full parameter optional Waiman Long
2026-04-21 8:32 ` Thomas Gleixner
2026-04-21 14:14 ` Waiman Long
2026-04-24 15:57 ` Frederic Weisbecker
2026-04-22 3:08 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 04/23] tick/nohz: Allow runtime changes in full dynticks CPUs Waiman Long
2026-04-21 8:50 ` Thomas Gleixner
2026-04-21 14:24 ` Waiman Long
2026-05-13 13:04 ` Frederic Weisbecker
2026-04-22 3:08 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 05/23] tick: Pass timer tick job to an online HK CPU in tick_cpu_dying() Waiman Long
2026-04-21 8:55 ` Thomas Gleixner
2026-04-21 14:22 ` Waiman Long
2026-04-21 3:03 ` [PATCH 06/23] rcu/nocbs: Allow runtime changes in RCU NOCBS cpumask Waiman Long
2026-04-22 3:08 ` sashiko-bot
2026-04-23 2:05 ` Waiman Long [this message]
2026-04-21 3:03 ` [PATCH 07/23] watchdog: Sync up with runtime change of isolated CPUs Waiman Long
2026-04-22 3:08 ` sashiko-bot
2026-04-23 2:14 ` Waiman Long
2026-04-21 3:03 ` [PATCH 08/23] arm64: topology: Use RCU to protect access to HK_TYPE_TICK cpumask Waiman Long
2026-04-22 3:08 ` sashiko-bot
2026-04-22 9:34 ` Chen Ridong
2026-05-13 16:19 ` Frederic Weisbecker
2026-04-21 3:03 ` [PATCH 09/23] workqueue: Use RCU to protect access of HK_TYPE_TIMER cpumask Waiman Long
2026-04-21 3:03 ` [PATCH 10/23] cpu: " Waiman Long
2026-04-21 8:57 ` Thomas Gleixner
2026-04-21 14:25 ` Waiman Long
2026-04-21 3:03 ` [PATCH 11/23] hrtimer: " Waiman Long
2026-04-21 8:59 ` Thomas Gleixner
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 12/23] net: Use boot time housekeeping cpumask settings for now Waiman Long
2026-04-21 3:03 ` [PATCH 13/23] sched/core: Use RCU to protect access of HK_TYPE_KERNEL_NOISE cpumask Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-23 14:37 ` Waiman Long
2026-04-21 3:03 ` [PATCH 14/23] hwmon/coretemp: Use RCU to protect access of HK_TYPE_MISC cpumask Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 15/23] Drivers: hv: Use RCU to protect access of HK_TYPE_MANAGED_IRQ cpumask Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-23 17:14 ` Waiman Long
2026-04-21 3:03 ` [PATCH 16/23] genirq/cpuhotplug: " Waiman Long
2026-04-21 9:02 ` Thomas Gleixner
2026-04-21 14:29 ` Waiman Long
2026-04-21 3:03 ` [PATCH 17/23] sched/isolation: Extend housekeeping_dereference_check() to cover changes in nohz_full or manged_irqs cpumasks Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-23 17:30 ` Waiman Long
2026-04-21 3:03 ` [PATCH 18/23] cpu/hotplug: Add a new cpuhp_offline_cb() API Waiman Long
2026-04-21 16:17 ` Thomas Gleixner
2026-04-21 17:29 ` Waiman Long
2026-04-21 18:43 ` Thomas Gleixner
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 19/23] cgroup/cpuset: Improve check for calling housekeeping_update() Waiman Long
2026-04-23 1:10 ` Chen Ridong
2026-04-24 18:32 ` Waiman Long
2026-04-21 3:03 ` [PATCH 20/23] cgroup/cpuset: Enable runtime update of HK_TYPE_{KERNEL_NOISE,MANAGED_IRQ} cpumasks Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 21/23] cgroup/cpuset: Limit the side effect of using CPU hotplug on isolated partition Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 22/23] cgroup/cpuset: Prevent offline_disabled CPUs from being used in " Waiman Long
2026-04-22 3:09 ` sashiko-bot
2026-04-21 3:03 ` [PATCH 23/23] cgroup/cpuset: Documentation and kselftest updates Waiman Long
2026-04-22 3:09 ` sashiko-bot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=9b164822-b4f3-4a97-b602-e49c495b54b8@redhat.com \
--to=longman@redhat.com \
--cc=hannes@cmpxchg.org \
--cc=linux-hwmon@vger.kernel.org \
--cc=mkoutny@suse.com \
--cc=sashiko@lists.linux.dev \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox