From: Lukasz Luba <lukasz.luba@arm.com>
To: "Rafael J. Wysocki" <rafael@kernel.org>,
Mauricio Faria de Oliveira <mfo@igalia.com>
Cc: Daniel Lezcano <daniel.lezcano@linaro.org>,
LKML <linux-kernel@vger.kernel.org>,
Linux PM <linux-pm@vger.kernel.org>
Subject: Re: [PATCH v2] thermal: core: Address thermal zone removal races with resume
Date: Mon, 30 Mar 2026 10:15:04 +0100 [thread overview]
Message-ID: <5321acf9-0de4-4b8a-811d-e1d71b152887@arm.com> (raw)
In-Reply-To: <6267615.lOV4Wx5bFT@rafael.j.wysocki>
On 3/27/26 09:49, Rafael J. Wysocki wrote:
> From: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
>
> Since thermal_zone_pm_complete() and thermal_zone_device_resume()
> re-initialize the poll_queue delayed work for the given thermal zone,
> the cancel_delayed_work_sync() in thermal_zone_device_unregister()
> may miss some already running work items and the thermal zone may
> be freed prematurely [1].
>
> There are two failing scenarios that both start with
> running thermal_pm_notify_complete() right before invoking
> thermal_zone_device_unregister() for one of the thermal zones.
>
> In the first scenario, there is a work item already running for
> the given thermal zone when thermal_pm_notify_complete() calls
> thermal_zone_pm_complete() for that thermal zone and it continues to
> run when thermal_zone_device_unregister() starts. Since the poll_queue
> delayed work has been re-initialized by thermal_pm_notify_complete(), the
> running work item will be missed by the cancel_delayed_work_sync() in
> thermal_zone_device_unregister() and if it continues to run past the
> freeing of the thermal zone object, a use-after-free will occur.
>
> In the second scenario, thermal_zone_device_resume() queued up by
> thermal_pm_notify_complete() runs right after the thermal_zone_exit()
> called by thermal_zone_device_unregister() has returned. The poll_queue
> delayed work is re-initialized by it before cancel_delayed_work_sync() is
> called by thermal_zone_device_unregister(), so it may continue to run
> after the freeing of the thermal zone object, which also leads to a
> use-after-free.
>
> Address the first failing scenario by ensuring that no thermal work
> items will be running when thermal_pm_notify_complete() is called.
> For this purpose, first move the cancel_delayed_work() call from
> thermal_zone_pm_complete() to thermal_zone_pm_prepare() to prevent
> new work from entering the workqueue going forward. Next, switch
> over to using a dedicated workqueue for thermal events and update
> the code in thermal_pm_notify() to flush that workqueue after
> thermal_pm_notify_prepare() has returned which will take care of
> all leftover thermal work already on the workqueue (that leftover
> work would do nothing useful anyway because all of the thermal zones
> have been flagged as suspended).
>
> The second failing scenario is addressed by adding a tz->state check
> to thermal_zone_device_resume() to prevent it from re-initializing
> the poll_queue delayed work if the thermal zone is going away.
>
> Note that the above changes will also facilitate relocating the suspend
> and resume of thermal zones closer to the suspend and resume of devices,
> respectively.
>
> Fixes: 5a5efdaffda5 ("thermal: core: Resume thermal zones asynchronously")
> Reported-by: Mauricio Faria de Oliveira <mfo@igalia.com>
> Closes: https://lore.kernel.org/linux-pm/20260324-thermal-core-uaf-init_delayed_work-v1-1-6611ae76a8a1@igalia.com/ [1]
> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
> ---
>
> v1 -> v2: Return -ENOMEM from thermal_init() when the workqueue allocation
> fails (Sashiko)
>
> Lukasz, Daniel, I'm quite confident about this patch, but I would appreciate
> your feedback.
My apologies for delay...
>
> ---
> drivers/thermal/thermal_core.c | 31 ++++++++++++++++++++++++++-----
> 1 file changed, 26 insertions(+), 5 deletions(-)
>
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -42,6 +42,8 @@ static struct thermal_governor *def_gove
>
> static bool thermal_pm_suspended;
>
> +static struct workqueue_struct *thermal_wq __ro_after_init;
> +
> /*
> * Governor section: set of functions to handle thermal governors
> *
> @@ -314,7 +316,7 @@ static void thermal_zone_device_set_poll
> if (delay > HZ)
> delay = round_jiffies_relative(delay);
>
> - mod_delayed_work(system_freezable_power_efficient_wq, &tz->poll_queue, delay);
> + mod_delayed_work(thermal_wq, &tz->poll_queue, delay);
> }
>
> static void thermal_zone_recheck(struct thermal_zone_device *tz, int error)
> @@ -1795,6 +1797,10 @@ static void thermal_zone_device_resume(s
>
> guard(thermal_zone)(tz);
>
> + /* If the thermal zone is going away, there's nothing to do. */
> + if (tz->state & TZ_STATE_FLAG_EXIT)
> + return;
> +
> tz->state &= ~(TZ_STATE_FLAG_SUSPENDED | TZ_STATE_FLAG_RESUMING);
>
> thermal_debug_tz_resume(tz);
> @@ -1825,6 +1831,9 @@ static void thermal_zone_pm_prepare(stru
> }
>
> tz->state |= TZ_STATE_FLAG_SUSPENDED;
> +
> + /* Prevent new work from getting to the workqueue subsequently. */
> + cancel_delayed_work(&tz->poll_queue);
> }
>
> static void thermal_pm_notify_prepare(void)
> @@ -1843,8 +1852,6 @@ static void thermal_zone_pm_complete(str
> {
> guard(thermal_zone)(tz);
>
> - cancel_delayed_work(&tz->poll_queue);
> -
> reinit_completion(&tz->resume);
> tz->state |= TZ_STATE_FLAG_RESUMING;
>
> @@ -1854,7 +1861,7 @@ static void thermal_zone_pm_complete(str
> */
> INIT_DELAYED_WORK(&tz->poll_queue, thermal_zone_device_resume);
> /* Queue up the work without a delay. */
> - mod_delayed_work(system_freezable_power_efficient_wq, &tz->poll_queue, 0);
> + mod_delayed_work(thermal_wq, &tz->poll_queue, 0);
> }
>
> static void thermal_pm_notify_complete(void)
> @@ -1877,6 +1884,11 @@ static int thermal_pm_notify(struct noti
> case PM_RESTORE_PREPARE:
> case PM_SUSPEND_PREPARE:
> thermal_pm_notify_prepare();
> + /*
> + * Allow any leftover thermal work items already on the
> + * worqueue to complete so they don't get in the way later.
> + */
> + flush_workqueue(thermal_wq);
> break;
> case PM_POST_HIBERNATION:
> case PM_POST_RESTORE:
> @@ -1909,9 +1921,16 @@ static int __init thermal_init(void)
> if (result)
> goto error;
>
> + thermal_wq = alloc_workqueue("thermal_events",
> + WQ_FREEZABLE | WQ_POWER_EFFICIENT | WQ_PERCPU, 0);
Why we need those workqueue instances per-cpu?
It isn't a concurrent work, only one CPU can manage the cpufreq domain.
I would drop that flag and allow to have single instance and migrate
between cpus (to not wake up the one which queued the work).
> + if (!thermal_wq) {
> + result = -ENOMEM;
> + goto unregister_netlink;
> + }
> +
> result = thermal_register_governors();
> if (result)
> - goto unregister_netlink;
> + goto destroy_workqueue;
>
> thermal_class = kzalloc_obj(*thermal_class);
> if (!thermal_class) {
> @@ -1938,6 +1957,8 @@ static int __init thermal_init(void)
>
> unregister_governors:
> thermal_unregister_governors();
> +destroy_workqueue:
> + destroy_workqueue(thermal_wq);
> unregister_netlink:
> thermal_netlink_exit();
> error:
>
>
>
next prev parent reply other threads:[~2026-03-30 9:14 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-27 9:49 [PATCH v2] thermal: core: Address thermal zone removal races with resume Rafael J. Wysocki
2026-03-27 14:47 ` Mauricio Faria de Oliveira
2026-03-27 14:50 ` Rafael J. Wysocki
2026-03-30 9:15 ` Lukasz Luba [this message]
2026-03-30 11:41 ` Rafael J. Wysocki
2026-03-31 9:28 ` Lukasz Luba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5321acf9-0de4-4b8a-811d-e1d71b152887@arm.com \
--to=lukasz.luba@arm.com \
--cc=daniel.lezcano@linaro.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=mfo@igalia.com \
--cc=rafael@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox