From: Lukasz Luba <lukasz.luba@arm.com>
To: Gaurav Kohli <gaurav.kohli@oss.qualcomm.com>
Cc: linux-arm-msm@vger.kernel.org, krzk+dt@kernel.org,
casey.connolly@linaro.org, amitk@kernel.org,
konradybcio@kernel.org, rui.zhang@intel.com,
daniel.lezcano@linaro.org, rafael@kernel.org,
conor+dt@kernel.org, andersson@kernel.org, robh@kernel.org,
devicetree@vger.kernel.org, linux-kernel@vger.kernel.org,
linux-pm@vger.kernel.org,
Amit Kucheria <amit.kucheria@oss.qualcomm.com>,
mathieu.poirier@linaro.org, mani@kernel.org
Subject: Re: [PATCH v1 1/8] thermal: Add Remote Proc cooling driver
Date: Mon, 2 Feb 2026 10:59:27 +0000 [thread overview]
Message-ID: <a27b755d-00d2-4350-98ab-0b68ab754770@arm.com> (raw)
In-Reply-To: <20251223123227.1317244-2-gaurav.kohli@oss.qualcomm.com>
Hi Gaurav,
On 12/23/25 12:32, Gaurav Kohli wrote:
> Add a new generic driver for thermal cooling devices that control
> remote processors (modem, DSP, etc.) through various communication
> channels.
>
> This driver provides an abstraction layer between the thermal
> subsystem and vendor-specific remote processor communication
> mechanisms.
Is this the patch about proposing the new cooling
device type at last LPC2025 conference (what we've discussed with Amit)?
There was some feedback asking you to add a bit more description
into this patch header, please do that (with some background as well).
>
> Suggested-by: Amit Kucheria <amit.kucheria@oss.qualcomm.com>
> Signed-off-by: Gaurav Kohli <gaurav.kohli@oss.qualcomm.com>
> ---
> MAINTAINERS | 8 ++
> drivers/thermal/Kconfig | 11 ++
> drivers/thermal/Makefile | 2 +
> drivers/thermal/remoteproc_cooling.c | 154 +++++++++++++++++++++++++++
> include/linux/remoteproc_cooling.h | 52 +++++++++
> 5 files changed, 227 insertions(+)
> create mode 100644 drivers/thermal/remoteproc_cooling.c
> create mode 100644 include/linux/remoteproc_cooling.h
>
> diff --git a/MAINTAINERS b/MAINTAINERS
> index 679e5f11e672..c1ba87315cdf 100644
> --- a/MAINTAINERS
> +++ b/MAINTAINERS
> @@ -25935,6 +25935,14 @@ F: drivers/thermal/cpufreq_cooling.c
> F: drivers/thermal/cpuidle_cooling.c
> F: include/linux/cpu_cooling.h
>
> +THERMAL/REMOTEPROC_COOLING
> +M: Gaurav Kohli <gaurav.kohli@oss.qualcomm.com>
> +L: linux-pm@vger.kernel.org
> +S: Supported
> +F: drivers/thermal/remoteproc_cooling.c
> +F: include/linux/remoteproc_cooling.h
> +
> +
> THERMAL/POWER_ALLOCATOR
> M: Lukasz Luba <lukasz.luba@arm.com>
> L: linux-pm@vger.kernel.org
> diff --git a/drivers/thermal/Kconfig b/drivers/thermal/Kconfig
> index b10080d61860..31e92be34387 100644
> --- a/drivers/thermal/Kconfig
> +++ b/drivers/thermal/Kconfig
> @@ -229,6 +229,17 @@ config PCIE_THERMAL
>
> If you want this support, you should say Y here.
>
> +
> +config REMOTEPROC_THERMAL
> + bool "Remote processor cooling support"
> + help
> + This implements a generic cooling mechanism for remote processors
> + (modem, DSP, etc.) that allows vendor-specific implementations to
> + register thermal cooling devices and provide callbacks for thermal
> + mitigation.
> +
> + If you want this support, you should say Y here.
> +
> config THERMAL_EMULATION
> bool "Thermal emulation mode support"
> help
> diff --git a/drivers/thermal/Makefile b/drivers/thermal/Makefile
> index bb21e7ea7fc6..ae747dde54fe 100644
> --- a/drivers/thermal/Makefile
> +++ b/drivers/thermal/Makefile
> @@ -34,6 +34,8 @@ thermal_sys-$(CONFIG_DEVFREQ_THERMAL) += devfreq_cooling.o
>
> thermal_sys-$(CONFIG_PCIE_THERMAL) += pcie_cooling.o
>
> +thermal_sys-$(CONFIG_REMOTEPROC_THERMAL) += remoteproc_cooling.o
> +
> obj-$(CONFIG_K3_THERMAL) += k3_bandgap.o k3_j72xx_bandgap.o
> # platform thermal drivers
> obj-y += broadcom/
> diff --git a/drivers/thermal/remoteproc_cooling.c b/drivers/thermal/remoteproc_cooling.c
> new file mode 100644
> index 000000000000..a1f948cbde0f
> --- /dev/null
> +++ b/drivers/thermal/remoteproc_cooling.c
> @@ -0,0 +1,154 @@
> +// SPDX-License-Identifier: GPL-2.0
> +/*
> + * Remote Processor Cooling Device
> + *
> + * Copyright (c) 2025, Qualcomm Innovation Center, Inc. All rights reserved.
> + */
> +
> +#include <linux/err.h>
> +#include <linux/export.h>
> +#include <linux/module.h>
> +#include <linux/mutex.h>
> +#include <linux/of.h>
> +#include <linux/slab.h>
> +#include <linux/thermal.h>
> +
> +#define REMOTEPROC_PREFIX "rproc_"
> +
> +struct remoteproc_cooling_ops {
> + int (*get_max_level)(void *devdata, unsigned long *level);
> + int (*get_cur_level)(void *devdata, unsigned long *level);
> + int (*set_cur_level)(void *devdata, unsigned long level);
> +};
1. There is no comment for struct and the functions like you did below.
2. Why you need those 3 callbacks?
It looks like they are simple wrappers on stuff in
'struct thermal_cooling_device_ops'.
Please try to get rid of them and re-use the existing fwk callbacks.
> +
> +/**
> + * struct remoteproc_cdev - Remote processor cooling device
> + * @cdev: Thermal cooling device handle
> + * @ops: Vendor-specific operation callbacks
> + * @devdata: Private data for vendor implementation
> + * @np: Device tree node associated with this cooling device
> + * @lock: Mutex to protect cooling device operations
> + */
> +struct remoteproc_cdev {
Please use the full naming:
remoteproc_cooling_device
> + struct thermal_cooling_device *cdev;
You don't need to keep it here. AFAICS it's only
used in the 'unregister' function. Please check my
comment here and then remove this pointer.
(It creates uneseccery linkage between those devices).
> + const struct remoteproc_cooling_ops *ops;
So here it can be simply:
struct thermal_cooling_device_ops cooling_ops;
> + void *devdata;
> + struct device_node *np;
This 'np' is also not used, remove it please.
> + struct mutex lock;
> +};
> +
> +
> +/* Thermal cooling device callbacks */
> +
> +static int remoteproc_get_max_state(struct thermal_cooling_device *cdev,
> + unsigned long *state)
> +{
> + struct remoteproc_cdev *rproc_cdev = cdev->devdata;
> + int ret;
> +
> + if (!rproc_cdev || !rproc_cdev->ops)
> + return -EINVAL;
This mustn't be changed in runtime accidenly. We don't guard in
cpufreq-/devfreq- cooling these callbacks that way. Please drop them.
> +
> + mutex_lock(&rproc_cdev->lock);
> + ret = rproc_cdev->ops->get_max_level(rproc_cdev->devdata, state);
> + mutex_unlock(&rproc_cdev->lock);
> +
> + return ret;
> +}
> +
> +static int remoteproc_get_cur_state(struct thermal_cooling_device *cdev,
> + unsigned long *state)
> +{
> + struct remoteproc_cdev *rproc_cdev = cdev->devdata;
> + int ret;
> +
> + if (!rproc_cdev || !rproc_cdev->ops)
> + return -EINVAL;
as above
> +
> + mutex_lock(&rproc_cdev->lock);
> + ret = rproc_cdev->ops->get_cur_level(rproc_cdev->devdata, state);
> + mutex_unlock(&rproc_cdev->lock);
> +
> + return ret;
> +}
> +
> +static int remoteproc_set_cur_state(struct thermal_cooling_device *cdev,
> + unsigned long state)
> +{
> + struct remoteproc_cdev *rproc_cdev = cdev->devdata;
> + int ret;
> +
> + if (!rproc_cdev || !rproc_cdev->ops)
> + return -EINVAL;
as above
> +
> + mutex_lock(&rproc_cdev->lock);
> + ret = rproc_cdev->ops->set_cur_level(rproc_cdev->devdata, state);
> + mutex_unlock(&rproc_cdev->lock);
> +
> + return ret;
> +}
> +
> +static const struct thermal_cooling_device_ops remoteproc_cooling_ops = {
> + .get_max_state = remoteproc_get_max_state,
> + .get_cur_state = remoteproc_get_cur_state,
> + .set_cur_state = remoteproc_set_cur_state,
> +};
> +
> +struct remoteproc_cdev *
> +remoteproc_cooling_register(struct device_node *np,
> + const char *name, const struct remoteproc_cooling_ops *ops,
> + void *devdata)
Since this is based on device_node, please align to the naming
convention from cpufreq-/devfreq-cooling and use prefix 'of_'
of_remoteproc_cooling_register()
> +{
> + struct remoteproc_cdev *rproc_cdev;
> + struct thermal_cooling_device *cdev;
> + int ret;
> +
> + if (!name || !ops) {
IMO you should check the '!np' here, not the lines below.
We can simply bail out very early.
> + return ERR_PTR(-EINVAL);
> + }
> +
> + rproc_cdev = kzalloc(sizeof(*rproc_cdev), GFP_KERNEL);
> + if (!rproc_cdev)
> + return ERR_PTR(-ENOMEM);
> +
> + rproc_cdev->ops = ops;
> + rproc_cdev->devdata = devdata;
> + rproc_cdev->np = np;
> + mutex_init(&rproc_cdev->lock);
> +
> + char *rproc_name __free(kfree) =
> + kasprintf(GFP_KERNEL, REMOTEPROC_PREFIX "%s", name);
> + /* Register with thermal framework */
> + if (np) {
Too late to test 'np'.
> + cdev = thermal_of_cooling_device_register(np, rproc_name, rproc_cdev,
> + &remoteproc_cooling_ops);
> + }
> +
> + if (IS_ERR(cdev)) {
This check should be just below the line when you get the 'cdev'
> + ret = PTR_ERR(cdev);
> + goto free_rproc_cdev;
> + }
> +
> + rproc_cdev->cdev = cdev;
> +
> + return rproc_cdev;
> +
> +free_rproc_cdev:
> + kfree(rproc_cdev);
> + return ERR_PTR(ret);
> +}
> +EXPORT_SYMBOL_GPL(remoteproc_cooling_register);
> +
> +void remoteproc_cooling_unregister(struct remoteproc_cdev *rproc_cdev)
Change the API to be alined with cpufreq-cooling and devfreq-cooling
types of devices, so:
void remoteproc_cooling_unregister(struct thermal_cooling_device *cdev)
You still should be able to get the rptoc_cdev like:
rproc_cdev = cdev->devdata;
and free it.
> +{
> + if (!rproc_cdev)
> + return;
> +
> + thermal_cooling_device_unregister(rproc_cdev->cdev);
> + mutex_destroy(&rproc_cdev->lock);
> + kfree(rproc_cdev);
> +}
> +EXPORT_SYMBOL_GPL(remoteproc_cooling_unregister);
> +
> +MODULE_LICENSE("GPL");
> +MODULE_DESCRIPTION("Remote Processor Cooling Device");
> diff --git a/include/linux/remoteproc_cooling.h b/include/linux/remoteproc_cooling.h
> new file mode 100644
> index 000000000000..ef94019d220d
> --- /dev/null
> +++ b/include/linux/remoteproc_cooling.h
> @@ -0,0 +1,52 @@
> +/* SPDX-License-Identifier: GPL-2.0 */
> +/*
> + * Remote Processor Cooling Device
> + *
> + * Copyright (c) 2025, Qualcomm Innovation Center
> + */
> +
> +#ifndef __REMOTEPROC_COOLING_H__
> +#define __REMOTEPROC_COOLING_H__
> +
> +#include <linux/thermal.h>
> +
> +struct device;
> +struct device_node;
> +
> +struct remoteproc_cooling_ops {
> + int (*get_max_level)(void *devdata, unsigned long *level);
> + int (*get_cur_level)(void *devdata, unsigned long *level);
> + int (*set_cur_level)(void *devdata, unsigned long level);
> +};
That duplicate w/ .c file content.
We don't need this in the header, please follow the cpufreq-/devfreq-
design.
> +
> +struct remoteproc_cdev;
> +
> +#ifdef CONFIG_REMOTEPROC_THERMAL
> +
> +struct remoteproc_cdev *
> +remoteproc_cooling_register(struct device_node *np,
> + const char *name,
> + const struct remoteproc_cooling_ops *ops,
> + void *devdata);
> +
> +void remoteproc_cooling_unregister(struct remoteproc_cdev *rproc_cdev);
> +
> +#else /* !CONFIG_REMOTEPROC_THERMAL */
> +
> +static inline struct remoteproc_cdev *
> +remoteproc_cooling_register(struct device_node *np,
> + const char *name,
> + const struct remoteproc_cooling_ops *ops,
> + void *devdata)
> +{
> + return ERR_PTR(-EINVAL);
> +}
Function naming convention here as well
Regards,
Lukasz
next prev parent reply other threads:[~2026-02-02 10:59 UTC|newest]
Thread overview: 67+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-12-23 12:32 [PATCH v1 0/8] Add RemoteProc cooling support Gaurav Kohli
2025-12-23 12:32 ` [PATCH v1 1/8] thermal: Add Remote Proc cooling driver Gaurav Kohli
2025-12-23 19:23 ` Dmitry Baryshkov
2025-12-24 8:20 ` Gaurav Kohli
2026-01-03 15:05 ` Bjorn Andersson
2026-01-05 5:18 ` Gaurav Kohli
2026-01-08 11:59 ` Zhongqiu Han
2026-01-29 5:40 ` Gaurav Kohli
2026-02-02 10:59 ` Lukasz Luba [this message]
2026-02-09 5:28 ` Gaurav Kohli
2026-02-09 10:21 ` Lukasz Luba
2025-12-23 12:32 ` [PATCH v1 2/8] remoteproc: qcom: probe all child devices Gaurav Kohli
2025-12-23 19:26 ` Dmitry Baryshkov
2026-01-03 14:56 ` Bjorn Andersson
2026-01-08 7:07 ` Gaurav Kohli
2026-01-23 13:53 ` Gaurav Kohli
2026-01-23 19:03 ` Dmitry Baryshkov
2026-01-27 16:12 ` Gaurav Kohli
2026-01-27 16:41 ` Dmitry Baryshkov
2026-01-28 9:39 ` Gaurav Kohli
2026-01-28 9:45 ` Konrad Dybcio
2026-01-30 7:03 ` Gaurav Kohli
2026-01-30 9:13 ` Konrad Dybcio
2026-01-31 8:06 ` Dmitry Baryshkov
2026-01-31 10:11 ` Gaurav Kohli
2026-01-31 11:40 ` Dmitry Baryshkov
2026-01-31 11:45 ` Gaurav Kohli
2025-12-23 12:32 ` [PATCH v1 3/8] dt-bindings: thermal: Add qcom,qmi-cooling yaml bindings Gaurav Kohli
2025-12-23 13:59 ` Rob Herring (Arm)
2025-12-24 8:20 ` Gaurav Kohli
2025-12-23 19:30 ` Dmitry Baryshkov
2025-12-24 8:24 ` Gaurav Kohli
2025-12-24 9:31 ` Dmitry Baryshkov
2026-01-03 15:08 ` Bjorn Andersson
2025-12-23 19:52 ` Dmitry Baryshkov
2025-12-24 8:57 ` Krzysztof Kozlowski
2025-12-24 10:08 ` Gaurav Kohli
2025-12-24 10:24 ` Krzysztof Kozlowski
2025-12-31 6:42 ` Gaurav Kohli
2025-12-31 7:35 ` Krzysztof Kozlowski
2025-12-31 7:47 ` Dmitry Baryshkov
2025-12-31 7:52 ` Gaurav Kohli
2025-12-31 7:55 ` Dmitry Baryshkov
2025-12-24 9:02 ` Krzysztof Kozlowski
2025-12-31 11:59 ` Konrad Dybcio
2026-01-08 8:43 ` Gaurav Kohli
2025-12-23 12:32 ` [PATCH v1 4/8] thermal: qcom: add qmi-cooling driver Gaurav Kohli
2025-12-23 19:49 ` Dmitry Baryshkov
2025-12-31 6:28 ` Gaurav Kohli
2025-12-31 6:33 ` Dmitry Baryshkov
2025-12-24 9:01 ` Krzysztof Kozlowski
2025-12-31 6:32 ` Gaurav Kohli
2025-12-23 12:32 ` [PATCH v1 5/8] arm64: dts: qcom: Enable cdsp qmi tmd devices for lemans Gaurav Kohli
2025-12-23 19:32 ` Dmitry Baryshkov
2025-12-23 12:32 ` [PATCH v1 6/8] arm64: dts: qcom: Enable cdsp qmi tmd devices for talos Gaurav Kohli
2026-01-03 15:13 ` Bjorn Andersson
2025-12-23 12:32 ` [PATCH v1 7/8] arm64: dts: qcom: Enable cdsp qmi tmd devices for kodiak Gaurav Kohli
2026-01-03 15:14 ` Bjorn Andersson
2025-12-23 12:32 ` [PATCH v1 8/8] arm64: dts: qcom: Enable cdsp qmi tmd devices for monaco Gaurav Kohli
2025-12-24 8:58 ` Krzysztof Kozlowski
2025-12-24 10:11 ` Gaurav Kohli
2026-01-10 16:13 ` [PATCH v1 0/8] Add RemoteProc cooling support Casey Connolly
2026-01-13 9:33 ` Gaurav Kohli
2026-02-01 20:20 ` Trilok Soni
2026-02-02 9:53 ` Konrad Dybcio
2026-02-09 10:22 ` Gaurav Kohli
2026-02-09 5:33 ` Gaurav Kohli
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a27b755d-00d2-4350-98ab-0b68ab754770@arm.com \
--to=lukasz.luba@arm.com \
--cc=amit.kucheria@oss.qualcomm.com \
--cc=amitk@kernel.org \
--cc=andersson@kernel.org \
--cc=casey.connolly@linaro.org \
--cc=conor+dt@kernel.org \
--cc=daniel.lezcano@linaro.org \
--cc=devicetree@vger.kernel.org \
--cc=gaurav.kohli@oss.qualcomm.com \
--cc=konradybcio@kernel.org \
--cc=krzk+dt@kernel.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=mani@kernel.org \
--cc=mathieu.poirier@linaro.org \
--cc=rafael@kernel.org \
--cc=robh@kernel.org \
--cc=rui.zhang@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox