* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-18 8:38 ` [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails Viresh Kumar
@ 2023-01-18 19:57 ` Frank Rowand
2023-01-18 19:58 ` Rafael J. Wysocki
` (2 subsequent siblings)
3 siblings, 0 replies; 13+ messages in thread
From: Frank Rowand @ 2023-01-18 19:57 UTC (permalink / raw)
To: Viresh Kumar, Rafael J. Wysocki, Daniel Lezcano, Amit Kucheria,
Zhang Rui, Yang Yingliang
Cc: linux-pm, Vincent Guittot, Caleb Connolly, Rafael J. Wysocki,
linux-kernel
On 1/18/23 02:38, Viresh Kumar wrote:
> put_device() shouldn't be called before a prior call to
> device_register(). __thermal_cooling_device_register() doesn't follow
> that properly and needs fixing. Also
> thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> on few error paths.
>
> Fix all this by placing the calls at the right place.
>
> Based on initial work done by Caleb Connolly.
>
> Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
> ---
> For v6.2-rc.
>
> V3->V4:
> - The first three versions were sent by Caleb.
> - The new version fixes the current bugs, without looking to optimize the
> code any further, which is done separately in the next two patches.
>
> drivers/thermal/thermal_core.c | 13 ++++++++++---
> 1 file changed, 10 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/thermal/thermal_core.c b/drivers/thermal/thermal_core.c
> index f17ab2316dbd..77bd47d976a2 100644
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -909,15 +909,20 @@ __thermal_cooling_device_register(struct device_node *np,
> cdev->devdata = devdata;
>
> ret = cdev->ops->get_max_state(cdev, &cdev->max_state);
> - if (ret)
> - goto out_kfree_type;
> + if (ret) {
> + kfree(cdev->type);
> + goto out_ida_remove;
> + }
>
> thermal_cooling_device_setup_sysfs(cdev);
> +
> ret = dev_set_name(&cdev->device, "cooling_device%d", cdev->id);
> if (ret) {
> + kfree(cdev->type);
> thermal_cooling_device_destroy_sysfs(cdev);
> - goto out_kfree_type;
> + goto out_ida_remove;
> }
> +
> ret = device_register(&cdev->device);
> if (ret)
> goto out_kfree_type;
> @@ -943,6 +948,8 @@ __thermal_cooling_device_register(struct device_node *np,
> thermal_cooling_device_destroy_sysfs(cdev);
> kfree(cdev->type);
> put_device(&cdev->device);
> +
> + /* thermal_release() takes care of the rest */
> cdev = NULL;
> out_ida_remove:
> ida_free(&thermal_cdev_ida, id);
My testing:
Applied on top of v6.2-rc1
The configuration is qcom_defconfig
The system is a Qualcomm Dragon 8074
The two WARNING stack traces no longer occur after applying the patch.
Tested-by: Frank Rowand <frowand.list@gmail.com>
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-18 8:38 ` [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails Viresh Kumar
2023-01-18 19:57 ` Frank Rowand
@ 2023-01-18 19:58 ` Rafael J. Wysocki
2023-01-19 5:16 ` Viresh Kumar
2023-01-19 8:13 ` Yang Yingliang
2023-01-19 15:02 ` Caleb Connolly
3 siblings, 1 reply; 13+ messages in thread
From: Rafael J. Wysocki @ 2023-01-18 19:58 UTC (permalink / raw)
To: Viresh Kumar
Cc: Rafael J. Wysocki, Daniel Lezcano, Amit Kucheria, Zhang Rui,
Yang Yingliang, linux-pm, Vincent Guittot, Caleb Connolly,
Rafael J. Wysocki, linux-kernel
On Wed, Jan 18, 2023 at 9:38 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
>
> put_device() shouldn't be called before a prior call to
> device_register(). __thermal_cooling_device_register() doesn't follow
> that properly and needs fixing. Also
> thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> on few error paths.
>
> Fix all this by placing the calls at the right place.
>
> Based on initial work done by Caleb Connolly.
>
> Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
OK, so I think that this patch is needed for 6.2 and the other two may
be queued up for later (they do depend on this one, though, of
course). Is my understanding correct?
> ---
> For v6.2-rc.
>
> V3->V4:
> - The first three versions were sent by Caleb.
> - The new version fixes the current bugs, without looking to optimize the
> code any further, which is done separately in the next two patches.
>
> drivers/thermal/thermal_core.c | 13 ++++++++++---
> 1 file changed, 10 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/thermal/thermal_core.c b/drivers/thermal/thermal_core.c
> index f17ab2316dbd..77bd47d976a2 100644
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -909,15 +909,20 @@ __thermal_cooling_device_register(struct device_node *np,
> cdev->devdata = devdata;
>
> ret = cdev->ops->get_max_state(cdev, &cdev->max_state);
> - if (ret)
> - goto out_kfree_type;
> + if (ret) {
> + kfree(cdev->type);
> + goto out_ida_remove;
> + }
>
> thermal_cooling_device_setup_sysfs(cdev);
> +
> ret = dev_set_name(&cdev->device, "cooling_device%d", cdev->id);
> if (ret) {
> + kfree(cdev->type);
> thermal_cooling_device_destroy_sysfs(cdev);
> - goto out_kfree_type;
> + goto out_ida_remove;
> }
> +
> ret = device_register(&cdev->device);
> if (ret)
> goto out_kfree_type;
> @@ -943,6 +948,8 @@ __thermal_cooling_device_register(struct device_node *np,
> thermal_cooling_device_destroy_sysfs(cdev);
> kfree(cdev->type);
> put_device(&cdev->device);
> +
> + /* thermal_release() takes care of the rest */
> cdev = NULL;
> out_ida_remove:
> ida_free(&thermal_cdev_ida, id);
> --
> 2.31.1.272.g89b43f80a514
>
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-18 19:58 ` Rafael J. Wysocki
@ 2023-01-19 5:16 ` Viresh Kumar
2023-01-19 20:09 ` Rafael J. Wysocki
0 siblings, 1 reply; 13+ messages in thread
From: Viresh Kumar @ 2023-01-19 5:16 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Daniel Lezcano, Amit Kucheria, Zhang Rui, Yang Yingliang,
linux-pm, Vincent Guittot, Caleb Connolly, Rafael J. Wysocki,
linux-kernel
On 18-01-23, 20:58, Rafael J. Wysocki wrote:
> On Wed, Jan 18, 2023 at 9:38 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
> >
> > put_device() shouldn't be called before a prior call to
> > device_register(). __thermal_cooling_device_register() doesn't follow
> > that properly and needs fixing. Also
> > thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> > on few error paths.
> >
> > Fix all this by placing the calls at the right place.
> >
> > Based on initial work done by Caleb Connolly.
> >
> > Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> > Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> > Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> > Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
>
> OK, so I think that this patch is needed for 6.2 and the other two may
> be queued up for later (they do depend on this one, though, of
> course). Is my understanding correct?
Right.
--
viresh
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-19 5:16 ` Viresh Kumar
@ 2023-01-19 20:09 ` Rafael J. Wysocki
2023-01-24 19:26 ` Rafael J. Wysocki
0 siblings, 1 reply; 13+ messages in thread
From: Rafael J. Wysocki @ 2023-01-19 20:09 UTC (permalink / raw)
To: Viresh Kumar
Cc: Rafael J. Wysocki, Daniel Lezcano, Amit Kucheria, Zhang Rui,
Yang Yingliang, linux-pm, Vincent Guittot, Caleb Connolly,
Rafael J. Wysocki, linux-kernel
On Thu, Jan 19, 2023 at 6:16 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
>
> On 18-01-23, 20:58, Rafael J. Wysocki wrote:
> > On Wed, Jan 18, 2023 at 9:38 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
> > >
> > > put_device() shouldn't be called before a prior call to
> > > device_register(). __thermal_cooling_device_register() doesn't follow
> > > that properly and needs fixing. Also
> > > thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> > > on few error paths.
> > >
> > > Fix all this by placing the calls at the right place.
> > >
> > > Based on initial work done by Caleb Connolly.
> > >
> > > Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> > > Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> > > Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> > > Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
> >
> > OK, so I think that this patch is needed for 6.2 and the other two may
> > be queued up for later (they do depend on this one, though, of
> > course). Is my understanding correct?
>
> Right.
OK, applied as 6.2-rc material and I'll get to the other two when this goes in.
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-19 20:09 ` Rafael J. Wysocki
@ 2023-01-24 19:26 ` Rafael J. Wysocki
0 siblings, 0 replies; 13+ messages in thread
From: Rafael J. Wysocki @ 2023-01-24 19:26 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Viresh Kumar, Daniel Lezcano, Amit Kucheria, Zhang Rui,
Yang Yingliang, linux-pm, Vincent Guittot, Caleb Connolly,
Rafael J. Wysocki, linux-kernel
On Thu, Jan 19, 2023 at 9:09 PM Rafael J. Wysocki <rafael@kernel.org> wrote:
>
> On Thu, Jan 19, 2023 at 6:16 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
> >
> > On 18-01-23, 20:58, Rafael J. Wysocki wrote:
> > > On Wed, Jan 18, 2023 at 9:38 AM Viresh Kumar <viresh.kumar@linaro.org> wrote:
> > > >
> > > > put_device() shouldn't be called before a prior call to
> > > > device_register(). __thermal_cooling_device_register() doesn't follow
> > > > that properly and needs fixing. Also
> > > > thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> > > > on few error paths.
> > > >
> > > > Fix all this by placing the calls at the right place.
> > > >
> > > > Based on initial work done by Caleb Connolly.
> > > >
> > > > Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> > > > Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> > > > Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> > > > Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
> > >
> > > OK, so I think that this patch is needed for 6.2 and the other two may
> > > be queued up for later (they do depend on this one, though, of
> > > course). Is my understanding correct?
> >
> > Right.
>
> OK, applied as 6.2-rc material and I'll get to the other two when this goes in.
Patches [2-3/3] from this series have been applied as 6.3 material now, thanks!
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-18 8:38 ` [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails Viresh Kumar
2023-01-18 19:57 ` Frank Rowand
2023-01-18 19:58 ` Rafael J. Wysocki
@ 2023-01-19 8:13 ` Yang Yingliang
2023-01-19 15:02 ` Caleb Connolly
3 siblings, 0 replies; 13+ messages in thread
From: Yang Yingliang @ 2023-01-19 8:13 UTC (permalink / raw)
To: Viresh Kumar, Rafael J. Wysocki, Daniel Lezcano, Amit Kucheria,
Zhang Rui
Cc: linux-pm, Vincent Guittot, Caleb Connolly, Rafael J. Wysocki,
linux-kernel
On 2023/1/18 16:38, Viresh Kumar wrote:
> put_device() shouldn't be called before a prior call to
> device_register(). __thermal_cooling_device_register() doesn't follow
> that properly and needs fixing. Also
> thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> on few error paths.
>
> Fix all this by placing the calls at the right place.
>
> Based on initial work done by Caleb Connolly.
>
> Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
> ---
Reviewed-by: Yang Yingliang <yangyingliang@huawei.com>
> For v6.2-rc.
>
> V3->V4:
> - The first three versions were sent by Caleb.
> - The new version fixes the current bugs, without looking to optimize the
> code any further, which is done separately in the next two patches.
>
> drivers/thermal/thermal_core.c | 13 ++++++++++---
> 1 file changed, 10 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/thermal/thermal_core.c b/drivers/thermal/thermal_core.c
> index f17ab2316dbd..77bd47d976a2 100644
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -909,15 +909,20 @@ __thermal_cooling_device_register(struct device_node *np,
> cdev->devdata = devdata;
>
> ret = cdev->ops->get_max_state(cdev, &cdev->max_state);
> - if (ret)
> - goto out_kfree_type;
> + if (ret) {
> + kfree(cdev->type);
> + goto out_ida_remove;
> + }
>
> thermal_cooling_device_setup_sysfs(cdev);
> +
> ret = dev_set_name(&cdev->device, "cooling_device%d", cdev->id);
> if (ret) {
> + kfree(cdev->type);
> thermal_cooling_device_destroy_sysfs(cdev);
> - goto out_kfree_type;
> + goto out_ida_remove;
> }
> +
> ret = device_register(&cdev->device);
> if (ret)
> goto out_kfree_type;
> @@ -943,6 +948,8 @@ __thermal_cooling_device_register(struct device_node *np,
> thermal_cooling_device_destroy_sysfs(cdev);
> kfree(cdev->type);
> put_device(&cdev->device);
> +
> + /* thermal_release() takes care of the rest */
> cdev = NULL;
> out_ida_remove:
> ida_free(&thermal_cdev_ida, id);
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails
2023-01-18 8:38 ` [PATCH V4 1/3] thermal: core: call put_device() only after device_register() fails Viresh Kumar
` (2 preceding siblings ...)
2023-01-19 8:13 ` Yang Yingliang
@ 2023-01-19 15:02 ` Caleb Connolly
3 siblings, 0 replies; 13+ messages in thread
From: Caleb Connolly @ 2023-01-19 15:02 UTC (permalink / raw)
To: Viresh Kumar, Rafael J. Wysocki, Daniel Lezcano, Amit Kucheria,
Zhang Rui, Yang Yingliang
Cc: linux-pm, Vincent Guittot, Rafael J. Wysocki, linux-kernel
On 18/01/2023 08:38, Viresh Kumar wrote:
> put_device() shouldn't be called before a prior call to
> device_register(). __thermal_cooling_device_register() doesn't follow
> that properly and needs fixing. Also
> thermal_cooling_device_destroy_sysfs() is getting called unnecessarily
> on few error paths.
>
> Fix all this by placing the calls at the right place.
>
> Based on initial work done by Caleb Connolly.
>
> Fixes: 4748f9687caa ("thermal: core: fix some possible name leaks in error paths")
> Fixes: c408b3d1d9bb ("thermal: Validate new state in cur_state_store()")
> Reported-by: Caleb Connolly <caleb.connolly@linaro.org>
> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
Tested-by: Caleb Connolly <caleb.connolly@linaro.org>
Thanks for sending this, with this I no longer hit the splats when
get_max_state() fails.
> ---
> For v6.2-rc.
>
> V3->V4:
> - The first three versions were sent by Caleb.
> - The new version fixes the current bugs, without looking to optimize the
> code any further, which is done separately in the next two patches.
>
> drivers/thermal/thermal_core.c | 13 ++++++++++---
> 1 file changed, 10 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/thermal/thermal_core.c b/drivers/thermal/thermal_core.c
> index f17ab2316dbd..77bd47d976a2 100644
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -909,15 +909,20 @@ __thermal_cooling_device_register(struct device_node *np,
> cdev->devdata = devdata;
>
> ret = cdev->ops->get_max_state(cdev, &cdev->max_state);
> - if (ret)
> - goto out_kfree_type;
> + if (ret) {
> + kfree(cdev->type);
> + goto out_ida_remove;
> + }
>
> thermal_cooling_device_setup_sysfs(cdev);
> +
> ret = dev_set_name(&cdev->device, "cooling_device%d", cdev->id);
> if (ret) {
> + kfree(cdev->type);
> thermal_cooling_device_destroy_sysfs(cdev);
> - goto out_kfree_type;
> + goto out_ida_remove;
> }
> +
> ret = device_register(&cdev->device);
> if (ret)
> goto out_kfree_type;
> @@ -943,6 +948,8 @@ __thermal_cooling_device_register(struct device_node *np,
> thermal_cooling_device_destroy_sysfs(cdev);
> kfree(cdev->type);
> put_device(&cdev->device);
> +
> + /* thermal_release() takes care of the rest */
> cdev = NULL;
> out_ida_remove:
> ida_free(&thermal_cdev_ida, id);
--
Kind Regards,
Caleb (they/them)
^ permalink raw reply [flat|nested] 13+ messages in thread