* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-09 19:59 ` Rafael J. Wysocki
@ 2025-06-11 9:11 ` Claudiu Beznea
2025-06-11 16:27 ` Jonathan Cameron
2025-06-11 16:23 ` Jonathan Cameron
2025-06-13 7:39 ` Claudiu Beznea
2 siblings, 1 reply; 22+ messages in thread
From: Claudiu Beznea @ 2025-06-11 9:11 UTC (permalink / raw)
To: Rafael J. Wysocki, Jonathan Cameron
Cc: Dmitry Torokhov, gregkh, dakr, len.brown, pavel, ulf.hansson,
daniel.lezcano, linux-kernel, linux-pm, bhelgaas, geert,
linux-iio, linux-renesas-soc, fabrizio.castro.jz, Claudiu Beznea
Hi, Rafael,
On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
>>
>> On Fri, 6 Jun 2025 22:01:52 +0200
>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>>
>> Hi Rafael,
>>
>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
>>> <dmitry.torokhov@gmail.com> wrote:
>>>>
>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
>>>>>>
>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
>>>>>>
>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
>>>>>> dev_pm_domain_detach(), often following patterns like:
>>>>>>
>>>>>> static int bus_probe(struct device *_dev)
>>>>>> {
>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>> int ret;
>>>>>>
>>>>>> // ...
>>>>>>
>>>>>> ret = dev_pm_domain_attach(_dev, true);
>>>>>> if (ret)
>>>>>> return ret;
>>>>>>
>>>>>> if (drv->probe)
>>>>>> ret = drv->probe(dev);
>>>>>>
>>>>>> // ...
>>>>>> }
>>>>>>
>>>>>> static void bus_remove(struct device *_dev)
>>>>>> {
>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>
>>>>>> if (drv->remove)
>>>>>> drv->remove(dev);
>>>>>> dev_pm_domain_detach(_dev);
>>>>>> }
>>>>>>
>>>>>> When the driver's probe function uses devres-managed resources that depend
>>>>>> on the power domain state, those resources are released later during
>>>>>> device_unbind_cleanup().
>>>>>>
>>>>>> Releasing devres-managed resources that depend on the power domain state
>>>>>> after detaching the device from its PM domain can cause failures.
>>>>>>
>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
>>>>>> function, and the device's clocks are managed by the PM domain, then
>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
>>>>>> the clocks have been removed from the PM domain. It may happen that the
>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
>>>>>
>>>>> Don't use devm_pm_runtime_enable() then.
>>>>
>>>> What about other devm_ APIs? Are you suggesting that platform drivers
>>>> should not be using devm_clk*(), devm_regulator_*(),
>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
>>>> off the device too early, before cleanup code has a chance to execute
>>>> proper cleanup.
>>>>
>>>> The issue is not limited to runtime PM.
>>>>
>>>>>
>>>>>> If the driver specific runtime PM APIs access registers directly, this
>>>>>> will lead to accessing device registers without clocks being enabled.
>>>>>> Similar issues may occur with other devres actions that access device
>>>>>> registers.
>>>>>>
>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
>>>>>> after all driver's devres-managed resources have been release.
>>>>>>
>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
>>>>>> arguments, one for the domain state on attach, one for the domain state on
>>>>>> detach.
>>>>>
>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
>>>>
>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
>>>
>>> Yes, among other things.
>>
>> Maybe naming could make abuse at least obvious to spot? e.g.
>> pm_domain_attach_with_devm_release()
>
> If I'm not mistaken, it is not even necessary to use devres for this.
>
> You might as well add a dev_pm_domain_detach() call to
> device_unbind_cleanup() after devres_release_all(). There is a slight
> complication related to the second argument of it, but I suppose that
> this can be determined at the attach time and stored in a new device
> PM flag, or similar.
I can try this as well.
Another option I see at the moment would be keep the code added in
drivers/base/power/common.c in drivers/base/platform.c, something like:
diff --git a/drivers/base/platform.c b/drivers/base/platform.c
index 075ec1d1b73a..391d725cd4c7 100644
--- a/drivers/base/platform.c
+++ b/drivers/base/platform.c
@@ -1376,10 +1376,18 @@ static int platform_uevent(const struct device
*dev, struct kobj_uevent_env *env
return 0;
}
+static void platform_dev_pm_domain_detach(struct device *dev, void *res)
+{
+ bool *power_off = res;
+
+ dev_pm_domain_detach(dev, *power_off);
+}
+
static int platform_probe(struct device *_dev)
{
struct platform_driver *drv = to_platform_driver(_dev->driver);
struct platform_device *dev = to_platform_device(_dev);
+ bool *power_off;
int ret;
/*
@@ -1396,15 +1404,22 @@ static int platform_probe(struct device *_dev)
if (ret < 0)
return ret;
+ power_off = devres_alloc(platform_dev_pm_domain_detach,
sizeof(*power_off),
+ GFP_KERNEL);
+ if (!power_off)
+ return -ENOMEM;
+
ret = dev_pm_domain_attach(_dev, true);
- if (ret)
+ if (ret) {
+ devres_free(power_off);
goto out;
+ }
- if (drv->probe) {
+ *power_off = true;
+ devres_add(_dev, power_off);
+
+ if (drv->probe)
ret = drv->probe(dev);
- if (ret)
- dev_pm_domain_detach(_dev, true);
- }
out:
if (drv->prevent_deferred_probe && ret == -EPROBE_DEFER) {
@@ -1422,7 +1437,6 @@ static void platform_remove(struct device *_dev)
if (drv->remove)
drv->remove(dev);
- dev_pm_domain_detach(_dev, true);
}
but this would involve duplicating code, as, sooner or later, this would
have to be done for other busses as well.
Could you please let me know what option would you prefer so that I can go
forward with it?
Thank you for your review,
Claudiu
>
> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> this should not cause the domain to be detached twice in a row from
> the same device, but that needs to be double-checked.
>
> Thanks!
^ permalink raw reply related [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-11 9:11 ` Claudiu Beznea
@ 2025-06-11 16:27 ` Jonathan Cameron
0 siblings, 0 replies; 22+ messages in thread
From: Jonathan Cameron @ 2025-06-11 16:27 UTC (permalink / raw)
To: Claudiu Beznea
Cc: Rafael J. Wysocki, Dmitry Torokhov, gregkh, dakr, len.brown,
pavel, ulf.hansson, daniel.lezcano, linux-kernel, linux-pm,
bhelgaas, geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
On Wed, 11 Jun 2025 12:11:08 +0300
Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> Hi, Rafael,
>
> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> > On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> >>
> >> On Fri, 6 Jun 2025 22:01:52 +0200
> >> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> >>
> >> Hi Rafael,
> >>
> >>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> >>> <dmitry.torokhov@gmail.com> wrote:
> >>>>
> >>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> >>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> >>>>>>
> >>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> >>>>>>
> >>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> >>>>>> dev_pm_domain_detach(), often following patterns like:
> >>>>>>
> >>>>>> static int bus_probe(struct device *_dev)
> >>>>>> {
> >>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>> int ret;
> >>>>>>
> >>>>>> // ...
> >>>>>>
> >>>>>> ret = dev_pm_domain_attach(_dev, true);
> >>>>>> if (ret)
> >>>>>> return ret;
> >>>>>>
> >>>>>> if (drv->probe)
> >>>>>> ret = drv->probe(dev);
> >>>>>>
> >>>>>> // ...
> >>>>>> }
> >>>>>>
> >>>>>> static void bus_remove(struct device *_dev)
> >>>>>> {
> >>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>
> >>>>>> if (drv->remove)
> >>>>>> drv->remove(dev);
> >>>>>> dev_pm_domain_detach(_dev);
> >>>>>> }
> >>>>>>
> >>>>>> When the driver's probe function uses devres-managed resources that depend
> >>>>>> on the power domain state, those resources are released later during
> >>>>>> device_unbind_cleanup().
> >>>>>>
> >>>>>> Releasing devres-managed resources that depend on the power domain state
> >>>>>> after detaching the device from its PM domain can cause failures.
> >>>>>>
> >>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> >>>>>> function, and the device's clocks are managed by the PM domain, then
> >>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> >>>>>> the clocks have been removed from the PM domain. It may happen that the
> >>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> >>>>>
> >>>>> Don't use devm_pm_runtime_enable() then.
> >>>>
> >>>> What about other devm_ APIs? Are you suggesting that platform drivers
> >>>> should not be using devm_clk*(), devm_regulator_*(),
> >>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> >>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> >>>> off the device too early, before cleanup code has a chance to execute
> >>>> proper cleanup.
> >>>>
> >>>> The issue is not limited to runtime PM.
> >>>>
> >>>>>
> >>>>>> If the driver specific runtime PM APIs access registers directly, this
> >>>>>> will lead to accessing device registers without clocks being enabled.
> >>>>>> Similar issues may occur with other devres actions that access device
> >>>>>> registers.
> >>>>>>
> >>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> >>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> >>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> >>>>>> after all driver's devres-managed resources have been release.
> >>>>>>
> >>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> >>>>>> arguments, one for the domain state on attach, one for the domain state on
> >>>>>> detach.
> >>>>>
> >>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> >>>>
> >>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> >>>
> >>> Yes, among other things.
> >>
> >> Maybe naming could make abuse at least obvious to spot? e.g.
> >> pm_domain_attach_with_devm_release()
> >
> > If I'm not mistaken, it is not even necessary to use devres for this.
> >
> > You might as well add a dev_pm_domain_detach() call to
> > device_unbind_cleanup() after devres_release_all(). There is a slight
> > complication related to the second argument of it, but I suppose that
> > this can be determined at the attach time and stored in a new device
> > PM flag, or similar.
>
> I can try this as well.
>
> Another option I see at the moment would be keep the code added in
> drivers/base/power/common.c in drivers/base/platform.c, something like:
Whilst this avoids the exposure of a devm interface by just
pushing it down to each bus, we'll for ever be rejecting cleanups
that unify it. So I'd rather explore Rafael's suggestion to just
handle this one in the driver core.
That only deals with the pm domain issue and not the other ones
Dmitry refers to where devres being handled after bus->remove()
bites us, but maybe that is the right way forwards for now at least.
Jonathan
>
> diff --git a/drivers/base/platform.c b/drivers/base/platform.c
> index 075ec1d1b73a..391d725cd4c7 100644
> --- a/drivers/base/platform.c
> +++ b/drivers/base/platform.c
> @@ -1376,10 +1376,18 @@ static int platform_uevent(const struct device
> *dev, struct kobj_uevent_env *env
> return 0;
> }
>
> +static void platform_dev_pm_domain_detach(struct device *dev, void *res)
> +{
> + bool *power_off = res;
> +
> + dev_pm_domain_detach(dev, *power_off);
> +}
> +
> static int platform_probe(struct device *_dev)
> {
> struct platform_driver *drv = to_platform_driver(_dev->driver);
> struct platform_device *dev = to_platform_device(_dev);
> + bool *power_off;
> int ret;
>
> /*
> @@ -1396,15 +1404,22 @@ static int platform_probe(struct device *_dev)
> if (ret < 0)
> return ret;
>
> + power_off = devres_alloc(platform_dev_pm_domain_detach,
> sizeof(*power_off),
> + GFP_KERNEL);
> + if (!power_off)
> + return -ENOMEM;
> +
> ret = dev_pm_domain_attach(_dev, true);
> - if (ret)
> + if (ret) {
> + devres_free(power_off);
> goto out;
> + }
>
> - if (drv->probe) {
> + *power_off = true;
> + devres_add(_dev, power_off);
> +
> + if (drv->probe)
> ret = drv->probe(dev);
> - if (ret)
> - dev_pm_domain_detach(_dev, true);
> - }
>
> out:
> if (drv->prevent_deferred_probe && ret == -EPROBE_DEFER) {
> @@ -1422,7 +1437,6 @@ static void platform_remove(struct device *_dev)
>
> if (drv->remove)
> drv->remove(dev);
> - dev_pm_domain_detach(_dev, true);
> }
>
> but this would involve duplicating code, as, sooner or later, this would
> have to be done for other busses as well.
>
> Could you please let me know what option would you prefer so that I can go
> forward with it?
>
> Thank you for your review,
> Claudiu
>
> >
> > Note that dev->pm_domain is expected to be cleared by ->detach(), so
> > this should not cause the domain to be detached twice in a row from
> > the same device, but that needs to be double-checked.
> >
> > Thanks!
>
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-09 19:59 ` Rafael J. Wysocki
2025-06-11 9:11 ` Claudiu Beznea
@ 2025-06-11 16:23 ` Jonathan Cameron
2025-06-11 16:35 ` Dmitry Torokhov
2025-06-13 7:39 ` Claudiu Beznea
2 siblings, 1 reply; 22+ messages in thread
From: Jonathan Cameron @ 2025-06-11 16:23 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Dmitry Torokhov, Claudiu, gregkh, dakr, len.brown, pavel,
ulf.hansson, daniel.lezcano, linux-kernel, linux-pm, bhelgaas,
geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
On Mon, 9 Jun 2025 21:59:57 +0200
"Rafael J. Wysocki" <rafael@kernel.org> wrote:
> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> >
> > On Fri, 6 Jun 2025 22:01:52 +0200
> > "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> >
> > Hi Rafael,
> >
> > > On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> > > <dmitry.torokhov@gmail.com> wrote:
> > > >
> > > > On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> > > > > On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> > > > > >
> > > > > > From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> > > > > >
> > > > > > The dev_pm_domain_attach() function is typically used in bus code alongside
> > > > > > dev_pm_domain_detach(), often following patterns like:
> > > > > >
> > > > > > static int bus_probe(struct device *_dev)
> > > > > > {
> > > > > > struct bus_driver *drv = to_bus_driver(dev->driver);
> > > > > > struct bus_device *dev = to_bus_device(_dev);
> > > > > > int ret;
> > > > > >
> > > > > > // ...
> > > > > >
> > > > > > ret = dev_pm_domain_attach(_dev, true);
> > > > > > if (ret)
> > > > > > return ret;
> > > > > >
> > > > > > if (drv->probe)
> > > > > > ret = drv->probe(dev);
> > > > > >
> > > > > > // ...
> > > > > > }
> > > > > >
> > > > > > static void bus_remove(struct device *_dev)
> > > > > > {
> > > > > > struct bus_driver *drv = to_bus_driver(dev->driver);
> > > > > > struct bus_device *dev = to_bus_device(_dev);
> > > > > >
> > > > > > if (drv->remove)
> > > > > > drv->remove(dev);
> > > > > > dev_pm_domain_detach(_dev);
> > > > > > }
> > > > > >
> > > > > > When the driver's probe function uses devres-managed resources that depend
> > > > > > on the power domain state, those resources are released later during
> > > > > > device_unbind_cleanup().
> > > > > >
> > > > > > Releasing devres-managed resources that depend on the power domain state
> > > > > > after detaching the device from its PM domain can cause failures.
> > > > > >
> > > > > > For example, if the driver uses devm_pm_runtime_enable() in its probe
> > > > > > function, and the device's clocks are managed by the PM domain, then
> > > > > > during removal the runtime PM is disabled in device_unbind_cleanup() after
> > > > > > the clocks have been removed from the PM domain. It may happen that the
> > > > > > devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> > > > >
> > > > > Don't use devm_pm_runtime_enable() then.
> > > >
> > > > What about other devm_ APIs? Are you suggesting that platform drivers
> > > > should not be using devm_clk*(), devm_regulator_*(),
> > > > devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> > > > dev_pm_domain_detach() that is called by platform bus_remove() may shut
> > > > off the device too early, before cleanup code has a chance to execute
> > > > proper cleanup.
> > > >
> > > > The issue is not limited to runtime PM.
> > > >
> > > > >
> > > > > > If the driver specific runtime PM APIs access registers directly, this
> > > > > > will lead to accessing device registers without clocks being enabled.
> > > > > > Similar issues may occur with other devres actions that access device
> > > > > > registers.
> > > > > >
> > > > > > Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> > > > > > dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> > > > > > device is detached from its PM domain in device_unbind_cleanup(), only
> > > > > > after all driver's devres-managed resources have been release.
> > > > > >
> > > > > > For flexibility, the implemented devm_pm_domain_attach() has 2 state
> > > > > > arguments, one for the domain state on attach, one for the domain state on
> > > > > > detach.
> > > > >
> > > > > dev_pm_domain_attach() is not part driver API and I'm not convinced at
> > > >
> > > > Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> > >
> > > Yes, among other things.
> >
> > Maybe naming could make abuse at least obvious to spot? e.g.
> > pm_domain_attach_with_devm_release()
>
> If I'm not mistaken, it is not even necessary to use devres for this.
>
> You might as well add a dev_pm_domain_detach() call to
> device_unbind_cleanup() after devres_release_all(). There is a slight
> complication related to the second argument of it, but I suppose that
> this can be determined at the attach time and stored in a new device
> PM flag, or similar.
That options sounds good to me. I think this moves dev_pm_domain_detach()
call into the the driver core / perhaps device_unbind_cleanup(). It's a noop
if a domain was never attached so that should be fine.
Given that second parameter, I guess we can't move the dev_pm_domain_attach()
into the driver core as well so it is a little odd wrt to balance,
but with some documentation that is probably fine. I don't think we
really want a bus->remove_after_devres() callback for just this.
Ulf what do you think of this approach?
Jonathan
>
> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> this should not cause the domain to be detached twice in a row from
> the same device, but that needs to be double-checked.
>
> Thanks!
>
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-11 16:23 ` Jonathan Cameron
@ 2025-06-11 16:35 ` Dmitry Torokhov
0 siblings, 0 replies; 22+ messages in thread
From: Dmitry Torokhov @ 2025-06-11 16:35 UTC (permalink / raw)
To: Jonathan Cameron
Cc: Rafael J. Wysocki, Claudiu, gregkh, dakr, len.brown, pavel,
ulf.hansson, daniel.lezcano, linux-kernel, linux-pm, bhelgaas,
geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
On Wed, Jun 11, 2025 at 05:23:07PM +0100, Jonathan Cameron wrote:
> On Mon, 9 Jun 2025 21:59:57 +0200
> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>
> > On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> > >
> > > On Fri, 6 Jun 2025 22:01:52 +0200
> > > "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> > >
> > > Hi Rafael,
> > >
> > > > On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> > > > <dmitry.torokhov@gmail.com> wrote:
> > > > >
> > > > > On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> > > > > > On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> > > > > > >
> > > > > > > From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> > > > > > >
> > > > > > > The dev_pm_domain_attach() function is typically used in bus code alongside
> > > > > > > dev_pm_domain_detach(), often following patterns like:
> > > > > > >
> > > > > > > static int bus_probe(struct device *_dev)
> > > > > > > {
> > > > > > > struct bus_driver *drv = to_bus_driver(dev->driver);
> > > > > > > struct bus_device *dev = to_bus_device(_dev);
> > > > > > > int ret;
> > > > > > >
> > > > > > > // ...
> > > > > > >
> > > > > > > ret = dev_pm_domain_attach(_dev, true);
> > > > > > > if (ret)
> > > > > > > return ret;
> > > > > > >
> > > > > > > if (drv->probe)
> > > > > > > ret = drv->probe(dev);
> > > > > > >
> > > > > > > // ...
> > > > > > > }
> > > > > > >
> > > > > > > static void bus_remove(struct device *_dev)
> > > > > > > {
> > > > > > > struct bus_driver *drv = to_bus_driver(dev->driver);
> > > > > > > struct bus_device *dev = to_bus_device(_dev);
> > > > > > >
> > > > > > > if (drv->remove)
> > > > > > > drv->remove(dev);
> > > > > > > dev_pm_domain_detach(_dev);
> > > > > > > }
> > > > > > >
> > > > > > > When the driver's probe function uses devres-managed resources that depend
> > > > > > > on the power domain state, those resources are released later during
> > > > > > > device_unbind_cleanup().
> > > > > > >
> > > > > > > Releasing devres-managed resources that depend on the power domain state
> > > > > > > after detaching the device from its PM domain can cause failures.
> > > > > > >
> > > > > > > For example, if the driver uses devm_pm_runtime_enable() in its probe
> > > > > > > function, and the device's clocks are managed by the PM domain, then
> > > > > > > during removal the runtime PM is disabled in device_unbind_cleanup() after
> > > > > > > the clocks have been removed from the PM domain. It may happen that the
> > > > > > > devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> > > > > >
> > > > > > Don't use devm_pm_runtime_enable() then.
> > > > >
> > > > > What about other devm_ APIs? Are you suggesting that platform drivers
> > > > > should not be using devm_clk*(), devm_regulator_*(),
> > > > > devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> > > > > dev_pm_domain_detach() that is called by platform bus_remove() may shut
> > > > > off the device too early, before cleanup code has a chance to execute
> > > > > proper cleanup.
> > > > >
> > > > > The issue is not limited to runtime PM.
> > > > >
> > > > > >
> > > > > > > If the driver specific runtime PM APIs access registers directly, this
> > > > > > > will lead to accessing device registers without clocks being enabled.
> > > > > > > Similar issues may occur with other devres actions that access device
> > > > > > > registers.
> > > > > > >
> > > > > > > Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> > > > > > > dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> > > > > > > device is detached from its PM domain in device_unbind_cleanup(), only
> > > > > > > after all driver's devres-managed resources have been release.
> > > > > > >
> > > > > > > For flexibility, the implemented devm_pm_domain_attach() has 2 state
> > > > > > > arguments, one for the domain state on attach, one for the domain state on
> > > > > > > detach.
> > > > > >
> > > > > > dev_pm_domain_attach() is not part driver API and I'm not convinced at
> > > > >
> > > > > Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> > > >
> > > > Yes, among other things.
> > >
> > > Maybe naming could make abuse at least obvious to spot? e.g.
> > > pm_domain_attach_with_devm_release()
> >
> > If I'm not mistaken, it is not even necessary to use devres for this.
> >
> > You might as well add a dev_pm_domain_detach() call to
> > device_unbind_cleanup() after devres_release_all(). There is a slight
> > complication related to the second argument of it, but I suppose that
> > this can be determined at the attach time and stored in a new device
> > PM flag, or similar.
>
> That options sounds good to me. I think this moves dev_pm_domain_detach()
> call into the the driver core / perhaps device_unbind_cleanup(). It's a noop
> if a domain was never attached so that should be fine.
>
> Given that second parameter, I guess we can't move the dev_pm_domain_attach()
> into the driver core as well so it is a little odd wrt to balance,
> but with some documentation that is probably fine.
It is going to be confusing IMO and might lead to ordering issues again
if there are more resources allocated by bus probe() before domain
attach is called.
I know Rafael does not consider dev_pm_domain_attach() a "driver" API
(although I do not see much difference between driver and bus probe
code), but maybe we can solve this by using different name
(devres_controlled_pm_domain_attach() instead of
devm_pm_domain_attach()?).
OTOH we have devm_pm_domain_attach_list() already which is I guess
driver-level API...
Thanks.
--
Dmitry
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-09 19:59 ` Rafael J. Wysocki
2025-06-11 9:11 ` Claudiu Beznea
2025-06-11 16:23 ` Jonathan Cameron
@ 2025-06-13 7:39 ` Claudiu Beznea
2025-06-13 10:02 ` Rafael J. Wysocki
2 siblings, 1 reply; 22+ messages in thread
From: Claudiu Beznea @ 2025-06-13 7:39 UTC (permalink / raw)
To: Rafael J. Wysocki, Jonathan Cameron
Cc: Dmitry Torokhov, gregkh, dakr, len.brown, pavel, ulf.hansson,
daniel.lezcano, linux-kernel, linux-pm, bhelgaas, geert,
linux-iio, linux-renesas-soc, fabrizio.castro.jz, Claudiu Beznea
Hi, Rafael,
On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
>>
>> On Fri, 6 Jun 2025 22:01:52 +0200
>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>>
>> Hi Rafael,
>>
>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
>>> <dmitry.torokhov@gmail.com> wrote:
>>>>
>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
>>>>>>
>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
>>>>>>
>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
>>>>>> dev_pm_domain_detach(), often following patterns like:
>>>>>>
>>>>>> static int bus_probe(struct device *_dev)
>>>>>> {
>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>> int ret;
>>>>>>
>>>>>> // ...
>>>>>>
>>>>>> ret = dev_pm_domain_attach(_dev, true);
>>>>>> if (ret)
>>>>>> return ret;
>>>>>>
>>>>>> if (drv->probe)
>>>>>> ret = drv->probe(dev);
>>>>>>
>>>>>> // ...
>>>>>> }
>>>>>>
>>>>>> static void bus_remove(struct device *_dev)
>>>>>> {
>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>
>>>>>> if (drv->remove)
>>>>>> drv->remove(dev);
>>>>>> dev_pm_domain_detach(_dev);
>>>>>> }
>>>>>>
>>>>>> When the driver's probe function uses devres-managed resources that depend
>>>>>> on the power domain state, those resources are released later during
>>>>>> device_unbind_cleanup().
>>>>>>
>>>>>> Releasing devres-managed resources that depend on the power domain state
>>>>>> after detaching the device from its PM domain can cause failures.
>>>>>>
>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
>>>>>> function, and the device's clocks are managed by the PM domain, then
>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
>>>>>> the clocks have been removed from the PM domain. It may happen that the
>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
>>>>>
>>>>> Don't use devm_pm_runtime_enable() then.
>>>>
>>>> What about other devm_ APIs? Are you suggesting that platform drivers
>>>> should not be using devm_clk*(), devm_regulator_*(),
>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
>>>> off the device too early, before cleanup code has a chance to execute
>>>> proper cleanup.
>>>>
>>>> The issue is not limited to runtime PM.
>>>>
>>>>>
>>>>>> If the driver specific runtime PM APIs access registers directly, this
>>>>>> will lead to accessing device registers without clocks being enabled.
>>>>>> Similar issues may occur with other devres actions that access device
>>>>>> registers.
>>>>>>
>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
>>>>>> after all driver's devres-managed resources have been release.
>>>>>>
>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
>>>>>> arguments, one for the domain state on attach, one for the domain state on
>>>>>> detach.
>>>>>
>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
>>>>
>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
>>>
>>> Yes, among other things.
>>
>> Maybe naming could make abuse at least obvious to spot? e.g.
>> pm_domain_attach_with_devm_release()
>
> If I'm not mistaken, it is not even necessary to use devres for this.
>
> You might as well add a dev_pm_domain_detach() call to
> device_unbind_cleanup() after devres_release_all(). There is a slight
> complication related to the second argument of it, but I suppose that
> this can be determined at the attach time and stored in a new device
> PM flag, or similar.
>
I looked into this solution. I've tested it for all my failure cases and
went good.
> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> this should not cause the domain to be detached twice in a row from
> the same device, but that needs to be double-checked.
The genpd_dev_pm_detach() calls genpd_remove_device() ->
dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
find any other detach function in the current code base.
The code I've tested for this solution is this one:
diff --git a/drivers/base/dd.c b/drivers/base/dd.c
index b526e0e0f52d..5e9750d007b4 100644
--- a/drivers/base/dd.c
+++ b/drivers/base/dd.c
@@ -25,6 +25,7 @@
#include <linux/kthread.h>
#include <linux/wait.h>
#include <linux/async.h>
+#include <linux/pm_domain.h>
#include <linux/pm_runtime.h>
#include <linux/pinctrl/devinfo.h>
#include <linux/slab.h>
@@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
dev->dma_range_map = NULL;
device_set_driver(dev, NULL);
dev_set_drvdata(dev, NULL);
- if (dev->pm_domain && dev->pm_domain->dismiss)
- dev->pm_domain->dismiss(dev);
+ if (dev->pm_domain) {
+ if (dev->pm_domain->dismiss)
+ dev->pm_domain->dismiss(dev);
+ dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
+ }
pm_runtime_reinit(dev);
dev_pm_set_driver_flags(dev, 0);
}
diff --git a/drivers/base/platform.c b/drivers/base/platform.c
index 075ec1d1b73a..2459be6aecf4 100644
--- a/drivers/base/platform.c
+++ b/drivers/base/platform.c
@@ -1400,11 +1400,8 @@ static int platform_probe(struct device *_dev)
if (ret)
goto out;
- if (drv->probe) {
+ if (drv->probe)
ret = drv->probe(dev);
- if (ret)
- dev_pm_domain_detach(_dev, true);
- }
out:
if (drv->prevent_deferred_probe && ret == -EPROBE_DEFER) {
@@ -1422,7 +1419,6 @@ static void platform_remove(struct device *_dev)
if (drv->remove)
drv->remove(dev);
- dev_pm_domain_detach(_dev, true);
}
static void platform_shutdown(struct device *_dev)
diff --git a/drivers/base/power/common.c b/drivers/base/power/common.c
index 781968a128ff..4bd1e3c7f401 100644
--- a/drivers/base/power/common.c
+++ b/drivers/base/power/common.c
@@ -111,6 +111,9 @@ int dev_pm_domain_attach(struct device *dev, bool power_on)
if (!ret)
ret = genpd_dev_pm_attach(dev);
+ if (dev->pm_domain)
+ dev->pm_domain->detach_power_off = power_on;
+
return ret < 0 ? ret : 0;
}
EXPORT_SYMBOL_GPL(dev_pm_domain_attach);
diff --git a/include/linux/pm.h b/include/linux/pm.h
index f0bd8fbae4f2..12e97e09e85c 100644
--- a/include/linux/pm.h
+++ b/include/linux/pm.h
@@ -748,6 +748,7 @@ struct dev_pm_domain {
void (*sync)(struct device *dev);
void (*dismiss)(struct device *dev);
int (*set_performance_state)(struct device *dev, unsigned int state);
+ bool detach_power_off;
};
Rafael, Ulf, Dmitry, Jonathan, all,
Could you please let me know how do you consider this approach?
Thank you,
Claudiu
^ permalink raw reply related [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-13 7:39 ` Claudiu Beznea
@ 2025-06-13 10:02 ` Rafael J. Wysocki
2025-06-13 12:44 ` Claudiu Beznea
2025-06-16 9:37 ` Claudiu Beznea
0 siblings, 2 replies; 22+ messages in thread
From: Rafael J. Wysocki @ 2025-06-13 10:02 UTC (permalink / raw)
To: Claudiu Beznea
Cc: Rafael J. Wysocki, Jonathan Cameron, Dmitry Torokhov, gregkh,
dakr, len.brown, pavel, ulf.hansson, daniel.lezcano, linux-kernel,
linux-pm, bhelgaas, geert, linux-iio, linux-renesas-soc,
fabrizio.castro.jz, Claudiu Beznea
On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
>
> Hi, Rafael,
>
> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> > On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> >>
> >> On Fri, 6 Jun 2025 22:01:52 +0200
> >> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> >>
> >> Hi Rafael,
> >>
> >>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> >>> <dmitry.torokhov@gmail.com> wrote:
> >>>>
> >>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> >>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> >>>>>>
> >>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> >>>>>>
> >>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> >>>>>> dev_pm_domain_detach(), often following patterns like:
> >>>>>>
> >>>>>> static int bus_probe(struct device *_dev)
> >>>>>> {
> >>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>> int ret;
> >>>>>>
> >>>>>> // ...
> >>>>>>
> >>>>>> ret = dev_pm_domain_attach(_dev, true);
> >>>>>> if (ret)
> >>>>>> return ret;
> >>>>>>
> >>>>>> if (drv->probe)
> >>>>>> ret = drv->probe(dev);
> >>>>>>
> >>>>>> // ...
> >>>>>> }
> >>>>>>
> >>>>>> static void bus_remove(struct device *_dev)
> >>>>>> {
> >>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>
> >>>>>> if (drv->remove)
> >>>>>> drv->remove(dev);
> >>>>>> dev_pm_domain_detach(_dev);
> >>>>>> }
> >>>>>>
> >>>>>> When the driver's probe function uses devres-managed resources that depend
> >>>>>> on the power domain state, those resources are released later during
> >>>>>> device_unbind_cleanup().
> >>>>>>
> >>>>>> Releasing devres-managed resources that depend on the power domain state
> >>>>>> after detaching the device from its PM domain can cause failures.
> >>>>>>
> >>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> >>>>>> function, and the device's clocks are managed by the PM domain, then
> >>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> >>>>>> the clocks have been removed from the PM domain. It may happen that the
> >>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> >>>>>
> >>>>> Don't use devm_pm_runtime_enable() then.
> >>>>
> >>>> What about other devm_ APIs? Are you suggesting that platform drivers
> >>>> should not be using devm_clk*(), devm_regulator_*(),
> >>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> >>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> >>>> off the device too early, before cleanup code has a chance to execute
> >>>> proper cleanup.
> >>>>
> >>>> The issue is not limited to runtime PM.
> >>>>
> >>>>>
> >>>>>> If the driver specific runtime PM APIs access registers directly, this
> >>>>>> will lead to accessing device registers without clocks being enabled.
> >>>>>> Similar issues may occur with other devres actions that access device
> >>>>>> registers.
> >>>>>>
> >>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> >>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> >>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> >>>>>> after all driver's devres-managed resources have been release.
> >>>>>>
> >>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> >>>>>> arguments, one for the domain state on attach, one for the domain state on
> >>>>>> detach.
> >>>>>
> >>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> >>>>
> >>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> >>>
> >>> Yes, among other things.
> >>
> >> Maybe naming could make abuse at least obvious to spot? e.g.
> >> pm_domain_attach_with_devm_release()
> >
> > If I'm not mistaken, it is not even necessary to use devres for this.
> >
> > You might as well add a dev_pm_domain_detach() call to
> > device_unbind_cleanup() after devres_release_all(). There is a slight
> > complication related to the second argument of it, but I suppose that
> > this can be determined at the attach time and stored in a new device
> > PM flag, or similar.
> >
>
> I looked into this solution. I've tested it for all my failure cases and
> went good.
OK
> > Note that dev->pm_domain is expected to be cleared by ->detach(), so
> > this should not cause the domain to be detached twice in a row from
> > the same device, but that needs to be double-checked.
>
> The genpd_dev_pm_detach() calls genpd_remove_device() ->
> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
> find any other detach function in the current code base.
There is also acpi_dev_pm_detach() which can be somewhat hard to find,
but it calls dev_pm_domain_set(dev, NULL) either.
> The code I've tested for this solution is this one:
>
> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
> index b526e0e0f52d..5e9750d007b4 100644
> --- a/drivers/base/dd.c
> +++ b/drivers/base/dd.c
> @@ -25,6 +25,7 @@
> #include <linux/kthread.h>
> #include <linux/wait.h>
> #include <linux/async.h>
> +#include <linux/pm_domain.h>
> #include <linux/pm_runtime.h>
> #include <linux/pinctrl/devinfo.h>
> #include <linux/slab.h>
> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
> dev->dma_range_map = NULL;
> device_set_driver(dev, NULL);
> dev_set_drvdata(dev, NULL);
> - if (dev->pm_domain && dev->pm_domain->dismiss)
> - dev->pm_domain->dismiss(dev);
> + if (dev->pm_domain) {
> + if (dev->pm_domain->dismiss)
> + dev->pm_domain->dismiss(dev);
> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
I would do the "detach" before the "dismiss" to retain the current ordering.
Also it is interesting that you ended up calling them both in one
place. It kind of indicates that the PM domains attached via
dev_pm_domain_attach() should be attached earlier and just use the
->activate() and ->dismiss() callbacks.
> + }
> pm_runtime_reinit(dev);
> dev_pm_set_driver_flags(dev, 0);
> }
> diff --git a/drivers/base/platform.c b/drivers/base/platform.c
> index 075ec1d1b73a..2459be6aecf4 100644
> --- a/drivers/base/platform.c
> +++ b/drivers/base/platform.c
> @@ -1400,11 +1400,8 @@ static int platform_probe(struct device *_dev)
> if (ret)
> goto out;
>
> - if (drv->probe) {
> + if (drv->probe)
> ret = drv->probe(dev);
> - if (ret)
> - dev_pm_domain_detach(_dev, true);
> - }
>
> out:
> if (drv->prevent_deferred_probe && ret == -EPROBE_DEFER) {
> @@ -1422,7 +1419,6 @@ static void platform_remove(struct device *_dev)
>
> if (drv->remove)
> drv->remove(dev);
> - dev_pm_domain_detach(_dev, true);
> }
>
> static void platform_shutdown(struct device *_dev)
> diff --git a/drivers/base/power/common.c b/drivers/base/power/common.c
> index 781968a128ff..4bd1e3c7f401 100644
> --- a/drivers/base/power/common.c
> +++ b/drivers/base/power/common.c
> @@ -111,6 +111,9 @@ int dev_pm_domain_attach(struct device *dev, bool power_on)
> if (!ret)
> ret = genpd_dev_pm_attach(dev);
>
> + if (dev->pm_domain)
> + dev->pm_domain->detach_power_off = power_on;
This will not work for acpi_general_pm_domain because it is shared by
all of its users.
It is likely to not work for shared PM domains in general.
I would put the new flag into struct dev_pm_info.
> +
> return ret < 0 ? ret : 0;
> }
> EXPORT_SYMBOL_GPL(dev_pm_domain_attach);
> diff --git a/include/linux/pm.h b/include/linux/pm.h
> index f0bd8fbae4f2..12e97e09e85c 100644
> --- a/include/linux/pm.h
> +++ b/include/linux/pm.h
> @@ -748,6 +748,7 @@ struct dev_pm_domain {
> void (*sync)(struct device *dev);
> void (*dismiss)(struct device *dev);
> int (*set_performance_state)(struct device *dev, unsigned int state);
> + bool detach_power_off;
> };
>
> Rafael, Ulf, Dmitry, Jonathan, all,
>
> Could you please let me know how do you consider this approach?
Please see my comments above.
Thanks!
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-13 10:02 ` Rafael J. Wysocki
@ 2025-06-13 12:44 ` Claudiu Beznea
2025-06-16 9:37 ` Claudiu Beznea
1 sibling, 0 replies; 22+ messages in thread
From: Claudiu Beznea @ 2025-06-13 12:44 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Jonathan Cameron, Dmitry Torokhov, gregkh, dakr, len.brown, pavel,
ulf.hansson, daniel.lezcano, linux-kernel, linux-pm, bhelgaas,
geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
Hi, Rafael,
On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
>>
>> Hi, Rafael,
>>
>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
>>>>
>>>> On Fri, 6 Jun 2025 22:01:52 +0200
>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>>>>
>>>> Hi Rafael,
>>>>
>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
>>>>> <dmitry.torokhov@gmail.com> wrote:
>>>>>>
>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
>>>>>>>>
>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
>>>>>>>>
>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
>>>>>>>> dev_pm_domain_detach(), often following patterns like:
>>>>>>>>
>>>>>>>> static int bus_probe(struct device *_dev)
>>>>>>>> {
>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>> int ret;
>>>>>>>>
>>>>>>>> // ...
>>>>>>>>
>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
>>>>>>>> if (ret)
>>>>>>>> return ret;
>>>>>>>>
>>>>>>>> if (drv->probe)
>>>>>>>> ret = drv->probe(dev);
>>>>>>>>
>>>>>>>> // ...
>>>>>>>> }
>>>>>>>>
>>>>>>>> static void bus_remove(struct device *_dev)
>>>>>>>> {
>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>>
>>>>>>>> if (drv->remove)
>>>>>>>> drv->remove(dev);
>>>>>>>> dev_pm_domain_detach(_dev);
>>>>>>>> }
>>>>>>>>
>>>>>>>> When the driver's probe function uses devres-managed resources that depend
>>>>>>>> on the power domain state, those resources are released later during
>>>>>>>> device_unbind_cleanup().
>>>>>>>>
>>>>>>>> Releasing devres-managed resources that depend on the power domain state
>>>>>>>> after detaching the device from its PM domain can cause failures.
>>>>>>>>
>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
>>>>>>>> function, and the device's clocks are managed by the PM domain, then
>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
>>>>>>>
>>>>>>> Don't use devm_pm_runtime_enable() then.
>>>>>>
>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
>>>>>> should not be using devm_clk*(), devm_regulator_*(),
>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
>>>>>> off the device too early, before cleanup code has a chance to execute
>>>>>> proper cleanup.
>>>>>>
>>>>>> The issue is not limited to runtime PM.
>>>>>>
>>>>>>>
>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
>>>>>>>> will lead to accessing device registers without clocks being enabled.
>>>>>>>> Similar issues may occur with other devres actions that access device
>>>>>>>> registers.
>>>>>>>>
>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
>>>>>>>> after all driver's devres-managed resources have been release.
>>>>>>>>
>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
>>>>>>>> detach.
>>>>>>>
>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
>>>>>>
>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
>>>>>
>>>>> Yes, among other things.
>>>>
>>>> Maybe naming could make abuse at least obvious to spot? e.g.
>>>> pm_domain_attach_with_devm_release()
>>>
>>> If I'm not mistaken, it is not even necessary to use devres for this.
>>>
>>> You might as well add a dev_pm_domain_detach() call to
>>> device_unbind_cleanup() after devres_release_all(). There is a slight
>>> complication related to the second argument of it, but I suppose that
>>> this can be determined at the attach time and stored in a new device
>>> PM flag, or similar.
>>>
>>
>> I looked into this solution. I've tested it for all my failure cases and
>> went good.
>
> OK
>
>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
>>> this should not cause the domain to be detached twice in a row from
>>> the same device, but that needs to be double-checked.
>>
>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
>> find any other detach function in the current code base.
>
> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> but it calls dev_pm_domain_set(dev, NULL) either.
Thank you for the pointer.
>
>> The code I've tested for this solution is this one:
>>
>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
>> index b526e0e0f52d..5e9750d007b4 100644
>> --- a/drivers/base/dd.c
>> +++ b/drivers/base/dd.c
>> @@ -25,6 +25,7 @@
>> #include <linux/kthread.h>
>> #include <linux/wait.h>
>> #include <linux/async.h>
>> +#include <linux/pm_domain.h>
>> #include <linux/pm_runtime.h>
>> #include <linux/pinctrl/devinfo.h>
>> #include <linux/slab.h>
>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
>> dev->dma_range_map = NULL;
>> device_set_driver(dev, NULL);
>> dev_set_drvdata(dev, NULL);
>> - if (dev->pm_domain && dev->pm_domain->dismiss)
>> - dev->pm_domain->dismiss(dev);
>> + if (dev->pm_domain) {
>> + if (dev->pm_domain->dismiss)
>> + dev->pm_domain->dismiss(dev);
>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
>
> I would do the "detach" before the "dismiss" to retain the current ordering.
OK.
>
> Also it is interesting that you ended up calling them both in one
> place. It kind of indicates that the PM domains attached via
> dev_pm_domain_attach() should be attached earlier and just use the
> ->activate() and ->dismiss() callbacks.
>
>> + }
>> pm_runtime_reinit(dev);
>> dev_pm_set_driver_flags(dev, 0);
>> }
>> diff --git a/drivers/base/platform.c b/drivers/base/platform.c
>> index 075ec1d1b73a..2459be6aecf4 100644
>> --- a/drivers/base/platform.c
>> +++ b/drivers/base/platform.c
>> @@ -1400,11 +1400,8 @@ static int platform_probe(struct device *_dev)
>> if (ret)
>> goto out;
>>
>> - if (drv->probe) {
>> + if (drv->probe)
>> ret = drv->probe(dev);
>> - if (ret)
>> - dev_pm_domain_detach(_dev, true);
>> - }
>>
>> out:
>> if (drv->prevent_deferred_probe && ret == -EPROBE_DEFER) {
>> @@ -1422,7 +1419,6 @@ static void platform_remove(struct device *_dev)
>>
>> if (drv->remove)
>> drv->remove(dev);
>> - dev_pm_domain_detach(_dev, true);
>> }
>>
>> static void platform_shutdown(struct device *_dev)
>> diff --git a/drivers/base/power/common.c b/drivers/base/power/common.c
>> index 781968a128ff..4bd1e3c7f401 100644
>> --- a/drivers/base/power/common.c
>> +++ b/drivers/base/power/common.c
>> @@ -111,6 +111,9 @@ int dev_pm_domain_attach(struct device *dev, bool power_on)
>> if (!ret)
>> ret = genpd_dev_pm_attach(dev);
>>
>> + if (dev->pm_domain)
>> + dev->pm_domain->detach_power_off = power_on;
>
> This will not work for acpi_general_pm_domain because it is shared by
> all of its users.
>
> It is likely to not work for shared PM domains in general.
>
> I would put the new flag into struct dev_pm_info.
OK, I'll do it.
Thank you for your input,
Claudiu
>
>> +
>> return ret < 0 ? ret : 0;
>> }
>> EXPORT_SYMBOL_GPL(dev_pm_domain_attach);
>> diff --git a/include/linux/pm.h b/include/linux/pm.h
>> index f0bd8fbae4f2..12e97e09e85c 100644
>> --- a/include/linux/pm.h
>> +++ b/include/linux/pm.h
>> @@ -748,6 +748,7 @@ struct dev_pm_domain {
>> void (*sync)(struct device *dev);
>> void (*dismiss)(struct device *dev);
>> int (*set_performance_state)(struct device *dev, unsigned int state);
>> + bool detach_power_off;
>> };
>>
>> Rafael, Ulf, Dmitry, Jonathan, all,
>>
>> Could you please let me know how do you consider this approach?
>
> Please see my comments above.
>
> Thanks!
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-13 10:02 ` Rafael J. Wysocki
2025-06-13 12:44 ` Claudiu Beznea
@ 2025-06-16 9:37 ` Claudiu Beznea
2025-06-16 11:18 ` Rafael J. Wysocki
1 sibling, 1 reply; 22+ messages in thread
From: Claudiu Beznea @ 2025-06-16 9:37 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Jonathan Cameron, Dmitry Torokhov, gregkh, dakr, len.brown, pavel,
ulf.hansson, daniel.lezcano, linux-kernel, linux-pm, bhelgaas,
geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
Hi, Rafael,
On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
>>
>> Hi, Rafael,
>>
>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
>>>>
>>>> On Fri, 6 Jun 2025 22:01:52 +0200
>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>>>>
>>>> Hi Rafael,
>>>>
>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
>>>>> <dmitry.torokhov@gmail.com> wrote:
>>>>>>
>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
>>>>>>>>
>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
>>>>>>>>
>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
>>>>>>>> dev_pm_domain_detach(), often following patterns like:
>>>>>>>>
>>>>>>>> static int bus_probe(struct device *_dev)
>>>>>>>> {
>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>> int ret;
>>>>>>>>
>>>>>>>> // ...
>>>>>>>>
>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
>>>>>>>> if (ret)
>>>>>>>> return ret;
>>>>>>>>
>>>>>>>> if (drv->probe)
>>>>>>>> ret = drv->probe(dev);
>>>>>>>>
>>>>>>>> // ...
>>>>>>>> }
>>>>>>>>
>>>>>>>> static void bus_remove(struct device *_dev)
>>>>>>>> {
>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>>
>>>>>>>> if (drv->remove)
>>>>>>>> drv->remove(dev);
>>>>>>>> dev_pm_domain_detach(_dev);
>>>>>>>> }
>>>>>>>>
>>>>>>>> When the driver's probe function uses devres-managed resources that depend
>>>>>>>> on the power domain state, those resources are released later during
>>>>>>>> device_unbind_cleanup().
>>>>>>>>
>>>>>>>> Releasing devres-managed resources that depend on the power domain state
>>>>>>>> after detaching the device from its PM domain can cause failures.
>>>>>>>>
>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
>>>>>>>> function, and the device's clocks are managed by the PM domain, then
>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
>>>>>>>
>>>>>>> Don't use devm_pm_runtime_enable() then.
>>>>>>
>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
>>>>>> should not be using devm_clk*(), devm_regulator_*(),
>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
>>>>>> off the device too early, before cleanup code has a chance to execute
>>>>>> proper cleanup.
>>>>>>
>>>>>> The issue is not limited to runtime PM.
>>>>>>
>>>>>>>
>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
>>>>>>>> will lead to accessing device registers without clocks being enabled.
>>>>>>>> Similar issues may occur with other devres actions that access device
>>>>>>>> registers.
>>>>>>>>
>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
>>>>>>>> after all driver's devres-managed resources have been release.
>>>>>>>>
>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
>>>>>>>> detach.
>>>>>>>
>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
>>>>>>
>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
>>>>>
>>>>> Yes, among other things.
>>>>
>>>> Maybe naming could make abuse at least obvious to spot? e.g.
>>>> pm_domain_attach_with_devm_release()
>>>
>>> If I'm not mistaken, it is not even necessary to use devres for this.
>>>
>>> You might as well add a dev_pm_domain_detach() call to
>>> device_unbind_cleanup() after devres_release_all(). There is a slight
>>> complication related to the second argument of it, but I suppose that
>>> this can be determined at the attach time and stored in a new device
>>> PM flag, or similar.
>>>
>>
>> I looked into this solution. I've tested it for all my failure cases and
>> went good.
>
> OK
>
>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
>>> this should not cause the domain to be detached twice in a row from
>>> the same device, but that needs to be double-checked.
>>
>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
>> find any other detach function in the current code base.
>
> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> but it calls dev_pm_domain_set(dev, NULL) either.
>
>> The code I've tested for this solution is this one:
>>
>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
>> index b526e0e0f52d..5e9750d007b4 100644
>> --- a/drivers/base/dd.c
>> +++ b/drivers/base/dd.c
>> @@ -25,6 +25,7 @@
>> #include <linux/kthread.h>
>> #include <linux/wait.h>
>> #include <linux/async.h>
>> +#include <linux/pm_domain.h>
>> #include <linux/pm_runtime.h>
>> #include <linux/pinctrl/devinfo.h>
>> #include <linux/slab.h>
>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
>> dev->dma_range_map = NULL;
>> device_set_driver(dev, NULL);
>> dev_set_drvdata(dev, NULL);
>> - if (dev->pm_domain && dev->pm_domain->dismiss)
>> - dev->pm_domain->dismiss(dev);
>> + if (dev->pm_domain) {
>> + if (dev->pm_domain->dismiss)
>> + dev->pm_domain->dismiss(dev);
>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
>
> I would do the "detach" before the "dismiss" to retain the current ordering.
I applied on my local development branch all your suggestions except this
one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
dev->pm_domain = NULL.
Due to this I would call first ->dismiss() then ->detach(), as initially
proposed. Please let me know if you consider it otherwise.
Thank you,
Claudiu
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-16 9:37 ` Claudiu Beznea
@ 2025-06-16 11:18 ` Rafael J. Wysocki
2025-06-16 11:37 ` Claudiu Beznea
0 siblings, 1 reply; 22+ messages in thread
From: Rafael J. Wysocki @ 2025-06-16 11:18 UTC (permalink / raw)
To: Claudiu Beznea
Cc: Rafael J. Wysocki, Jonathan Cameron, Dmitry Torokhov, gregkh,
dakr, len.brown, pavel, ulf.hansson, daniel.lezcano, linux-kernel,
linux-pm, bhelgaas, geert, linux-iio, linux-renesas-soc,
fabrizio.castro.jz, Claudiu Beznea
On Mon, Jun 16, 2025 at 11:37 AM Claudiu Beznea
<claudiu.beznea@tuxon.dev> wrote:
>
> Hi, Rafael,
>
> On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> > On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> >>
> >> Hi, Rafael,
> >>
> >> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> >>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> >>>>
> >>>> On Fri, 6 Jun 2025 22:01:52 +0200
> >>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> >>>>
> >>>> Hi Rafael,
> >>>>
> >>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> >>>>> <dmitry.torokhov@gmail.com> wrote:
> >>>>>>
> >>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> >>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> >>>>>>>>
> >>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> >>>>>>>>
> >>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> >>>>>>>> dev_pm_domain_detach(), often following patterns like:
> >>>>>>>>
> >>>>>>>> static int bus_probe(struct device *_dev)
> >>>>>>>> {
> >>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>>> int ret;
> >>>>>>>>
> >>>>>>>> // ...
> >>>>>>>>
> >>>>>>>> ret = dev_pm_domain_attach(_dev, true);
> >>>>>>>> if (ret)
> >>>>>>>> return ret;
> >>>>>>>>
> >>>>>>>> if (drv->probe)
> >>>>>>>> ret = drv->probe(dev);
> >>>>>>>>
> >>>>>>>> // ...
> >>>>>>>> }
> >>>>>>>>
> >>>>>>>> static void bus_remove(struct device *_dev)
> >>>>>>>> {
> >>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>>>
> >>>>>>>> if (drv->remove)
> >>>>>>>> drv->remove(dev);
> >>>>>>>> dev_pm_domain_detach(_dev);
> >>>>>>>> }
> >>>>>>>>
> >>>>>>>> When the driver's probe function uses devres-managed resources that depend
> >>>>>>>> on the power domain state, those resources are released later during
> >>>>>>>> device_unbind_cleanup().
> >>>>>>>>
> >>>>>>>> Releasing devres-managed resources that depend on the power domain state
> >>>>>>>> after detaching the device from its PM domain can cause failures.
> >>>>>>>>
> >>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> >>>>>>>> function, and the device's clocks are managed by the PM domain, then
> >>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> >>>>>>>> the clocks have been removed from the PM domain. It may happen that the
> >>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> >>>>>>>
> >>>>>>> Don't use devm_pm_runtime_enable() then.
> >>>>>>
> >>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
> >>>>>> should not be using devm_clk*(), devm_regulator_*(),
> >>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> >>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> >>>>>> off the device too early, before cleanup code has a chance to execute
> >>>>>> proper cleanup.
> >>>>>>
> >>>>>> The issue is not limited to runtime PM.
> >>>>>>
> >>>>>>>
> >>>>>>>> If the driver specific runtime PM APIs access registers directly, this
> >>>>>>>> will lead to accessing device registers without clocks being enabled.
> >>>>>>>> Similar issues may occur with other devres actions that access device
> >>>>>>>> registers.
> >>>>>>>>
> >>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> >>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> >>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> >>>>>>>> after all driver's devres-managed resources have been release.
> >>>>>>>>
> >>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> >>>>>>>> arguments, one for the domain state on attach, one for the domain state on
> >>>>>>>> detach.
> >>>>>>>
> >>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> >>>>>>
> >>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> >>>>>
> >>>>> Yes, among other things.
> >>>>
> >>>> Maybe naming could make abuse at least obvious to spot? e.g.
> >>>> pm_domain_attach_with_devm_release()
> >>>
> >>> If I'm not mistaken, it is not even necessary to use devres for this.
> >>>
> >>> You might as well add a dev_pm_domain_detach() call to
> >>> device_unbind_cleanup() after devres_release_all(). There is a slight
> >>> complication related to the second argument of it, but I suppose that
> >>> this can be determined at the attach time and stored in a new device
> >>> PM flag, or similar.
> >>>
> >>
> >> I looked into this solution. I've tested it for all my failure cases and
> >> went good.
> >
> > OK
> >
> >>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> >>> this should not cause the domain to be detached twice in a row from
> >>> the same device, but that needs to be double-checked.
> >>
> >> The genpd_dev_pm_detach() calls genpd_remove_device() ->
> >> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
> >> find any other detach function in the current code base.
> >
> > There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> > but it calls dev_pm_domain_set(dev, NULL) either.
> >
> >> The code I've tested for this solution is this one:
> >>
> >> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
> >> index b526e0e0f52d..5e9750d007b4 100644
> >> --- a/drivers/base/dd.c
> >> +++ b/drivers/base/dd.c
> >> @@ -25,6 +25,7 @@
> >> #include <linux/kthread.h>
> >> #include <linux/wait.h>
> >> #include <linux/async.h>
> >> +#include <linux/pm_domain.h>
> >> #include <linux/pm_runtime.h>
> >> #include <linux/pinctrl/devinfo.h>
> >> #include <linux/slab.h>
> >> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
> >> dev->dma_range_map = NULL;
> >> device_set_driver(dev, NULL);
> >> dev_set_drvdata(dev, NULL);
> >> - if (dev->pm_domain && dev->pm_domain->dismiss)
> >> - dev->pm_domain->dismiss(dev);
> >> + if (dev->pm_domain) {
> >> + if (dev->pm_domain->dismiss)
> >> + dev->pm_domain->dismiss(dev);
> >> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
> >
> > I would do the "detach" before the "dismiss" to retain the current ordering.
>
> I applied on my local development branch all your suggestions except this
> one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
> dev->pm_domain = NULL.
>
> Due to this I would call first ->dismiss() then ->detach(), as initially
> proposed. Please let me know if you consider it otherwise.
This is a matter of adding one more dev->pm_domain check AFAICS, but OK.
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-16 11:18 ` Rafael J. Wysocki
@ 2025-06-16 11:37 ` Claudiu Beznea
2025-06-16 11:47 ` Rafael J. Wysocki
0 siblings, 1 reply; 22+ messages in thread
From: Claudiu Beznea @ 2025-06-16 11:37 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Jonathan Cameron, Dmitry Torokhov, gregkh, dakr, len.brown, pavel,
ulf.hansson, daniel.lezcano, linux-kernel, linux-pm, bhelgaas,
geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
On 16.06.2025 14:18, Rafael J. Wysocki wrote:
> On Mon, Jun 16, 2025 at 11:37 AM Claudiu Beznea
> <claudiu.beznea@tuxon.dev> wrote:
>>
>> Hi, Rafael,
>>
>> On 13.06.2025 13:02, Rafael J. Wysocki wrote:
>>> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
>>>>
>>>> Hi, Rafael,
>>>>
>>>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
>>>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
>>>>>>
>>>>>> On Fri, 6 Jun 2025 22:01:52 +0200
>>>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
>>>>>>
>>>>>> Hi Rafael,
>>>>>>
>>>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
>>>>>>> <dmitry.torokhov@gmail.com> wrote:
>>>>>>>>
>>>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
>>>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
>>>>>>>>>>
>>>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
>>>>>>>>>>
>>>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
>>>>>>>>>> dev_pm_domain_detach(), often following patterns like:
>>>>>>>>>>
>>>>>>>>>> static int bus_probe(struct device *_dev)
>>>>>>>>>> {
>>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>>>> int ret;
>>>>>>>>>>
>>>>>>>>>> // ...
>>>>>>>>>>
>>>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
>>>>>>>>>> if (ret)
>>>>>>>>>> return ret;
>>>>>>>>>>
>>>>>>>>>> if (drv->probe)
>>>>>>>>>> ret = drv->probe(dev);
>>>>>>>>>>
>>>>>>>>>> // ...
>>>>>>>>>> }
>>>>>>>>>>
>>>>>>>>>> static void bus_remove(struct device *_dev)
>>>>>>>>>> {
>>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
>>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
>>>>>>>>>>
>>>>>>>>>> if (drv->remove)
>>>>>>>>>> drv->remove(dev);
>>>>>>>>>> dev_pm_domain_detach(_dev);
>>>>>>>>>> }
>>>>>>>>>>
>>>>>>>>>> When the driver's probe function uses devres-managed resources that depend
>>>>>>>>>> on the power domain state, those resources are released later during
>>>>>>>>>> device_unbind_cleanup().
>>>>>>>>>>
>>>>>>>>>> Releasing devres-managed resources that depend on the power domain state
>>>>>>>>>> after detaching the device from its PM domain can cause failures.
>>>>>>>>>>
>>>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
>>>>>>>>>> function, and the device's clocks are managed by the PM domain, then
>>>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
>>>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
>>>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
>>>>>>>>>
>>>>>>>>> Don't use devm_pm_runtime_enable() then.
>>>>>>>>
>>>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
>>>>>>>> should not be using devm_clk*(), devm_regulator_*(),
>>>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
>>>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
>>>>>>>> off the device too early, before cleanup code has a chance to execute
>>>>>>>> proper cleanup.
>>>>>>>>
>>>>>>>> The issue is not limited to runtime PM.
>>>>>>>>
>>>>>>>>>
>>>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
>>>>>>>>>> will lead to accessing device registers without clocks being enabled.
>>>>>>>>>> Similar issues may occur with other devres actions that access device
>>>>>>>>>> registers.
>>>>>>>>>>
>>>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
>>>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
>>>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
>>>>>>>>>> after all driver's devres-managed resources have been release.
>>>>>>>>>>
>>>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
>>>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
>>>>>>>>>> detach.
>>>>>>>>>
>>>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
>>>>>>>>
>>>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
>>>>>>>
>>>>>>> Yes, among other things.
>>>>>>
>>>>>> Maybe naming could make abuse at least obvious to spot? e.g.
>>>>>> pm_domain_attach_with_devm_release()
>>>>>
>>>>> If I'm not mistaken, it is not even necessary to use devres for this.
>>>>>
>>>>> You might as well add a dev_pm_domain_detach() call to
>>>>> device_unbind_cleanup() after devres_release_all(). There is a slight
>>>>> complication related to the second argument of it, but I suppose that
>>>>> this can be determined at the attach time and stored in a new device
>>>>> PM flag, or similar.
>>>>>
>>>>
>>>> I looked into this solution. I've tested it for all my failure cases and
>>>> went good.
>>>
>>> OK
>>>
>>>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
>>>>> this should not cause the domain to be detached twice in a row from
>>>>> the same device, but that needs to be double-checked.
>>>>
>>>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
>>>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
>>>> find any other detach function in the current code base.
>>>
>>> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
>>> but it calls dev_pm_domain_set(dev, NULL) either.
>>>
>>>> The code I've tested for this solution is this one:
>>>>
>>>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
>>>> index b526e0e0f52d..5e9750d007b4 100644
>>>> --- a/drivers/base/dd.c
>>>> +++ b/drivers/base/dd.c
>>>> @@ -25,6 +25,7 @@
>>>> #include <linux/kthread.h>
>>>> #include <linux/wait.h>
>>>> #include <linux/async.h>
>>>> +#include <linux/pm_domain.h>
>>>> #include <linux/pm_runtime.h>
>>>> #include <linux/pinctrl/devinfo.h>
>>>> #include <linux/slab.h>
>>>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
>>>> dev->dma_range_map = NULL;
>>>> device_set_driver(dev, NULL);
>>>> dev_set_drvdata(dev, NULL);
>>>> - if (dev->pm_domain && dev->pm_domain->dismiss)
>>>> - dev->pm_domain->dismiss(dev);
>>>> + if (dev->pm_domain) {
>>>> + if (dev->pm_domain->dismiss)
>>>> + dev->pm_domain->dismiss(dev);
>>>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
>>>
>>> I would do the "detach" before the "dismiss" to retain the current ordering.
>>
>> I applied on my local development branch all your suggestions except this
>> one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
>> dev->pm_domain = NULL.
>>
>> Due to this I would call first ->dismiss() then ->detach(), as initially
>> proposed. Please let me know if you consider it otherwise.
>
> This is a matter of adding one more dev->pm_domain check AFAICS, but OK.
I don't know all the subtleties around this, my concern with adding one
more check on dev->pm_domain was that the
dev->pm_domain->dismiss() will never be called if the ->detach() function
will be called before ->dismiss() and it will set dev->pm_domain = NULL (as
it does today (though genpd_dev_pm_detach() and acpi_dev_pm_detach())).
For platform drivers that used to call dev_pm_domain_detach() in platform
bus remove function, if I'm not wrong, the dev->pm_domain->dismiss() was
never called previously. If that is a valid scenario, the code proposed in
this thread will change the behavior for devices that have ->dismiss()
implemented.
Thank you,
Claudiu
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-16 11:37 ` Claudiu Beznea
@ 2025-06-16 11:47 ` Rafael J. Wysocki
2025-06-19 12:21 ` Ulf Hansson
0 siblings, 1 reply; 22+ messages in thread
From: Rafael J. Wysocki @ 2025-06-16 11:47 UTC (permalink / raw)
To: Claudiu Beznea
Cc: Rafael J. Wysocki, Jonathan Cameron, Dmitry Torokhov, gregkh,
dakr, len.brown, pavel, ulf.hansson, daniel.lezcano, linux-kernel,
linux-pm, bhelgaas, geert, linux-iio, linux-renesas-soc,
fabrizio.castro.jz, Claudiu Beznea
On Mon, Jun 16, 2025 at 1:37 PM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
>
>
>
> On 16.06.2025 14:18, Rafael J. Wysocki wrote:
> > On Mon, Jun 16, 2025 at 11:37 AM Claudiu Beznea
> > <claudiu.beznea@tuxon.dev> wrote:
> >>
> >> Hi, Rafael,
> >>
> >> On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> >>> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> >>>>
> >>>> Hi, Rafael,
> >>>>
> >>>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> >>>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> >>>>>>
> >>>>>> On Fri, 6 Jun 2025 22:01:52 +0200
> >>>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> >>>>>>
> >>>>>> Hi Rafael,
> >>>>>>
> >>>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> >>>>>>> <dmitry.torokhov@gmail.com> wrote:
> >>>>>>>>
> >>>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> >>>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> >>>>>>>>>>
> >>>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> >>>>>>>>>>
> >>>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> >>>>>>>>>> dev_pm_domain_detach(), often following patterns like:
> >>>>>>>>>>
> >>>>>>>>>> static int bus_probe(struct device *_dev)
> >>>>>>>>>> {
> >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>>>>> int ret;
> >>>>>>>>>>
> >>>>>>>>>> // ...
> >>>>>>>>>>
> >>>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
> >>>>>>>>>> if (ret)
> >>>>>>>>>> return ret;
> >>>>>>>>>>
> >>>>>>>>>> if (drv->probe)
> >>>>>>>>>> ret = drv->probe(dev);
> >>>>>>>>>>
> >>>>>>>>>> // ...
> >>>>>>>>>> }
> >>>>>>>>>>
> >>>>>>>>>> static void bus_remove(struct device *_dev)
> >>>>>>>>>> {
> >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> >>>>>>>>>>
> >>>>>>>>>> if (drv->remove)
> >>>>>>>>>> drv->remove(dev);
> >>>>>>>>>> dev_pm_domain_detach(_dev);
> >>>>>>>>>> }
> >>>>>>>>>>
> >>>>>>>>>> When the driver's probe function uses devres-managed resources that depend
> >>>>>>>>>> on the power domain state, those resources are released later during
> >>>>>>>>>> device_unbind_cleanup().
> >>>>>>>>>>
> >>>>>>>>>> Releasing devres-managed resources that depend on the power domain state
> >>>>>>>>>> after detaching the device from its PM domain can cause failures.
> >>>>>>>>>>
> >>>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> >>>>>>>>>> function, and the device's clocks are managed by the PM domain, then
> >>>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> >>>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
> >>>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> >>>>>>>>>
> >>>>>>>>> Don't use devm_pm_runtime_enable() then.
> >>>>>>>>
> >>>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
> >>>>>>>> should not be using devm_clk*(), devm_regulator_*(),
> >>>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> >>>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> >>>>>>>> off the device too early, before cleanup code has a chance to execute
> >>>>>>>> proper cleanup.
> >>>>>>>>
> >>>>>>>> The issue is not limited to runtime PM.
> >>>>>>>>
> >>>>>>>>>
> >>>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
> >>>>>>>>>> will lead to accessing device registers without clocks being enabled.
> >>>>>>>>>> Similar issues may occur with other devres actions that access device
> >>>>>>>>>> registers.
> >>>>>>>>>>
> >>>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> >>>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> >>>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> >>>>>>>>>> after all driver's devres-managed resources have been release.
> >>>>>>>>>>
> >>>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> >>>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
> >>>>>>>>>> detach.
> >>>>>>>>>
> >>>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> >>>>>>>>
> >>>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> >>>>>>>
> >>>>>>> Yes, among other things.
> >>>>>>
> >>>>>> Maybe naming could make abuse at least obvious to spot? e.g.
> >>>>>> pm_domain_attach_with_devm_release()
> >>>>>
> >>>>> If I'm not mistaken, it is not even necessary to use devres for this.
> >>>>>
> >>>>> You might as well add a dev_pm_domain_detach() call to
> >>>>> device_unbind_cleanup() after devres_release_all(). There is a slight
> >>>>> complication related to the second argument of it, but I suppose that
> >>>>> this can be determined at the attach time and stored in a new device
> >>>>> PM flag, or similar.
> >>>>>
> >>>>
> >>>> I looked into this solution. I've tested it for all my failure cases and
> >>>> went good.
> >>>
> >>> OK
> >>>
> >>>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> >>>>> this should not cause the domain to be detached twice in a row from
> >>>>> the same device, but that needs to be double-checked.
> >>>>
> >>>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
> >>>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
> >>>> find any other detach function in the current code base.
> >>>
> >>> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> >>> but it calls dev_pm_domain_set(dev, NULL) either.
> >>>
> >>>> The code I've tested for this solution is this one:
> >>>>
> >>>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
> >>>> index b526e0e0f52d..5e9750d007b4 100644
> >>>> --- a/drivers/base/dd.c
> >>>> +++ b/drivers/base/dd.c
> >>>> @@ -25,6 +25,7 @@
> >>>> #include <linux/kthread.h>
> >>>> #include <linux/wait.h>
> >>>> #include <linux/async.h>
> >>>> +#include <linux/pm_domain.h>
> >>>> #include <linux/pm_runtime.h>
> >>>> #include <linux/pinctrl/devinfo.h>
> >>>> #include <linux/slab.h>
> >>>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
> >>>> dev->dma_range_map = NULL;
> >>>> device_set_driver(dev, NULL);
> >>>> dev_set_drvdata(dev, NULL);
> >>>> - if (dev->pm_domain && dev->pm_domain->dismiss)
> >>>> - dev->pm_domain->dismiss(dev);
> >>>> + if (dev->pm_domain) {
> >>>> + if (dev->pm_domain->dismiss)
> >>>> + dev->pm_domain->dismiss(dev);
> >>>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
> >>>
> >>> I would do the "detach" before the "dismiss" to retain the current ordering.
> >>
> >> I applied on my local development branch all your suggestions except this
> >> one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
> >> dev->pm_domain = NULL.
> >>
> >> Due to this I would call first ->dismiss() then ->detach(), as initially
> >> proposed. Please let me know if you consider it otherwise.
> >
> > This is a matter of adding one more dev->pm_domain check AFAICS, but OK.
>
> I don't know all the subtleties around this, my concern with adding one
> more check on dev->pm_domain was that the
> dev->pm_domain->dismiss() will never be called if the ->detach() function
> will be called before ->dismiss() and it will set dev->pm_domain = NULL (as
> it does today (though genpd_dev_pm_detach() and acpi_dev_pm_detach())).
>
> For platform drivers that used to call dev_pm_domain_detach() in platform
> bus remove function, if I'm not wrong, the dev->pm_domain->dismiss() was
> never called previously. If that is a valid scenario, the code proposed in
> this thread will change the behavior for devices that have ->dismiss()
> implemented.
->dismiss() and ->detach() are supposed to be mutually exclusive, so
this should not be a problem either way and in practice so far the
only user of ->dismiss() has been acpi_lpss_pm_domain which doesn't do
->detach() at all.
So this is your call.
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-16 11:47 ` Rafael J. Wysocki
@ 2025-06-19 12:21 ` Ulf Hansson
2025-06-21 11:15 ` Rafael J. Wysocki
0 siblings, 1 reply; 22+ messages in thread
From: Ulf Hansson @ 2025-06-19 12:21 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Claudiu Beznea, Jonathan Cameron, Dmitry Torokhov, gregkh, dakr,
len.brown, pavel, daniel.lezcano, linux-kernel, linux-pm,
bhelgaas, geert, linux-iio, linux-renesas-soc, fabrizio.castro.jz,
Claudiu Beznea
On Mon, 16 Jun 2025 at 13:47, Rafael J. Wysocki <rafael@kernel.org> wrote:
>
> On Mon, Jun 16, 2025 at 1:37 PM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> >
> >
> >
> > On 16.06.2025 14:18, Rafael J. Wysocki wrote:
> > > On Mon, Jun 16, 2025 at 11:37 AM Claudiu Beznea
> > > <claudiu.beznea@tuxon.dev> wrote:
> > >>
> > >> Hi, Rafael,
> > >>
> > >> On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> > >>> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> > >>>>
> > >>>> Hi, Rafael,
> > >>>>
> > >>>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> > >>>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> > >>>>>>
> > >>>>>> On Fri, 6 Jun 2025 22:01:52 +0200
> > >>>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> > >>>>>>
> > >>>>>> Hi Rafael,
> > >>>>>>
> > >>>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> > >>>>>>> <dmitry.torokhov@gmail.com> wrote:
> > >>>>>>>>
> > >>>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> > >>>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> > >>>>>>>>>>
> > >>>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> > >>>>>>>>>>
> > >>>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> > >>>>>>>>>> dev_pm_domain_detach(), often following patterns like:
> > >>>>>>>>>>
> > >>>>>>>>>> static int bus_probe(struct device *_dev)
> > >>>>>>>>>> {
> > >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> > >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> > >>>>>>>>>> int ret;
> > >>>>>>>>>>
> > >>>>>>>>>> // ...
> > >>>>>>>>>>
> > >>>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
> > >>>>>>>>>> if (ret)
> > >>>>>>>>>> return ret;
> > >>>>>>>>>>
> > >>>>>>>>>> if (drv->probe)
> > >>>>>>>>>> ret = drv->probe(dev);
> > >>>>>>>>>>
> > >>>>>>>>>> // ...
> > >>>>>>>>>> }
> > >>>>>>>>>>
> > >>>>>>>>>> static void bus_remove(struct device *_dev)
> > >>>>>>>>>> {
> > >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> > >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> > >>>>>>>>>>
> > >>>>>>>>>> if (drv->remove)
> > >>>>>>>>>> drv->remove(dev);
> > >>>>>>>>>> dev_pm_domain_detach(_dev);
> > >>>>>>>>>> }
> > >>>>>>>>>>
> > >>>>>>>>>> When the driver's probe function uses devres-managed resources that depend
> > >>>>>>>>>> on the power domain state, those resources are released later during
> > >>>>>>>>>> device_unbind_cleanup().
> > >>>>>>>>>>
> > >>>>>>>>>> Releasing devres-managed resources that depend on the power domain state
> > >>>>>>>>>> after detaching the device from its PM domain can cause failures.
> > >>>>>>>>>>
> > >>>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> > >>>>>>>>>> function, and the device's clocks are managed by the PM domain, then
> > >>>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> > >>>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
> > >>>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> > >>>>>>>>>
> > >>>>>>>>> Don't use devm_pm_runtime_enable() then.
> > >>>>>>>>
> > >>>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
> > >>>>>>>> should not be using devm_clk*(), devm_regulator_*(),
> > >>>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> > >>>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> > >>>>>>>> off the device too early, before cleanup code has a chance to execute
> > >>>>>>>> proper cleanup.
> > >>>>>>>>
> > >>>>>>>> The issue is not limited to runtime PM.
> > >>>>>>>>
> > >>>>>>>>>
> > >>>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
> > >>>>>>>>>> will lead to accessing device registers without clocks being enabled.
> > >>>>>>>>>> Similar issues may occur with other devres actions that access device
> > >>>>>>>>>> registers.
> > >>>>>>>>>>
> > >>>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> > >>>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> > >>>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> > >>>>>>>>>> after all driver's devres-managed resources have been release.
> > >>>>>>>>>>
> > >>>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> > >>>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
> > >>>>>>>>>> detach.
> > >>>>>>>>>
> > >>>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> > >>>>>>>>
> > >>>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> > >>>>>>>
> > >>>>>>> Yes, among other things.
> > >>>>>>
> > >>>>>> Maybe naming could make abuse at least obvious to spot? e.g.
> > >>>>>> pm_domain_attach_with_devm_release()
> > >>>>>
> > >>>>> If I'm not mistaken, it is not even necessary to use devres for this.
> > >>>>>
> > >>>>> You might as well add a dev_pm_domain_detach() call to
> > >>>>> device_unbind_cleanup() after devres_release_all(). There is a slight
> > >>>>> complication related to the second argument of it, but I suppose that
> > >>>>> this can be determined at the attach time and stored in a new device
> > >>>>> PM flag, or similar.
> > >>>>>
> > >>>>
> > >>>> I looked into this solution. I've tested it for all my failure cases and
> > >>>> went good.
> > >>>
> > >>> OK
> > >>>
> > >>>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> > >>>>> this should not cause the domain to be detached twice in a row from
> > >>>>> the same device, but that needs to be double-checked.
> > >>>>
> > >>>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
> > >>>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
> > >>>> find any other detach function in the current code base.
> > >>>
> > >>> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> > >>> but it calls dev_pm_domain_set(dev, NULL) either.
> > >>>
> > >>>> The code I've tested for this solution is this one:
> > >>>>
> > >>>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
> > >>>> index b526e0e0f52d..5e9750d007b4 100644
> > >>>> --- a/drivers/base/dd.c
> > >>>> +++ b/drivers/base/dd.c
> > >>>> @@ -25,6 +25,7 @@
> > >>>> #include <linux/kthread.h>
> > >>>> #include <linux/wait.h>
> > >>>> #include <linux/async.h>
> > >>>> +#include <linux/pm_domain.h>
> > >>>> #include <linux/pm_runtime.h>
> > >>>> #include <linux/pinctrl/devinfo.h>
> > >>>> #include <linux/slab.h>
> > >>>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
> > >>>> dev->dma_range_map = NULL;
> > >>>> device_set_driver(dev, NULL);
> > >>>> dev_set_drvdata(dev, NULL);
> > >>>> - if (dev->pm_domain && dev->pm_domain->dismiss)
> > >>>> - dev->pm_domain->dismiss(dev);
> > >>>> + if (dev->pm_domain) {
> > >>>> + if (dev->pm_domain->dismiss)
> > >>>> + dev->pm_domain->dismiss(dev);
> > >>>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
> > >>>
> > >>> I would do the "detach" before the "dismiss" to retain the current ordering.
> > >>
> > >> I applied on my local development branch all your suggestions except this
> > >> one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
> > >> dev->pm_domain = NULL.
> > >>
> > >> Due to this I would call first ->dismiss() then ->detach(), as initially
> > >> proposed. Please let me know if you consider it otherwise.
> > >
> > > This is a matter of adding one more dev->pm_domain check AFAICS, but OK.
> >
> > I don't know all the subtleties around this, my concern with adding one
> > more check on dev->pm_domain was that the
> > dev->pm_domain->dismiss() will never be called if the ->detach() function
> > will be called before ->dismiss() and it will set dev->pm_domain = NULL (as
> > it does today (though genpd_dev_pm_detach() and acpi_dev_pm_detach())).
> >
> > For platform drivers that used to call dev_pm_domain_detach() in platform
> > bus remove function, if I'm not wrong, the dev->pm_domain->dismiss() was
> > never called previously. If that is a valid scenario, the code proposed in
> > this thread will change the behavior for devices that have ->dismiss()
> > implemented.
>
> ->dismiss() and ->detach() are supposed to be mutually exclusive, so
> this should not be a problem either way and in practice so far the
> only user of ->dismiss() has been acpi_lpss_pm_domain which doesn't do
> ->detach() at all.
May I ask if you know if there remains any real good reasons to keep
the ->dismiss() callback around?
Can't acpi_lpss_pm_domain() convert to use the ->detach() callback
instead? Just thinking that it would be easier, but maybe it doesn't
work.
Kind regards
Uffe
^ permalink raw reply [flat|nested] 22+ messages in thread
* Re: [PATCH v3 1/2] PM: domains: Add devres variant for dev_pm_domain_attach()
2025-06-19 12:21 ` Ulf Hansson
@ 2025-06-21 11:15 ` Rafael J. Wysocki
0 siblings, 0 replies; 22+ messages in thread
From: Rafael J. Wysocki @ 2025-06-21 11:15 UTC (permalink / raw)
To: Ulf Hansson
Cc: Rafael J. Wysocki, Claudiu Beznea, Jonathan Cameron,
Dmitry Torokhov, gregkh, dakr, len.brown, pavel, daniel.lezcano,
linux-kernel, linux-pm, bhelgaas, geert, linux-iio,
linux-renesas-soc, fabrizio.castro.jz, Claudiu Beznea
On Thu, Jun 19, 2025 at 2:21 PM Ulf Hansson <ulf.hansson@linaro.org> wrote:
>
> On Mon, 16 Jun 2025 at 13:47, Rafael J. Wysocki <rafael@kernel.org> wrote:
> >
> > On Mon, Jun 16, 2025 at 1:37 PM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> > >
> > >
> > >
> > > On 16.06.2025 14:18, Rafael J. Wysocki wrote:
> > > > On Mon, Jun 16, 2025 at 11:37 AM Claudiu Beznea
> > > > <claudiu.beznea@tuxon.dev> wrote:
> > > >>
> > > >> Hi, Rafael,
> > > >>
> > > >> On 13.06.2025 13:02, Rafael J. Wysocki wrote:
> > > >>> On Fri, Jun 13, 2025 at 9:39 AM Claudiu Beznea <claudiu.beznea@tuxon.dev> wrote:
> > > >>>>
> > > >>>> Hi, Rafael,
> > > >>>>
> > > >>>> On 09.06.2025 22:59, Rafael J. Wysocki wrote:
> > > >>>>> On Sat, Jun 7, 2025 at 3:06 PM Jonathan Cameron <jic23@kernel.org> wrote:
> > > >>>>>>
> > > >>>>>> On Fri, 6 Jun 2025 22:01:52 +0200
> > > >>>>>> "Rafael J. Wysocki" <rafael@kernel.org> wrote:
> > > >>>>>>
> > > >>>>>> Hi Rafael,
> > > >>>>>>
> > > >>>>>>> On Fri, Jun 6, 2025 at 8:55 PM Dmitry Torokhov
> > > >>>>>>> <dmitry.torokhov@gmail.com> wrote:
> > > >>>>>>>>
> > > >>>>>>>> On Fri, Jun 06, 2025 at 06:00:34PM +0200, Rafael J. Wysocki wrote:
> > > >>>>>>>>> On Fri, Jun 6, 2025 at 1:18 PM Claudiu <claudiu.beznea@tuxon.dev> wrote:
> > > >>>>>>>>>>
> > > >>>>>>>>>> From: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
> > > >>>>>>>>>>
> > > >>>>>>>>>> The dev_pm_domain_attach() function is typically used in bus code alongside
> > > >>>>>>>>>> dev_pm_domain_detach(), often following patterns like:
> > > >>>>>>>>>>
> > > >>>>>>>>>> static int bus_probe(struct device *_dev)
> > > >>>>>>>>>> {
> > > >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> > > >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> > > >>>>>>>>>> int ret;
> > > >>>>>>>>>>
> > > >>>>>>>>>> // ...
> > > >>>>>>>>>>
> > > >>>>>>>>>> ret = dev_pm_domain_attach(_dev, true);
> > > >>>>>>>>>> if (ret)
> > > >>>>>>>>>> return ret;
> > > >>>>>>>>>>
> > > >>>>>>>>>> if (drv->probe)
> > > >>>>>>>>>> ret = drv->probe(dev);
> > > >>>>>>>>>>
> > > >>>>>>>>>> // ...
> > > >>>>>>>>>> }
> > > >>>>>>>>>>
> > > >>>>>>>>>> static void bus_remove(struct device *_dev)
> > > >>>>>>>>>> {
> > > >>>>>>>>>> struct bus_driver *drv = to_bus_driver(dev->driver);
> > > >>>>>>>>>> struct bus_device *dev = to_bus_device(_dev);
> > > >>>>>>>>>>
> > > >>>>>>>>>> if (drv->remove)
> > > >>>>>>>>>> drv->remove(dev);
> > > >>>>>>>>>> dev_pm_domain_detach(_dev);
> > > >>>>>>>>>> }
> > > >>>>>>>>>>
> > > >>>>>>>>>> When the driver's probe function uses devres-managed resources that depend
> > > >>>>>>>>>> on the power domain state, those resources are released later during
> > > >>>>>>>>>> device_unbind_cleanup().
> > > >>>>>>>>>>
> > > >>>>>>>>>> Releasing devres-managed resources that depend on the power domain state
> > > >>>>>>>>>> after detaching the device from its PM domain can cause failures.
> > > >>>>>>>>>>
> > > >>>>>>>>>> For example, if the driver uses devm_pm_runtime_enable() in its probe
> > > >>>>>>>>>> function, and the device's clocks are managed by the PM domain, then
> > > >>>>>>>>>> during removal the runtime PM is disabled in device_unbind_cleanup() after
> > > >>>>>>>>>> the clocks have been removed from the PM domain. It may happen that the
> > > >>>>>>>>>> devm_pm_runtime_enable() action causes the device to be runtime-resumed.
> > > >>>>>>>>>
> > > >>>>>>>>> Don't use devm_pm_runtime_enable() then.
> > > >>>>>>>>
> > > >>>>>>>> What about other devm_ APIs? Are you suggesting that platform drivers
> > > >>>>>>>> should not be using devm_clk*(), devm_regulator_*(),
> > > >>>>>>>> devm_request_*_irq() and devm_add_action_or_reset()? Because again,
> > > >>>>>>>> dev_pm_domain_detach() that is called by platform bus_remove() may shut
> > > >>>>>>>> off the device too early, before cleanup code has a chance to execute
> > > >>>>>>>> proper cleanup.
> > > >>>>>>>>
> > > >>>>>>>> The issue is not limited to runtime PM.
> > > >>>>>>>>
> > > >>>>>>>>>
> > > >>>>>>>>>> If the driver specific runtime PM APIs access registers directly, this
> > > >>>>>>>>>> will lead to accessing device registers without clocks being enabled.
> > > >>>>>>>>>> Similar issues may occur with other devres actions that access device
> > > >>>>>>>>>> registers.
> > > >>>>>>>>>>
> > > >>>>>>>>>> Add devm_pm_domain_attach(). When replacing the dev_pm_domain_attach() and
> > > >>>>>>>>>> dev_pm_domain_detach() in bus probe and bus remove, it ensures that the
> > > >>>>>>>>>> device is detached from its PM domain in device_unbind_cleanup(), only
> > > >>>>>>>>>> after all driver's devres-managed resources have been release.
> > > >>>>>>>>>>
> > > >>>>>>>>>> For flexibility, the implemented devm_pm_domain_attach() has 2 state
> > > >>>>>>>>>> arguments, one for the domain state on attach, one for the domain state on
> > > >>>>>>>>>> detach.
> > > >>>>>>>>>
> > > >>>>>>>>> dev_pm_domain_attach() is not part driver API and I'm not convinced at
> > > >>>>>>>>
> > > >>>>>>>> Is the concern that devm_pm_domain_attach() will be [ab]used by drivers?
> > > >>>>>>>
> > > >>>>>>> Yes, among other things.
> > > >>>>>>
> > > >>>>>> Maybe naming could make abuse at least obvious to spot? e.g.
> > > >>>>>> pm_domain_attach_with_devm_release()
> > > >>>>>
> > > >>>>> If I'm not mistaken, it is not even necessary to use devres for this.
> > > >>>>>
> > > >>>>> You might as well add a dev_pm_domain_detach() call to
> > > >>>>> device_unbind_cleanup() after devres_release_all(). There is a slight
> > > >>>>> complication related to the second argument of it, but I suppose that
> > > >>>>> this can be determined at the attach time and stored in a new device
> > > >>>>> PM flag, or similar.
> > > >>>>>
> > > >>>>
> > > >>>> I looked into this solution. I've tested it for all my failure cases and
> > > >>>> went good.
> > > >>>
> > > >>> OK
> > > >>>
> > > >>>>> Note that dev->pm_domain is expected to be cleared by ->detach(), so
> > > >>>>> this should not cause the domain to be detached twice in a row from
> > > >>>>> the same device, but that needs to be double-checked.
> > > >>>>
> > > >>>> The genpd_dev_pm_detach() calls genpd_remove_device() ->
> > > >>>> dev_pm_domain_set(dev, NULL) which sets the dev->pm_domain = NULL. I can't
> > > >>>> find any other detach function in the current code base.
> > > >>>
> > > >>> There is also acpi_dev_pm_detach() which can be somewhat hard to find,
> > > >>> but it calls dev_pm_domain_set(dev, NULL) either.
> > > >>>
> > > >>>> The code I've tested for this solution is this one:
> > > >>>>
> > > >>>> diff --git a/drivers/base/dd.c b/drivers/base/dd.c
> > > >>>> index b526e0e0f52d..5e9750d007b4 100644
> > > >>>> --- a/drivers/base/dd.c
> > > >>>> +++ b/drivers/base/dd.c
> > > >>>> @@ -25,6 +25,7 @@
> > > >>>> #include <linux/kthread.h>
> > > >>>> #include <linux/wait.h>
> > > >>>> #include <linux/async.h>
> > > >>>> +#include <linux/pm_domain.h>
> > > >>>> #include <linux/pm_runtime.h>
> > > >>>> #include <linux/pinctrl/devinfo.h>
> > > >>>> #include <linux/slab.h>
> > > >>>> @@ -552,8 +553,11 @@ static void device_unbind_cleanup(struct device *dev)
> > > >>>> dev->dma_range_map = NULL;
> > > >>>> device_set_driver(dev, NULL);
> > > >>>> dev_set_drvdata(dev, NULL);
> > > >>>> - if (dev->pm_domain && dev->pm_domain->dismiss)
> > > >>>> - dev->pm_domain->dismiss(dev);
> > > >>>> + if (dev->pm_domain) {
> > > >>>> + if (dev->pm_domain->dismiss)
> > > >>>> + dev->pm_domain->dismiss(dev);
> > > >>>> + dev_pm_domain_detach(dev, dev->pm_domain->detach_power_off);
> > > >>>
> > > >>> I would do the "detach" before the "dismiss" to retain the current ordering.
> > > >>
> > > >> I applied on my local development branch all your suggestions except this
> > > >> one because genpd_dev_pm_detach() as well as acpi_dev_pm_detach() set
> > > >> dev->pm_domain = NULL.
> > > >>
> > > >> Due to this I would call first ->dismiss() then ->detach(), as initially
> > > >> proposed. Please let me know if you consider it otherwise.
> > > >
> > > > This is a matter of adding one more dev->pm_domain check AFAICS, but OK.
> > >
> > > I don't know all the subtleties around this, my concern with adding one
> > > more check on dev->pm_domain was that the
> > > dev->pm_domain->dismiss() will never be called if the ->detach() function
> > > will be called before ->dismiss() and it will set dev->pm_domain = NULL (as
> > > it does today (though genpd_dev_pm_detach() and acpi_dev_pm_detach())).
> > >
> > > For platform drivers that used to call dev_pm_domain_detach() in platform
> > > bus remove function, if I'm not wrong, the dev->pm_domain->dismiss() was
> > > never called previously. If that is a valid scenario, the code proposed in
> > > this thread will change the behavior for devices that have ->dismiss()
> > > implemented.
> >
> > ->dismiss() and ->detach() are supposed to be mutually exclusive, so
> > this should not be a problem either way and in practice so far the
> > only user of ->dismiss() has been acpi_lpss_pm_domain which doesn't do
> > ->detach() at all.
>
> May I ask if you know if there remains any real good reasons to keep
> the ->dismiss() callback around?
>
> Can't acpi_lpss_pm_domain() convert to use the ->detach() callback
> instead? Just thinking that it would be easier, but maybe it doesn't
> work.
It will, but let's just not make too many changes in one go.
^ permalink raw reply [flat|nested] 22+ messages in thread