public inbox for linux-pm@vger.kernel.org
 help / color / mirror / Atom feed
From: Brian Norris <briannorris@chromium.org>
To: "Rafael J. Wysocki" <rafael@kernel.org>
Cc: "Bjorn Helgaas" <bhelgaas@google.com>,
	linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org,
	"Lukas Wunner" <lukas@wunner.de>,
	linux-pm@vger.kernel.org,
	"Marek Szyprowski" <m.szyprowski@samsung.com>,
	"Ilpo Järvinen" <ilpo.jarvinen@linux.intel.com>
Subject: Re: [PATCH v5] PCI/PM: Prevent runtime suspend until devices are fully initialized
Date: Thu, 22 Jan 2026 10:29:50 -0800	[thread overview]
Message-ID: <aXJsnh4HeiQ2oOBl@google.com> (raw)
In-Reply-To: <CAJZ5v0iBNOmMtqfqEbrYyuK2u+2J2+zZ-iQd1FvyCPjdvU2TJg@mail.gmail.com>

On Thu, Jan 22, 2026 at 07:17:21PM +0100, Rafael J. Wysocki wrote:
> On Thu, Jan 22, 2026 at 6:49 PM Brian Norris <briannorris@chromium.org> wrote:
> >
> > Previously, it was possible for a PCI device to be runtime-suspended before
> > it was fully initialized. When that happened, the suspend process could
> > save invalid device state, for example, before BAR assignment. Restoring
> > the invalid state during resume may leave the device non-functional.
> >
> > Prevent runtime suspend for PCI devices until they are fully initialized by
> > deferring pm_runtime_enable().
> >
> > More details on how exactly this may occur:
> >
> >   1. PCI device is created by pci_scan_slot() or similar
> >
> >   2. As part of pci_scan_slot(), pci_pm_init() puts the device in D0 and
> >      prevents runtime suspend prevented via pm_runtime_forbid()
> >
> >   3. pci_device_add() adds the underlying 'struct device' via device_add(),
> >      which means user space can allow runtime suspend, e.g.,
> >
> >        echo auto > /sys/bus/pci/devices/.../power/control
> >
> >   4. PCI device receives BAR configuration
> >      (pci_assign_unassigned_bus_resources(), etc.)
> >
> >   5. pci_bus_add_device() applies final fixups, saves device state, and
> >      tries to attach a driver
> >
> > The device may potentially be suspended between #3 and #5, so this is racy
> > with user space (udev or similar).
> >
> > Many PCI devices are enumerated at subsys_initcall time and so will not
> > race with user space, but devices created later by hotplug or modular
> > pwrctrl or host controller drivers are susceptible to this race.
> >
> > More runtime PM details at the first Link: below.
> >
> > Signed-off-by: Brian Norris <briannorris@chromium.org>
> > Tested-by: Marek Szyprowski <m.szyprowski@samsung.com>
> > Cc: stable@vger.kernel.org
> > Link: https://lore.kernel.org/all/20251016155335.1.I60a53c170a8596661883bd2b4ef475155c7aa72b@changeid/
> > Link: https://lore.kernel.org/all/0e35a4e1-894a-47c1-9528-fc5ffbafd9e2@samsung.com/
> >
> > Link: https://lore.kernel.org/all/20251016155335.1.I60a53c170a8596661883bd2b4ef475155c7aa72b@changeid/
> > Cc: <stable@vger.kernel.org>
> > ---
> >
> > Changes in v5:
> >  * Put pm_runtime_set_active() back where it was, to ensure our parent
> >    can't suspend before we're really ready. (See bug report in 2nd
> >    "Link:")
> >  * Add comments
> >  * Update commit description with Bjorn's rewrite
> >  * Add Marek's Tested-by
> >
> > Changes in v4:
> >  * Move pm_runtime_set_active() too
> >
> > Changes in v3:
> >  * Add Link to initial discussion
> >  * Add Rafael's Reviewed-by
> >  * Add lengthier footnotes about forbid vs allow vs sysfs
> >
> > Changes in v2:
> >  * Update CC list
> >  * Rework problem description
> >  * Update solution: defer pm_runtime_enable(), instead of trying to
> >    get()/put()
> >
> >  drivers/pci/bus.c | 7 +++++++
> >  drivers/pci/pci.c | 5 ++++-
> >  2 files changed, 11 insertions(+), 1 deletion(-)
> >
> > diff --git a/drivers/pci/bus.c b/drivers/pci/bus.c
> > index 4383a36fd6ca..90954f81962b 100644
> > --- a/drivers/pci/bus.c
> > +++ b/drivers/pci/bus.c
> > @@ -15,6 +15,7 @@
> >  #include <linux/of.h>
> >  #include <linux/of_platform.h>
> >  #include <linux/platform_device.h>
> > +#include <linux/pm_runtime.h>
> >  #include <linux/proc_fs.h>
> >  #include <linux/slab.h>
> >
> > @@ -379,6 +380,12 @@ void pci_bus_add_device(struct pci_dev *dev)
> >                 put_device(&pdev->dev);
> >         }
> >
> > +       /*
> > +        * Enable runtime PM (and potentially suspend) only after we've fully
> > +        * configured the PCI state.
> > +        */
> 
> I would make it a bit more precise, something like "Enable runtime PM,
> which potentially allows the device to suspend immediately, only after
> the PCI state has been configured completely."
> 
> Also, it is not particularly what "we" means in kernel code comments,
> so I generally avoid phrasing them this way.

Well, you may disagree with plenty of other people :)

$ git grep -ioh '\<we\>' ./drivers/pci | wc -w
943

Your suggestion also means using passive voice, which I sometimes
attempt to avoid.

But otherwise, looks fine to me.

> > +       pm_runtime_enable(&dev->dev);
> > +
> >         if (!dn || of_device_is_available(dn))
> >                 pci_dev_allow_binding(dev);
> >
> > diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c
> > index 13dbb405dc31..07b0d029aa51 100644
> > --- a/drivers/pci/pci.c
> > +++ b/drivers/pci/pci.c
> > @@ -3196,8 +3196,11 @@ void pci_pm_init(struct pci_dev *dev)
> >  poweron:
> >         pci_pm_power_up_and_verify_state(dev);
> >         pm_runtime_forbid(&dev->dev);
> > +       /*
> > +        * Mark ourselves active now, to prevent our parent from suspending
> > +        * while we finish configuring the PCI device.
> > +        */
> 
> I would rephrase the comment this way:
> 
> "Runtime PM will be enabled for the device when it has been fully
> configured, but since its parent and suppliers may suspend in the
> meantime, prevent them from doing so by changing the device's runtime
> PM status to "active"."

Sure. That's going on the more verbose end, like I suggested a bit ago.
I'm fine with more verbose, if it helps people.

I see Bjorn already applied this, so it seems he's happy enough. He also
spent a lot of time on making earlier writing less verbose, so I'll
defer to his opinions. I can spin a v6, or Bjorn can squash in his own
update, or he can choose to leave it as-is.

Brian

> >         pm_runtime_set_active(&dev->dev);
> > -       pm_runtime_enable(&dev->dev);
> >  }
> >
> >  static unsigned long pci_ea_flags(struct pci_dev *dev, u8 prop)
> > --

  reply	other threads:[~2026-01-22 18:29 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-01-22 17:48 [PATCH v5] PCI/PM: Prevent runtime suspend until devices are fully initialized Brian Norris
2026-01-22 18:17 ` Rafael J. Wysocki
2026-01-22 18:29   ` Brian Norris [this message]
2026-01-22 18:59     ` Rafael J. Wysocki
2026-01-22 19:06     ` Bjorn Helgaas
2026-01-22 18:21 ` Bjorn Helgaas

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aXJsnh4HeiQ2oOBl@google.com \
    --to=briannorris@chromium.org \
    --cc=bhelgaas@google.com \
    --cc=ilpo.jarvinen@linux.intel.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=linux-pm@vger.kernel.org \
    --cc=lukas@wunner.de \
    --cc=m.szyprowski@samsung.com \
    --cc=rafael@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox