From: Huang Ying <ying.huang@intel.com>
To: Mike Qiu <qiudayu@linux.vnet.ibm.com>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>,
Alan Stern <stern@rowland.harvard.edu>,
Bjorn Helgaas <bhelgaas@google.com>,
"linux-pci@vger.kernel.org" <linux-pci@vger.kernel.org>,
linux-pm@vger.kernel.org
Subject: Re: A question about the patch: [PATCH] PCI/PM: Keep runtime PM enabled for unbound PCI devices
Date: Wed, 27 Nov 2013 14:31:32 +0800 [thread overview]
Message-ID: <1385533892.9224.35.camel@yhuang-dev> (raw)
In-Reply-To: <52958403.4040201@linux.vnet.ibm.com>
On Wed, 2013-11-27 at 13:32 +0800, Mike Qiu wrote:
> On 11/27/2013 04:32 AM, Rafael J. Wysocki wrote:
> > On Tuesday, November 26, 2013 01:41:13 PM Mike Qiu wrote:
> >> On 11/14/2013 04:54 PM, Huang Ying wrote:
> >>> On Thu, 2013-11-14 at 16:37 +0800, mike wrote:
> >>>> On 11/14/2013 04:25 PM, Huang Ying wrote:
> >>>>> On Thu, 2013-11-14 at 16:12 +0800, mike wrote:
> >>>>>> On 11/14/2013 03:53 PM, Huang Ying wrote:
> >>>>>>> On Thu, 2013-11-14 at 15:19 +0800, mike wrote:
> >>>>>>>> On 11/14/2013 01:59 PM, Huang Ying wrote:
> >>>>>>>>> On Thu, 2013-11-14 at 11:23 +0800, mike wrote:
> >>>>>>>>>> On 11/14/2013 03:20 AM, Alan Stern wrote:
> >>>>>>>>>>> On Wed, 13 Nov 2013, Bjorn Helgaas wrote:
> >>>>>>>>>>>
> >>>>>>>>>>>> [+cc Rafael, linux-pm]
> >>>>>>>>>>>>
> >>>>>>>>>>>> On Wed, Nov 13, 2013 at 6:09 AM, mike<qiudayu@linux.vnet.ibm.com> wrote:
> >>>>>>>>>>>>> Hi Huang Ying,
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> I see you are the author of this patch, commit id is:
> >>>>>>>>>>>>> 967577b062417b4e4b8e27b711220f4124f5153a
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> I have a question while I try to understand this patch,
> >>>>>>>>>>>>> So I would very grateful if you or others can give me some reply.....
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> ............
> >>>>>>>>>>>>> - rc = ddi->drv->probe(ddi->dev, ddi->id);
> >>>>>>>>>>>>> + pm_runtime_get_sync(dev);
> >>>>>>>>>>>>> + pci_dev->driver = pci_drv;
> >>>>>>>>>>>>> ^^^^^^^^^^^^^^^^^^^^^^^^^^
> >>>>>>>>>>>>> I see here you make the driver to initialize before probe,
> >>>>>>>>>>>>> But I have no idea of why you do this change.....
> >>>>>>>>>>>>>
> >>>>>>>>>>>>> and I look inside the code, it may be pm_runtime relate??
> >>>>>>>>>>> Yes, it is related to runtime PM. In the PCI subsystem, runtime PM
> >>>>>>>>>>> doesn't do anything unless pci_dev->driver is set. You can see this at
> >>>>>>>>>>> the start of pci_pm_runtime_suspend().
> >>>>>>>>>>>
> >>>>>>>>>>> Since we want the driver's probe routine to be able to carry out
> >>>>>>>>>>> runtime PM operations, we have to set pci_dev->driver before the probe
> >>>>>>>>>>> routine runs.
> >>>>>>>>>> Is there any situations , like in probe state, pci_dev->driver
> >>>>>>>>>> has been set. the pci_pm_runtime_xxx() has passed
> >>>>>>>>>> pci_dev->driver NULL check, but at this point, probe fail
> >>>>>>>>>> occurs, and pci_dev->driver to be set to NULL.
> >>>>>>>>>>
> >>>>>>>>>> What will happen ? Or this situation will never happen?
> >>>>>>>>>> I'm confuse about this.
> >>>>>>>>> I think that will never happen. Before ->probe(), pm_runtime_get_sync()
> >>>>>>>>> is called, so pci_pm_runtime_xxx() will not be called until
> >>>>>>>>> pm_runtime_put_noidle() is called in ->probe(). And
> >>>>>>>>> should be done as one of the latest actions in
> >>>>>>>>> ->probe(), after the normal probe actions succeeded.
> >>>>>>>> OK, just as your description, it seems OK.
> >>>>>>>> But this is really a issue as I explained in last email.
> >>>>>>>>
> >>>>>>>> So I want to know if there are any side-effect of changing the code
> >>>>>>>> in pci_pm_runtime_xxx()
> >>>>>>>>
> >>>>>>>> if (!pci_dev->driver)
> >>>>>>>> return 0;
> >>>>>>>> to
> >>>>>>>>
> >>>>>>>> if (!dev->driver)
> >>>>>>>> return 0;
> >>>>>>>>
> >>>>>>> If you make this change, we can not put devices into low power state
> >>>>>>> (runtime suspend the device) in ->probe(). That is expected in some
> >>>>>> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
> >>>>>> This means dev->driver is NULL ?? but pci_dev->driver is set???
> >>>>>>
> >>>>>> Because if use pci_dev->driver can put into low power state, means
> >>>>>>
> >>>>>> pci_dev->driver is set, but in the situation, use dev->driver will can't,
> >>>>>>
> >>>>>> means dev->driver = null, but I have not find any case that
> >>>>>>
> >>>>>> dev->driver = null, but pci_dev->driver != null;
> >>>>> Sorry I make a mistake here. The dev->driver != null in
> >>>>> local_pci_probe(). We use pci_dev->driver instead of dev->driver in
> >>>>> pci_pm_runtime_xxx() because we want device to be kept in normal power
> >>>>> state (D0) and SUSPENDED state when unbound.The
> >>>>> pm_runtime_put/get_sync in pci_device_remove/local_pci_probe will not
> >>>>> change the power state of the device because of the check in
> >>>>> pci_pm_runtime_xxx().
> >>>> Yes, you are right, but what am I confuse is that, why check dev->driver
> >>>> in pci_pm_runtime_xxx() can't keep the device in normal power
> >>>> state (D0) and SUSPENDED state when unbound.
> >>>>
> >>>> May be logic issue ?
> >>> Because dev->driver is set before local_pci_probe() and cleared after
> >>> pci_device_remove(). But we need a flag to be changed in
> >>> local_pci_probe() and pci_device_remove().
> >> Hi Ying,
> >>
> >> I'm now face one bug, and the root cause is this logic has some problem.
> >>
> >> The other component calls the ops in driver during probe state, which a
> >> lot of critical data struct haven't been setup yet.
> >>
> >> This never happen in old logic, because dev->driver is unset in probe
> >> state, it can check dev->driver to see if the device diver can work. But
> >> for new logic it is really a big issue.
> > What is the other component and why is it doing that?
>
> Some component like EEH in Power arch, it need to check whether the
> driver is work or not.
>
> In old logic, if probed then dev->driver set, otherwise it will be NULL,
> it is safe to do so.
>
> But in new, it has problem, it can call the driver API, which is very
> dangerous in probe state, maybe a lot key data structure haven't been
> setup yet, this lead to the kernel down and machine reboot. Also this
> can be fixed in driver, like check the driver data it self, this
> solution needs all the driver fix this issue, It may be a huge program.
>
> So we need a new flag I think, or which old flag can we use to solve
> this issue ?
I think a flag is not safe for you. Driver may be removed when you
operate on it. Better to use device_lock() if possible, which will be
held during device probe and driver remove.
Best Regards,
Huang Ying
> Thanks
> Mike
> >
> > Checking dev->driver may not be a correct way to address this issue anyway.
>
> >
> > Thanks!
> >
>
next prev parent reply other threads:[~2013-11-27 6:31 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-11-13 13:09 A question about the patch: [PATCH] PCI/PM: Keep runtime PM enabled for unbound PCI devices mike
2013-11-13 16:47 ` Bjorn Helgaas
2013-11-13 19:20 ` Alan Stern
2013-11-14 3:23 ` mike
2013-11-14 5:59 ` Huang Ying
2013-11-14 7:19 ` mike
2013-11-14 7:53 ` Huang Ying
2013-11-14 8:12 ` mike
2013-11-14 8:25 ` Huang Ying
2013-11-14 8:37 ` mike
2013-11-14 8:54 ` Huang Ying
2013-11-26 5:41 ` Mike Qiu
2013-11-26 20:32 ` Rafael J. Wysocki
2013-11-27 5:32 ` Mike Qiu
2013-11-27 6:31 ` Huang Ying [this message]
2013-11-27 14:15 ` Rafael J. Wysocki
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1385533892.9224.35.camel@yhuang-dev \
--to=ying.huang@intel.com \
--cc=bhelgaas@google.com \
--cc=linux-pci@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=qiudayu@linux.vnet.ibm.com \
--cc=rjw@rjwysocki.net \
--cc=stern@rowland.harvard.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).