* [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support
@ 2011-10-05 11:34 Ajaykumar Hotchandani
2011-10-05 19:41 ` Rafael J. Wysocki
2011-10-06 10:39 ` Stefano Stabellini
0 siblings, 2 replies; 14+ messages in thread
From: Ajaykumar Hotchandani @ 2011-10-05 11:34 UTC (permalink / raw)
To: Jesse Barnes, linux-pci, linux-kernel
During test of one IB card with guest VM, found that, msi is not initialized properly.
It turns out __write_msi_msg will do nothing if device current_state is not PCI_D0.
And, that pci device does not have pm_cap in guest VM.
There is an error in setting of power state to PCI_D0 in pci_enable_device(), but error is not returned for this.
Following is code flow:
pci_enable_device() --> __pci_enable_device_flags() --> do_pci_enable_device() --> pci_set_power_state() --> __pci_start_power_transition()
We have following condition inside __pci_start_power_transition():
if (platform_pci_power_manageable(dev)) {
error = platform_pci_set_power_state(dev, state);
if (!error)
pci_update_current_state(dev, state);
} else {
error = -ENODEV;
/* Fall back to PCI_D0 if native PM is not supported */
if (!dev->pm_cap)
dev->current_state = PCI_D0;
}
Here, from platform_pci_set_power_state(), acpi_pci_set_power_state() is getting called and that is failing with ENODEV because of following condition:
if (!handle || ACPI_SUCCESS(acpi_get_handle(handle, "_EJ0",&tmp)))
return -ENODEV;
Because of that, pci_update_current_state() is not getting called.
With this patch, if device power state can not be set via platform_pci_set_power_state and that device does not have native pm support, then PCI device power state will be set to PCI_D0.
Signed-off-by: Ajaykumar Hotchandani<ajaykumar.hotchandani@oracle.com>
Signed-off-by: Yinghai Lu<yinghai.lu@oracle.com>
---
drivers/pci/pci.c | 3 +++
1 files changed, 3 insertions(+), 0 deletions(-)
diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c
index e9651f0..ca8c82d 100644
--- a/drivers/pci/pci.c
+++ b/drivers/pci/pci.c
@@ -664,6 +664,9 @@ static int pci_platform_power_transition(struct pci_dev *dev, pci_power_t state)
error = platform_pci_set_power_state(dev, state);
if (!error)
pci_update_current_state(dev, state);
+ /* Fall back to PCI_D0 if native PM is not supported */
+ else if (!dev->pm_cap)
+ dev->current_state = PCI_D0;
} else {
error = -ENODEV;
/* Fall back to PCI_D0 if native PM is not supported */
--
1.7.5.1
^ permalink raw reply related [flat|nested] 14+ messages in thread* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-05 11:34 [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support Ajaykumar Hotchandani @ 2011-10-05 19:41 ` Rafael J. Wysocki 2011-10-06 16:10 ` Yinghai Lu 2011-10-06 10:39 ` Stefano Stabellini 1 sibling, 1 reply; 14+ messages in thread From: Rafael J. Wysocki @ 2011-10-05 19:41 UTC (permalink / raw) To: Ajaykumar Hotchandani; +Cc: Jesse Barnes, linux-pci, linux-kernel Hi, On Wednesday, October 05, 2011, Ajaykumar Hotchandani wrote: > During test of one IB card with guest VM, found that, msi is not initialized properly. > > It turns out __write_msi_msg will do nothing if device current_state is not PCI_D0. > And, that pci device does not have pm_cap in guest VM. > > There is an error in setting of power state to PCI_D0 in pci_enable_device(), but error is not returned for this. > Following is code flow: > pci_enable_device() --> __pci_enable_device_flags() --> do_pci_enable_device() --> pci_set_power_state() --> __pci_start_power_transition() > We have following condition inside __pci_start_power_transition(): > if (platform_pci_power_manageable(dev)) { > error = platform_pci_set_power_state(dev, state); > if (!error) > pci_update_current_state(dev, state); > } else { > error = -ENODEV; > /* Fall back to PCI_D0 if native PM is not supported */ > if (!dev->pm_cap) > dev->current_state = PCI_D0; > } > > Here, from platform_pci_set_power_state(), acpi_pci_set_power_state() is getting called and that is failing with ENODEV because of following condition: > if (!handle || ACPI_SUCCESS(acpi_get_handle(handle, "_EJ0",&tmp))) > return -ENODEV; > > Because of that, pci_update_current_state() is not getting called. > > With this patch, if device power state can not be set via platform_pci_set_power_state and that device does not have native pm support, then PCI device power state will be set to PCI_D0. > > Signed-off-by: Ajaykumar Hotchandani<ajaykumar.hotchandani@oracle.com> > Signed-off-by: Yinghai Lu<yinghai.lu@oracle.com> > --- > drivers/pci/pci.c | 3 +++ > 1 files changed, 3 insertions(+), 0 deletions(-) > > diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c > index e9651f0..ca8c82d 100644 > --- a/drivers/pci/pci.c > +++ b/drivers/pci/pci.c > @@ -664,6 +664,9 @@ static int pci_platform_power_transition(struct pci_dev *dev, pci_power_t state) > error = platform_pci_set_power_state(dev, state); > if (!error) > pci_update_current_state(dev, state); > + /* Fall back to PCI_D0 if native PM is not supported */ > + else if (!dev->pm_cap) > + dev->current_state = PCI_D0; > } else { > error = -ENODEV; > /* Fall back to PCI_D0 if native PM is not supported */ > I have some vague memories that we tried that and it broke something. How thoroughly has it been tested? Rafael ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-05 19:41 ` Rafael J. Wysocki @ 2011-10-06 16:10 ` Yinghai Lu 2011-10-06 17:20 ` Rafael J. Wysocki 0 siblings, 1 reply; 14+ messages in thread From: Yinghai Lu @ 2011-10-06 16:10 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Ajaykumar Hotchandani, Jesse Barnes, linux-pci, linux-kernel On Wed, Oct 5, 2011 at 12:41 PM, Rafael J. Wysocki <rjw@sisk.pl> wrote: > Hi, >> drivers/pci/pci.c | 3 +++ >> 1 files changed, 3 insertions(+), 0 deletions(-) >> >> diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c >> index e9651f0..ca8c82d 100644 >> --- a/drivers/pci/pci.c >> +++ b/drivers/pci/pci.c >> @@ -664,6 +664,9 @@ static int pci_platform_power_transition(struct pci_dev *dev, pci_power_t state) >> error = platform_pci_set_power_state(dev, state); >> if (!error) >> pci_update_current_state(dev, state); >> + /* Fall back to PCI_D0 if native PM is not supported */ >> + else if (!dev->pm_cap) >> + dev->current_state = PCI_D0; >> } else { >> error = -ENODEV; >> /* Fall back to PCI_D0 if native PM is not supported */ >> > > I have some vague memories that we tried that and it broke something. Do have any hint for us to dig out the configuration for failing? > > How thoroughly has it been tested? tested in setups with intel cpus from Nehalem. Yinghai ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-06 16:10 ` Yinghai Lu @ 2011-10-06 17:20 ` Rafael J. Wysocki 0 siblings, 0 replies; 14+ messages in thread From: Rafael J. Wysocki @ 2011-10-06 17:20 UTC (permalink / raw) To: Yinghai Lu; +Cc: Ajaykumar Hotchandani, Jesse Barnes, linux-pci, linux-kernel On Thursday, October 06, 2011, Yinghai Lu wrote: > On Wed, Oct 5, 2011 at 12:41 PM, Rafael J. Wysocki <rjw@sisk.pl> wrote: > > Hi, > >> drivers/pci/pci.c | 3 +++ > >> 1 files changed, 3 insertions(+), 0 deletions(-) > >> > >> diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c > >> index e9651f0..ca8c82d 100644 > >> --- a/drivers/pci/pci.c > >> +++ b/drivers/pci/pci.c > >> @@ -664,6 +664,9 @@ static int pci_platform_power_transition(struct pci_dev *dev, pci_power_t state) > >> error = platform_pci_set_power_state(dev, state); > >> if (!error) > >> pci_update_current_state(dev, state); > >> + /* Fall back to PCI_D0 if native PM is not supported */ > >> + else if (!dev->pm_cap) > >> + dev->current_state = PCI_D0; > >> } else { > >> error = -ENODEV; > >> /* Fall back to PCI_D0 if native PM is not supported */ > >> > > > > I have some vague memories that we tried that and it broke something. > > Do have any hint for us to dig out the configuration for failing? > > > > > How thoroughly has it been tested? > > tested in setups with intel cpus from Nehalem. Can you test it on older hardware too, please? Rafael ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-05 11:34 [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support Ajaykumar Hotchandani 2011-10-05 19:41 ` Rafael J. Wysocki @ 2011-10-06 10:39 ` Stefano Stabellini 2011-10-06 16:05 ` Yinghai Lu 1 sibling, 1 reply; 14+ messages in thread From: Stefano Stabellini @ 2011-10-06 10:39 UTC (permalink / raw) To: Ajaykumar Hotchandani Cc: Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Wed, 5 Oct 2011, Ajaykumar Hotchandani wrote: > During test of one IB card with guest VM, found that, msi is not initialized properly. > > It turns out __write_msi_msg will do nothing if device current_state is not PCI_D0. > And, that pci device does not have pm_cap in guest VM. > > There is an error in setting of power state to PCI_D0 in pci_enable_device(), but error is not returned for this. > Following is code flow: > pci_enable_device() --> __pci_enable_device_flags() --> do_pci_enable_device() --> pci_set_power_state() --> __pci_start_power_transition() > We have following condition inside __pci_start_power_transition(): > if (platform_pci_power_manageable(dev)) { > error = platform_pci_set_power_state(dev, state); > if (!error) > pci_update_current_state(dev, state); > } else { > error = -ENODEV; > /* Fall back to PCI_D0 if native PM is not supported */ > if (!dev->pm_cap) > dev->current_state = PCI_D0; > } > > Here, from platform_pci_set_power_state(), acpi_pci_set_power_state() is getting called and that is failing with ENODEV because of following condition: > if (!handle || ACPI_SUCCESS(acpi_get_handle(handle, "_EJ0",&tmp))) > return -ENODEV; > > Because of that, pci_update_current_state() is not getting called. > > With this patch, if device power state can not be set via platform_pci_set_power_state and that device does not have native pm support, then PCI device power state will be set to PCI_D0. > > Signed-off-by: Ajaykumar Hotchandani<ajaykumar.hotchandani@oracle.com> > Signed-off-by: Yinghai Lu<yinghai.lu@oracle.com> I had the same issue and sent a patch a while ago to fix it, adding current_state = PCI_D0 in acpiphp_glue.c:register_slot it is strange that this does not work for you: http://marc.info/?l=linux-kernel&m=129891002722845&w=2 ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-06 10:39 ` Stefano Stabellini @ 2011-10-06 16:05 ` Yinghai Lu 2011-10-06 16:17 ` Stefano Stabellini 0 siblings, 1 reply; 14+ messages in thread From: Yinghai Lu @ 2011-10-06 16:05 UTC (permalink / raw) To: Stefano Stabellini Cc: Ajaykumar Hotchandani, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini <stefano.stabellini@eu.citrix.com> wrote: > > I had the same issue and sent a patch a while ago to fix it, adding > > current_state = PCI_D0 in acpiphp_glue.c:register_slot > > it is strange that this does not work for you: > > http://marc.info/?l=linux-kernel&m=129891002722845&w=2 So guest os has to load acpiphp instead of pciehp? Yinghai ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-06 16:05 ` Yinghai Lu @ 2011-10-06 16:17 ` Stefano Stabellini 2011-10-10 12:02 ` Ajaykumar Hotchandani 0 siblings, 1 reply; 14+ messages in thread From: Stefano Stabellini @ 2011-10-06 16:17 UTC (permalink / raw) To: Yinghai Lu Cc: Stefano Stabellini, Ajaykumar Hotchandani, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Thu, 6 Oct 2011, Yinghai Lu wrote: > On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini > <stefano.stabellini@eu.citrix.com> wrote: > > > > I had the same issue and sent a patch a while ago to fix it, adding > > > > current_state = PCI_D0 in acpiphp_glue.c:register_slot > > > > it is strange that this does not work for you: > > > > http://marc.info/?l=linux-kernel&m=129891002722845&w=2 > > So guest os has to load acpiphp instead of pciehp? maybe pciehp needs to make sure that current_state = D0 in pciehp_enable_slot, like acpiphp does ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-06 16:17 ` Stefano Stabellini @ 2011-10-10 12:02 ` Ajaykumar Hotchandani 2011-10-13 14:30 ` Stefano Stabellini 0 siblings, 1 reply; 14+ messages in thread From: Ajaykumar Hotchandani @ 2011-10-10 12:02 UTC (permalink / raw) To: Stefano Stabellini Cc: Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On 10/06/2011 09:47 PM, Stefano Stabellini wrote: > On Thu, 6 Oct 2011, Yinghai Lu wrote: >> On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini >> <stefano.stabellini@eu.citrix.com> wrote: >>> I had the same issue and sent a patch a while ago to fix it, adding >>> >>> current_state = PCI_D0 in acpiphp_glue.c:register_slot >>> >>> it is strange that this does not work for you: >>> >>> http://marc.info/?l=linux-kernel&m=129891002722845&w=2 >> So guest os has to load acpiphp instead of pciehp? > > maybe pciehp needs to make sure that current_state = D0 in > pciehp_enable_slot, like acpiphp does Here, acpi hotplugging is involved. With your change in register_slot(), device will have proper power state when module is being loaded for the first time after booting. However, while unload of pci module; following is in pci_device_remove(): if (pci_dev->current_state == PCI_D0) pci_dev->current_state = PCI_UNKNOWN; So, device power state state will remain PCI_UNKNOWN while module is loaded again. Subsequently, MSI write will do nothing. Thanks, Ajay ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-10 12:02 ` Ajaykumar Hotchandani @ 2011-10-13 14:30 ` Stefano Stabellini 2011-10-18 13:52 ` Ajaykumar Hotchandani 0 siblings, 1 reply; 14+ messages in thread From: Stefano Stabellini @ 2011-10-13 14:30 UTC (permalink / raw) To: Ajaykumar Hotchandani Cc: Stefano Stabellini, Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Mon, 10 Oct 2011, Ajaykumar Hotchandani wrote: > On 10/06/2011 09:47 PM, Stefano Stabellini wrote: > > On Thu, 6 Oct 2011, Yinghai Lu wrote: > >> On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini > >> <stefano.stabellini@eu.citrix.com> wrote: > >>> I had the same issue and sent a patch a while ago to fix it, adding > >>> > >>> current_state = PCI_D0 in acpiphp_glue.c:register_slot > >>> > >>> it is strange that this does not work for you: > >>> > >>> http://marc.info/?l=linux-kernel&m=129891002722845&w=2 > >> So guest os has to load acpiphp instead of pciehp? > > > > maybe pciehp needs to make sure that current_state = D0 in > > pciehp_enable_slot, like acpiphp does > Here, acpi hotplugging is involved. > With your change in register_slot(), device will have proper power state when module is being loaded for the first time after booting. > However, while unload of pci module; following is in pci_device_remove(): > if (pci_dev->current_state == PCI_D0) > pci_dev->current_state = PCI_UNKNOWN; > > So, device power state state will remain PCI_UNKNOWN while module is loaded again. Subsequently, MSI write will do nothing. Does this mean that this bug would actually trigger even with devices that do support _EJ0 and power management? Because acpi_pci_set_power_state won't set current_state to PCI_D0 because the "hotplug driver will take care of _PSx" (see 10b3dcae0f275e2546e55303d64ddbb58cec7599) but the hotplug driver is not actually invoked when loading again a driver module of an existing pci device (the example you mention above)? ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-13 14:30 ` Stefano Stabellini @ 2011-10-18 13:52 ` Ajaykumar Hotchandani 2011-10-27 13:14 ` Stefano Stabellini 0 siblings, 1 reply; 14+ messages in thread From: Ajaykumar Hotchandani @ 2011-10-18 13:52 UTC (permalink / raw) To: Stefano Stabellini Cc: Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On 10/13/2011 08:00 PM, Stefano Stabellini wrote: > On Mon, 10 Oct 2011, Ajaykumar Hotchandani wrote: >> On 10/06/2011 09:47 PM, Stefano Stabellini wrote: >>> On Thu, 6 Oct 2011, Yinghai Lu wrote: >>>> On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini >>>> <stefano.stabellini@eu.citrix.com> wrote: >>>>> I had the same issue and sent a patch a while ago to fix it, adding >>>>> >>>>> current_state = PCI_D0 in acpiphp_glue.c:register_slot >>>>> >>>>> it is strange that this does not work for you: >>>>> >>>>> http://marc.info/?l=linux-kernel&m=129891002722845&w=2 >>>> So guest os has to load acpiphp instead of pciehp? >>> maybe pciehp needs to make sure that current_state = D0 in >>> pciehp_enable_slot, like acpiphp does >> Here, acpi hotplugging is involved. >> With your change in register_slot(), device will have proper power state when module is being loaded for the first time after booting. >> However, while unload of pci module; following is in pci_device_remove(): >> if (pci_dev->current_state == PCI_D0) >> pci_dev->current_state = PCI_UNKNOWN; >> >> So, device power state state will remain PCI_UNKNOWN while module is loaded again. Subsequently, MSI write will do nothing. > Does this mean that this bug would actually trigger even with devices that > do support _EJ0 and power management? Nope. I don't have system to verify. But, for the scenario you mentioned (device is hot pluggable, acpi bus power manageable and device with pm_cap supported; if I understand it correctly), following is what I think should happen: - Inside pci_platform_power_transition(), platform_pci_power_manageable() will be successful. However, platform_pci_set_power_state() will fail (as device supports EJ0) and subsequently pci_update_current_state() will not get called. - So, current power state of device will not be read and power state of device will remain PCI_UNKNOWN. - Now, pci_raw_set_power_state() will get called. Here, as current power state is PCI_UNKNOWN, pci_write_config_word() will be called with pmcsr value as 0. As, last two bits of pmcsr is 0, written power state of device will be PCI_D0 now. And subsequently, dev->current_state will be assigned as PCI_D0 (by using pci_read_config_word() ). > Because acpi_pci_set_power_state won't set current_state to PCI_D0 > because the "hotplug driver will take care of _PSx" (see > 10b3dcae0f275e2546e55303d64ddbb58cec7599) but the hotplug driver is not > actually invoked when loading again a driver module of an existing pci > device (the example you mention above)? Can you please elaborate more on this? acpi_bus_set_power() will update state of acpi_dev (ACPI_STATE_XX) and not pci_dev (PCI_XX). And here, issue lies with current_state of pci_dev. Thanks, Ajay ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-18 13:52 ` Ajaykumar Hotchandani @ 2011-10-27 13:14 ` Stefano Stabellini 2011-10-27 14:34 ` Ajaykumar Hotchandani 0 siblings, 1 reply; 14+ messages in thread From: Stefano Stabellini @ 2011-10-27 13:14 UTC (permalink / raw) To: Ajaykumar Hotchandani Cc: Stefano Stabellini, Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Tue, 18 Oct 2011, Ajaykumar Hotchandani wrote: > On 10/13/2011 08:00 PM, Stefano Stabellini wrote: > > On Mon, 10 Oct 2011, Ajaykumar Hotchandani wrote: > >> On 10/06/2011 09:47 PM, Stefano Stabellini wrote: > >>> On Thu, 6 Oct 2011, Yinghai Lu wrote: > >>>> On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini > >>>> <stefano.stabellini@eu.citrix.com> wrote: > >>>>> I had the same issue and sent a patch a while ago to fix it, adding > >>>>> > >>>>> current_state = PCI_D0 in acpiphp_glue.c:register_slot > >>>>> > >>>>> it is strange that this does not work for you: > >>>>> > >>>>> http://marc.info/?l=linux-kernel&m=129891002722845&w=2 > >>>> So guest os has to load acpiphp instead of pciehp? > >>> maybe pciehp needs to make sure that current_state = D0 in > >>> pciehp_enable_slot, like acpiphp does > >> Here, acpi hotplugging is involved. > >> With your change in register_slot(), device will have proper power state when module is being loaded for the first time after booting. > >> However, while unload of pci module; following is in pci_device_remove(): > >> if (pci_dev->current_state == PCI_D0) > >> pci_dev->current_state = PCI_UNKNOWN; > >> > >> So, device power state state will remain PCI_UNKNOWN while module is loaded again. Subsequently, MSI write will do nothing. > > Does this mean that this bug would actually trigger even with devices that > > do support _EJ0 and power management? > Nope. I don't have system to verify. But, for the scenario you mentioned > (device is hot pluggable, acpi bus power manageable and device with > pm_cap supported; if I understand it correctly), following is what I > think should happen: > - Inside pci_platform_power_transition(), > platform_pci_power_manageable() will be successful. However, > platform_pci_set_power_state() will fail (as device supports EJ0) and > subsequently pci_update_current_state() will not get called. > - So, current power state of device will not be read and power state of > device will remain PCI_UNKNOWN. > - Now, pci_raw_set_power_state() will get called. Here, as current power > state is PCI_UNKNOWN, pci_write_config_word() will be called with pmcsr > value as 0. As, last two bits of pmcsr is 0, written power state of > device will be PCI_D0 now. And subsequently, dev->current_state will be > assigned as PCI_D0 (by using pci_read_config_word() ). OK, I understand. > > Because acpi_pci_set_power_state won't set current_state to PCI_D0 > > because the "hotplug driver will take care of _PSx" (see > > 10b3dcae0f275e2546e55303d64ddbb58cec7599) but the hotplug driver is not > > actually invoked when loading again a driver module of an existing pci > > device (the example you mention above)? > Can you please elaborate more on this? > acpi_bus_set_power() will update state of acpi_dev (ACPI_STATE_XX) and > not pci_dev (PCI_XX). > And here, issue lies with current_state of pci_dev. I agree, I was merely stating again what the problem is. BTW I am OK with your patch. ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-27 13:14 ` Stefano Stabellini @ 2011-10-27 14:34 ` Ajaykumar Hotchandani 2011-10-27 15:01 ` Stefano Stabellini 0 siblings, 1 reply; 14+ messages in thread From: Ajaykumar Hotchandani @ 2011-10-27 14:34 UTC (permalink / raw) To: Stefano Stabellini Cc: Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On 10/27/2011 06:44 PM, Stefano Stabellini wrote: > On Tue, 18 Oct 2011, Ajaykumar Hotchandani wrote: >> On 10/13/2011 08:00 PM, Stefano Stabellini wrote: >>> On Mon, 10 Oct 2011, Ajaykumar Hotchandani wrote: >>>> On 10/06/2011 09:47 PM, Stefano Stabellini wrote: >>>>> On Thu, 6 Oct 2011, Yinghai Lu wrote: >>>>>> On Thu, Oct 6, 2011 at 3:39 AM, Stefano Stabellini >>>>>> <stefano.stabellini@eu.citrix.com> wrote: >>>>>>> I had the same issue and sent a patch a while ago to fix it, adding >>>>>>> >>>>>>> current_state = PCI_D0 in acpiphp_glue.c:register_slot >>>>>>> >>>>>>> it is strange that this does not work for you: >>>>>>> >>>>>>> http://marc.info/?l=linux-kernel&m=129891002722845&w=2 >>>>>> So guest os has to load acpiphp instead of pciehp? >>>>> maybe pciehp needs to make sure that current_state = D0 in >>>>> pciehp_enable_slot, like acpiphp does >>>> Here, acpi hotplugging is involved. >>>> With your change in register_slot(), device will have proper power state when module is being loaded for the first time after booting. >>>> However, while unload of pci module; following is in pci_device_remove(): >>>> if (pci_dev->current_state == PCI_D0) >>>> pci_dev->current_state = PCI_UNKNOWN; >>>> >>>> So, device power state state will remain PCI_UNKNOWN while module is loaded again. Subsequently, MSI write will do nothing. >>> Does this mean that this bug would actually trigger even with devices that >>> do support _EJ0 and power management? >> Nope. I don't have system to verify. But, for the scenario you mentioned >> (device is hot pluggable, acpi bus power manageable and device with >> pm_cap supported; if I understand it correctly), following is what I >> think should happen: >> - Inside pci_platform_power_transition(), >> platform_pci_power_manageable() will be successful. However, >> platform_pci_set_power_state() will fail (as device supports EJ0) and >> subsequently pci_update_current_state() will not get called. >> - So, current power state of device will not be read and power state of >> device will remain PCI_UNKNOWN. >> - Now, pci_raw_set_power_state() will get called. Here, as current power >> state is PCI_UNKNOWN, pci_write_config_word() will be called with pmcsr >> value as 0. As, last two bits of pmcsr is 0, written power state of >> device will be PCI_D0 now. And subsequently, dev->current_state will be >> assigned as PCI_D0 (by using pci_read_config_word() ). > OK, I understand. > > >>> Because acpi_pci_set_power_state won't set current_state to PCI_D0 >>> because the "hotplug driver will take care of _PSx" (see >>> 10b3dcae0f275e2546e55303d64ddbb58cec7599) but the hotplug driver is not >>> actually invoked when loading again a driver module of an existing pci >>> device (the example you mention above)? >> Can you please elaborate more on this? >> acpi_bus_set_power() will update state of acpi_dev (ACPI_STATE_XX) and >> not pci_dev (PCI_XX). >> And here, issue lies with current_state of pci_dev. > I agree, I was merely stating again what the problem is. > > BTW I am OK with your patch. Should we revert 47e9037ac16637cd7f12b8790ea7ce6680e42168 (your changes in register_slot() ) I think, after this change, it's not needed. Or, am I missing some scenario? ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-27 14:34 ` Ajaykumar Hotchandani @ 2011-10-27 15:01 ` Stefano Stabellini 2011-10-28 12:06 ` Ajaykumar Hotchandani 0 siblings, 1 reply; 14+ messages in thread From: Stefano Stabellini @ 2011-10-27 15:01 UTC (permalink / raw) To: Ajaykumar Hotchandani Cc: Stefano Stabellini, Yinghai Lu, Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org On Thu, 27 Oct 2011, Ajaykumar Hotchandani wrote: > > BTW I am OK with your patch. > > Should we revert 47e9037ac16637cd7f12b8790ea7ce6680e42168 (your changes > in register_slot() ) > I think, after this change, it's not needed. Or, am I missing some scenario? > I think you are right, 47e9037ac16637cd7f12b8790ea7ce6680e42168 is not needed anymore, so I am OK with reverting it. ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support 2011-10-27 15:01 ` Stefano Stabellini @ 2011-10-28 12:06 ` Ajaykumar Hotchandani 0 siblings, 0 replies; 14+ messages in thread From: Ajaykumar Hotchandani @ 2011-10-28 12:06 UTC (permalink / raw) To: Jesse Barnes, linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org Cc: Stefano Stabellini, Yinghai Lu On 10/27/2011 08:31 PM, Stefano Stabellini wrote: > On Thu, 27 Oct 2011, Ajaykumar Hotchandani wrote: >>> BTW I am OK with your patch. >> Should we revert 47e9037ac16637cd7f12b8790ea7ce6680e42168 (your changes >> in register_slot() ) >> I think, after this change, it's not needed. Or, am I missing some scenario? >> > I think you are right, 47e9037ac16637cd7f12b8790ea7ce6680e42168 is not > needed anymore, so I am OK with reverting it. Thanks, following is updated patch. [PATCH -v2] PCI: Set device power state to PCI_D0 for device without native PM support During test of one IB card with guest VM, found that, msi is not initialized properly. It turns out __write_msi_msg will do nothing if device current_state is not PCI_D0. And, that pci device does not have pm_cap in guest VM. There is an error in setting of power state to PCI_D0 in pci_enable_device(), but error is not returned for this. Following is code flow: pci_enable_device() --> __pci_enable_device_flags() --> do_pci_enable_device() --> pci_set_power_state() --> __pci_start_power_transition() We have following condition inside __pci_start_power_transition(): if (platform_pci_power_manageable(dev)) { error = platform_pci_set_power_state(dev, state); if (!error) pci_update_current_state(dev, state); } else { error = -ENODEV; /* Fall back to PCI_D0 if native PM is not supported */ if (!dev->pm_cap) dev->current_state = PCI_D0; } Here, from platform_pci_set_power_state(), acpi_pci_set_power_state() is getting called and that is failing with ENODEV because of following condition: if (!handle || ACPI_SUCCESS(acpi_get_handle(handle, "_EJ0",&tmp))) return -ENODEV; Because of that, pci_update_current_state() is not getting called. With this patch, if device power state can not be set via platform_pci_set_power_state and that device does not have native pm support, then PCI device power state will be set to PCI_D0. -v2: This also reverts 47e9037ac16637cd7f12b8790ea7ce6680e42168, as it's not needed after this change. Signed-off-by: Ajaykumar Hotchandani <ajaykumar.hotchandani@oracle.com> Signed-off-by: Yinghai Lu <yinghai.lu@oracle.com> --- drivers/pci/hotplug/acpiphp_glue.c | 1 - drivers/pci/pci.c | 3 +++ 2 files changed, 3 insertions(+), 1 deletions(-) diff --git a/drivers/pci/hotplug/acpiphp_glue.c b/drivers/pci/hotplug/acpiphp_glue.c index 2202857..11431c1 100644 --- a/drivers/pci/hotplug/acpiphp_glue.c +++ b/drivers/pci/hotplug/acpiphp_glue.c @@ -212,7 +212,6 @@ register_slot(acpi_handle handle, u32 lvl, void *context, void **rv) pdev = pci_get_slot(pbus, PCI_DEVFN(device, function)); if (pdev) { - pdev->current_state = PCI_D0; slot->flags |= (SLOT_ENABLED | SLOT_POWEREDON); pci_dev_put(pdev); } diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c index e9651f0..6866937 100644 --- a/drivers/pci/pci.c +++ b/drivers/pci/pci.c @@ -664,6 +664,9 @@ static int pci_platform_power_transition(struct pci_dev *dev, pci_power_t state) error = platform_pci_set_power_state(dev, state); if (!error) pci_update_current_state(dev, state); + /* Fall back to PCI_D0 if native PM is not supported */ + if (!dev->pm_cap) + dev->current_state = PCI_D0; } else { error = -ENODEV; /* Fall back to PCI_D0 if native PM is not supported */ -- 1.7.5.1 ^ permalink raw reply related [flat|nested] 14+ messages in thread
end of thread, other threads:[~2011-10-28 12:07 UTC | newest] Thread overview: 14+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-10-05 11:34 [PATCH] PCI: Set device power state to PCI_D0 for device without native PM support Ajaykumar Hotchandani 2011-10-05 19:41 ` Rafael J. Wysocki 2011-10-06 16:10 ` Yinghai Lu 2011-10-06 17:20 ` Rafael J. Wysocki 2011-10-06 10:39 ` Stefano Stabellini 2011-10-06 16:05 ` Yinghai Lu 2011-10-06 16:17 ` Stefano Stabellini 2011-10-10 12:02 ` Ajaykumar Hotchandani 2011-10-13 14:30 ` Stefano Stabellini 2011-10-18 13:52 ` Ajaykumar Hotchandani 2011-10-27 13:14 ` Stefano Stabellini 2011-10-27 14:34 ` Ajaykumar Hotchandani 2011-10-27 15:01 ` Stefano Stabellini 2011-10-28 12:06 ` Ajaykumar Hotchandani
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox