* RE: [PATCH v1 1/2] PCI: pciehp: Fix presence detect change interrupt handling
@ 2016-08-23 8:58 Patel, Mayurkumar
2016-09-08 19:59 ` Bjorn Helgaas
0 siblings, 1 reply; 3+ messages in thread
From: Patel, Mayurkumar @ 2016-08-23 8:58 UTC (permalink / raw)
To: 'Bjorn Helgaas'
Cc: 'Rajat Jain', 'bhelgaas@google.com',
'linux-pci@vger.kernel.org', Wysocki, Rafael J,
'mika.westerberg@linux.intel.com', Busch, Keith,
Tarazona-Duarte, Luis Antonio, 'Rajat Jain',
'Andy Shevchenko', Patel, Mayurkumar
Currently, if very fast hotplug removal and insertion event comes
as following
[ 608.823412] pciehp 0000:00:1c.1:pcie04: Card not present on Slot(1)
[ 608.835249] pciehp 0000:00:1c.1:pcie04: Card present on Slot(1)
In this case following scenario happens,
While removal:
pcie_isr() -> pciehp_queue_interrupt_event() -> triggers queue_work().
work invokes interrupt_event_handler() -> case INT_PRESENCE_OFF
and calls handle_surprise_event().
handle_surprise_event() again calls pciehp_get_adapter_status()
and reads slot status which might have been changed
already due to PCI_EXP_SLTSTA_PDC event for hotplug insertion
has happened. So it queues, ENABLE_REQ for both removal
and insertion interrupt based on latest slot status.
In this case, PCIe device can not be hot-add again because
it was never removed due to which device can not get enabled.
handle_surprise_event() can be removed and pciehp_queue_power_work()
can be directly triggered based on INT_PRESENCE_ON and INT_PRESENCE_OFF
from the switch case exist in interrupt_event_hanlder().
The patch ensures the pciehp_queue_power_work() processes
presence detect change for removal and insertion correctly.
Signed-off-by: Mayurkumar Patel <mayurkumar.patel@intel.com>
Acked-by: Rajat Jain <rajatxjain@gmail.com>
---
Resending the patch with another patch which has pcie_isr() correct
event handling proposal
drivers/pci/hotplug/pciehp_ctrl.c | 18 ++----------------
1 file changed, 2 insertions(+), 16 deletions(-)
diff --git a/drivers/pci/hotplug/pciehp_ctrl.c b/drivers/pci/hotplug/pciehp=
_ctrl.c
index 880978b..87c5bea 100644
--- a/drivers/pci/hotplug/pciehp_ctrl.c
+++ b/drivers/pci/hotplug/pciehp_ctrl.c
@@ -301,20 +301,6 @@ static void handle_button_press_event(struct slot *p_s=
lot)
/*
* Note: This function must be called with slot->lock held
*/
-static void handle_surprise_event(struct slot *p_slot)
-{
- u8 getstatus;
-
- pciehp_get_adapter_status(p_slot, &getstatus);
- if (!getstatus)
- pciehp_queue_power_work(p_slot, DISABLE_REQ);
- else
- pciehp_queue_power_work(p_slot, ENABLE_REQ);
-}
-
-/*
- * Note: This function must be called with slot->lock held
- */
static void handle_link_event(struct slot *p_slot, u32 event)
{
struct controller *ctrl =3D p_slot->ctrl;
@@ -377,14 +363,14 @@ static void interrupt_event_handler(struct work_struc=
t *work)
pciehp_green_led_off(p_slot);
break;
case INT_PRESENCE_ON:
- handle_surprise_event(p_slot);
+ pciehp_queue_power_work(p_slot, ENABLE_REQ);
break;
case INT_PRESENCE_OFF:
/*
* Regardless of surprise capability, we need to
* definitely remove a card that has been pulled out!
*/
- handle_surprise_event(p_slot);
+ pciehp_queue_power_work(p_slot, DISABLE_REQ);
break;
case INT_LINK_UP:
case INT_LINK_DOWN:
--
1.7.9.5
Intel Deutschland GmbH
Registered Address: Am Campeon 10-12, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de
Managing Directors: Christin Eisenschmid, Christian Lamprechter
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH v1 1/2] PCI: pciehp: Fix presence detect change interrupt handling
2016-08-23 8:58 [PATCH v1 1/2] PCI: pciehp: Fix presence detect change interrupt handling Patel, Mayurkumar
@ 2016-09-08 19:59 ` Bjorn Helgaas
0 siblings, 0 replies; 3+ messages in thread
From: Bjorn Helgaas @ 2016-09-08 19:59 UTC (permalink / raw)
To: Patel, Mayurkumar
Cc: 'Rajat Jain', 'bhelgaas@google.com',
'linux-pci@vger.kernel.org', Wysocki, Rafael J,
'mika.westerberg@linux.intel.com', Busch, Keith,
Tarazona-Duarte, Luis Antonio, 'Rajat Jain',
'Andy Shevchenko'
On Tue, Aug 23, 2016 at 08:58:51AM +0000, Patel, Mayurkumar wrote:
> Currently, if very fast hotplug removal and insertion event comes
> as following
>
> [ 608.823412] pciehp 0000:00:1c.1:pcie04: Card not present on Slot(1)
> [ 608.835249] pciehp 0000:00:1c.1:pcie04: Card present on Slot(1)
>
> In this case following scenario happens,
>
> While removal:
> pcie_isr() -> pciehp_queue_interrupt_event() -> triggers queue_work().
> work invokes interrupt_event_handler() -> case INT_PRESENCE_OFF
> and calls handle_surprise_event().
>
> handle_surprise_event() again calls pciehp_get_adapter_status()
> and reads slot status which might have been changed
> already due to PCI_EXP_SLTSTA_PDC event for hotplug insertion
> has happened. So it queues, ENABLE_REQ for both removal
> and insertion interrupt based on latest slot status.
>
> In this case, PCIe device can not be hot-add again because
> it was never removed due to which device can not get enabled.
>
> handle_surprise_event() can be removed and pciehp_queue_power_work()
> can be directly triggered based on INT_PRESENCE_ON and INT_PRESENCE_OFF
> from the switch case exist in interrupt_event_hanlder().
>
> The patch ensures the pciehp_queue_power_work() processes
> presence detect change for removal and insertion correctly.
>
> Signed-off-by: Mayurkumar Patel <mayurkumar.patel@intel.com>
> Acked-by: Rajat Jain <rajatxjain@gmail.com>
I applied this to pci/hotplug for v4.9, with the following changelog.
PCI: pciehp: Fix presence detect change interrupt handling
When a hotplug insertion happens immediately after a hotplug removal, we
may not handle the removal correctly, which may cause the insertion to
fail.
If Presence Detect State (PCI_EXP_SLTSTA_PDS) has changed from "card
present" to "empty", we must remove the kernel pci_dev, even if a device
is inserted again. With the previous code, that might not happen if the
insertion happens soon after the removal. Consider this path:
# hotplug removal causes interrupt and clears PCI_EXP_SLTSTA_PDS
pcie_isr
pcie_capability_read_word(pdev, PCI_EXP_SLTSTA, &status)
present = status & PCI_EXP_SLTSTA_PDS # FALSE
pciehp_queue_interrupt_event(INT_PRESENCE_OFF)
queue_work(...) # interrupt_event_handler
# hotplug insertion sets PCI_EXP_SLTSTA_PDS
interrupt_event_handler
handle_surprise_event
pciehp_get_adapter_status
pcie_capability_read_word(pdev, PCI_EXP_SLTSTA, &status)
present = status & PCI_EXP_SLTSTA_PDS # TRUE <----
pciehp_queue_power_work(ENABLE_REQ)
The first PCI_EXP_SLTSTA read sees that the slot was empty, so we queue up
handle_surprise_event(). But handle_surprise_event() reads PCI_EXP_SLTSTA
again, and by that time, the slot has a card in it again, so it tries to
turn on the power and scan the slot. The scan fails because we still have
the old pci_dev for the device that was removed.
In interrupt_event_handler(), we already know the event type
(INT_PRESENCE_ON or INT_PRESENCE_OFF), so there's no need to read
PCI_EXP_SLTSTA again in handle_surprise_event(). Remove
handle_surprise_event() and queue the power work directly.
> ---
> Resending the patch with another patch which has pcie_isr() correct
> event handling proposal
>
> drivers/pci/hotplug/pciehp_ctrl.c | 18 ++----------------
> 1 file changed, 2 insertions(+), 16 deletions(-)
>
> diff --git a/drivers/pci/hotplug/pciehp_ctrl.c b/drivers/pci/hotplug/pciehp_ctrl.c
> index 880978b..87c5bea 100644
> --- a/drivers/pci/hotplug/pciehp_ctrl.c
> +++ b/drivers/pci/hotplug/pciehp_ctrl.c
> @@ -301,20 +301,6 @@ static void handle_button_press_event(struct slot *p_slot)
> /*
> * Note: This function must be called with slot->lock held
> */
> -static void handle_surprise_event(struct slot *p_slot)
> -{
> - u8 getstatus;
> -
> - pciehp_get_adapter_status(p_slot, &getstatus);
> - if (!getstatus)
> - pciehp_queue_power_work(p_slot, DISABLE_REQ);
> - else
> - pciehp_queue_power_work(p_slot, ENABLE_REQ);
> -}
> -
> -/*
> - * Note: This function must be called with slot->lock held
> - */
> static void handle_link_event(struct slot *p_slot, u32 event)
> {
> struct controller *ctrl = p_slot->ctrl;
> @@ -377,14 +363,14 @@ static void interrupt_event_handler(struct work_struct *work)
> pciehp_green_led_off(p_slot);
> break;
> case INT_PRESENCE_ON:
> - handle_surprise_event(p_slot);
> + pciehp_queue_power_work(p_slot, ENABLE_REQ);
> break;
> case INT_PRESENCE_OFF:
> /*
> * Regardless of surprise capability, we need to
> * definitely remove a card that has been pulled out!
> */
> - handle_surprise_event(p_slot);
> + pciehp_queue_power_work(p_slot, DISABLE_REQ);
> break;
> case INT_LINK_UP:
> case INT_LINK_DOWN:
> --
> 1.7.9.5
>
> Intel Deutschland GmbH
> Registered Address: Am Campeon 10-12, 85579 Neubiberg, Germany
> Tel: +49 89 99 8853-0, www.intel.de
> Managing Directors: Christin Eisenschmid, Christian Lamprechter
> Chairperson of the Supervisory Board: Nicole Lau
> Registered Office: Munich
> Commercial Register: Amtsgericht Muenchen HRB 186928
>
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: [PATCH v1] PCI: pciehp: Fix presence detect change interrupt handling
@ 2016-08-17 22:37 Patel, Mayurkumar
2016-08-18 21:07 ` [PATCH v1 1/2] " Mayurkumar Patel
0 siblings, 1 reply; 3+ messages in thread
From: Patel, Mayurkumar @ 2016-08-17 22:37 UTC (permalink / raw)
To: Bjorn Helgaas, Rajat Jain
Cc: bhelgaas@google.com, linux-pci@vger.kernel.org, Wysocki, Rafael J,
mika.westerberg@linux.intel.com, Shevchenko, Andriy, Busch, Keith,
Tarazona-Duarte, Luis Antonio, Rajat Jain
Hi Bjorn and Rajat
Thanks for replying.
=
> Hi Rajat, thanks for chiming in!
> =
> On Wed, Aug 17, 2016 at 10:54:12AM -0700, Rajat Jain wrote:
> > On Wed, Aug 17, 2016 at 10:12 AM, Bjorn Helgaas <helgaas@kernel.org> wr=
ote:
> > >
> > > Hi Mayurkumar,
> > >
> > > On Wed, Aug 17, 2016 at 01:42:18PM +0000, Patel, Mayurkumar wrote:
> > > > Currently, if very fast hotplug removal and insertion event comes
> > > > as following
> > > >
> > > > [ 608.823412] pciehp 0000:00:1c.1:pcie04: Card not present on Slot=
(1)
> > > > [ 608.835249] pciehp 0000:00:1c.1:pcie04: Card present on Slot(1)
> > > >
> > > > In this case following scenario happens,
> > > >
> > > > While removal:
> > > > pcie_isr() -> pciehp_queue_interrupt_event() -> triggers queue_work=
().
> > > > work invokes interrupt_event_handler() -> case INT_PRESENCE_OFF
> > > > and calls handle_surprise_event().
> > > >
> > > > handle_surprise_event() again calls pciehp_get_adapter_status()
> > > > and reads slot status which might have been changed
> > > > already due to PCI_EXP_SLTSTA_PDC event for hotplug insertion
> > > > has happened. So it queues, ENABLE_REQ for both removal
> > > > and insertion interrupt based on latest slot status.
> > > >
> > > > In this case, PCIe device can not be hot-add again because
> > > > it was never removed due to which device can not get enabled.
> > > >
> > > > handle_surprise_event() can be removed and pciehp_queue_power_work()
> > > > can be directly triggered based on INT_PRESENCE_ON and INT_PRESENCE=
_OFF
> > > > from the switch case exist in interrupt_event_hanlder().
> > > >
> > > > The patch ensures the pciehp_queue_power_work() processes
> > > > presence detect change for removal and insertion correctly.
> > > >
> > > > Signed-off-by: Mayurkumar Patel <mayurkumar.patel@intel.com>
> >
> > Acked-by: Rajat Jain <rajatxjain@gmail.com>
> >
> > >
> > > > ---
> > > > Resending the patch addressing to PCI Maintainer Bjorn Helgaas.
> > > >
> > > > drivers/pci/hotplug/pciehp_ctrl.c | 18 ++----------------
> > > > 1 file changed, 2 insertions(+), 16 deletions(-)
> > > >
> > > > diff --git a/drivers/pci/hotplug/pciehp_ctrl.c b/drivers/pci/hotplu=
g/pciehp_ctrl.c
> > > > index 880978b..87c5bea 100644
> > > > --- a/drivers/pci/hotplug/pciehp_ctrl.c
> > > > +++ b/drivers/pci/hotplug/pciehp_ctrl.c
> > > > @@ -301,20 +301,6 @@ static void handle_button_press_event(struct s=
lot *p_slot)
> > > > /*
> > > > * Note: This function must be called with slot->lock held
> > > > */
> > > > -static void handle_surprise_event(struct slot *p_slot)
> > > > -{
> > > > - u8 getstatus;
> > > > -
> > > > - pciehp_get_adapter_status(p_slot, &getstatus);
> > > > - if (!getstatus)
> > > > - pciehp_queue_power_work(p_slot, DISABLE_REQ);
> > > > - else
> > > > - pciehp_queue_power_work(p_slot, ENABLE_REQ);
> > > > -}
> > > > -
> > > > -/*
> > > > - * Note: This function must be called with slot->lock held
> > > > - */
> > > > static void handle_link_event(struct slot *p_slot, u32 event)
> > > > {
> > > > struct controller *ctrl =3D p_slot->ctrl;
> > > > @@ -377,14 +363,14 @@ static void interrupt_event_handler(struct wo=
rk_struct *work)
> > > > pciehp_green_led_off(p_slot);
> > > > break;
> > > > case INT_PRESENCE_ON:
> > > > - handle_surprise_event(p_slot);
> > > > + pciehp_queue_power_work(p_slot, ENABLE_REQ);
> > > > break;
> > > > case INT_PRESENCE_OFF:
> > > > /*
> > > > * Regardless of surprise capability, we need to
> > > > * definitely remove a card that has been pulled out!
> > > > */
> > > > - handle_surprise_event(p_slot);
> > > > + pciehp_queue_power_work(p_slot, DISABLE_REQ);
> > > > break;
> > > > case INT_LINK_UP:
> > > > case INT_LINK_DOWN:
> > >
> > > Thanks a lot for this. I think other people have seen the same issue.
> > >
> > > Even with this fix, don't we have essentially the same problem one
> > > layer back? The first thing pcie_isr() does is read PCI_EXP_SLTSTA,
> > > then few lines down, we call pciehp_get_adapter_status(), which reads
> > > PCI_EXP_SLTSTA *again*. So I think the window is smaller but still
> > > there.
> > >
> > > I think what we really should do is read the status registers
> > > (PCI_EXP_SLTSTA and probably also PCI_EXP_LNKSTA) *once* in
> > > pcie_isr(), before we write PCI_EXP_SLTSTA to clear the RW1C bits
> > > there, and then queue up events based on those values, without
> > > re-reading the registers.
> > >
> > > What do you think?
> >
> >
> > Yes, I agree. =
Yes indeed that should be done too.
> > We need to do something about that *in addition * to the
> > above patch to cover the
> > whole story. However I think there still will be a room for some
> > interrupt misses because we are
> > collecting the interrupts in intr_loc, and theoretically we could be
> > in a situation where in the pcie_isr, the
> >
> > do {
> > ...
> > } while(detected)
> >
> > loop gets a removal->insertion->removal all while in the same
> > invocation of pcie_isr().
> > If this happens, the intr_loc will have recorded a single insertion
> > and a single removal, and
> > the final result will depend on the order in which we decide to
> > process the events in intr_loc.
> =
> I don't quite understand how that "do { .. } while (detected)" loop
> works or why it's done that way. Collecting interrupt status bits in
> an ISR is obviously a very common task; it seems like there should be
> a standard, idiomatic way of doing it, but I don't know it.
> =
> > Or, may be we can make the calls to pciehp_queue_interrupt_event()
> > before clearing the
> > RW1C in the slot status register (in the loop)?
> =
> Yeah, it seems like we should read PCI_EXP_SLTSTA once, queue up any
> events related to it, then clear the relevant SLTSTA bits.
> =
Do you mean to remove the do {...} while loop and just
read PCI_EXP_SLTSTA once in ISR , queue the work and clear interrupts?
> Bjorn
Intel Deutschland GmbH
Registered Address: Am Campeon 10-12, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de
Managing Directors: Christin Eisenschmid, Christian Lamprechter
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928
^ permalink raw reply [flat|nested] 3+ messages in thread
* [PATCH v1 1/2] PCI: pciehp: Fix presence detect change interrupt handling
2016-08-17 22:37 [PATCH v1] " Patel, Mayurkumar
@ 2016-08-18 21:07 ` Mayurkumar Patel
0 siblings, 0 replies; 3+ messages in thread
From: Mayurkumar Patel @ 2016-08-18 21:07 UTC (permalink / raw)
To: helgaas, bhelgaas
Cc: rajatja, linux-pci, andriy.shevchenko, mika.westerberg,
rafael.j.wysocki, luis.antonio.tarazona-duarte, keith.busch,
mayurkumar.patel
Currently, if very fast hotplug removal and insertion event comes
as following
[ 608.823412] pciehp 0000:00:1c.1:pcie04: Card not present on Slot(1)
[ 608.835249] pciehp 0000:00:1c.1:pcie04: Card present on Slot(1)
In this case following scenario happens,
While removal:
pcie_isr() -> pciehp_queue_interrupt_event() -> triggers queue_work().
work invokes interrupt_event_handler() -> case INT_PRESENCE_OFF
and calls handle_surprise_event().
handle_surprise_event() again calls pciehp_get_adapter_status()
and reads slot status which might have been changed
already due to PCI_EXP_SLTSTA_PDC event for hotplug insertion
has happened. So it queues, ENABLE_REQ for both removal
and insertion interrupt based on latest slot status.
In this case, PCIe device can not be hot-add again because
it was never removed due to which device can not get enabled.
handle_surprise_event() can be removed and pciehp_queue_power_work()
can be directly triggered based on INT_PRESENCE_ON and INT_PRESENCE_OFF
from the switch case exist in interrupt_event_hanlder().
The patch ensures the pciehp_queue_power_work() processes
presence detect change for removal and insertion correctly.
Signed-off-by: Mayurkumar Patel <mayurkumar.patel@intel.com>
Acked-by: Rajat Jain <rajatxjain@gmail.com>
---
drivers/pci/hotplug/pciehp_ctrl.c | 18 ++----------------
1 file changed, 2 insertions(+), 16 deletions(-)
diff --git a/drivers/pci/hotplug/pciehp_ctrl.c b/drivers/pci/hotplug/pciehp_ctrl.c
index 880978b..87c5bea 100644
--- a/drivers/pci/hotplug/pciehp_ctrl.c
+++ b/drivers/pci/hotplug/pciehp_ctrl.c
@@ -301,20 +301,6 @@ static void handle_button_press_event(struct slot *p_slot)
/*
* Note: This function must be called with slot->lock held
*/
-static void handle_surprise_event(struct slot *p_slot)
-{
- u8 getstatus;
-
- pciehp_get_adapter_status(p_slot, &getstatus);
- if (!getstatus)
- pciehp_queue_power_work(p_slot, DISABLE_REQ);
- else
- pciehp_queue_power_work(p_slot, ENABLE_REQ);
-}
-
-/*
- * Note: This function must be called with slot->lock held
- */
static void handle_link_event(struct slot *p_slot, u32 event)
{
struct controller *ctrl = p_slot->ctrl;
@@ -377,14 +363,14 @@ static void interrupt_event_handler(struct work_struct *work)
pciehp_green_led_off(p_slot);
break;
case INT_PRESENCE_ON:
- handle_surprise_event(p_slot);
+ pciehp_queue_power_work(p_slot, ENABLE_REQ);
break;
case INT_PRESENCE_OFF:
/*
* Regardless of surprise capability, we need to
* definitely remove a card that has been pulled out!
*/
- handle_surprise_event(p_slot);
+ pciehp_queue_power_work(p_slot, DISABLE_REQ);
break;
case INT_LINK_UP:
case INT_LINK_DOWN:
--
1.7.9.5
^ permalink raw reply related [flat|nested] 3+ messages in thread
end of thread, other threads:[~2016-09-08 19:59 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-08-23 8:58 [PATCH v1 1/2] PCI: pciehp: Fix presence detect change interrupt handling Patel, Mayurkumar
2016-09-08 19:59 ` Bjorn Helgaas
-- strict thread matches above, loose matches on Subject: below --
2016-08-17 22:37 [PATCH v1] " Patel, Mayurkumar
2016-08-18 21:07 ` [PATCH v1 1/2] " Mayurkumar Patel
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).