From: Sathyanarayanan Kuppuswamy <sathyanarayanan.kuppuswamy@linux.intel.com>
To: Lukas Wunner <lukas@wunner.de>, Bjorn Helgaas <helgaas@kernel.org>
Cc: Keith Busch <kbusch@kernel.org>,
Yicong Yang <yangyicong@hisilicon.com>,
linux-pci@vger.kernel.org,
Stuart Hayes <stuart.w.hayes@gmail.com>,
Mika Westerberg <mika.westerberg@linux.intel.com>,
Ilpo Jarvinen <ilpo.jarvinen@linux.intel.com>,
Joel Mathew Thomas <proxy0@tutamail.com>,
Russ Weight <russ.weight@linux.dev>,
Matthew Gerlach <matthew.gerlach@altera.com>,
Yilun Xu <yilun.xu@intel.com>,
linux-fpga@vger.kernel.org, Moshe Shemesh <moshe@nvidia.com>,
Shay Drory <shayd@nvidia.com>, Saeed Mahameed <saeedm@nvidia.com>,
Alex Williamson <alex.williamson@redhat.com>
Subject: Re: [PATCH 1/2] PCI: pciehp: Ignore Presence Detect Changed caused by DPC
Date: Thu, 10 Apr 2025 19:34:41 -0700 [thread overview]
Message-ID: <6b8cf94f-4264-46c5-bf08-77e77796c3ac@linux.intel.com> (raw)
In-Reply-To: <fa264ff71952915c4e35a53c89eb0cde8455a5c5.1744298239.git.lukas@wunner.de>
On 4/10/25 8:27 AM, Lukas Wunner wrote:
> Commit a97396c6eb13 ("PCI: pciehp: Ignore Link Down/Up caused by DPC")
> amended PCIe hotplug to not bring down the slot upon Data Link Layer State
> Changed events caused by Downstream Port Containment.
>
> However Keith reports off-list that if the slot uses in-band presence
> detect (i.e. Presence Detect State is derived from Data Link Layer Link
> Active), DPC also causes a spurious Presence Detect Changed event.
>
> This needs to be ignored as well.
>
> Unfortunately there's no register indicating that in-band presence detect
> is used. PCIe r5.0 sec 7.5.3.10 introduced the In-Band PD Disable bit in
> the Slot Control Register. The PCIe hotplug driver sets this bit on
> ports supporting it. But older ports may still use in-band presence
> detect.
>
> If in-band presence detect can be disabled, Presence Detect Changed events
It should be "in-band presence detect is disabled", right?
> occurring during DPC must not be ignored because they signal device
> replacement. On all other ports, device replacement cannot be detected
> reliably because the Presence Detect Changed event could be a side effect
> of DPC. On those (older) ports, perform a best-effort device replacement
> check by comparing the Vendor ID, Device ID and other data in Config Space
> with the values cached in struct pci_dev. Use the existing helper
> pciehp_device_replaced() to accomplish this. It is currently #ifdef'ed to
> CONFIG_PM_SLEEP in pciehp_core.c, so move it to pciehp_hpc.c where most
> other functions accessing config space reside.
>
> Reported-by: Keith Busch <kbusch@kernel.org>
> Signed-off-by: Lukas Wunner <lukas@wunner.de>
> ---
Code looks fine to me
Reviewed-by: Kuppuswamy Sathyanarayanan
<sathyanarayanan.kuppuswamy@linux.intel.com>
> drivers/pci/hotplug/pciehp.h | 1 +
> drivers/pci/hotplug/pciehp_core.c | 29 -----------------------
> drivers/pci/hotplug/pciehp_hpc.c | 49 +++++++++++++++++++++++++++++++++------
> 3 files changed, 43 insertions(+), 36 deletions(-)
>
> diff --git a/drivers/pci/hotplug/pciehp.h b/drivers/pci/hotplug/pciehp.h
> index 273dd8c..debc79b0 100644
> --- a/drivers/pci/hotplug/pciehp.h
> +++ b/drivers/pci/hotplug/pciehp.h
> @@ -187,6 +187,7 @@ struct controller {
> int pciehp_card_present_or_link_active(struct controller *ctrl);
> int pciehp_check_link_status(struct controller *ctrl);
> int pciehp_check_link_active(struct controller *ctrl);
> +bool pciehp_device_replaced(struct controller *ctrl);
> void pciehp_release_ctrl(struct controller *ctrl);
>
> int pciehp_sysfs_enable_slot(struct hotplug_slot *hotplug_slot);
> diff --git a/drivers/pci/hotplug/pciehp_core.c b/drivers/pci/hotplug/pciehp_core.c
> index 997841c..f59baa9 100644
> --- a/drivers/pci/hotplug/pciehp_core.c
> +++ b/drivers/pci/hotplug/pciehp_core.c
> @@ -284,35 +284,6 @@ static int pciehp_suspend(struct pcie_device *dev)
> return 0;
> }
>
> -static bool pciehp_device_replaced(struct controller *ctrl)
> -{
> - struct pci_dev *pdev __free(pci_dev_put) = NULL;
> - u32 reg;
> -
> - if (pci_dev_is_disconnected(ctrl->pcie->port))
> - return false;
> -
> - pdev = pci_get_slot(ctrl->pcie->port->subordinate, PCI_DEVFN(0, 0));
> - if (!pdev)
> - return true;
> -
> - if (pci_read_config_dword(pdev, PCI_VENDOR_ID, ®) ||
> - reg != (pdev->vendor | (pdev->device << 16)) ||
> - pci_read_config_dword(pdev, PCI_CLASS_REVISION, ®) ||
> - reg != (pdev->revision | (pdev->class << 8)))
> - return true;
> -
> - if (pdev->hdr_type == PCI_HEADER_TYPE_NORMAL &&
> - (pci_read_config_dword(pdev, PCI_SUBSYSTEM_VENDOR_ID, ®) ||
> - reg != (pdev->subsystem_vendor | (pdev->subsystem_device << 16))))
> - return true;
> -
> - if (pci_get_dsn(pdev) != ctrl->dsn)
> - return true;
> -
> - return false;
> -}
> -
> static int pciehp_resume_noirq(struct pcie_device *dev)
> {
> struct controller *ctrl = get_service_data(dev);
> diff --git a/drivers/pci/hotplug/pciehp_hpc.c b/drivers/pci/hotplug/pciehp_hpc.c
> index 8a09fb6..388fbed 100644
> --- a/drivers/pci/hotplug/pciehp_hpc.c
> +++ b/drivers/pci/hotplug/pciehp_hpc.c
> @@ -563,18 +563,48 @@ void pciehp_power_off_slot(struct controller *ctrl)
> PCI_EXP_SLTCTL_PWR_OFF);
> }
>
> +bool pciehp_device_replaced(struct controller *ctrl)
> +{
> + struct pci_dev *pdev __free(pci_dev_put) = NULL;
> + u32 reg;
> +
> + if (pci_dev_is_disconnected(ctrl->pcie->port))
> + return false;
> +
> + pdev = pci_get_slot(ctrl->pcie->port->subordinate, PCI_DEVFN(0, 0));
> + if (!pdev)
> + return true;
> +
> + if (pci_read_config_dword(pdev, PCI_VENDOR_ID, ®) ||
> + reg != (pdev->vendor | (pdev->device << 16)) ||
> + pci_read_config_dword(pdev, PCI_CLASS_REVISION, ®) ||
> + reg != (pdev->revision | (pdev->class << 8)))
> + return true;
> +
> + if (pdev->hdr_type == PCI_HEADER_TYPE_NORMAL &&
> + (pci_read_config_dword(pdev, PCI_SUBSYSTEM_VENDOR_ID, ®) ||
> + reg != (pdev->subsystem_vendor | (pdev->subsystem_device << 16))))
> + return true;
> +
> + if (pci_get_dsn(pdev) != ctrl->dsn)
> + return true;
> +
> + return false;
> +}
> +
> static void pciehp_ignore_dpc_link_change(struct controller *ctrl,
> - struct pci_dev *pdev, int irq)
> + struct pci_dev *pdev, int irq,
> + u16 ignored_events)
> {
> /*
> * Ignore link changes which occurred while waiting for DPC recovery.
> * Could be several if DPC triggered multiple times consecutively.
> */
> synchronize_hardirq(irq);
> - atomic_and(~PCI_EXP_SLTSTA_DLLSC, &ctrl->pending_events);
> + atomic_and(~ignored_events, &ctrl->pending_events);
> if (pciehp_poll_mode)
> pcie_capability_write_word(pdev, PCI_EXP_SLTSTA,
> - PCI_EXP_SLTSTA_DLLSC);
> + ignored_events);
> ctrl_info(ctrl, "Slot(%s): Link Down/Up ignored (recovered by DPC)\n",
> slot_name(ctrl));
>
> @@ -584,8 +614,8 @@ static void pciehp_ignore_dpc_link_change(struct controller *ctrl,
> * Synthesize it to ensure that it is acted on.
> */
> down_read_nested(&ctrl->reset_lock, ctrl->depth);
> - if (!pciehp_check_link_active(ctrl))
> - pciehp_request(ctrl, PCI_EXP_SLTSTA_DLLSC);
> + if (!pciehp_check_link_active(ctrl) || pciehp_device_replaced(ctrl))
> + pciehp_request(ctrl, ignored_events);
> up_read(&ctrl->reset_lock);
> }
>
> @@ -736,8 +766,13 @@ static irqreturn_t pciehp_ist(int irq, void *dev_id)
> */
> if ((events & PCI_EXP_SLTSTA_DLLSC) && pci_dpc_recovered(pdev) &&
> ctrl->state == ON_STATE) {
> - events &= ~PCI_EXP_SLTSTA_DLLSC;
> - pciehp_ignore_dpc_link_change(ctrl, pdev, irq);
> + u16 ignored_events = PCI_EXP_SLTSTA_DLLSC;
> +
> + if (!ctrl->inband_presence_disabled)
> + ignored_events |= events & PCI_EXP_SLTSTA_PDC;
> +
> + events &= ~ignored_events;
> + pciehp_ignore_dpc_link_change(ctrl, pdev, irq, ignored_events);
> }
>
> /*
--
Sathyanarayanan Kuppuswamy
Linux Kernel Developer
next prev parent reply other threads:[~2025-04-11 2:34 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-04-10 15:27 [PATCH 0/2] Ignore spurious PCIe hotplug events Lukas Wunner
2025-04-10 15:27 ` [PATCH 1/2] PCI: pciehp: Ignore Presence Detect Changed caused by DPC Lukas Wunner
2025-04-11 2:34 ` Sathyanarayanan Kuppuswamy [this message]
2025-04-11 8:58 ` Lukas Wunner
2025-04-14 13:33 ` Ilpo Järvinen
2025-04-10 15:27 ` [PATCH 2/2] PCI: pciehp: Ignore Link Down/Up caused by Secondary Bus Reset Lukas Wunner
2025-04-11 22:28 ` Sathyanarayanan Kuppuswamy
2025-04-12 3:36 ` Lukas Wunner
2025-04-13 17:21 ` Sathyanarayanan Kuppuswamy
2025-04-13 17:22 ` Sathyanarayanan Kuppuswamy
2025-04-14 13:32 ` Ilpo Järvinen
2025-04-16 8:00 ` Ilpo Järvinen
2025-04-10 22:19 ` [PATCH 0/2] Ignore spurious PCIe hotplug events Bjorn Helgaas
2025-04-15 20:51 ` Keith Busch
2025-04-16 15:06 ` Lukas Wunner
2025-04-18 1:26 ` Keith Busch
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6b8cf94f-4264-46c5-bf08-77e77796c3ac@linux.intel.com \
--to=sathyanarayanan.kuppuswamy@linux.intel.com \
--cc=alex.williamson@redhat.com \
--cc=helgaas@kernel.org \
--cc=ilpo.jarvinen@linux.intel.com \
--cc=kbusch@kernel.org \
--cc=linux-fpga@vger.kernel.org \
--cc=linux-pci@vger.kernel.org \
--cc=lukas@wunner.de \
--cc=matthew.gerlach@altera.com \
--cc=mika.westerberg@linux.intel.com \
--cc=moshe@nvidia.com \
--cc=proxy0@tutamail.com \
--cc=russ.weight@linux.dev \
--cc=saeedm@nvidia.com \
--cc=shayd@nvidia.com \
--cc=stuart.w.hayes@gmail.com \
--cc=yangyicong@hisilicon.com \
--cc=yilun.xu@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).