Linux PCI subsystem development
 help / color / mirror / Atom feed
* ixgbe probe failure on Proxmox 8
@ 2024-02-13 23:52 Bjorn Helgaas
  2024-02-14 21:35 ` Brandeburg, Jesse
  0 siblings, 1 reply; 2+ messages in thread
From: Bjorn Helgaas @ 2024-02-13 23:52 UTC (permalink / raw)
  To: Jesse Brandeburg, Tony Nguyen
  Cc: Wang, Qingshun, intel-wired-lan, netdev, linux-pci

Just a heads-up about an ixgbe probe failure seen with Proxmox 8.  I
suspect this is a PCI core problem, probably not an ixgbe problem.

The ixgbe device logs an Advisory Non-Fatal Error and it seems like
subsequent reads from the device return ~0:

  pcieport 0000:00:03.1: AER: Corrected error received: 0000:05:00.0
  pci 0000:05:00.0: PCIe Bus Error: severity=Corrected, type=Transaction Layer, (Receiver ID)
  pci 0000:05:00.0:   device [8086:1563] error status/mask=00002000/00000000
  pci 0000:05:00.0:    [13] NonFatalErr

  ixgbe 0000:05:00.0: enabling device (0000 -> 0002)
  ixgbe 0000:05:00.0: Adapter removed

The user report is at
https://forum.proxmox.com/threads/proxmox-8-kernel-6-2-16-4-pve-ixgbe-driver-fails-to-load-due-to-pci-device-probing-failure.131203/post-633851. 

I opened a bugzilla with complete dmesg log at
https://bugzilla.kernel.org/show_bug.cgi?id=218491 with some
speculation about what might have caused this, e.g., an ACS
configuration error or something.  It's lame, I know, so this is just
a shot in the dark.

Bjorn

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: ixgbe probe failure on Proxmox 8
  2024-02-13 23:52 ixgbe probe failure on Proxmox 8 Bjorn Helgaas
@ 2024-02-14 21:35 ` Brandeburg, Jesse
  0 siblings, 0 replies; 2+ messages in thread
From: Brandeburg, Jesse @ 2024-02-14 21:35 UTC (permalink / raw)
  To: Bjorn Helgaas
  Cc: Nguyen, Anthony L, Wang, Qingshun,
	intel-wired-lan@lists.osuosl.org, netdev@vger.kernel.org,
	linux-pci@vger.kernel.org


--
Jesse Brandeburg


> On Feb 13, 2024, at 3:52 PM, Bjorn Helgaas <helgaas@kernel.org> wrote:
> 
> Just a heads-up about an ixgbe probe failure seen with Proxmox 8.  I
> suspect this is a PCI core problem, probably not an ixgbe problem.
> 
> The ixgbe device logs an Advisory Non-Fatal Error and it seems like
> subsequent reads from the device return ~0:
> 
>  pcieport 0000:00:03.1: AER: Corrected error received: 0000:05:00.0

Why does the user or bios configure corrected errors as fatal or requiring a reset? 

Seems like a self inflicted wound. 

>  pci 0000:05:00.0: PCIe Bus Error: severity=Corrected, type=Transaction Layer, (Receiver ID)
>  pci 0000:05:00.0:   device [8086:1563] error status/mask=00002000/00000000
>  pci 0000:05:00.0:    [13] NonFatalErr
> 
>  ixgbe 0000:05:00.0: enabling device (0000 -> 0002)
>  ixgbe 0000:05:00.0: Adapter removed
> 
> The user report is at
> https://forum.proxmox.com/threads/proxmox-8-kernel-6-2-16-4-pve-ixgbe-driver-fails-to-load-due-to-pci-device-probing-failure.131203/post-633851.
> 
> I opened a bugzilla with complete dmesg log at
> https://bugzilla.kernel.org/show_bug.cgi?id=218491 with some
> speculation about what might have caused this, e.g., an ACS
> configuration error or something.  It's lame, I know, so this is just
> a shot in the dark.

I’ll look a little more at this tomorrow. I remember lots of inconsistent behaviors around this stuff with early ixgbe and when this PCI capabilities was first enabled. Most of these issues have been resolved since and I haven’t heard of one for a long time. 


> 
> Bjorn
> 

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2024-02-14 21:35 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-02-13 23:52 ixgbe probe failure on Proxmox 8 Bjorn Helgaas
2024-02-14 21:35 ` Brandeburg, Jesse

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox