* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
@ 2024-11-05 17:17 ` bugzilla-daemon
2024-11-05 17:36 ` bugzilla-daemon
` (7 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-05 17:17 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
Eduard Kachur (glite60@gmail.com) changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |glite60@gmail.com
--- Comment #1 from Eduard Kachur (glite60@gmail.com) ---
Created attachment 307144
--> https://bugzilla.kernel.org/attachment.cgi?id=307144&action=edit
tbtrace dump during connection and crash
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
2024-11-05 17:17 ` [Bug 218795] " bugzilla-daemon
@ 2024-11-05 17:36 ` bugzilla-daemon
2024-11-06 10:29 ` bugzilla-daemon
` (6 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-05 17:36 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #2 from Eduard Kachur (glite60@gmail.com) ---
Created attachment 307145
--> https://bugzilla.kernel.org/attachment.cgi?id=307145&action=edit
trace log with thunderbolt and pci
I have similar case with eGPU and VFIO passtrough into Windows VM, which
crashes.
Laptop specs
HP Zbook Firefly G10 A
Ryzen 7 7840 HS
Wikingoo Q1L box with JHL6340, also bought and tried Wikingoo P1-60W-M with
JHL7440 told by manufacturer, but lspci names it JHL7540.
Nvidia Quadro P1000
Ubuntu 24.10 Kernel 6.11
System gives lots of:
[ 6323.581954] pcieport 0000:00:04.1: AER: Correctable error message received
from 0000:64:01.0
[ 6323.581966] pcieport 0000:64:01.0: PCIe Bus Error: severity=Correctable,
type=Data Link Layer, (Receiver ID)
[ 6323.581969] pcieport 0000:64:01.0: device [8086:15da] error
status/mask=00000080/00002000
[ 6323.581973] pcieport 0000:64:01.0: [ 7] BadDLLP
And eventually crashes VM with:
[ 6360.466620] pcieport 0000:00:04.1: AER: Multiple Uncorrectable (Non-Fatal)
error message received from 0000:65:00.0
[ 6360.466648] vfio-pci 0000:65:00.0: PCIe Bus Error: severity=Uncorrectable
(Non-Fatal), type=Transaction Layer, (Requester ID)
[ 6360.466652] vfio-pci 0000:65:00.0: device [10de:1cb1] error
status/mask=00004000/00000000
[ 6360.466655] vfio-pci 0000:65:00.0: [14] CmpltTO (First)
Box with newer JHL7440 doesn't have so many BadDLLP errors, but also crashes
with CmpltTO.
Without passtrough and Nvidia driver on host system there are still lots of
BadDLLP errors, but I haven't seen a crash.
I tried pcie_aspm=off with those boxes, but they are not initialized in that
case with hotplug and in coldboot case, Intel based system has same behaviour.
pcie_aspm=force causes some additional errors on PCIe bus.
Possible workaround for me to get a stable system with passtrough is to use
pci=nommconf, but this causes graphical glitches on host GPU in 3D rendering
case.
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
2024-11-05 17:17 ` [Bug 218795] " bugzilla-daemon
2024-11-05 17:36 ` bugzilla-daemon
@ 2024-11-06 10:29 ` bugzilla-daemon
2024-11-07 17:30 ` bugzilla-daemon
` (5 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-06 10:29 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #3 from Eduard Kachur (glite60@gmail.com) ---
I guess person here is in the same boat:
https://askubuntu.com/questions/1531087/pcie-bus-error-thunderbolt-4-bridge
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (2 preceding siblings ...)
2024-11-06 10:29 ` bugzilla-daemon
@ 2024-11-07 17:30 ` bugzilla-daemon
2024-11-07 17:45 ` bugzilla-daemon
` (4 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-07 17:30 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #4 from Mario Limonciello (AMD) (mario.limonciello@amd.com) ---
> pci 0000:35:00.0: 2.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s
> PCIe x1 link at 0000:00:04.1 (capable of 31.504 Gb/s with 8.0 GT/s PCIe x4
> link)
It's worth mentioning that this message is meaningless in the context of USB4.
There were various discussions on the mailing lists about changing this, but it
never landed anywhere.
https://lore.kernel.org/linux-usb/20231103190758.82911-1-mario.limonciello@amd.com/
See specifically patch 8 for more context and the specs that indicate why it
behaves this way.
At least with AMD dGPUs put in eGPU enclosures this was causing problems for
amdgpu because if used pcie_bandwidth_available(). We've changed this in
amdgpu to look at the link partner to exclude this causing issues.
https://github.com/torvalds/linux/blob/v6.12-rc5/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c#L5903
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (3 preceding siblings ...)
2024-11-07 17:30 ` bugzilla-daemon
@ 2024-11-07 17:45 ` bugzilla-daemon
2024-11-08 8:43 ` bugzilla-daemon
` (3 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-07 17:45 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #5 from Eduard Kachur (glite60@gmail.com) ---
Just in case I also ordered ADT-UT3G and will keep you in touch when I will be
able to verify errors and crashes with it.
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (4 preceding siblings ...)
2024-11-07 17:45 ` bugzilla-daemon
@ 2024-11-08 8:43 ` bugzilla-daemon
2024-11-08 15:41 ` bugzilla-daemon
` (2 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-08 8:43 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #6 from Eduard Kachur (glite60@gmail.com) ---
Anyway, is there anything that can be done for PCIe errors except pci=nommconf?
GPU driver inside VM seems to be crashing periodically.
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (5 preceding siblings ...)
2024-11-08 8:43 ` bugzilla-daemon
@ 2024-11-08 15:41 ` bugzilla-daemon
2024-11-08 19:19 ` bugzilla-daemon
2024-11-20 11:11 ` bugzilla-daemon
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-08 15:41 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #7 from Mario Limonciello (AMD) (mario.limonciello@amd.com) ---
> Anyway, is there anything that can be done for PCIe errors except
> pci=nommconf?
If you want to ignore the errors you can use "pci=noaer".
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (6 preceding siblings ...)
2024-11-08 15:41 ` bugzilla-daemon
@ 2024-11-08 19:19 ` bugzilla-daemon
2024-11-20 11:11 ` bugzilla-daemon
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-08 19:19 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #8 from Eduard Kachur (glite60@gmail.com) ---
(In reply to Mario Limonciello (AMD) from comment #7)
> If you want to ignore the errors you can use "pci=noaer".
Previously it didn't work with the newer box, VM was silently crashing without
any PCIe errors in console (which is expected), so I didn't bother to try it
with older one, but surprisingly it works well here.
Thanks!
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread* [Bug 218795] USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors)
2024-04-30 11:41 [Bug 218795] New: USB4 / Thunderbolt + AMD: unstable and slow link (many uncorrectable errors) bugzilla-daemon
` (7 preceding siblings ...)
2024-11-08 19:19 ` bugzilla-daemon
@ 2024-11-20 11:11 ` bugzilla-daemon
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon @ 2024-11-20 11:11 UTC (permalink / raw)
To: linux-usb
https://bugzilla.kernel.org/show_bug.cgi?id=218795
--- Comment #9 from Eduard Kachur (glite60@gmail.com) ---
So, I've got ADT-UT3G, no errors, no crashes. Is there anything I can help with
debugging older boxes?
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 10+ messages in thread