* [REGRESSION 00/04] Crash during resume of pcie bridge
@ 2025-10-06 12:09 Bert Karwatzki
2025-10-06 12:09 ` [REGRESSION 01/04] " Bert Karwatzki
` (5 more replies)
0 siblings, 6 replies; 31+ messages in thread
From: Bert Karwatzki @ 2025-10-06 12:09 UTC (permalink / raw)
To: linux-kernel
Cc: Bert Karwatzki, linux-next, linux-stable, regressions, linux-pci,
linux-acpi, Mario Limonciello, Christian König,
Rafael J . Wysocki
Since linux version v6.15 I experience random crashes on my MSI Alpha 15 Laptop
running debian trixie (amd64). The first such crash happened about in the midth
of june, and as there were no useful log messages and even using netconsole
gave no useful message I suspected faulty hardware. So I ran memtest86+ and
found a faulty address line and replaced the memory (unfortunately 64G to 16G).
But the crashes occured again and so I did a thorough investigation.
The crashes occur after 30min to 33h (yes, hours) of uptime and consist of a
sudden reboot after which the PCI bridge at 00:02.4 and the nvme device
connected to it are missing. If there's sound running during the crash then the
first sign of the crash is the sound looping like a broken record for about 2s,
after which the reboot happens. With the missing nvme device the reboot drops to
a rescue shell. Using "shutdown -h now" from that shell and starting the laptop
with the power button restores the missing PCI bridge and nvme device.
The hardware is the following (it's a dual GPU laptop where the GUI
runs on the built-in GPU):
$ cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 25
model : 80
model name : AMD Ryzen 7 5800H with Radeon Graphics
stepping : 0
microcode : 0xa50000c
cpu MHz : 3394.238
cache size : 512 KB
physical id : 0
siblings : 16
core id : 0
cpu cores : 8
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 16
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca fsrm debug_swap
bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass srso ibpb_no_ret
bogomips : 6388.57
TLB size : 2560 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm hwpstate cpb eff_freq_ro [13] [14]
$ lspci -nn
00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne Root Complex [1022:1630]
00:00.2 IOMMU [0806]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne IOMMU [1022:1631]
00:01.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632]
00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633]
00:02.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632]
00:02.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634]
00:02.2 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634]
00:02.3 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634]
00:02.4 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634]
00:08.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632]
00:08.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus [1022:1635]
00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:790b] (rev 51)
00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:790e] (rev 51)
00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 0 [1022:166a]
00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 1 [1022:166b]
00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 2 [1022:166c]
00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 3 [1022:166d]
00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 4 [1022:166e]
00:18.5 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 5 [1022:166f]
00:18.6 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 6 [1022:1670]
00:18.7 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 7 [1022:1671]
01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3)
02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479]
03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3)
03:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21/23 HDMI/DP Audio Controller [1002:ab28]
04:00.0 Network controller [0280]: MEDIATEK Corp. MT7921K (RZ608) Wi-Fi 6E 80MHz [14c3:0608]
05:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller [10ec:8168] (rev 15)
06:00.0 Non-Volatile memory controller [0108]: Kingston Technology Company, Inc. KC3000/FURY Renegade NVMe SSD [E18] [2646:5013] (rev 01)
07:00.0 Non-Volatile memory controller [0108]: Micron/Crucial Technology P1 NVMe PCIe SSD[Frampton] [c0a9:2263] (rev 03)
08:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] [1002:1638] (rev c5)
08:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Renoir Radeon High Definition Audio Controller [1002:1637]
08:00.2 Encryption controller [1080]: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor [1022:15df]
08:00.3 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1 [1022:1639]
08:00.4 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1 [1022:1639]
08:00.5 Multimedia controller [0480]: Advanced Micro Devices, Inc. [AMD] Audio Coprocessor [1022:15e2] (rev 01)
08:00.6 Audio device [0403]: Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller [1022:15e3]
08:00.7 Signal processing controller [1180]: Advanced Micro Devices, Inc. [AMD] Sensor Fusion Hub [1022:15e4]
These devices are attached to the PCI bus like this:
$ lspci -t
-[0000:00]-+-00.0
+-00.2
+-01.0
+-01.1-[01-03]----00.0-[02-03]----00.0-[03]--+-00.0 // This is the bridge which causes the crash
| \-00.1
+-02.0
+-02.1-[04]----00.0
+-02.2-[05]----00.0
+-02.3-[06]----00.0
+-02.4-[07]----00.0 // These are the bridge and nvme device which disappear after the crash.
+-08.0
+-08.1-[08]--+-00.0
| +-00.1
| +-00.2
| +-00.3
| +-00.4
| +-00.5
| +-00.6
| \-00.7
+-14.0
+-14.3
+-18.0
+-18.1
+-18.2
+-18.3
+-18.4
+-18.5
+-18.6
\-18.7
I tried to bisect this between v6.14 and v6.15 but due to the wildly varying time
it takes to trigger the bug the bisections were not successful. Nevertheless they
gave lots of data about affected and non-affected version of the linux kernel,
and it's quite likely that version v6.14 is indeed free of the bug.
Here's an almost complete list of tested versions:
(Somewhat) sorted (by kernel version, 6.14.0-rc* kernels are from attempted bisections
between v6.14 and v6.15)
v6.14.0 no crash after 16h
v6.14.11 no crash after 7.5h
6.14.0-rc1-bisect-00003-g541ddf31e300 booted 12:24, 22.8.2025, no crash after {48h, 17h}
6.14.0-rc1-mystery-00134-gcc28c0e5e725 booted 11:42, 5.8.2025, no crash after 10.5h
6.14.0-rc1-mystery-00198-gd7f6f07ecec9 booted 22:27, 5.8.2025, no crash after 12h
6.14.0-rc4-mystery-01022-gab498828fad7 booted 21:04, 3.8.2025, no crash after {14h, 24h}
6.14.0-rc4-mystery-01427-g7547510d4a91 booted 11:11, 4.8.2025, no crash after {13h, 23h}
6.14.0-rc6-mystery-01641-g0f04462874e1 booted 00:26, 5.8.2025, no crash after {11h, 24h}
6.14.0-mystery-00826-g327ecdbc0fda no crash after {16h, 17h, 6.5h}
############## here the crashes start (time to each crash, crashes do not always occur) ########
6.14.0-bisect-01053-gebfb94d87b35 booted 10:15, 20.8.2025 crash after ~33h
6.14.0-mystery-09584-g7d06015d936c crash 20.44 3.8.2025 after 7h
6.14.0-mystery-11703-geb0ece16027f crash 13.22 3.8.2025 after 1.75h
6.15.0 crashed around 15-17.6.2025, unknown uptime (This is the first crash!)
6.15.0-nort crash after 6.75h
6.16-rc4 (next-20250627) crash after ~4h
6.16-rc4 (next-20250630) crash after ~5h
6.16-rc4 (next-20250703) crash after ~2.5h (sound buffer repeated for ~1s before restarting)
6.16-rc6 (next-20250718) crash after {2h, 2h}
6.16-rc7 (next-20250721) crash after {~30min, 2h, 5.5h}
6.16.0-nortlockdep crash after 4h
6.17.0-rc4-next-20250902-master booted 8:36, 3.9.2025, crash after ~3.5h
6.17.0-rc5-next-20250908-master booted 10:25, 9.9.2025, crash after {~6.5h, 14h}
6.17.0-rc6-next-20250917-acpidebug booted 12:41, 20.9.2025, crash 15:22 20.8.2025 (~3h, 647 GPP notifies)
The versions below contain additional debugging printk()s and dev_info()s.
The details of these debugging statements are explained below.
6.17.0-rc6-next-20250917-gpudebug-00018-g7a38b625a003 booted 12:58, 26.9.2025, crash 12:01, 27.9.2025 (~23h, 1500 GPP notifies)
6.17.0-rc6-next-20250917-gpudebug-00021-gab98d880e3c8 booted 23:52, 28.9.2025, crash 2:25, 30.9.2025 (26.5h, 1504GPP0, 889GPP2)
6.17.0-rc6-next-20250917-gpudebug-00024-g5c6b49b810db booted 9:10, 2.10.2025, 60h 3093 GPP0 notifies without crash (too many printk()s?)
6.17.0-rc6-next-20250917-gpudebug-00028-gf99cf81b1da7 booted 21:21, 4.10.2025 first try stopped after 77min due to hung tasks
6.17.0-rc6-next-20250917-gpudebug-00028-gf99cf81b1da7 booted 23:37, 4.10.2025 crash 4:52, 6.10.2025 (~27.5h)
6.17.0-rc6-next-20250917-gpudebug-00029-ge797f42363d1 booted 13:00, 6.10.2025 currently testing
As the bisections were not succesfull I tried to monitor the crash using
netconsole and CONFIG_ACPI_DEBUG and "acpi.debug_layer=0xf acpi.debug_level=0x107"
as command line parameters. With this the last message on netconsole before
the crash is usually:
[21465.639279] [ T251] evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 00000000f81f36b8
GPP0 is the ACPI name of this PCI bridge (at least that's my best guess):
00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633]
to which the discrete GPU is connected
03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3)
via the pci express switch
01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3)
02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479]
While the GUI (xfce on xorg) on my laptop runs on the built-in GPU the discrete
GPU usually wakes up quite often, e.g. when a window is opened or when scrolling down on youtube.
A somewhat reliable method to generate GPP0 notifies is putting on a youtube
video and the periodically starting evolution with this script:
#!/bin/bash
for i in {0..1000}
do
echo $i
evolution &
sleep 5
killall evolution
sleep 55
done
This is also the method I used to test the debug kernel in the following mails.
Bert Karwatzki
^ permalink raw reply [flat|nested] 31+ messages in thread* [REGRESSION 01/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki @ 2025-10-06 12:09 ` Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 02/04] " Bert Karwatzki ` (4 subsequent siblings) 5 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-10-06 12:09 UTC (permalink / raw) To: linux-kernel Cc: Bert Karwatzki, linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Christian König, Rafael J . Wysocki To further debug the issue I inserted calls to dev_info() and printk() into the amdgpu suspend/resume code, and the acpi and pcie hotplug resume code. This is the the patch used in kernel version 6.17.0-rc6-next-20250917-gpudebug-00021-gab98d880e3c8 (see list in previous mail) (on top of next-20250917) diff --git a/drivers/acpi/bus.c b/drivers/acpi/bus.c index a984ccd4a2a0..bc365c0dbe2f 100644 --- a/drivers/acpi/bus.c +++ b/drivers/acpi/bus.c @@ -514,46 +514,60 @@ static void acpi_bus_notify(acpi_handle handle, u32 type, void *data) switch (type) { case ACPI_NOTIFY_BUS_CHECK: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_BUS_CHECK event\n"); break; case ACPI_NOTIFY_DEVICE_CHECK: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_CHECK event\n"); break; case ACPI_NOTIFY_DEVICE_WAKE: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_WAKE\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_WAKE event\n"); return; case ACPI_NOTIFY_EJECT_REQUEST: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_EJECT_REQUEST\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_EJECT_REQUEST event\n"); break; case ACPI_NOTIFY_DEVICE_CHECK_LIGHT: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK_LIGHT\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_CHECK_LIGHT event\n"); /* TBD: Exactly what does 'light' mean? */ return; case ACPI_NOTIFY_FREQUENCY_MISMATCH: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_FREQUENCY_MISMATCH\n", __func__, __LINE__); acpi_handle_err(handle, "Device cannot be configured due " "to a frequency mismatch\n"); return; case ACPI_NOTIFY_BUS_MODE_MISMATCH: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_MODE_MISMATCH\n", __func__, __LINE__); acpi_handle_err(handle, "Device cannot be configured due " "to a bus mode mismatch\n"); return; case ACPI_NOTIFY_POWER_FAULT: + printk(KERN_INFO "%s %d: ACPI_NOTIFY_POWER_FAULT\n", __func__, __LINE__); acpi_handle_err(handle, "Device has suffered a power fault\n"); return; default: + printk(KERN_INFO "%s %d: acpi unknown event type\n", __func__, __LINE__); acpi_handle_debug(handle, "Unknown event type 0x%x\n", type); return; } adev = acpi_get_acpi_dev(handle); + if (adev) + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); + else + printk(KERN_INFO "%s %d: adev = NULL\n", __func__, __LINE__); + if (adev && ACPI_SUCCESS(acpi_hotplug_schedule(adev, type))) return; diff --git a/drivers/acpi/device_pm.c b/drivers/acpi/device_pm.c index 4e0583274b8f..9a7dc432b50d 100644 --- a/drivers/acpi/device_pm.c +++ b/drivers/acpi/device_pm.c @@ -539,6 +539,7 @@ static void acpi_pm_notify_handler(acpi_handle handle, u32 val, void *not_used) if (!adev) return; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); mutex_lock(&acpi_pm_notifier_lock); if (adev->wakeup.flags.notifier_present) { diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c index 5ff343096ece..0f6a16856119 100644 --- a/drivers/acpi/osl.c +++ b/drivers/acpi/osl.c @@ -1167,6 +1167,7 @@ void acpi_os_wait_events_complete(void) * Make sure the GPE handler or the fixed event handler is not used * on another CPU after removal. */ + printk(KERN_INFO "%s %d\n", __func__, __LINE__); if (acpi_sci_irq_valid()) synchronize_hardirq(acpi_sci_irq); flush_workqueue(kacpid_wq); @@ -1184,6 +1185,7 @@ static void acpi_hotplug_work_fn(struct work_struct *work) { struct acpi_hp_work *hpw = container_of(work, struct acpi_hp_work, work); + printk(KERN_INFO "%s %d\n", __func__, __LINE__); acpi_os_wait_events_complete(); acpi_device_hotplug(hpw->adev, hpw->src); kfree(hpw); @@ -1192,6 +1194,7 @@ static void acpi_hotplug_work_fn(struct work_struct *work) acpi_status acpi_hotplug_schedule(struct acpi_device *adev, u32 src) { struct acpi_hp_work *hpw; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_handle_debug(adev->handle, "Scheduling hotplug event %u for deferred handling\n", diff --git a/drivers/acpi/scan.c b/drivers/acpi/scan.c index 065abe56f440..d53be7e0388d 100644 --- a/drivers/acpi/scan.c +++ b/drivers/acpi/scan.c @@ -251,6 +251,7 @@ static int acpi_scan_check_and_detach(struct acpi_device *adev, void *p) { struct acpi_scan_handler *handler = adev->handler; uintptr_t flags = (uintptr_t)p; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_dev_for_each_child_reverse(adev, acpi_scan_check_and_detach, p); @@ -314,6 +315,7 @@ static void acpi_scan_check_subtree(struct acpi_device *adev) { uintptr_t flags = ACPI_SCAN_CHECK_FLAG_STATUS; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_scan_check_and_detach(adev, (void *)flags); } @@ -369,6 +371,7 @@ static int acpi_scan_rescan_bus(struct acpi_device *adev) { struct acpi_scan_handler *handler = adev->handler; int ret; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); if (handler && handler->hotplug.scan_dependent) ret = handler->hotplug.scan_dependent(adev); @@ -385,6 +388,7 @@ static int acpi_scan_device_check(struct acpi_device *adev) { struct acpi_device *parent; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_scan_check_subtree(adev); if (!acpi_device_is_present(adev)) @@ -412,19 +416,24 @@ static int acpi_scan_device_check(struct acpi_device *adev) static int acpi_scan_bus_check(struct acpi_device *adev) { acpi_scan_check_subtree(adev); + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_rescan_bus(adev); } static int acpi_generic_hotplug_event(struct acpi_device *adev, u32 type) { + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); switch (type) { case ACPI_NOTIFY_BUS_CHECK: + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_bus_check(adev); case ACPI_NOTIFY_DEVICE_CHECK: + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_device_check(adev); case ACPI_NOTIFY_EJECT_REQUEST: case ACPI_OST_EC_OSPM_EJECT: + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); if (adev->handler && !adev->handler->hotplug.enabled) { dev_info(&adev->dev, "Eject disabled\n"); return -EPERM; @@ -441,6 +450,7 @@ void acpi_device_hotplug(struct acpi_device *adev, u32 src) u32 ost_code = ACPI_OST_SC_NON_SPECIFIC_FAILURE; int error = -ENODEV; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); lock_device_hotplug(); mutex_lock(&acpi_scan_lock); @@ -466,9 +476,10 @@ void acpi_device_hotplug(struct acpi_device *adev, u32 src) * There may be additional notify handlers for device objects * without the .event() callback, so ignore them here. */ - if (notify) + if (notify) { + dev_info(&adev->dev, "%s %d: calling notify = %px\n", __func__, __LINE__, (void *) notify); error = notify(adev, src); - else + } else goto out; } switch (error) { diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_acpi.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_acpi.c index 6c62e27b9800..4f00e15e7759 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_acpi.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_acpi.c @@ -168,6 +168,7 @@ static union acpi_object *amdgpu_atif_call(struct amdgpu_atif *atif, atif_arg_elements[1].integer.value = 0; } + printk(KERN_INFO "%s %d\n", __func__, __LINE__); status = acpi_evaluate_object(atif->handle, NULL, &atif_arg, &buffer); obj = (union acpi_object *)buffer.pointer; @@ -559,6 +560,7 @@ static union acpi_object *amdgpu_atcs_call(struct amdgpu_atcs *atcs, atcs_arg_elements[1].integer.value = 0; } + printk(KERN_INFO "%s %d\n", __func__, __LINE__); status = acpi_evaluate_object(atcs->handle, NULL, &atcs_arg, &buffer); /* Fail only if calling the method fails and ATIF is supported */ @@ -608,6 +610,7 @@ static int amdgpu_atcs_verify_interface(struct amdgpu_atcs *atcs) size_t size; int err = 0; + printk(KERN_INFO "%s %d\n", __func__, __LINE__); info = amdgpu_atcs_call(atcs, ATCS_FUNCTION_VERIFY_INTERFACE, NULL); if (!info) return -EIO; @@ -682,6 +685,7 @@ int amdgpu_acpi_pcie_notify_device_ready(struct amdgpu_device *adev) if (!atcs->functions.pcie_dev_rdy) return -EINVAL; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); info = amdgpu_atcs_call(atcs, ATCS_FUNCTION_PCIE_DEVICE_READY_NOTIFICATION, NULL); if (!info) return -EIO; @@ -733,6 +737,7 @@ int amdgpu_acpi_pcie_performance_request(struct amdgpu_device *adev, params.pointer = &atcs_input; while (retry--) { + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); info = amdgpu_atcs_call(atcs, ATCS_FUNCTION_PCIE_PERFORMANCE_REQUEST, ¶ms); if (!info) return -EIO; @@ -798,6 +803,7 @@ int amdgpu_acpi_power_shift_control(struct amdgpu_device *adev, params.length = sizeof(struct atcs_pwr_shift_input); params.pointer = &atcs_input; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); info = amdgpu_atcs_call(atcs, ATCS_FUNCTION_POWER_SHIFT_CONTROL, ¶ms); if (!info) { DRM_ERROR("ATCS PSC update failed\n"); diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_atpx_handler.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_atpx_handler.c index 3893e6fc2f03..ed3063f09007 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_atpx_handler.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_atpx_handler.c @@ -123,6 +123,7 @@ static union acpi_object *amdgpu_atpx_call(acpi_handle handle, int function, atpx_arg_elements[1].integer.value = 0; } + printk(KERN_INFO "%s %d\n", __func__, __LINE__); status = acpi_evaluate_object(handle, NULL, &atpx_arg, &buffer); /* Fail only if calling the method fails and ATPX is supported */ diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_bios.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_bios.c index 00e96419fcda..542d039cfd42 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_bios.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_bios.c @@ -272,6 +272,7 @@ static int amdgpu_atrm_call(acpi_handle atrm_handle, uint8_t *bios, atrm_arg_elements[1].type = ACPI_TYPE_INTEGER; atrm_arg_elements[1].integer.value = len; + printk(KERN_INFO "%s %d\n", __func__, __LINE__); status = acpi_evaluate_object(atrm_handle, NULL, &atrm_arg, &buffer); if (ACPI_FAILURE(status)) { DRM_ERROR("failed to evaluate ATRM got %s\n", acpi_format_exception(status)); diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index 0fdfde3dcb9f..bab504d1d24d 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c @@ -5194,6 +5194,7 @@ int amdgpu_device_suspend(struct drm_device *dev, bool notify_clients) struct amdgpu_device *adev = drm_to_adev(dev); int r = 0; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (dev->switch_power_state == DRM_SWITCH_POWER_OFF) return 0; @@ -5208,6 +5209,7 @@ int amdgpu_device_suspend(struct drm_device *dev, bool notify_clients) return r; } + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (amdgpu_acpi_smart_shift_update(adev, AMDGPU_SS_DEV_D3)) dev_warn(adev->dev, "smart shift update failed\n"); @@ -5286,6 +5288,7 @@ int amdgpu_device_resume(struct drm_device *dev, bool notify_clients) struct amdgpu_device *adev = drm_to_adev(dev); int r = 0; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (amdgpu_sriov_vf(adev)) { r = amdgpu_virt_request_full_gpu(adev, true); if (r) @@ -5379,6 +5382,7 @@ int amdgpu_device_resume(struct drm_device *dev, bool notify_clients) amdgpu_vram_mgr_clear_reset_blocks(adev); adev->in_suspend = false; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (amdgpu_acpi_smart_shift_update(adev, AMDGPU_SS_DEV_D0)) dev_warn(adev->dev, "smart shift update failed\n"); diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c index ece251cbe8c3..165bd79fce82 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c @@ -2795,6 +2795,7 @@ static int amdgpu_pmops_runtime_suspend(struct device *dev) struct drm_device *drm_dev = pci_get_drvdata(pdev); struct amdgpu_device *adev = drm_to_adev(drm_dev); int ret, i; + dev_info(dev, "%s %d\n", __func__, __LINE__); if (adev->pm.rpm_mode == AMDGPU_RUNPM_NONE) { pm_runtime_forbid(dev); @@ -2874,6 +2875,7 @@ static int amdgpu_pmops_runtime_resume(struct device *dev) struct drm_device *drm_dev = pci_get_drvdata(pdev); struct amdgpu_device *adev = drm_to_adev(drm_dev); int ret; + dev_info(dev, "%s %d\n", __func__, __LINE__); if (adev->pm.rpm_mode == AMDGPU_RUNPM_NONE) return -EINVAL; diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c index 8841d7213de4..576ff827d80c 100644 --- a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c @@ -7475,6 +7475,7 @@ static int gfx_v10_0_hw_init(struct amdgpu_ip_block *ip_block) { int r; struct amdgpu_device *adev = ip_block->adev; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (!amdgpu_emu_mode) gfx_v10_0_init_golden_registers(adev); @@ -7529,6 +7530,7 @@ static int gfx_v10_0_hw_init(struct amdgpu_ip_block *ip_block) static int gfx_v10_0_hw_fini(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); cancel_delayed_work_sync(&adev->gfx.idle_work); diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c index d7499be8c4bf..fd4062e97e11 100644 --- a/drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c +++ b/drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c @@ -983,6 +983,7 @@ static int gmc_v10_0_hw_init(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; int r; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); adev->gmc.flush_pasid_uses_kiq = !amdgpu_emu_mode; @@ -1029,6 +1030,7 @@ static void gmc_v10_0_gart_disable(struct amdgpu_device *adev) static int gmc_v10_0_hw_fini(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); gmc_v10_0_gart_disable(adev); diff --git a/drivers/gpu/drm/amd/amdgpu/jpeg_v3_0.c b/drivers/gpu/drm/amd/amdgpu/jpeg_v3_0.c index d1a011c40ba2..a181c9965282 100644 --- a/drivers/gpu/drm/amd/amdgpu/jpeg_v3_0.c +++ b/drivers/gpu/drm/amd/amdgpu/jpeg_v3_0.c @@ -174,6 +174,7 @@ static int jpeg_v3_0_hw_init(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; struct amdgpu_ring *ring = adev->jpeg.inst->ring_dec; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); adev->nbio.funcs->vcn_doorbell_range(adev, ring->use_doorbell, (adev->doorbell_index.vcn.vcn_ring0_1 << 1), 0); @@ -212,6 +213,7 @@ static int jpeg_v3_0_suspend(struct amdgpu_ip_block *ip_block) { int r; + dev_info(ip_block->adev->dev, "%s %d\n", __func__, __LINE__); r = jpeg_v3_0_hw_fini(ip_block); if (r) return r; @@ -232,6 +234,7 @@ static int jpeg_v3_0_resume(struct amdgpu_ip_block *ip_block) { int r; + dev_info(ip_block->adev->dev, "%s %d\n", __func__, __LINE__); r = amdgpu_jpeg_resume(ip_block->adev); if (r) return r; diff --git a/drivers/gpu/drm/amd/amdgpu/navi10_ih.c b/drivers/gpu/drm/amd/amdgpu/navi10_ih.c index 4cd325149b63..f33f5e2e6e53 100644 --- a/drivers/gpu/drm/amd/amdgpu/navi10_ih.c +++ b/drivers/gpu/drm/amd/amdgpu/navi10_ih.c @@ -320,6 +320,7 @@ static int navi10_ih_irq_init(struct amdgpu_device *adev) u32 ih_chicken; int ret; int i; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); /* disable irqs */ ret = navi10_ih_toggle_interrupts(adev, false); @@ -385,6 +386,7 @@ static int navi10_ih_irq_init(struct amdgpu_device *adev) */ static void navi10_ih_irq_disable(struct amdgpu_device *adev) { + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); force_update_wptr_for_self_int(adev, 0, 8, false); navi10_ih_toggle_interrupts(adev, false); diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v5_2.c b/drivers/gpu/drm/amd/amdgpu/sdma_v5_2.c index 3bd44c24f692..78f60da4f498 100644 --- a/drivers/gpu/drm/amd/amdgpu/sdma_v5_2.c +++ b/drivers/gpu/drm/amd/amdgpu/sdma_v5_2.c @@ -697,6 +697,7 @@ static int sdma_v5_2_gfx_resume(struct amdgpu_device *adev) { int i, r; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); for (i = 0; i < adev->sdma.num_instances; i++) { r = sdma_v5_2_gfx_resume_instance(adev, i, false); if (r) @@ -819,6 +820,7 @@ static int sdma_v5_2_start(struct amdgpu_device *adev) int r = 0; struct amdgpu_ip_block *ip_block; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (amdgpu_sriov_vf(adev)) { sdma_v5_2_ctx_switch_enable(adev, false); sdma_v5_2_enable(adev, false); @@ -1404,6 +1406,7 @@ static int sdma_v5_2_hw_fini(struct amdgpu_ip_block *ip_block) if (amdgpu_sriov_vf(adev)) return 0; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); sdma_v5_2_ctx_switch_enable(adev, false); sdma_v5_2_enable(adev, false); diff --git a/drivers/gpu/drm/amd/amdgpu/vcn_v3_0.c b/drivers/gpu/drm/amd/amdgpu/vcn_v3_0.c index d9cf8f0feeb3..b31062f212b5 100644 --- a/drivers/gpu/drm/amd/amdgpu/vcn_v3_0.c +++ b/drivers/gpu/drm/amd/amdgpu/vcn_v3_0.c @@ -367,6 +367,7 @@ static int vcn_v3_0_hw_init(struct amdgpu_ip_block *ip_block) struct amdgpu_device *adev = ip_block->adev; struct amdgpu_ring *ring; int i, j, r; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); if (amdgpu_sriov_vf(adev)) { r = vcn_v3_0_start_sriov(adev); @@ -441,6 +442,7 @@ static int vcn_v3_0_hw_fini(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; int i; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); for (i = 0; i < adev->vcn.num_vcn_inst; ++i) { struct amdgpu_vcn_inst *vinst = &adev->vcn.inst[i]; @@ -474,6 +476,7 @@ static int vcn_v3_0_suspend(struct amdgpu_ip_block *ip_block) struct amdgpu_device *adev = ip_block->adev; int r, i; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); r = vcn_v3_0_hw_fini(ip_block); if (r) return r; @@ -498,6 +501,7 @@ static int vcn_v3_0_resume(struct amdgpu_ip_block *ip_block) { struct amdgpu_device *adev = ip_block->adev; int r, i; + dev_info(adev->dev, "%s %d\n", __func__, __LINE__); for (i = 0; i < adev->vcn.num_vcn_inst; i++) { r = amdgpu_vcn_resume(ip_block->adev, i); diff --git a/drivers/pci/hotplug/acpiphp_glue.c b/drivers/pci/hotplug/acpiphp_glue.c index 5b1f271c6034..e56ab308da20 100644 --- a/drivers/pci/hotplug/acpiphp_glue.c +++ b/drivers/pci/hotplug/acpiphp_glue.c @@ -484,6 +484,7 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) struct pci_dev *dev; struct pci_bus *bus = slot->bus; struct acpiphp_func *func; + printk(KERN_INFO "%s %d\n", __func__, __LINE__); if (bridge && bus->self && hotplug_is_native(bus->self)) { /* @@ -494,10 +495,14 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) * as a Thunderbolt host controller. */ for_each_pci_bridge(dev, bus) { - if (PCI_SLOT(dev->devfn) == slot->device) + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); + if (PCI_SLOT(dev->devfn) == slot->device) { + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); acpiphp_native_scan_bridge(dev); + } } } else { + printk(KERN_INFO "%s %d\n", __func__, __LINE__); LIST_HEAD(add_list); int max, pass; @@ -505,11 +510,15 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) max = acpiphp_max_busnr(bus); for (pass = 0; pass < 2; pass++) { for_each_pci_bridge(dev, bus) { - if (PCI_SLOT(dev->devfn) != slot->device) + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); + if (PCI_SLOT(dev->devfn) != slot->device) { + printk(KERN_INFO "%s %d\n", __func__, __LINE__); continue; + } max = pci_scan_bridge(bus, dev, max, pass); if (pass && dev->subordinate) { + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); check_hotplug_bridge(slot, dev); pcibios_resource_survey_bus(dev->subordinate); __pci_bus_size_bridges(dev->subordinate, @@ -526,6 +535,7 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) list_for_each_entry(dev, &bus->devices, bus_list) { /* Assume that newly added devices are powered on already. */ + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); if (!pci_dev_is_added(dev)) dev->current_state = PCI_D0; } @@ -544,6 +554,7 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) } pci_dev_put(dev); } + printk(KERN_INFO "%s %d\n", __func__, __LINE__); } /** @@ -702,31 +713,43 @@ static void acpiphp_check_bridge(struct acpiphp_bridge *bridge) if (bridge->is_going_away) return; - if (bridge->pci_dev) + if (bridge->pci_dev) { + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); pm_runtime_get_sync(&bridge->pci_dev->dev); + } + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); list_for_each_entry(slot, &bridge->slots, node) { struct pci_bus *bus = slot->bus; struct pci_dev *dev, *tmp; + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); if (slot_no_hotplug(slot)) { - ; /* do nothing */ + /* do nothing */ + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); } else if (device_status_valid(get_slot_status(slot))) { /* remove stale devices if any */ list_for_each_entry_safe_reverse(dev, tmp, - &bus->devices, bus_list) + &bus->devices, bus_list) { + dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); if (PCI_SLOT(dev->devfn) == slot->device) trim_stale_devices(dev); + } /* configure all functions */ + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); enable_slot(slot, true); } else { + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); disable_slot(slot); } } - if (bridge->pci_dev) + if (bridge->pci_dev) { pm_runtime_put(&bridge->pci_dev->dev); + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); + } + dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); } /* @@ -760,6 +783,7 @@ static void acpiphp_sanitize_bus(struct pci_bus *bus) void acpiphp_check_host_bridge(struct acpi_device *adev) { struct acpiphp_bridge *bridge = NULL; + dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_lock_hp_context(); if (adev->hp) { @@ -799,6 +823,7 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) switch (type) { case ACPI_NOTIFY_BUS_CHECK: /* bus re-enumerate */ + printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "Bus check in %s()\n", __func__); if (bridge) acpiphp_check_bridge(bridge); @@ -809,6 +834,7 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) case ACPI_NOTIFY_DEVICE_CHECK: /* device check */ + printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "Device check in %s()\n", __func__); if (bridge) { acpiphp_check_bridge(bridge); @@ -824,19 +850,23 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) case ACPI_NOTIFY_EJECT_REQUEST: /* request device eject */ + printk(KERN_INFO "%s %d: ACPI_NOTIFY_EJECT_REQUEST\n", __func__, __LINE__); acpi_handle_debug(handle, "Eject request in %s()\n", __func__); acpiphp_disable_and_eject_slot(slot); break; } pci_unlock_rescan_remove(); + printk(KERN_INFO "%s %d:\n", __func__, __LINE__); if (bridge) put_bridge(bridge); + printk(KERN_INFO "%s %d:\n", __func__, __LINE__); } static int acpiphp_hotplug_notify(struct acpi_device *adev, u32 type) { struct acpiphp_context *context; + dev_info(&adev->dev, "%s %d: %s = %px\n", __func__, __LINE__, __func__, (void *) acpiphp_hotplug_notify); context = acpiphp_grab_context(adev); if (!context) diff --git a/drivers/pci/hotplug/pciehp_core.c b/drivers/pci/hotplug/pciehp_core.c index f59baa912970..8f90f91c0a07 100644 --- a/drivers/pci/hotplug/pciehp_core.c +++ b/drivers/pci/hotplug/pciehp_core.c @@ -266,6 +266,7 @@ static void pciehp_disable_interrupt(struct pcie_device *dev) * Disable hotplug interrupt so that it does not trigger * immediately when the downstream link goes down. */ + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); if (pme_is_native(dev)) pcie_disable_interrupt(get_service_data(dev)); } @@ -273,6 +274,7 @@ static void pciehp_disable_interrupt(struct pcie_device *dev) #ifdef CONFIG_PM_SLEEP static int pciehp_suspend(struct pcie_device *dev) { + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); /* * If the port is already runtime suspended we can keep it that * way. @@ -287,6 +289,7 @@ static int pciehp_suspend(struct pcie_device *dev) static int pciehp_resume_noirq(struct pcie_device *dev) { struct controller *ctrl = get_service_data(dev); + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); /* pci_restore_state() just wrote to the Slot Control register */ ctrl->cmd_started = jiffies; @@ -317,6 +320,7 @@ static int pciehp_resume_noirq(struct pcie_device *dev) static int pciehp_resume(struct pcie_device *dev) { struct controller *ctrl = get_service_data(dev); + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); if (pme_is_native(dev)) pcie_enable_interrupt(ctrl); @@ -328,6 +332,7 @@ static int pciehp_resume(struct pcie_device *dev) static int pciehp_runtime_suspend(struct pcie_device *dev) { + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); pciehp_disable_interrupt(dev); return 0; } @@ -335,6 +340,7 @@ static int pciehp_runtime_suspend(struct pcie_device *dev) static int pciehp_runtime_resume(struct pcie_device *dev) { struct controller *ctrl = get_service_data(dev); + dev_info(&dev->device, "%s %d\n", __func__, __LINE__); /* pci_restore_state() just wrote to the Slot Control register */ ctrl->cmd_started = jiffies; This gives as output when crashing (only the last few lines, which don not appear in /var/log/kern.log, but are captured with netconsole) The processess involved here are the following: T254: [irq/40-ACPI:Event] (this is a threaded interrupt handler for ACPI events) The other two processes are [kworker/mm_percpu_wq] workqueues. 2025-09-30T02:25:57.704378+02:00 [T254]evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 0000000017caa1c9 2025-09-30T02:25:57.704378+02:00 [T61442]acpi_bus_notify 517: ACPI_NOTIFY_BUS_CHECK 2025-09-30T02:25:57.704378+02:00 [T61442]acpi device:00: acpi_bus_notify 567#012 SUBSYSTEM=acpi#012 DEVICE=+acpi:device:00 2025-09-30T02:25:57.704378+02:00 [T61442]acpi device:00: acpi_hotplug_schedule 1197#012 SUBSYSTEM=acpi#012 DEVICE=+acpi:device:00 2025-09-30T02:25:57.704378+02:00 [T77816]acpi_hotplug_work_fn 1188 2025-09-30T02:25:57.704378+02:00 [T77816]acpi_os_wait_events_complete 1170 2025-09-30T02:25:57.704378+02:00 [T77816]acpi device:00: acpi_device_hotplug 453#012 SUBSYSTEM=acpi#012 DEVICE=+acpi:device:00 2025-09-30T02:25:57.704378+02:00 [T77816]acpi device:00: acpi_device_hotplug 480: calling notify = ffffffffb8a24fc0#012 SUBSYSTEM=acpi#012 DEVICE=+acpi:device:00 2025-09-30T02:25:57.704378+02:00 [T77816]acpi device:00: acpiphp_hotplug_notify 869: acpiphp_hotplug_notify = ffffffffb8a24fc0#012 SUBSYSTEM=acpi#012 DEVICE=+acpi:device:00 2025-09-30T02:25:57.704378+02:00 [T77816]hotplug_event 826: ACPI_NOTIFY_BUS_CHECK 2025-09-30T02:25:57.704378+02:00 [T77816]pcieport 0000:00:01.1: acpiphp_check_bridge 717#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 So the problem as appears to be happening inside of acpiphp_check_bridge(): static void acpiphp_check_bridge(struct acpiphp_bridge *bridge) { struct acpiphp_slot *slot; /* Bail out if the bridge is going away. */ if (bridge->is_going_away) return; if (bridge->pci_dev) { dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); // This is the last reported line. pm_runtime_get_sync(&bridge->pci_dev->dev); } dev_info(&bridge->pci_dev->dev, "%s %d\n", __func__, __LINE__); // This line is not reported during a crash. Bert Karwatzki ^ permalink raw reply related [flat|nested] 31+ messages in thread
* [REGRESSION 02/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 01/04] " Bert Karwatzki @ 2025-10-06 12:09 ` Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 03/04] " Bert Karwatzki ` (3 subsequent siblings) 5 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-10-06 12:09 UTC (permalink / raw) To: linux-kernel Cc: Bert Karwatzki, linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Christian König, Rafael J . Wysocki The next step is to monitor pm_runtime_get_sync(), rpm_resume() and __pm_runtime_resume(). Here we need to use conditional debugging output or else we get messages at a rate of about a million lines per minute. These is the additional debugging used in 6.17.0-rc6-next-20250917-gpudebug-00024-g5c6b49b810db diff --git a/drivers/base/power/runtime.c b/drivers/base/power/runtime.c index 7420b9851fe0..895898c3cd56 100644 --- a/drivers/base/power/runtime.c +++ b/drivers/base/power/runtime.c @@ -787,12 +787,18 @@ static int rpm_resume(struct device *dev, int rpmflags) struct device *parent = NULL; int retval = 0; + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); trace_rpm_resume(dev, rpmflags); repeat: if (dev->power.runtime_error) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); retval = -EINVAL; } else if (dev->power.disable_depth > 0) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.runtime_status == RPM_ACTIVE && dev->power.last_status == RPM_ACTIVE) retval = 1; @@ -808,31 +814,45 @@ static int rpm_resume(struct device *dev, int rpmflags) * rather than cancelling it now only to restart it again in the near * future. */ + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.request = RPM_REQ_NONE; if (!dev->power.timer_autosuspends) pm_runtime_deactivate_timer(dev); if (dev->power.runtime_status == RPM_ACTIVE) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); retval = 1; goto out; } if (dev->power.runtime_status == RPM_RESUMING || dev->power.runtime_status == RPM_SUSPENDING) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); DEFINE_WAIT(wait); if (rpmflags & (RPM_ASYNC | RPM_NOWAIT)) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.runtime_status == RPM_SUSPENDING) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.deferred_resume = true; if (rpmflags & RPM_NOWAIT) retval = -EINPROGRESS; } else { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); retval = -EINPROGRESS; } goto out; } if (dev->power.irq_safe) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); spin_unlock(&dev->power.lock); cpu_relax(); @@ -856,6 +876,8 @@ static int rpm_resume(struct device *dev, int rpmflags) spin_lock_irq(&dev->power.lock); } finish_wait(&dev->power.wait_queue, &wait); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); goto repeat; } @@ -865,22 +887,32 @@ static int rpm_resume(struct device *dev, int rpmflags) * the resume will actually succeed. */ if (dev->power.no_callbacks && !parent && dev->parent) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); spin_lock_nested(&dev->parent->power.lock, SINGLE_DEPTH_NESTING); if (dev->parent->power.disable_depth > 0 || dev->parent->power.ignore_children || dev->parent->power.runtime_status == RPM_ACTIVE) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); atomic_inc(&dev->parent->power.child_count); spin_unlock(&dev->parent->power.lock); retval = 1; goto no_callback; /* Assume success. */ } spin_unlock(&dev->parent->power.lock); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); } /* Carry out an asynchronous or a synchronous resume. */ if (rpmflags & RPM_ASYNC) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.request = RPM_REQ_RESUME; if (!dev->power.request_pending) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.request_pending = true; queue_work(pm_wq, &dev->power.work); } @@ -894,6 +926,8 @@ static int rpm_resume(struct device *dev, int rpmflags) * necessary. Not needed if dev is irq-safe; then the * parent is permanently resumed. */ + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); parent = dev->parent; if (dev->power.irq_safe) goto skip_parent; @@ -909,6 +943,8 @@ static int rpm_resume(struct device *dev, int rpmflags) */ if (!parent->power.disable_depth && !parent->power.ignore_children) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); rpm_resume(parent, 0); if (parent->power.runtime_status != RPM_ACTIVE) retval = -EBUSY; @@ -919,10 +955,14 @@ static int rpm_resume(struct device *dev, int rpmflags) if (retval) goto out; + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); goto repeat; } skip_parent: + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.no_callbacks) goto no_callback; /* Assume success. */ @@ -933,11 +973,15 @@ static int rpm_resume(struct device *dev, int rpmflags) dev_pm_disable_wake_irq_check(dev, false); retval = rpm_callback(callback, dev); if (retval) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_SUSPENDED); pm_runtime_cancel_pending(dev); dev_pm_enable_wake_irq_check(dev, false); } else { no_callback: + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_ACTIVE); pm_runtime_mark_last_busy(dev); if (parent) @@ -949,7 +993,11 @@ static int rpm_resume(struct device *dev, int rpmflags) rpm_idle(dev, RPM_ASYNC); out: + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (parent && !dev->power.irq_safe) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); spin_unlock_irq(&dev->power.lock); pm_runtime_put(parent); @@ -959,6 +1007,8 @@ static int rpm_resume(struct device *dev, int rpmflags) trace_rpm_return_int(dev, _THIS_IP_, retval); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); return retval; } @@ -1181,17 +1231,27 @@ int __pm_runtime_resume(struct device *dev, int rpmflags) { unsigned long flags; int retval; + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); might_sleep_if(!(rpmflags & RPM_ASYNC) && !dev->power.irq_safe && dev->power.runtime_status != RPM_ACTIVE); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (rpmflags & RPM_GET_PUT) atomic_inc(&dev->power.usage_count); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); spin_lock_irqsave(&dev->power.lock, flags); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); retval = rpm_resume(dev, rpmflags); spin_unlock_irqrestore(&dev->power.lock, flags); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); return retval; } EXPORT_SYMBOL_GPL(__pm_runtime_resume); diff --git a/include/linux/pm_runtime.h b/include/linux/pm_runtime.h index d88d6b6ccf5b..0888b0d5ec73 100644 --- a/include/linux/pm_runtime.h +++ b/include/linux/pm_runtime.h @@ -508,6 +508,8 @@ static inline int pm_runtime_get(struct device *dev) */ static inline int pm_runtime_get_sync(struct device *dev) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s\n", __func__); return __pm_runtime_resume(dev, RPM_GET_PUT); } With this there is no crash after 60h uptime with ~3093 GPP0 notifies, probably the printk()s are mitigating the crash in some way (i.e. there's a race and the printk()s are slowing down only one side ...). It would be nice if we could get a crash while all the printk()s are in place, but I'm not sure if we can ... Stopped 6.17.0-rc6-next-20250917-gpudebug-00024-g5c6b49b810db after 60h and 3093 GPP0 notifies without crash. Bert Karwatzki ^ permalink raw reply related [flat|nested] 31+ messages in thread
* [REGRESSION 03/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 01/04] " Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 02/04] " Bert Karwatzki @ 2025-10-06 12:09 ` Bert Karwatzki 2025-10-06 12:09 ` [REGRESSION 04/04] " Bert Karwatzki ` (2 subsequent siblings) 5 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-10-06 12:09 UTC (permalink / raw) To: linux-kernel Cc: Bert Karwatzki, linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Christian König, Rafael J . Wysocki In order to get a working crash I removed some of the monitoring again: diff --git a/drivers/acpi/bus.c b/drivers/acpi/bus.c index bc365c0dbe2f..a984ccd4a2a0 100644 --- a/drivers/acpi/bus.c +++ b/drivers/acpi/bus.c @@ -514,60 +514,46 @@ static void acpi_bus_notify(acpi_handle handle, u32 type, void *data) switch (type) { case ACPI_NOTIFY_BUS_CHECK: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_BUS_CHECK event\n"); break; case ACPI_NOTIFY_DEVICE_CHECK: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_CHECK event\n"); break; case ACPI_NOTIFY_DEVICE_WAKE: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_WAKE\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_WAKE event\n"); return; case ACPI_NOTIFY_EJECT_REQUEST: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_EJECT_REQUEST\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_EJECT_REQUEST event\n"); break; case ACPI_NOTIFY_DEVICE_CHECK_LIGHT: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK_LIGHT\n", __func__, __LINE__); acpi_handle_debug(handle, "ACPI_NOTIFY_DEVICE_CHECK_LIGHT event\n"); /* TBD: Exactly what does 'light' mean? */ return; case ACPI_NOTIFY_FREQUENCY_MISMATCH: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_FREQUENCY_MISMATCH\n", __func__, __LINE__); acpi_handle_err(handle, "Device cannot be configured due " "to a frequency mismatch\n"); return; case ACPI_NOTIFY_BUS_MODE_MISMATCH: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_MODE_MISMATCH\n", __func__, __LINE__); acpi_handle_err(handle, "Device cannot be configured due " "to a bus mode mismatch\n"); return; case ACPI_NOTIFY_POWER_FAULT: - printk(KERN_INFO "%s %d: ACPI_NOTIFY_POWER_FAULT\n", __func__, __LINE__); acpi_handle_err(handle, "Device has suffered a power fault\n"); return; default: - printk(KERN_INFO "%s %d: acpi unknown event type\n", __func__, __LINE__); acpi_handle_debug(handle, "Unknown event type 0x%x\n", type); return; } adev = acpi_get_acpi_dev(handle); - if (adev) - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); - else - printk(KERN_INFO "%s %d: adev = NULL\n", __func__, __LINE__); - if (adev && ACPI_SUCCESS(acpi_hotplug_schedule(adev, type))) return; diff --git a/drivers/acpi/device_pm.c b/drivers/acpi/device_pm.c index 9a7dc432b50d..4e0583274b8f 100644 --- a/drivers/acpi/device_pm.c +++ b/drivers/acpi/device_pm.c @@ -539,7 +539,6 @@ static void acpi_pm_notify_handler(acpi_handle handle, u32 val, void *not_used) if (!adev) return; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); mutex_lock(&acpi_pm_notifier_lock); if (adev->wakeup.flags.notifier_present) { diff --git a/drivers/acpi/osl.c b/drivers/acpi/osl.c index 0f6a16856119..5ff343096ece 100644 --- a/drivers/acpi/osl.c +++ b/drivers/acpi/osl.c @@ -1167,7 +1167,6 @@ void acpi_os_wait_events_complete(void) * Make sure the GPE handler or the fixed event handler is not used * on another CPU after removal. */ - printk(KERN_INFO "%s %d\n", __func__, __LINE__); if (acpi_sci_irq_valid()) synchronize_hardirq(acpi_sci_irq); flush_workqueue(kacpid_wq); @@ -1185,7 +1184,6 @@ static void acpi_hotplug_work_fn(struct work_struct *work) { struct acpi_hp_work *hpw = container_of(work, struct acpi_hp_work, work); - printk(KERN_INFO "%s %d\n", __func__, __LINE__); acpi_os_wait_events_complete(); acpi_device_hotplug(hpw->adev, hpw->src); kfree(hpw); @@ -1194,7 +1192,6 @@ static void acpi_hotplug_work_fn(struct work_struct *work) acpi_status acpi_hotplug_schedule(struct acpi_device *adev, u32 src) { struct acpi_hp_work *hpw; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_handle_debug(adev->handle, "Scheduling hotplug event %u for deferred handling\n", diff --git a/drivers/acpi/scan.c b/drivers/acpi/scan.c index d53be7e0388d..065abe56f440 100644 --- a/drivers/acpi/scan.c +++ b/drivers/acpi/scan.c @@ -251,7 +251,6 @@ static int acpi_scan_check_and_detach(struct acpi_device *adev, void *p) { struct acpi_scan_handler *handler = adev->handler; uintptr_t flags = (uintptr_t)p; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_dev_for_each_child_reverse(adev, acpi_scan_check_and_detach, p); @@ -315,7 +314,6 @@ static void acpi_scan_check_subtree(struct acpi_device *adev) { uintptr_t flags = ACPI_SCAN_CHECK_FLAG_STATUS; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_scan_check_and_detach(adev, (void *)flags); } @@ -371,7 +369,6 @@ static int acpi_scan_rescan_bus(struct acpi_device *adev) { struct acpi_scan_handler *handler = adev->handler; int ret; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); if (handler && handler->hotplug.scan_dependent) ret = handler->hotplug.scan_dependent(adev); @@ -388,7 +385,6 @@ static int acpi_scan_device_check(struct acpi_device *adev) { struct acpi_device *parent; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); acpi_scan_check_subtree(adev); if (!acpi_device_is_present(adev)) @@ -416,24 +412,19 @@ static int acpi_scan_device_check(struct acpi_device *adev) static int acpi_scan_bus_check(struct acpi_device *adev) { acpi_scan_check_subtree(adev); - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_rescan_bus(adev); } static int acpi_generic_hotplug_event(struct acpi_device *adev, u32 type) { - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); switch (type) { case ACPI_NOTIFY_BUS_CHECK: - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_bus_check(adev); case ACPI_NOTIFY_DEVICE_CHECK: - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); return acpi_scan_device_check(adev); case ACPI_NOTIFY_EJECT_REQUEST: case ACPI_OST_EC_OSPM_EJECT: - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); if (adev->handler && !adev->handler->hotplug.enabled) { dev_info(&adev->dev, "Eject disabled\n"); return -EPERM; @@ -450,7 +441,6 @@ void acpi_device_hotplug(struct acpi_device *adev, u32 src) u32 ost_code = ACPI_OST_SC_NON_SPECIFIC_FAILURE; int error = -ENODEV; - dev_info(&adev->dev, "%s %d\n", __func__, __LINE__); lock_device_hotplug(); mutex_lock(&acpi_scan_lock); @@ -476,10 +466,9 @@ void acpi_device_hotplug(struct acpi_device *adev, u32 src) * There may be additional notify handlers for device objects * without the .event() callback, so ignore them here. */ - if (notify) { - dev_info(&adev->dev, "%s %d: calling notify = %px\n", __func__, __LINE__, (void *) notify); + if (notify) error = notify(adev, src); - } else + else goto out; } switch (error) { diff --git a/drivers/base/power/runtime.c b/drivers/base/power/runtime.c index 895898c3cd56..27cce7f1b1d3 100644 --- a/drivers/base/power/runtime.c +++ b/drivers/base/power/runtime.c @@ -142,6 +142,8 @@ EXPORT_SYMBOL_GPL(pm_runtime_suspended_time); */ static void pm_runtime_deactivate_timer(struct device *dev) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.timer_expires > 0) { hrtimer_try_to_cancel(&dev->power.suspend_timer); dev->power.timer_expires = 0; @@ -787,8 +789,6 @@ static int rpm_resume(struct device *dev, int rpmflags) struct device *parent = NULL; int retval = 0; - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); trace_rpm_resume(dev, rpmflags); repeat: @@ -815,7 +815,7 @@ static int rpm_resume(struct device *dev, int rpmflags) * future. */ if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); + dev_info(dev, "%s %d dev = %px\n", __func__, __LINE__, dev); dev->power.request = RPM_REQ_NONE; if (!dev->power.timer_autosuspends) pm_runtime_deactivate_timer(dev); @@ -1231,22 +1231,16 @@ int __pm_runtime_resume(struct device *dev, int rpmflags) { unsigned long flags; int retval; - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); might_sleep_if(!(rpmflags & RPM_ASYNC) && !dev->power.irq_safe && dev->power.runtime_status != RPM_ACTIVE); - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); if (rpmflags & RPM_GET_PUT) atomic_inc(&dev->power.usage_count); - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); spin_lock_irqsave(&dev->power.lock, flags); if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); + dev_info(dev, "%s %d dev = %px\n", __func__, __LINE__, dev); retval = rpm_resume(dev, rpmflags); spin_unlock_irqrestore(&dev->power.lock, flags); diff --git a/drivers/pci/hotplug/acpiphp_glue.c b/drivers/pci/hotplug/acpiphp_glue.c index e56ab308da20..e21255b97251 100644 --- a/drivers/pci/hotplug/acpiphp_glue.c +++ b/drivers/pci/hotplug/acpiphp_glue.c @@ -484,7 +484,6 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) struct pci_dev *dev; struct pci_bus *bus = slot->bus; struct acpiphp_func *func; - printk(KERN_INFO "%s %d\n", __func__, __LINE__); if (bridge && bus->self && hotplug_is_native(bus->self)) { /* @@ -495,14 +494,11 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) * as a Thunderbolt host controller. */ for_each_pci_bridge(dev, bus) { - dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); if (PCI_SLOT(dev->devfn) == slot->device) { - dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); acpiphp_native_scan_bridge(dev); } } } else { - printk(KERN_INFO "%s %d\n", __func__, __LINE__); LIST_HEAD(add_list); int max, pass; @@ -510,15 +506,12 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) max = acpiphp_max_busnr(bus); for (pass = 0; pass < 2; pass++) { for_each_pci_bridge(dev, bus) { - dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); if (PCI_SLOT(dev->devfn) != slot->device) { - printk(KERN_INFO "%s %d\n", __func__, __LINE__); continue; } max = pci_scan_bridge(bus, dev, max, pass); if (pass && dev->subordinate) { - dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); check_hotplug_bridge(slot, dev); pcibios_resource_survey_bus(dev->subordinate); __pci_bus_size_bridges(dev->subordinate, @@ -535,7 +528,6 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) list_for_each_entry(dev, &bus->devices, bus_list) { /* Assume that newly added devices are powered on already. */ - dev_info(&dev->dev, "%s %d\n", __func__, __LINE__); if (!pci_dev_is_added(dev)) dev->current_state = PCI_D0; } @@ -554,7 +546,6 @@ static void enable_slot(struct acpiphp_slot *slot, bool bridge) } pci_dev_put(dev); } - printk(KERN_INFO "%s %d\n", __func__, __LINE__); } /** @@ -823,7 +814,6 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) switch (type) { case ACPI_NOTIFY_BUS_CHECK: /* bus re-enumerate */ - printk(KERN_INFO "%s %d: ACPI_NOTIFY_BUS_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "Bus check in %s()\n", __func__); if (bridge) acpiphp_check_bridge(bridge); @@ -834,7 +824,6 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) case ACPI_NOTIFY_DEVICE_CHECK: /* device check */ - printk(KERN_INFO "%s %d: ACPI_NOTIFY_DEVICE_CHECK\n", __func__, __LINE__); acpi_handle_debug(handle, "Device check in %s()\n", __func__); if (bridge) { acpiphp_check_bridge(bridge); @@ -850,23 +839,19 @@ static void hotplug_event(u32 type, struct acpiphp_context *context) case ACPI_NOTIFY_EJECT_REQUEST: /* request device eject */ - printk(KERN_INFO "%s %d: ACPI_NOTIFY_EJECT_REQUEST\n", __func__, __LINE__); acpi_handle_debug(handle, "Eject request in %s()\n", __func__); acpiphp_disable_and_eject_slot(slot); break; } pci_unlock_rescan_remove(); - printk(KERN_INFO "%s %d:\n", __func__, __LINE__); if (bridge) put_bridge(bridge); - printk(KERN_INFO "%s %d:\n", __func__, __LINE__); } static int acpiphp_hotplug_notify(struct acpi_device *adev, u32 type) { struct acpiphp_context *context; - dev_info(&adev->dev, "%s %d: %s = %px\n", __func__, __LINE__, __func__, (void *) acpiphp_hotplug_notify); context = acpiphp_grab_context(adev); if (!context) diff --git a/include/linux/pm_runtime.h b/include/linux/pm_runtime.h index 0888b0d5ec73..d88d6b6ccf5b 100644 --- a/include/linux/pm_runtime.h +++ b/include/linux/pm_runtime.h @@ -508,8 +508,6 @@ static inline int pm_runtime_get(struct device *dev) */ static inline int pm_runtime_get_sync(struct device *dev) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s\n", __func__); return __pm_runtime_resume(dev, RPM_GET_PUT); } This is the message from 6.17.0-rc6-next-20250917-gpudebug-00028-gf99cf81b1da7 crashing, captured via netconsole: 2025-10-06T04:52:35.932429+02:00 [T248]evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 0000000069c9623b 2025-10-06T04:52:35.932429+02:00 [T177395]pcieport 0000:00:01.1: acpiphp_check_bridge 708#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [T177395]pcieport 0000:00:01.1: __pm_runtime_resume 1243 dev = ffff97c001c930c8#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: rpm_resume 818 dev = ffff97c001c930c8#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: pm_runtime_deactivate_timer 146#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: rpm_resume 930#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: rpm_resume 959#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: rpm_resume 818 dev = ffff97c001c930c8#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: pm_runtime_deactivate_timer 146#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 2025-10-06T04:52:35.932429+02:00 [177395]pcieport 0000:00:01.1: rpm_resume 965#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 So the crash seems to happen in this part of rpm_resume(): [...] skip_parent: if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); // this is the last reported line if (dev->power.no_callbacks) goto no_callback; /* Assume success. */ __update_runtime_status(dev, RPM_RESUMING); callback = RPM_GET_CALLBACK(dev, runtime_resume); dev_pm_disable_wake_irq_check(dev, false); retval = rpm_callback(callback, dev); if (retval) { if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_SUSPENDED); pm_runtime_cancel_pending(dev); dev_pm_enable_wake_irq_check(dev, false); } else { no_callback: if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); [...] Bert Karwatzki ^ permalink raw reply related [flat|nested] 31+ messages in thread
* [REGRESSION 04/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki ` (2 preceding siblings ...) 2025-10-06 12:09 ` [REGRESSION 03/04] " Bert Karwatzki @ 2025-10-06 12:09 ` Bert Karwatzki 2025-10-06 12:39 ` [REGRESSION 00/04] " Christian König 2025-10-07 21:33 ` Mario Limonciello 5 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-10-06 12:09 UTC (permalink / raw) To: linux-kernel Cc: Bert Karwatzki, linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Christian König, Rafael J . Wysocki To further close in on the crash we'll continue testing with 6.17.0-rc6-next-20250917-gpudebug-00029-ge797f42363d1 which adds more dev_info()s to the critical part of rpm_resume() and removes some unneeded ones: commit e797f42363d101b146971ec4d7e6c90bcc4064cd Author: Bert Karwatzki <spasswolf@web.de> Date: Mon Oct 6 12:17:16 2025 +0200 power: runtime: and more dev_info()s to rpm_resume() Signed-off-by: Bert Karwatzki <spasswolf@web.de> diff --git a/drivers/base/power/runtime.c b/drivers/base/power/runtime.c index 27cce7f1b1d3..c99dac998047 100644 --- a/drivers/base/power/runtime.c +++ b/drivers/base/power/runtime.c @@ -793,12 +793,8 @@ static int rpm_resume(struct device *dev, int rpmflags) repeat: if (dev->power.runtime_error) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); retval = -EINVAL; } else if (dev->power.disable_depth > 0) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.runtime_status == RPM_ACTIVE && dev->power.last_status == RPM_ACTIVE) retval = 1; @@ -887,32 +883,22 @@ static int rpm_resume(struct device *dev, int rpmflags) * the resume will actually succeed. */ if (dev->power.no_callbacks && !parent && dev->parent) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); spin_lock_nested(&dev->parent->power.lock, SINGLE_DEPTH_NESTING); if (dev->parent->power.disable_depth > 0 || dev->parent->power.ignore_children || dev->parent->power.runtime_status == RPM_ACTIVE) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); atomic_inc(&dev->parent->power.child_count); spin_unlock(&dev->parent->power.lock); retval = 1; goto no_callback; /* Assume success. */ } spin_unlock(&dev->parent->power.lock); - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); } /* Carry out an asynchronous or a synchronous resume. */ if (rpmflags & RPM_ASYNC) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.request = RPM_REQ_RESUME; if (!dev->power.request_pending) { - if (!strcmp(dev_name(dev), "0000:00:01.1")) - dev_info(dev, "%s %d\n", __func__, __LINE__); dev->power.request_pending = true; queue_work(pm_wq, &dev->power.work); } @@ -929,8 +915,11 @@ static int rpm_resume(struct device *dev, int rpmflags) if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); parent = dev->parent; - if (dev->power.irq_safe) + if (dev->power.irq_safe) { + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); goto skip_parent; + } spin_unlock(&dev->power.lock); @@ -966,12 +955,22 @@ static int rpm_resume(struct device *dev, int rpmflags) if (dev->power.no_callbacks) goto no_callback; /* Assume success. */ + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_RESUMING); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); callback = RPM_GET_CALLBACK(dev, runtime_resume); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d callback = %0x\n", __func__, __LINE__, (void *) callback); dev_pm_disable_wake_irq_check(dev, false); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); retval = rpm_callback(callback, dev); + if (!strcmp(dev_name(dev), "0000:00:01.1")) + dev_info(dev, "%s %d\n", __func__, __LINE__); if (retval) { if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); This test is currently running (booted 13:05, 6.10.2025) and I'll expect a crash after at least 24h of runtime. Bert Karwatzki ^ permalink raw reply related [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki ` (3 preceding siblings ...) 2025-10-06 12:09 ` [REGRESSION 04/04] " Bert Karwatzki @ 2025-10-06 12:39 ` Christian König 2025-10-06 16:22 ` Bert Karwatzki 2025-10-07 21:33 ` Mario Limonciello 5 siblings, 1 reply; 31+ messages in thread From: Christian König @ 2025-10-06 12:39 UTC (permalink / raw) To: Bert Karwatzki, linux-kernel Cc: linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Rafael J . Wysocki On 06.10.25 14:09, Bert Karwatzki wrote: > Since linux version v6.15 I experience random crashes on my MSI Alpha 15 Laptop > running debian trixie (amd64). The first such crash happened about in the midth > of june, and as there were no useful log messages and even using netconsole > gave no useful message I suspected faulty hardware. So I ran memtest86+ and > found a faulty address line and replaced the memory (unfortunately 64G to 16G). > But the crashes occured again and so I did a thorough investigation. > > The crashes occur after 30min to 33h (yes, hours) of uptime and consist of a > sudden reboot after which the PCI bridge at 00:02.4 and the nvme device > connected to it are missing. If there's sound running during the crash then the > first sign of the crash is the sound looping like a broken record for about 2s, > after which the reboot happens. With the missing nvme device the reboot drops to > a rescue shell. Using "shutdown -h now" from that shell and starting the laptop > with the power button restores the missing PCI bridge and nvme device. Oh well, it sounds like some PCIe device is dropping of the bus and taking it's upstream bridge with it. > As the bisections were not succesfull I tried to monitor the crash using > netconsole and CONFIG_ACPI_DEBUG and "acpi.debug_layer=0xf acpi.debug_level=0x107" > as command line parameters. With this the last message on netconsole before > the crash is usually: > > [21465.639279] [ T251] evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 00000000f81f36b8 A full dump of that might be helpful. That sounds like the dGPU is powering up/down. > > GPP0 is the ACPI name of this PCI bridge (at least that's my best guess): > > 00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633] > > to which the discrete GPU is connected > > 03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3) > > via the pci express switch > > 01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3) > 02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479] > > While the GUI (xfce on xorg) on my laptop runs on the built-in GPU the discrete > GPU usually wakes up quite often, e.g. when a window is opened or when scrolling down on youtube. Yeah, that is a known issue and we are working on it. Basically an application enumerates the possible render or video decode devices in the system and that wakes up the dGPU even when it isn't actually used. > A somewhat reliable method to generate GPP0 notifies is putting on a youtube > video and the periodically starting evolution with this script: > > #!/bin/bash > for i in {0..1000} > do > echo $i > evolution & > sleep 5 > killall evolution > sleep 55 > done > > This is also the method I used to test the debug kernel in the following mails. To further narrow down the issue please run your laptop with amdgpu.runpm=0 on the kernel command line for a while and see if that is stable or not. Thanks, Christian. > > Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-06 12:39 ` [REGRESSION 00/04] " Christian König @ 2025-10-06 16:22 ` Bert Karwatzki 2025-10-07 6:50 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-10-06 16:22 UTC (permalink / raw) To: Christian König, linux-kernel Cc: linux-next, linux-stable, regressions, linux-pci, linux-acpi, Mario Limonciello, Rafael J . Wysocki, spasswolf Am Montag, dem 06.10.2025 um 14:39 +0200 schrieb Christian König: > On 06.10.25 14:09, Bert Karwatzki wrote: > > Since linux version v6.15 I experience random crashes on my MSI Alpha 15 Laptop > > running debian trixie (amd64). The first such crash happened about in the midth > > of june, and as there were no useful log messages and even using netconsole > > gave no useful message I suspected faulty hardware. So I ran memtest86+ and > > found a faulty address line and replaced the memory (unfortunately 64G to 16G). > > But the crashes occured again and so I did a thorough investigation. > > > > The crashes occur after 30min to 33h (yes, hours) of uptime and consist of a > > sudden reboot after which the PCI bridge at 00:02.4 and the nvme device > > connected to it are missing. If there's sound running during the crash then the > > first sign of the crash is the sound looping like a broken record for about 2s, > > after which the reboot happens. With the missing nvme device the reboot drops to > > a rescue shell. Using "shutdown -h now" from that shell and starting the laptop > > with the power button restores the missing PCI bridge and nvme device. > > Oh well, it sounds like some PCIe device is dropping of the bus and taking it's upstream bridge with it. > > > As the bisections were not succesfull I tried to monitor the crash using > > netconsole and CONFIG_ACPI_DEBUG and "acpi.debug_layer=0xf acpi.debug_level=0x107" > > as command line parameters. With this the last message on netconsole before > > the crash is usually: > > > > [21465.639279] [ T251] evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 00000000f81f36b8 > > A full dump of that might be helpful. That sounds like the dGPU is powering up/down. Yes, that's what's happening. > > > > > GPP0 is the ACPI name of this PCI bridge (at least that's my best guess): > > > > 00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633] > > > > to which the discrete GPU is connected > > > > 03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3) > > > > via the pci express switch > > > > 01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3) > > 02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479] > > > > While the GUI (xfce on xorg) on my laptop runs on the built-in GPU the discrete > > GPU usually wakes up quite often, e.g. when a window is opened or when scrolling down on youtube. > > Yeah, that is a known issue and we are working on it. Until linux v6.15 this didn't cause any harm. > > Basically an application enumerates the possible render or video decode devices in the system and that wakes up the dGPU even when it isn't actually used. > > > A somewhat reliable method to generate GPP0 notifies is putting on a youtube > > video and the periodically starting evolution with this script: > > > > #!/bin/bash > > for i in {0..1000} > > do > > echo $i > > evolution & > > sleep 5 > > killall evolution > > sleep 55 > > done > > > > This is also the method I used to test the debug kernel in the following mails. > > To further narrow down the issue please run your laptop with amdgpu.runpm=0 on the kernel command line for a while and see if that is stable or not. > Even versions that did crash can be stable for 24h of uptime so I think this will take too long. I think I've already chased down the crash to this part of rpm_resume() (I'm currently doing a testrun with more dev_info()s in this part): skip_parent: if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); // this is the last reported line in netconsole if (dev->power.no_callbacks) goto no_callback; /* Assume success. */ __update_runtime_status(dev, RPM_RESUMING); callback = RPM_GET_CALLBACK(dev, runtime_resume); dev_pm_disable_wake_irq_check(dev, false); retval = rpm_callback(callback, dev); if (retval) { __update_runtime_status(dev, RPM_SUSPENDED); pm_runtime_cancel_pending(dev); dev_pm_enable_wake_irq_check(dev, false); } else { no_callback: Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-06 16:22 ` Bert Karwatzki @ 2025-10-07 6:50 ` Bert Karwatzki 0 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-10-07 6:50 UTC (permalink / raw) To: Christian König, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Mario Limonciello, Rafael J . Wysocki, spasswolf Am Montag, dem 06.10.2025 um 18:22 +0200 schrieb Bert Karwatzki: > > > > Even versions that did crash can be stable for 24h of uptime so I think this > will take too long. > I think I've already chased down the crash to this part of rpm_resume() > (I'm currently doing a testrun with more dev_info()s in this part): > > skip_parent: > > if (!strcmp(dev_name(dev), "0000:00:01.1")) > dev_info(dev, "%s %d\n", __func__, __LINE__); // this is the last reported line in netconsole > if (dev->power.no_callbacks) > goto no_callback; /* Assume success. */ > > __update_runtime_status(dev, RPM_RESUMING); > > callback = RPM_GET_CALLBACK(dev, runtime_resume); > > dev_pm_disable_wake_irq_check(dev, false); > retval = rpm_callback(callback, dev); > if (retval) { > __update_runtime_status(dev, RPM_SUSPENDED); > pm_runtime_cancel_pending(dev); > dev_pm_enable_wake_irq_check(dev, false); > } else { > no_callback: > > > Bert Karwatzki The testrun is already finished the crash occured after 10h and ~700 GPP0 notifies, the part of rpm_resume() above was monitored like this: skip_parent: if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); if (dev->power.no_callbacks) goto no_callback; /* Assume success. */ if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_RESUMING); if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); callback = RPM_GET_CALLBACK(dev, runtime_resume); if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d callback = %px\n", __func__, __LINE__, (void *) callback); dev_pm_disable_wake_irq_check(dev, false); if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); // This is the last reported line! retval = rpm_callback(callback, dev); if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); if (retval) { if (!strcmp(dev_name(dev), "0000:00:01.1")) dev_info(dev, "%s %d\n", __func__, __LINE__); __update_runtime_status(dev, RPM_SUSPENDED); pm_runtime_cancel_pending(dev); dev_pm_enable_wake_irq_check(dev, false); } else { no_callback: The result is that in the case of the crash rpm_callback() didn't return, so I'll continue the investigation in rpm_callback(). The whole calltrace is: acpiphp_check_bridge()->pm_runtime_get_sync()->__pm_runtime_resume()->rpm_resume()->rpm_callback() Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki ` (4 preceding siblings ...) 2025-10-06 12:39 ` [REGRESSION 00/04] " Christian König @ 2025-10-07 21:33 ` Mario Limonciello 2025-10-13 16:29 ` Bert Karwatzki 5 siblings, 1 reply; 31+ messages in thread From: Mario Limonciello @ 2025-10-07 21:33 UTC (permalink / raw) To: Bert Karwatzki, linux-kernel Cc: linux-next, linux-stable, regressions, linux-pci, linux-acpi, Christian König, Rafael J . Wysocki On 10/6/25 7:09 AM, Bert Karwatzki wrote: > Since linux version v6.15 I experience random crashes on my MSI Alpha 15 Laptop > running debian trixie (amd64). The first such crash happened about in the midth > of june, and as there were no useful log messages and even using netconsole > gave no useful message I suspected faulty hardware. So I ran memtest86+ and > found a faulty address line and replaced the memory (unfortunately 64G to 16G). > But the crashes occured again and so I did a thorough investigation. > > The crashes occur after 30min to 33h (yes, hours) of uptime and consist of a > sudden reboot after which the PCI bridge at 00:02.4 and the nvme device > connected to it are missing. If there's sound running during the crash then the > first sign of the crash is the sound looping like a broken record for about 2s, > after which the reboot happens. With the missing nvme device the reboot drops to > a rescue shell. Using "shutdown -h now" from that shell and starting the laptop > with the power button restores the missing PCI bridge and nvme device. > > The hardware is the following (it's a dual GPU laptop where the GUI > runs on the built-in GPU): > > $ cat /proc/cpuinfo > processor : 0 > vendor_id : AuthenticAMD > cpu family : 25 > model : 80 > model name : AMD Ryzen 7 5800H with Radeon Graphics > stepping : 0 > microcode : 0xa50000c > cpu MHz : 3394.238 > cache size : 512 KB > physical id : 0 > siblings : 16 > core id : 0 > cpu cores : 8 > apicid : 0 > initial apicid : 0 > fpu : yes > fpu_exception : yes > cpuid level : 16 > wp : yes > flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca fsrm debug_swap > bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass srso ibpb_no_ret > bogomips : 6388.57 > TLB size : 2560 4K pages > clflush size : 64 > cache_alignment : 64 > address sizes : 48 bits physical, 48 bits virtual > power management: ts ttp tm hwpstate cpb eff_freq_ro [13] [14] > > $ lspci -nn > 00:00.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne Root Complex [1022:1630] > 00:00.2 IOMMU [0806]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne IOMMU [1022:1631] > 00:01.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632] > 00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633] > 00:02.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632] > 00:02.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634] > 00:02.2 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634] > 00:02.3 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634] > 00:02.4 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge [1022:1634] > 00:08.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge [1022:1632] > 00:08.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus [1022:1635] > 00:14.0 SMBus [0c05]: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:790b] (rev 51) > 00:14.3 ISA bridge [0601]: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:790e] (rev 51) > 00:18.0 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 0 [1022:166a] > 00:18.1 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 1 [1022:166b] > 00:18.2 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 2 [1022:166c] > 00:18.3 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 3 [1022:166d] > 00:18.4 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 4 [1022:166e] > 00:18.5 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 5 [1022:166f] > 00:18.6 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 6 [1022:1670] > 00:18.7 Host bridge [0600]: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 7 [1022:1671] > 01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3) > 02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479] > 03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3) > 03:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21/23 HDMI/DP Audio Controller [1002:ab28] > 04:00.0 Network controller [0280]: MEDIATEK Corp. MT7921K (RZ608) Wi-Fi 6E 80MHz [14c3:0608] > 05:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller [10ec:8168] (rev 15) > 06:00.0 Non-Volatile memory controller [0108]: Kingston Technology Company, Inc. KC3000/FURY Renegade NVMe SSD [E18] [2646:5013] (rev 01) > 07:00.0 Non-Volatile memory controller [0108]: Micron/Crucial Technology P1 NVMe PCIe SSD[Frampton] [c0a9:2263] (rev 03) > 08:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] [1002:1638] (rev c5) > 08:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Renoir Radeon High Definition Audio Controller [1002:1637] > 08:00.2 Encryption controller [1080]: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor [1022:15df] > 08:00.3 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1 [1022:1639] > 08:00.4 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1 [1022:1639] > 08:00.5 Multimedia controller [0480]: Advanced Micro Devices, Inc. [AMD] Audio Coprocessor [1022:15e2] (rev 01) > 08:00.6 Audio device [0403]: Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller [1022:15e3] > 08:00.7 Signal processing controller [1180]: Advanced Micro Devices, Inc. [AMD] Sensor Fusion Hub [1022:15e4] > > These devices are attached to the PCI bus like this: > > $ lspci -t > -[0000:00]-+-00.0 > +-00.2 > +-01.0 > +-01.1-[01-03]----00.0-[02-03]----00.0-[03]--+-00.0 // This is the bridge which causes the crash > | \-00.1 > +-02.0 > +-02.1-[04]----00.0 > +-02.2-[05]----00.0 > +-02.3-[06]----00.0 > +-02.4-[07]----00.0 // These are the bridge and nvme device which disappear after the crash. > +-08.0 > +-08.1-[08]--+-00.0 > | +-00.1 > | +-00.2 > | +-00.3 > | +-00.4 > | +-00.5 > | +-00.6 > | \-00.7 > +-14.0 > +-14.3 > +-18.0 > +-18.1 > +-18.2 > +-18.3 > +-18.4 > +-18.5 > +-18.6 > \-18.7 > > I tried to bisect this between v6.14 and v6.15 but due to the wildly varying time > it takes to trigger the bug the bisections were not successful. Nevertheless they > gave lots of data about affected and non-affected version of the linux kernel, > and it's quite likely that version v6.14 is indeed free of the bug. > > Here's an almost complete list of tested versions: > (Somewhat) sorted (by kernel version, 6.14.0-rc* kernels are from attempted bisections > between v6.14 and v6.15) > v6.14.0 no crash after 16h > v6.14.11 no crash after 7.5h > 6.14.0-rc1-bisect-00003-g541ddf31e300 booted 12:24, 22.8.2025, no crash after {48h, 17h} > 6.14.0-rc1-mystery-00134-gcc28c0e5e725 booted 11:42, 5.8.2025, no crash after 10.5h > 6.14.0-rc1-mystery-00198-gd7f6f07ecec9 booted 22:27, 5.8.2025, no crash after 12h > 6.14.0-rc4-mystery-01022-gab498828fad7 booted 21:04, 3.8.2025, no crash after {14h, 24h} > 6.14.0-rc4-mystery-01427-g7547510d4a91 booted 11:11, 4.8.2025, no crash after {13h, 23h} > 6.14.0-rc6-mystery-01641-g0f04462874e1 booted 00:26, 5.8.2025, no crash after {11h, 24h} > 6.14.0-mystery-00826-g327ecdbc0fda no crash after {16h, 17h, 6.5h} > ############## here the crashes start (time to each crash, crashes do not always occur) ######## > 6.14.0-bisect-01053-gebfb94d87b35 booted 10:15, 20.8.2025 crash after ~33h > 6.14.0-mystery-09584-g7d06015d936c crash 20.44 3.8.2025 after 7h > 6.14.0-mystery-11703-geb0ece16027f crash 13.22 3.8.2025 after 1.75h > 6.15.0 crashed around 15-17.6.2025, unknown uptime (This is the first crash!) > 6.15.0-nort crash after 6.75h > 6.16-rc4 (next-20250627) crash after ~4h > 6.16-rc4 (next-20250630) crash after ~5h > 6.16-rc4 (next-20250703) crash after ~2.5h (sound buffer repeated for ~1s before restarting) > 6.16-rc6 (next-20250718) crash after {2h, 2h} > 6.16-rc7 (next-20250721) crash after {~30min, 2h, 5.5h} > 6.16.0-nortlockdep crash after 4h > 6.17.0-rc4-next-20250902-master booted 8:36, 3.9.2025, crash after ~3.5h > 6.17.0-rc5-next-20250908-master booted 10:25, 9.9.2025, crash after {~6.5h, 14h} > 6.17.0-rc6-next-20250917-acpidebug booted 12:41, 20.9.2025, crash 15:22 20.8.2025 (~3h, 647 GPP notifies) > The versions below contain additional debugging printk()s and dev_info()s. > The details of these debugging statements are explained below. > 6.17.0-rc6-next-20250917-gpudebug-00018-g7a38b625a003 booted 12:58, 26.9.2025, crash 12:01, 27.9.2025 (~23h, 1500 GPP notifies) > 6.17.0-rc6-next-20250917-gpudebug-00021-gab98d880e3c8 booted 23:52, 28.9.2025, crash 2:25, 30.9.2025 (26.5h, 1504GPP0, 889GPP2) > 6.17.0-rc6-next-20250917-gpudebug-00024-g5c6b49b810db booted 9:10, 2.10.2025, 60h 3093 GPP0 notifies without crash (too many printk()s?) > 6.17.0-rc6-next-20250917-gpudebug-00028-gf99cf81b1da7 booted 21:21, 4.10.2025 first try stopped after 77min due to hung tasks > 6.17.0-rc6-next-20250917-gpudebug-00028-gf99cf81b1da7 booted 23:37, 4.10.2025 crash 4:52, 6.10.2025 (~27.5h) > 6.17.0-rc6-next-20250917-gpudebug-00029-ge797f42363d1 booted 13:00, 6.10.2025 currently testing > > As the bisections were not succesfull I tried to monitor the crash using > netconsole and CONFIG_ACPI_DEBUG and "acpi.debug_layer=0xf acpi.debug_level=0x107" > as command line parameters. With this the last message on netconsole before > the crash is usually: > > [21465.639279] [ T251] evmisc-0132 ev_queue_notify_reques: Dispatching Notify on [GPP0] (Device) Value 0x00 (Bus Check) Node 00000000f81f36b8 > > GPP0 is the ACPI name of this PCI bridge (at least that's my best guess): > > 00:01.1 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge [1022:1633] > > to which the discrete GPU is connected > > 03:00.0 Display controller [0380]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] [1002:73ff] (rev c3) > > via the pci express switch > > 01:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch [1002:1478] (rev c3) > 02:00.0 PCI bridge [0604]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch [1002:1479] > > While the GUI (xfce on xorg) on my laptop runs on the built-in GPU the discrete > GPU usually wakes up quite often, e.g. when a window is opened or when scrolling down on youtube. > > A somewhat reliable method to generate GPP0 notifies is putting on a youtube > video and the periodically starting evolution with this script: > > #!/bin/bash > for i in {0..1000} > do > echo $i > evolution & > sleep 5 > killall evolution > sleep 55 > done > > This is also the method I used to test the debug kernel in the following mails. > > Bert Karwatzki Given the perpetrator and victim here don't share a common upstream root port (the only common is the root complex) I wonder if this is actually an issue with something non-obvious like the IOMMU. Can you still reproduce with amd_iommu=off? ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-07 21:33 ` Mario Limonciello @ 2025-10-13 16:29 ` Bert Karwatzki 2025-10-13 18:51 ` Mario Limonciello 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-10-13 16:29 UTC (permalink / raw) To: Mario Limonciello, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Christian König, Rafael J . Wysocki, spasswolf Am Dienstag, dem 07.10.2025 um 16:33 -0500 schrieb Mario Limonciello: > > Can you still reproduce with amd_iommu=off? Reproducing this is at all is very difficult, so I'll try to find the exact spot where things break (i.e. when the pci bus breaks and no more message are transmitted via netconsole) first. The current state of this search is that the crash occurs in pci_pm_runtime_resume(), before pci_fixup_device() is called: static int pci_pm_runtime_resume(struct device *dev) { struct pci_dev *pci_dev = to_pci_dev(dev); const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; pci_power_t prev_state = pci_dev->current_state; int error = 0; // dev_info(dev, "%s = %px\n", __func__, (void *) pci_pm_runtime_resume); // remove this so we don't get too much delay // This was still printed in the case of a crash // so the crash must happen below /* * Restoring config space is necessary even if the device is not bound * to a driver because although we left it in D0, it may have gone to * D3cold when the bridge above it runtime suspended. */ pci_pm_default_resume_early(pci_dev); if (!strcmp(dev_name(dev), "0000:00:01.1")) // This is the current test. dev_info(dev, "%s %d\n", __func__, __LINE__); pci_resume_ptm(pci_dev); if (!pci_dev->driver) return 0; //if (!strcmp(dev_name(dev), "0000:00:01.1")) // This was not printed when 6.17.0-rc6-next-20250917-gpudebug-00036-g4f7b4067c9ce // dev_info(dev, "%s %d\n", __func__, __LINE__); // crashed, so the crash must happen above pci_fixup_device(pci_fixup_resume_early, pci_dev); pci_pm_default_resume(pci_dev); if (prev_state == PCI_D3cold) pci_pm_bridge_power_up_actions(pci_dev); if (pm && pm->runtime_resume) error = pm->runtime_resume(dev); return error; } Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-13 16:29 ` Bert Karwatzki @ 2025-10-13 18:51 ` Mario Limonciello 2025-10-14 10:50 ` Christian König 0 siblings, 1 reply; 31+ messages in thread From: Mario Limonciello @ 2025-10-13 18:51 UTC (permalink / raw) To: Bert Karwatzki, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Christian König, Rafael J . Wysocki On 10/13/25 11:29 AM, Bert Karwatzki wrote: > Am Dienstag, dem 07.10.2025 um 16:33 -0500 schrieb Mario Limonciello: >> >> Can you still reproduce with amd_iommu=off? > > Reproducing this is at all is very difficult, so I'll try to find the exact spot > where things break > (i.e. when the pci bus breaks and no more message are transmitted > via netconsole) first. The current state of this search is that the crash occurs in > pci_pm_runtime_resume(), before pci_fixup_device() is called: > One other (unfortunate) possibility is that the timing of this crash occurring is not deterministic. As an idea for debugging this issue, do you think maybe using kdumpst [1] might be helpful to get more information on the state during the crash? Since NVME is missing you might need to boot off of USB or SD though so that kdumpst is able to save the vmcore out of RAM. Link: https://blogs.igalia.com/gpiccoli/2024/07/presenting-kdumpst-or-how-to-collect-kernel-crash-logs-on-arch-linux/ [1] > static int pci_pm_runtime_resume(struct device *dev) > { > struct pci_dev *pci_dev = to_pci_dev(dev); > const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; > pci_power_t prev_state = pci_dev->current_state; > int error = 0; > // dev_info(dev, "%s = %px\n", __func__, (void *) pci_pm_runtime_resume); // remove this so we don't get too much delay > // This was still printed in the case of a crash > // so the crash must happen below > > /* > * Restoring config space is necessary even if the device is not bound > * to a driver because although we left it in D0, it may have gone to > * D3cold when the bridge above it runtime suspended. > */ > pci_pm_default_resume_early(pci_dev); > if (!strcmp(dev_name(dev), "0000:00:01.1")) // This is the current test. > dev_info(dev, "%s %d\n", __func__, __LINE__); > pci_resume_ptm(pci_dev); > > if (!pci_dev->driver) > return 0; > > //if (!strcmp(dev_name(dev), "0000:00:01.1")) // This was not printed when 6.17.0-rc6-next-20250917-gpudebug-00036-g4f7b4067c9ce > // dev_info(dev, "%s %d\n", __func__, __LINE__); // crashed, so the crash must happen above > pci_fixup_device(pci_fixup_resume_early, pci_dev); > pci_pm_default_resume(pci_dev); > > if (prev_state == PCI_D3cold) > pci_pm_bridge_power_up_actions(pci_dev); > > if (pm && pm->runtime_resume) > error = pm->runtime_resume(dev); > > return error; > } > > > Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-13 18:51 ` Mario Limonciello @ 2025-10-14 10:50 ` Christian König [not found] ` <1853e2af7f70cf726df278137b6d2d89d9d9dc82.camel@web.de> 0 siblings, 1 reply; 31+ messages in thread From: Christian König @ 2025-10-14 10:50 UTC (permalink / raw) To: Mario Limonciello, Bert Karwatzki, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki On 13.10.25 20:51, Mario Limonciello wrote: > On 10/13/25 11:29 AM, Bert Karwatzki wrote: >> Am Dienstag, dem 07.10.2025 um 16:33 -0500 schrieb Mario Limonciello: >>> >>> Can you still reproduce with amd_iommu=off? >> >> Reproducing this is at all is very difficult, so I'll try to find the exact spot >> where things break (i.e. when the pci bus breaks and no more message are transmitted >> via netconsole) first. The current state of this search is that the crash occurs in >> pci_pm_runtime_resume(), before pci_fixup_device() is called: >> > > One other (unfortunate) possibility is that the timing of this crash occurring is not deterministic. Yeah, completely agree. The exact spot where things break is actually pretty uninteresting I think. Background is that it is most likely not the spot which caused the issue. Instead what happens is that something in the HW times out and you see a spontaneous reboot because of this. I would rather try to narrow down which operation or combination of things is causing the issue. Maybe also double check if runtime pm is actually working on the good kernel or if the issue might be that somebody fixed runtime pm and you are now seeing issues because you happen to have problematic HW which we need to add to the blacklist. Regards, Christian. > > As an idea for debugging this issue, do you think maybe using kdumpst [1] might be helpful to get more information on the state during the crash? > > Since NVME is missing you might need to boot off of USB or SD though so that kdumpst is able to save the vmcore out of RAM. > > Link: https://blogs.igalia.com/gpiccoli/2024/07/presenting-kdumpst-or-how-to-collect-kernel-crash-logs-on-arch-linux/ [1] >> static int pci_pm_runtime_resume(struct device *dev) >> { >> struct pci_dev *pci_dev = to_pci_dev(dev); >> const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; >> pci_power_t prev_state = pci_dev->current_state; >> int error = 0; >> // dev_info(dev, "%s = %px\n", __func__, (void *) pci_pm_runtime_resume); // remove this so we don't get too much delay >> // This was still printed in the case of a crash >> // so the crash must happen below >> >> /* >> * Restoring config space is necessary even if the device is not bound >> * to a driver because although we left it in D0, it may have gone to >> * D3cold when the bridge above it runtime suspended. >> */ >> pci_pm_default_resume_early(pci_dev); >> if (!strcmp(dev_name(dev), "0000:00:01.1")) // This is the current test. >> dev_info(dev, "%s %d\n", __func__, __LINE__); >> pci_resume_ptm(pci_dev); >> >> if (!pci_dev->driver) >> return 0; >> >> //if (!strcmp(dev_name(dev), "0000:00:01.1")) // This was not printed when 6.17.0-rc6-next-20250917-gpudebug-00036-g4f7b4067c9ce >> // dev_info(dev, "%s %d\n", __func__, __LINE__); // crashed, so the crash must happen above >> pci_fixup_device(pci_fixup_resume_early, pci_dev); >> pci_pm_default_resume(pci_dev); >> >> if (prev_state == PCI_D3cold) >> pci_pm_bridge_power_up_actions(pci_dev); >> >> if (pm && pm->runtime_resume) >> error = pm->runtime_resume(dev); >> >> return error; >> } >> >> >> Bert Karwatzki > ^ permalink raw reply [flat|nested] 31+ messages in thread
[parent not found: <1853e2af7f70cf726df278137b6d2d89d9d9dc82.camel@web.de>]
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge [not found] ` <1853e2af7f70cf726df278137b6d2d89d9d9dc82.camel@web.de> @ 2025-10-31 13:38 ` Bert Karwatzki 2025-10-31 13:47 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-10-31 13:38 UTC (permalink / raw) To: Christian König, Mario Limonciello, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf I'm currently trying to bisect this issue (again ...), and during a test run with kernel version commit 74adf9e35384 (in linux-next) I temporarily lost the discrete GPU. This loss did not result in a crash and did not result in a permanent loss of the discrete GPU (i.e. using the discrete GPU (e.g. DRI_PRIME=1 glxgears) works again). I'm not sure if this is related to the crashes. Error messsage: [76466.462660] [ T179286] pci_bus 0000:03: Allocating resources [76466.463156] [ T179892] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [76466.463193] [ T179892] amdgpu 0000:03:00.0: amdgpu: PSP is resuming... [76466.639416] [ T179892] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [76466.721071] [ T179892] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [76466.732309] [ T179892] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [76466.732319] [ T179892] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [76466.732326] [ T179892] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [76466.732333] [ T179892] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [76466.806897] [ T179892] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully! [76466.808829] [ T179892] [drm] kiq ring mec 2 pipe 1 q 0 [76466.815229] [ T179892] [drm] DMUB hardware initialized: version=0x02020020 [76466.834132] [ T179892] amdgpu 0000:03:00.0: [drm] Cannot find any crtc or sizes [76466.834153] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 [76466.834158] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0 [76466.834163] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0 [76466.834167] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0 [76466.834172] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0 [76466.834176] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0 [76466.834180] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0 [76466.834184] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0 [76466.834188] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0 [76466.834193] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0 [76466.834197] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0 [76466.834201] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0 [76466.834205] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0 [76466.834209] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8 [76466.834214] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8 [76466.834218] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8 [76466.834222] [ T179892] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8 [76466.838133] [ T179892] amdgpu 0000:03:00.0: [drm] Cannot find any crtc or sizes [76619.622697] [ T179316] pci_bus 0000:03: Allocating resources [76619.622938] [ T179987] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [76619.622971] [ T179987] amdgpu 0000:03:00.0: amdgpu: PSP is resuming... [76619.798737] [ T179987] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [76619.882296] [ T179987] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [76619.893732] [ T179987] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [76619.893751] [ T179987] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [76619.893760] [ T179987] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [76619.893769] [ T179987] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [76620.141719] [ T179987] amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:6 param:0x00000000 message:EnableAllSmuFeatures? [76620.141736] [ T179987] amdgpu 0000:03:00.0: amdgpu: Failed to enable requested dpm features! [76620.141742] [ T179987] amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw! [76620.141747] [ T179987] amdgpu 0000:03:00.0: amdgpu: resume of IP block <smu> failed -121 [76620.141754] [ T179987] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-121). [76621.007575] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Down [76621.007592] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card not present [76621.007651] [ T175001] pcieport 0000:02:00.0: Unable to change power state from D0 to D3hot, device inaccessible [76621.007736] [ T177848] pcieport 0000:01:00.0: Unable to change power state from D0 to D3hot, device inaccessible [76621.085815] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. [76621.140891] [ T140] ------------[ cut here ]------------ [76621.140904] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.141042] [ T140] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [76621.141122] [ T140] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [76621.141177] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Not tainted 6.14.0-mystery-00198-g74adf9e35384 #36 [76621.141183] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [76621.141187] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.141312] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [76621.141317] [ T140] RSP: 0018:ffffb97f00743bd0 EFLAGS: 00010202 [76621.141323] [ T140] RAX: 0000000000000000 RBX: ffff9f050a4bb890 RCX: 0000000080000000 [76621.141332] [ T140] RDX: ffff9f050a4bb8e0 RSI: ffff9f050a4bb8e8 RDI: ffff9f050a4bb890 [76621.141336] [ T140] RBP: ffff9f050a4bb8e0 R08: 0000000000000000 R09: 0000000000000014 [76621.141340] [ T140] R10: 0000000000000001 R11: 0000000000000000 R12: ffff9f050a4bb8e8 [76621.141344] [ T140] R13: ffff9f0512d7f400 R14: ffff9f050a48ef80 R15: ffffb97f00743d6e [76621.141348] [ T140] FS: 0000000000000000(0000) GS:ffff9f07ba780000(0000) knlGS:0000000000000000 [76621.141353] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [76621.141357] [ T140] CR2: 00007fe612af0900 CR3: 0000000120cbc000 CR4: 0000000000750ef0 [76621.141361] [ T140] PKRU: 55555554 [76621.141365] [ T140] Call Trace: [76621.141370] [ T140] <TASK> [76621.141381] [ T140] ? __warn.cold+0x90/0x9e [76621.141388] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.141507] [ T140] ? report_bug+0xfa/0x140 [76621.141513] [ T140] ? handle_bug+0x53/0x90 [76621.141518] [ T140] ? exc_invalid_op+0x17/0x70 [76621.141523] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [76621.141528] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.141637] [ T140] psp_v11_0_ring_destroy+0x2e/0x50 [amdgpu] [76621.141760] [ T140] psp_hw_fini+0x126/0x380 [amdgpu] [76621.141876] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [76621.142041] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [76621.142195] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [76621.142299] [ T140] pci_device_remove+0x3d/0xb0 [76621.142305] [ T140] device_release_driver_internal+0x197/0x200 [76621.142311] [ T140] pci_stop_bus_device+0x68/0x80 [76621.142317] [ T140] pci_stop_bus_device+0x38/0x80 [76621.142322] [ T140] pci_stop_bus_device+0x27/0x80 [76621.142327] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [76621.142332] [ T140] pciehp_unconfigure_device+0x93/0x180 [76621.142337] [ T140] pciehp_disable_slot+0x62/0x100 [76621.142343] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [76621.142348] [ T140] pciehp_ist+0x13b/0x180 [76621.142353] [ T140] irq_thread_fn+0x1e/0x60 [76621.142359] [ T140] irq_thread+0x114/0x1e0 [76621.142364] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [76621.142369] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [76621.142374] [ T140] ? irq_affinity_notify+0xd0/0xd0 [76621.142379] [ T140] kthread+0xea/0x1e0 [76621.142385] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.142390] [ T140] ret_from_fork+0x2f/0x50 [76621.142395] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.142400] [ T140] ret_from_fork_asm+0x11/0x20 [76621.142408] [ T140] </TASK> [76621.142412] [ T140] ---[ end trace 0000000000000000 ]--- [76621.143507] [ T140] ------------[ cut here ]------------ [76621.143511] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [76621.143608] [ T140] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [76621.143688] [ T140] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [76621.143739] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [76621.143744] [ T140] Tainted: [W]=WARN [76621.143748] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [76621.143752] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [76621.143855] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 7f d0 5c c7 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 6e d0 5c c7 b8 ea ff ff ff e9 64 d0 5c c7 [76621.143859] [ T140] RSP: 0018:ffffb97f00743c30 EFLAGS: 00010246 [76621.143864] [ T140] RAX: ffff9f0500bf3b60 RBX: ffff9f050a480000 RCX: 0000000000000000 [76621.143868] [ T140] RDX: 0000000000000000 RSI: ffff9f050a480c78 RDI: ffff9f050a480000 [76621.143872] [ T140] RBP: 0000000000000001 R08: 0000000000000001 R09: 0000000000000000 [76621.143876] [ T140] R10: 000000000020001f R11: 0000000000000000 R12: ffff9f050a4c6e48 [76621.143880] [ T140] R13: ffffffffc0cac1a8 R14: ffffffffc0cac1a8 R15: ffffb97f00743d6e [76621.143884] [ T140] FS: 0000000000000000(0000) GS:ffff9f07ba780000(0000) knlGS:0000000000000000 [76621.143888] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [76621.143892] [ T140] CR2: 00007fe612af0900 CR3: 0000000120cbc000 CR4: 0000000000750ef0 [76621.143896] [ T140] PKRU: 55555554 [76621.143899] [ T140] Call Trace: [76621.143903] [ T140] <TASK> [76621.143907] [ T140] ? __warn.cold+0x90/0x9e [76621.143913] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [76621.144016] [ T140] ? report_bug+0xfa/0x140 [76621.144022] [ T140] ? handle_bug+0x53/0x90 [76621.144026] [ T140] ? exc_invalid_op+0x17/0x70 [76621.144031] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [76621.144037] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [76621.144140] [ T140] gmc_v10_0_hw_fini+0x52/0xb0 [amdgpu] [76621.144243] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [76621.144387] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [76621.144533] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [76621.144629] [ T140] pci_device_remove+0x3d/0xb0 [76621.144635] [ T140] device_release_driver_internal+0x197/0x200 [76621.144640] [ T140] pci_stop_bus_device+0x68/0x80 [76621.144646] [ T140] pci_stop_bus_device+0x38/0x80 [76621.144650] [ T140] pci_stop_bus_device+0x27/0x80 [76621.144655] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [76621.144660] [ T140] pciehp_unconfigure_device+0x93/0x180 [76621.144666] [ T140] pciehp_disable_slot+0x62/0x100 [76621.144671] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [76621.144676] [ T140] pciehp_ist+0x13b/0x180 [76621.144681] [ T140] irq_thread_fn+0x1e/0x60 [76621.144687] [ T140] irq_thread+0x114/0x1e0 [76621.144692] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [76621.144697] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [76621.144702] [ T140] ? irq_affinity_notify+0xd0/0xd0 [76621.144707] [ T140] kthread+0xea/0x1e0 [76621.144712] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.144717] [ T140] ret_from_fork+0x2f/0x50 [76621.144723] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.144728] [ T140] ret_from_fork_asm+0x11/0x20 [76621.144735] [ T140] </TASK> [76621.144738] [ T140] ---[ end trace 0000000000000000 ]--- [76621.144792] [ T140] ------------[ cut here ]------------ [76621.144796] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.144894] [ T140] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [76621.144972] [ T140] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [76621.145021] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [76621.145026] [ T140] Tainted: [W]=WARN [76621.145030] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [76621.145034] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.145115] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [76621.145119] [ T140] RSP: 0018:ffffb97f00743c00 EFLAGS: 00010202 [76621.145124] [ T140] RAX: ffff9f07ba7a5d80 RBX: ffff9f050a494b60 RCX: 000000000000016f [76621.145127] [ T140] RDX: ffff9f050a494b68 RSI: ffff9f050a494b70 RDI: ffff9f050a494b60 [76621.145131] [ T140] RBP: ffff9f050a494b68 R08: 00000000000056ee R09: 0000000000000009 [76621.145135] [ T140] R10: 0000000000000056 R11: 0000000000000012 R12: ffff9f050a494b70 [76621.145139] [ T140] R13: ffff9f0500c18000 R14: ffff9f050a48ef80 R15: ffffb97f00743d6e [76621.145143] [ T140] FS: 0000000000000000(0000) GS:ffff9f07ba780000(0000) knlGS:0000000000000000 [76621.145147] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [76621.145151] [ T140] CR2: 00007fe612af0900 CR3: 0000000120cbc000 CR4: 0000000000750ef0 [76621.145155] [ T140] PKRU: 55555554 [76621.145159] [ T140] Call Trace: [76621.145163] [ T140] <TASK> [76621.145167] [ T140] ? __warn.cold+0x90/0x9e [76621.145172] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.145253] [ T140] ? report_bug+0xfa/0x140 [76621.145258] [ T140] ? handle_bug+0x53/0x90 [76621.145263] [ T140] ? exc_invalid_op+0x17/0x70 [76621.145267] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [76621.145273] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.145354] [ T140] amdgpu_ih_ring_fini+0x4f/0x80 [amdgpu] [76621.145446] [ T140] amdgpu_irq_fini_hw+0x2f/0x80 [amdgpu] [76621.145540] [ T140] amdgpu_device_fini_hw+0x231/0x2ad [amdgpu] [76621.145665] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [76621.145745] [ T140] pci_device_remove+0x3d/0xb0 [76621.145751] [ T140] device_release_driver_internal+0x197/0x200 [76621.145756] [ T140] pci_stop_bus_device+0x68/0x80 [76621.145761] [ T140] pci_stop_bus_device+0x38/0x80 [76621.145766] [ T140] pci_stop_bus_device+0x27/0x80 [76621.145771] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [76621.145776] [ T140] pciehp_unconfigure_device+0x93/0x180 [76621.145781] [ T140] pciehp_disable_slot+0x62/0x100 [76621.145786] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [76621.145791] [ T140] pciehp_ist+0x13b/0x180 [76621.145796] [ T140] irq_thread_fn+0x1e/0x60 [76621.145802] [ T140] irq_thread+0x114/0x1e0 [76621.145806] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [76621.145812] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [76621.145817] [ T140] ? irq_affinity_notify+0xd0/0xd0 [76621.145822] [ T140] kthread+0xea/0x1e0 [76621.145827] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.145833] [ T140] ret_from_fork+0x2f/0x50 [76621.145838] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.145843] [ T140] ret_from_fork_asm+0x11/0x20 [76621.145850] [ T140] </TASK> [76621.145854] [ T140] ---[ end trace 0000000000000000 ]--- [76621.146685] [ T140] ------------[ cut here ]------------ [76621.146691] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.146775] [ T140] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [76621.146853] [ T140] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [76621.146904] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [76621.146909] [ T140] Tainted: [W]=WARN [76621.146913] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [76621.146917] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.146998] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [76621.147003] [ T140] RSP: 0018:ffffb97f00743c48 EFLAGS: 00010202 [76621.147007] [ T140] RAX: 0000000000000000 RBX: ffff9f050a480a20 RCX: 0000000000000000 [76621.147011] [ T140] RDX: ffff9f050a480a28 RSI: 0000000000000000 RDI: ffff9f050a480a20 [76621.147015] [ T140] RBP: ffff9f050a480a28 R08: ffff9f05723cf5d8 R09: 0000000000000000 [76621.147019] [ T140] R10: 00007fe80b74e000 R11: 000000000000014d R12: 0000000000000000 [76621.147023] [ T140] R13: ffff9f0500c1d400 R14: ffff9f050a48ef80 R15: ffffb97f00743d6e [76621.147027] [ T140] FS: 0000000000000000(0000) GS:ffff9f07ba780000(0000) knlGS:0000000000000000 [76621.147031] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [76621.147035] [ T140] CR2: 00007fe612af0900 CR3: 0000000120cbc000 CR4: 0000000000750ef0 [76621.147039] [ T140] PKRU: 55555554 [76621.147042] [ T140] Call Trace: [76621.147046] [ T140] <TASK> [76621.147050] [ T140] ? __warn.cold+0x90/0x9e [76621.147056] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.147138] [ T140] ? report_bug+0xfa/0x140 [76621.147143] [ T140] ? handle_bug+0x53/0x90 [76621.147147] [ T140] ? exc_invalid_op+0x17/0x70 [76621.147152] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [76621.147158] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [76621.147239] [ T140] amdgpu_device_unmap_mmio+0x25/0x90 [amdgpu] [76621.147319] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [76621.147398] [ T140] pci_device_remove+0x3d/0xb0 [76621.147403] [ T140] device_release_driver_internal+0x197/0x200 [76621.147409] [ T140] pci_stop_bus_device+0x68/0x80 [76621.147414] [ T140] pci_stop_bus_device+0x38/0x80 [76621.147419] [ T140] pci_stop_bus_device+0x27/0x80 [76621.147424] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [76621.147429] [ T140] pciehp_unconfigure_device+0x93/0x180 [76621.147434] [ T140] pciehp_disable_slot+0x62/0x100 [76621.147439] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [76621.147444] [ T140] pciehp_ist+0x13b/0x180 [76621.147449] [ T140] irq_thread_fn+0x1e/0x60 [76621.147455] [ T140] irq_thread+0x114/0x1e0 [76621.147460] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [76621.147465] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [76621.147471] [ T140] ? irq_affinity_notify+0xd0/0xd0 [76621.147483] [ T140] kthread+0xea/0x1e0 [76621.147489] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.147494] [ T140] ret_from_fork+0x2f/0x50 [76621.147500] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [76621.147505] [ T140] ret_from_fork_asm+0x11/0x20 [76621.147512] [ T140] </TASK> [76621.147515] [ T140] ---[ end trace 0000000000000000 ]--- [76621.870884] [ T140] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible [76621.870977] [ T140] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible [76621.876006] [ T140] pci_bus 0000:03: busn_res: [bus 03] is released [76621.878237] [ T140] pci_bus 0000:02: busn_res: [bus 02-03] is released [76621.879867] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card present [76621.879873] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Up [76622.006565] [ T140] pci 0000:01:00.0: [1002:1478] type 01 class 0x060400 PCIe Switch Upstream Port [76622.006606] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff] [76622.006616] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [76622.006630] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [76622.006644] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [76622.006772] [ T140] pci 0000:01:00.0: PME# supported from D0 D3hot D3cold [76622.006874] [ T140] pci 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with 16.0 GT/s PCIe x8 link) [76622.007064] [ T140] pci 0000:01:00.0: Adding to iommu group 12 [76622.007193] [ T140] pci 0000:02:00.0: [1002:1479] type 01 class 0x060400 PCIe Switch Downstream Port [76622.007227] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [76622.007241] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [76622.007256] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [76622.007366] [ T140] pci 0000:02:00.0: PME# supported from D0 D3hot D3cold [76622.008231] [ T140] pci 0000:02:00.0: Adding to iommu group 13 [76622.008285] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [76622.008428] [ T140] pci 0000:03:00.0: [1002:73ff] type 00 class 0x038000 PCIe Legacy Endpoint [76622.008493] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref] [76622.008500] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref] [76622.008507] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff] [76622.008512] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref] [76622.008644] [ T140] pci 0000:03:00.0: PME# supported from D1 D2 D3hot D3cold [76622.008753] [ T140] pci 0000:03:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link) [76622.009273] [ T140] pci 0000:03:00.0: Adding to iommu group 14 [76622.009323] [ T140] pci 0000:03:00.1: [1002:ab28] type 00 class 0x040300 PCIe Legacy Endpoint [76622.009368] [ T140] pci 0000:03:00.1: BAR 0 [mem 0x00000000-0x00003fff] [76622.009386] [ T140] pci 0000:03:00.1: Max Payload Size set to 256 (was 128, max 256) [76622.009467] [ T140] pci 0000:03:00.1: PME# supported from D1 D2 D3hot D3cold [76622.010655] [ T140] pci 0000:03:00.1: Adding to iommu group 15 [76622.010691] [ T140] pci 0000:02:00.0: ASPM: current common clock configuration is inconsistent, reconfiguring [76622.010737] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [76622.010771] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x300000000 required for 0000:02:00.0 bridging to [bus 03] [76622.010777] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 03] requires relaxed alignment rules [76622.010783] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x400000000 required for 0000:01:00.0 bridging to [bus 02-03] [76622.010788] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 02-03] requires relaxed alignment rules [76622.010797] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned [76622.010802] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned [76622.010806] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff]: assigned [76622.010813] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned [76622.010818] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned [76622.010824] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref]: assigned [76622.010836] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref]: assigned [76622.010849] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff]: assigned [76622.010855] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref]: assigned [76622.010860] [ T140] pci 0000:03:00.1: BAR 0 [mem 0xfcb20000-0xfcb23fff]: assigned [76622.010867] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [76622.010874] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [76622.010881] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [76622.010890] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [76622.010897] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [76622.010904] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [76622.010913] [ T140] pcieport 0000:00:01.1: PCI bridge to [bus 01-03] [76622.010917] [ T140] pcieport 0000:00:01.1: bridge window [io 0x1000-0x1fff] [76622.010924] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfca00000-0xfccfffff] [76622.010929] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [76622.011839] [ T140] [drm] initializing kernel modesetting (DIMGREY_CAVEFISH 0x1002:0x73FF 0x1462:0x1313 0xC3). [76622.012233] [ T140] [drm] register mmio base: 0xFCA00000 [76622.012240] [ T140] [drm] register mmio size: 1048576 [76624.031458] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 0 <nv_common> [76624.031471] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 1 <gmc_v10_0> [76624.031482] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 2 <navi10_ih> [76624.031487] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 3 <psp> [76624.031491] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 4 <smu> [76624.031495] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 5 <dm> [76624.031499] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 6 <gfx_v10_0> [76624.031503] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 7 <sdma_v5_2> [76624.031508] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 8 <vcn_v3_0> [76624.031512] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 9 <jpeg_v3_0> [76624.031533] [ T140] amdgpu 0000:03:00.0: amdgpu: ACPI VFCT table present but broken (too short #2),skipping [76624.040650] [ T140] amdgpu 0000:03:00.0: amdgpu: Fetched VBIOS from ROM BAR [76624.040661] [ T140] amdgpu: ATOM BIOS: SWBRT77181.001 [76624.047812] [ T140] amdgpu 0000:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default) [76624.047826] [ T140] amdgpu 0000:03:00.0: amdgpu: MODE1 reset [76624.047832] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset [76624.047925] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset [76624.550931] [ T140] [drm] GPU posting now... [76624.551021] [ T140] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit [76624.551041] [ T140] amdgpu 0000:03:00.0: amdgpu: VRAM: 8176M 0x0000008000000000 - 0x00000081FEFFFFFF (8176M used) [76624.551051] [ T140] amdgpu 0000:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF [76624.551083] [ T140] [drm] Detected VRAM RAM=8176M, BAR=8192M [76624.551090] [ T140] [drm] RAM width 128bits GDDR6 [76624.551292] [ T140] [drm] amdgpu: 8176M of VRAM memory ready [76624.551306] [ T140] [drm] amdgpu: 6895M of GTT memory ready. [76624.551341] [ T140] [drm] GART: num cpu pages 131072, num gpu pages 131072 [76624.551558] [ T140] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [76627.137571] [ T140] amdgpu 0000:03:00.0: amdgpu: STB initialized to 2048 entries [76627.137664] [ T140] [drm] Loading DMUB firmware via PSP: version=0x02020020 [76627.138008] [ T140] [drm] use_doorbell being set to: [true] [76627.138031] [ T140] [drm] use_doorbell being set to: [true] [76627.138048] [ T140] [drm] Found VCN firmware Version ENC: 1.33 DEC: 4 VEP: 0 Revision: 6 [76627.301177] [ T140] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [76627.382840] [ T140] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [76627.394206] [ T140] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [76627.394236] [ T140] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [76627.394244] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [76627.394283] [ T140] amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable [76627.470860] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is initialized successfully! [76627.471678] [ T140] [drm] Display Core v3.2.316 initialized on DCN 3.0.2 [76627.471686] [ T140] [drm] DP-HDMI FRL PCON supported [76627.473002] [ T140] [drm] DMUB hardware initialized: version=0x02020020 [76627.505987] [ T140] [drm] kiq ring mec 2 pipe 1 q 0 [76627.514661] [ T140] kfd kfd: amdgpu: Allocated 3969056 bytes on gart [76627.514688] [ T140] kfd kfd: amdgpu: Total number of KFD nodes to be created: 1 [76627.514812] [ T140] amdgpu: Virtual CRAT table created for GPU [76627.515356] [ T140] amdgpu: Topology: Add dGPU node [0x73ff:0x1002] [76627.515361] [ T140] kfd kfd: amdgpu: added device 1002:73ff [76627.515385] [ T140] amdgpu 0000:03:00.0: amdgpu: SE 2, SH per SE 2, CU per SH 8, active_cu_number 28 [76627.515394] [ T140] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 [76627.515400] [ T140] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0 [76627.515406] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0 [76627.515411] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0 [76627.515415] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0 [76627.515419] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0 [76627.515423] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0 [76627.515428] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0 [76627.515431] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0 [76627.515436] [ T140] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0 [76627.515440] [ T140] amdgpu 0000:03:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0 [76627.515444] [ T140] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0 [76627.515448] [ T140] amdgpu 0000:03:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0 [76627.515452] [ T140] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8 [76627.515456] [ T140] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8 [76627.515460] [ T140] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8 [76627.515464] [ T140] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8 [76627.517023] [ T140] amdgpu 0000:03:00.0: amdgpu: Using BOCO for runtime pm [76627.525799] [ T140] [drm] Initialized amdgpu 3.61.0 for 0000:03:00.0 on minor 2 [76627.529014] [ T140] amdgpu 0000:03:00.0: [drm] Cannot find any crtc or sizes [76627.529208] [ T140] pci 0000:03:00.1: D0 power state depends on 0000:03:00.0 [76627.529246] [ T140] snd_hda_intel 0000:03:00.1: enabling device (0000 -> 0002) [76627.529333] [ T140] snd_hda_intel 0000:03:00.1: Handle vga_switcheroo audio client [76627.529342] [ T140] snd_hda_intel 0000:03:00.1: Force to non-snoop mode [76627.535481] [ T178264] snd_hda_intel 0000:03:00.1: bound 0000:03:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu]) [76627.536870] [ T178264] input: HDA ATI HDMI HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.1/0000:01:00.0/0000:02:00.0/0000:03:00.1/sound/card1/input32 [76627.537043] [ T178264] input: HDA ATI HDMI HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.1/0000:01:00.0/0000:02:00.0/0000:03:00.1/sound/card1/input33 [76627.537142] [ T178264] input: HDA ATI HDMI HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.1/0000:01:00.0/0000:02:00.0/0000:03:00.1/sound/card1/input34 [76627.537213] [ T178264] input: HDA ATI HDMI HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:01.1/0000:01:00.0/0000:02:00.0/0000:03:00.1/sound/card1/input35 [76627.537293] [ T178264] input: HDA ATI HDMI HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:01.1/0000:01:00.0/0000:02:00.0/0000:03:00.1/sound/card1/input36 Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-31 13:38 ` Bert Karwatzki @ 2025-10-31 13:47 ` Bert Karwatzki 2025-10-31 18:35 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-10-31 13:47 UTC (permalink / raw) To: Christian König, Mario Limonciello, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf Upon closer inspection I noticed that the PCIe bandwitdth has been reduced: > > [76621.870884] [ T140] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible > [76621.870977] [ T140] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible > [76621.876006] [ T140] pci_bus 0000:03: busn_res: [bus 03] is released > [76621.878237] [ T140] pci_bus 0000:02: busn_res: [bus 02-03] is released > [76621.879867] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card present > [76621.879873] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Up > [76622.006565] [ T140] pci 0000:01:00.0: [1002:1478] type 01 class 0x060400 PCIe Switch Upstream Port > [76622.006606] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff] > [76622.006616] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] > [76622.006630] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > [76622.006644] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [76622.006772] [ T140] pci 0000:01:00.0: PME# supported from D0 D3hot D3cold The PCIe band with seems to be have been reduce to PCIe 1.0 (2.5GT/s): > [76622.006874] [ T140] pci 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with > 16.0 GT/s PCIe x8 link) > > Bert Karwatzki This is the same message from system startup (here it's PCIe 3.0 (8.0GT/s), which is the PCIe version of the CPU (AMD Ryzen 7 5800H with Radeon Graphics)): [ 0.289221] [ T1] pci 0000:01:00.0: 63.008 Gb/s available PCIe bandwidth, limited by 8.0 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with 16.0 GT/s PCIe x8 link) Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-31 13:47 ` Bert Karwatzki @ 2025-10-31 18:35 ` Bert Karwatzki 2025-11-05 11:44 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-10-31 18:35 UTC (permalink / raw) To: Christian König, Mario Limonciello, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki Am Freitag, dem 31.10.2025 um 14:47 +0100 schrieb Bert Karwatzki: > Upon closer inspection I noticed that the PCIe bandwitdth has been reduced: > > > > > [76621.870884] [ T140] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible > > [76621.870977] [ T140] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible > > [76621.876006] [ T140] pci_bus 0000:03: busn_res: [bus 03] is released > > [76621.878237] [ T140] pci_bus 0000:02: busn_res: [bus 02-03] is released > > [76621.879867] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card present > > [76621.879873] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Up > > [76622.006565] [ T140] pci 0000:01:00.0: [1002:1478] type 01 class 0x060400 PCIe Switch Upstream Port > > [76622.006606] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff] > > [76622.006616] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] > > [76622.006630] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > > [76622.006644] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > > [76622.006772] [ T140] pci 0000:01:00.0: PME# supported from D0 D3hot D3cold > > The PCIe band with seems to be have been reduce to PCIe 1.0 (2.5GT/s): > > > [76622.006874] [ T140] pci 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with > > 16.0 GT/s PCIe x8 link) > > > > Bert Karwatzki > > This is the same message from system startup (here it's PCIe 3.0 (8.0GT/s), which is the PCIe version > of the CPU (AMD Ryzen 7 5800H with Radeon Graphics)): > [ 0.289221] [ T1] pci 0000:01:00.0: 63.008 Gb/s available PCIe bandwidth, limited by 8.0 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with > 16.0 GT/s PCIe x8 link) > > Bert Karwatzki 5 hours later the following happend: This time the discrete GPU is no longer present on the PCI bus. [94794.664620] [ T246165] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible [94794.927090] [ T249440] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [94794.927137] [ T249440] amdgpu 0000:03:00.0: amdgpu: PSP is resuming... [94795.103071] [ T249440] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [94795.184954] [ T249440] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [94795.196436] [ T249440] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [94795.196443] [ T249440] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [94795.196450] [ T249440] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [94795.196456] [ T249440] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [94795.270584] [ T249440] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully! [94795.272500] [ T249440] [drm] kiq ring mec 2 pipe 1 q 0 [94795.279250] [ T249440] [drm] DMUB hardware initialized: version=0x02020020 [94795.298642] [ T249440] amdgpu 0000:03:00.0: [drm] Cannot find any crtc or sizes [94795.298659] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 [94795.298664] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0 [94795.298668] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0 [94795.298672] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0 [94795.298676] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0 [94795.298680] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0 [94795.298683] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0 [94795.298687] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0 [94795.298691] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0 [94795.298695] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0 [94795.298699] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0 [94795.298703] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0 [94795.298707] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0 [94795.298711] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8 [94795.298715] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8 [94795.298719] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8 [94795.298722] [ T249440] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8 [94795.303424] [ T249440] amdgpu 0000:03:00.0: [drm] Cannot find any crtc or sizes [94858.150608] [ T255287] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible [94858.278571] [ T255287] amdgpu 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible [94869.026346] [ T255287] [drm:gmc_v10_0_flush_gpu_tlb [amdgpu]] *ERROR* Timeout waiting for sem acquire in VM flush! [94869.188171] [ T255287] amdgpu 0000:03:00.0: amdgpu: Timeout waiting for VM flush hub: 8! [94869.347241] [ T255287] amdgpu 0000:03:00.0: amdgpu: Timeout waiting for VM flush hub: 0! [94869.347265] [ T255287] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [94869.347283] [ T255287] amdgpu 0000:03:00.0: amdgpu: PSP is resuming... [94869.387560] [ T255287] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [94869.387688] [ T255287] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [94869.387832] [ T255287] ------------[ cut here ]------------ [94869.387838] [ T255287] WARNING: CPU: 7 PID: 255287 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94869.388000] [ T255287] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94869.388093] [ T255287] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94869.388156] [ T255287] CPU: 7 UID: 1000 PID: 255287 Comm: evolution Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94869.388163] [ T255287] Tainted: [W]=WARN [94869.388167] [ T255287] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94869.388172] [ T255287] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94869.388328] [ T255287] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [94869.388334] [ T255287] RSP: 0018:ffffb97f1035f940 EFLAGS: 00010202 [94869.388340] [ T255287] RAX: ffff9f05b0b69f80 RBX: ffff9f056be3c6e8 RCX: 00000081fec03000 [94869.388345] [ T255287] RDX: ffff9f056be3c6f8 RSI: ffff9f056be3c6f0 RDI: ffff9f056be3c6e8 [94869.388349] [ T255287] RBP: ffff9f056be3c6f8 R08: ffff9f056be3c728 R09: ffffb97f11391000 [94869.388353] [ T255287] R10: ffff9f07e02fffa8 R11: 0000000000000003 R12: ffff9f056be3c6f0 [94869.388358] [ T255287] R13: ffff9f05b0b67400 R14: ffff9f056be0ef80 R15: ffff9f05059a10c8 [94869.388362] [ T255287] FS: 00007f2cae63acc0(0000) GS:ffff9f07ba7c0000(0000) knlGS:0000000000000000 [94869.388367] [ T255287] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94869.388372] [ T255287] CR2: 00007f6d36cc2b9c CR3: 000000018c0d4000 CR4: 0000000000750ef0 [94869.388376] [ T255287] PKRU: 55555554 [94869.388381] [ T255287] Call Trace: [94869.388387] [ T255287] <TASK> [94869.388392] [ T255287] ? __warn.cold+0x90/0x9e [94869.388400] [ T255287] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94869.388556] [ T255287] ? report_bug+0xfa/0x140 [94869.388564] [ T255287] ? handle_bug+0x53/0x90 [94869.388570] [ T255287] ? exc_invalid_op+0x17/0x70 [94869.388576] [ T255287] ? asm_exc_invalid_op+0x1a/0x20 [94869.388584] [ T255287] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94869.388771] [ T255287] psp_rap_initialize+0x19f/0x1d0 [amdgpu] [94869.388962] [ T255287] psp_resume+0x19a/0x201 [amdgpu] [94869.389186] [ T255287] amdgpu_ip_block_resume+0x22/0x40 [amdgpu] [94869.389339] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.389347] [ T255287] amdgpu_device_fw_loading+0x109/0x140 [amdgpu] [94869.389510] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.389517] [ T255287] ? amdgpu_device_ip_resume_phase1+0x50/0x90 [amdgpu] [94869.389651] [ T255287] amdgpu_device_resume+0x65/0x2b0 [amdgpu] [94869.389793] [ T255287] amdgpu_pmops_runtime_resume+0x56/0x100 [amdgpu] [94869.389927] [ T255287] ? pci_pm_restore_noirq+0xc0/0xc0 [94869.389934] [ T255287] ? pci_pm_restore_noirq+0xc0/0xc0 [94869.389940] [ T255287] __rpm_callback+0x3f/0x160 [94869.389947] [ T255287] rpm_callback+0x50/0x60 [94869.389953] [ T255287] ? pci_pm_restore_noirq+0xc0/0xc0 [94869.389959] [ T255287] rpm_resume+0x50a/0x770 [94869.389968] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.389974] [ T255287] ? lock_timer_base+0x68/0x90 [94869.389982] [ T255287] __pm_runtime_resume+0x46/0x80 [94869.389989] [ T255287] amdgpu_driver_open_kms+0x4b/0x250 [amdgpu] [94869.390124] [ T255287] drm_file_alloc+0x1cb/0x270 [94869.390133] [ T255287] drm_open_helper+0x80/0x130 [94869.390140] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.390146] [ T255287] drm_open+0x6e/0x100 [94869.390153] [ T255287] drm_stub_open+0x99/0xd0 [94869.390160] [ T255287] chrdev_open+0xae/0x210 [94869.390168] [ T255287] ? __unregister_chrdev+0x40/0x40 [94869.390174] [ T255287] do_dentry_open+0x16b/0x580 [94869.390181] [ T255287] vfs_open+0x29/0xe0 [94869.390188] [ T255287] path_openat+0x832/0x12b0 [94869.390195] [ T255287] ? vsnprintf+0x4cb/0x5c0 [94869.390203] [ T255287] do_filp_open+0xc2/0x170 [94869.390211] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.390217] [ T255287] ? current_time+0x2a/0x110 [94869.390224] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.390230] [ T255287] ? __check_object_size+0x1f0/0x220 [94869.390236] [ T255287] ? srso_alias_return_thunk+0x5/0xfbef5 [94869.390242] [ T255287] do_sys_openat2+0x6c/0xd0 [94869.390250] [ T255287] __x64_sys_openat+0x50/0xa0 [94869.390256] [ T255287] do_syscall_64+0x5f/0x170 [94869.390263] [ T255287] entry_SYSCALL_64_after_hwframe+0x55/0x5d [94869.390270] [ T255287] RIP: 0033:0x7f2cb9dea9ee [94869.390276] [ T255287] Code: 08 0f 85 f5 4b ff ff 49 89 fb 48 89 f0 48 89 d7 48 89 ce 4c 89 c2 4d 89 ca 4c 8b 44 24 08 4c 8b 4c 24 10 4c 89 5c 24 08 0f 05 <c3> 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 80 00 00 00 00 48 83 ec 08 [94869.390282] [ T255287] RSP: 002b:00007ffcba63cc48 EFLAGS: 00000246 ORIG_RAX: 0000000000000101 [94869.390289] [ T255287] RAX: ffffffffffffffda RBX: 00007f2cae63acc0 RCX: 00007f2cb9dea9ee [94869.390295] [ T255287] RDX: 0000000000080002 RSI: 00005569c704b818 RDI: ffffffffffffff9c [94869.390300] [ T255287] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000000 [94869.390306] [ T255287] R10: 0000000000000000 R11: 0000000000000246 R12: 00005569c64c0d90 [94869.390311] [ T255287] R13: 00007f2ca4010600 R14: 00007f2ca4044410 R15: 0000000000000000 [94869.390318] [ T255287] </TASK> [94869.390323] [ T255287] ---[ end trace 0000000000000000 ]--- [94869.390335] [ T255287] amdgpu 0000:03:00.0: amdgpu: RAP TA initialize fail (0) status -1. [94869.390341] [ T255287] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [94869.390349] [ T255287] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [94869.390361] [ T255287] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [94869.390369] [ T255287] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [94869.390375] [ T255287] amdgpu 0000:03:00.0: amdgpu: dpm has been enabled [94869.390381] [ T255287] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully! [94869.550359] [ T255287] amdgpu 0000:03:00.0: amdgpu: rlc autoload: gc ucode autoload timeout [94869.550372] [ T255287] amdgpu 0000:03:00.0: amdgpu: resume of IP block <gfx_v10_0> failed -110 [94869.550378] [ T255287] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). [94877.422621] [ T255166] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible [94877.492483] [ T249695] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. [94877.492701] [ T249695] ------------[ cut here ]------------ [94877.492706] [ T249695] WARNING: CPU: 13 PID: 249695 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.492833] [ T249695] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94877.492924] [ T249695] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94877.492993] [ T249695] CPU: 13 UID: 0 PID: 249695 Comm: kworker/u64:4 Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94877.493002] [ T249695] Tainted: [W]=WARN [94877.493009] [ T249695] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94877.493016] [ T249695] Workqueue: kacpi_hotplug acpi_hotplug_work_fn [94877.493026] [ T249695] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.493121] [ T249695] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 7f d0 5c c7 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 6e d0 5c c7 b8 ea ff ff ff e9 64 d0 5c c7 [94877.493132] [ T249695] RSP: 0018:ffffb97f105f7bd0 EFLAGS: 00010246 [94877.493142] [ T249695] RAX: ffff9f0638b5e490 RBX: ffff9f05059a0000 RCX: 0000000000000000 [94877.493150] [ T249695] RDX: 0000000000000000 RSI: ffff9f05059a0008 RDI: ffff9f056be00000 [94877.493159] [ T249695] RBP: ffff9f05059a0000 R08: 0000000000000001 R09: 0000000000000000 [94877.493170] [ T249695] R10: 000000000040003f R11: 0000000000000000 R12: ffff9f056be00000 [94877.493175] [ T249695] R13: ffffffffc0cac1a8 R14: ffffffffc0cac1a8 R15: ffff9f0501370500 [94877.493186] [ T249695] FS: 0000000000000000(0000) GS:ffff9f07ba940000(0000) knlGS:0000000000000000 [94877.493190] [ T249695] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94877.493197] [ T249695] CR2: 00007fe7f6aa2000 CR3: 00000001055a0000 CR4: 0000000000750ef0 [94877.493201] [ T249695] PKRU: 55555554 [94877.493208] [ T249695] Call Trace: [94877.493213] [ T249695] <TASK> [94877.493221] [ T249695] ? __warn.cold+0x90/0x9e [94877.493229] [ T249695] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.493324] [ T249695] ? report_bug+0xfa/0x140 [94877.493334] [ T249695] ? handle_bug+0x53/0x90 [94877.493343] [ T249695] ? exc_invalid_op+0x17/0x70 [94877.493352] [ T249695] ? asm_exc_invalid_op+0x1a/0x20 [94877.493363] [ T249695] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.493460] [ T249695] smu_smc_hw_cleanup+0x5e/0x3e0 [amdgpu] [94877.493596] [ T249695] smu_hw_fini+0xfb/0x1a0 [amdgpu] [94877.493709] [ T249695] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [94877.493842] [ T249695] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [94877.493965] [ T249695] amdgpu_pci_remove+0x40/0x70 [amdgpu] [94877.494048] [ T249695] pci_device_remove+0x3d/0xb0 [94877.494056] [ T249695] device_release_driver_internal+0x197/0x200 [94877.494065] [ T249695] pci_stop_bus_device+0x68/0x80 [94877.494073] [ T249695] pci_stop_bus_device+0x38/0x80 [94877.494081] [ T249695] pci_stop_and_remove_bus_device+0xd/0x20 [94877.494089] [ T249695] trim_stale_devices+0x147/0x1a0 [94877.494097] [ T249695] trim_stale_devices+0xa1/0x1a0 [94877.494105] [ T249695] acpiphp_check_bridge.part.0+0x126/0x170 [94877.494113] [ T249695] acpiphp_hotplug_notify+0xc1/0x260 [94877.494121] [ T249695] ? acpiphp_post_dock_fixup+0xe0/0xe0 [94877.494128] [ T249695] acpi_device_hotplug+0xc1/0x450 [94877.494136] [ T249695] acpi_hotplug_work_fn+0x19/0x30 [94877.494144] [ T249695] process_one_work+0x161/0x270 [94877.494152] [ T249695] worker_thread+0x30a/0x440 [94877.494161] [ T249695] ? rescuer_thread+0x500/0x500 [94877.494168] [ T249695] kthread+0xea/0x1e0 [94877.494176] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.494184] [ T249695] ret_from_fork+0x2f/0x50 [94877.494192] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.494200] [ T249695] ret_from_fork_asm+0x11/0x20 [94877.494209] [ T249695] </TASK> [94877.494216] [ T249695] ---[ end trace 0000000000000000 ]--- [94877.494239] [ T249695] amdgpu 0000:03:00.0: amdgpu: Fail to disable thermal alert! [94877.514281] [ T249695] ------------[ cut here ]------------ [94877.514294] [ T249695] WARNING: CPU: 10 PID: 249695 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.514409] [ T249695] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94877.514518] [ T249695] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94877.514584] [ T249695] CPU: 10 UID: 0 PID: 249695 Comm: kworker/u64:4 Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94877.514592] [ T249695] Tainted: [W]=WARN [94877.514597] [ T249695] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94877.514602] [ T249695] Workqueue: kacpi_hotplug acpi_hotplug_work_fn [94877.514609] [ T249695] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.514718] [ T249695] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [94877.514724] [ T249695] RSP: 0018:ffffb97f105f7bb0 EFLAGS: 00010202 [94877.514730] [ T249695] RAX: 0000000000000000 RBX: ffff9f056be3b890 RCX: 0000000080000000 [94877.514736] [ T249695] RDX: ffff9f056be3b8e0 RSI: ffff9f056be3b8e8 RDI: ffff9f056be3b890 [94877.514741] [ T249695] RBP: ffff9f056be3b8e0 R08: 0000000000000000 R09: 00000000ffffffea [94877.514745] [ T249695] R10: ffff9f07e02fffa8 R11: 0000000000000003 R12: ffff9f056be3b8e8 [94877.514750] [ T249695] R13: ffff9f05b0b61000 R14: ffff9f056be0ef80 R15: ffff9f0501370500 [94877.514756] [ T249695] FS: 0000000000000000(0000) GS:ffff9f07ba880000(0000) knlGS:0000000000000000 [94877.514761] [ T249695] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94877.514766] [ T249695] CR2: 000055e0b0eb1fe8 CR3: 0000000386c18000 CR4: 0000000000750ef0 [94877.514771] [ T249695] PKRU: 55555554 [94877.514776] [ T249695] Call Trace: [94877.514782] [ T249695] <TASK> [94877.514787] [ T249695] ? __warn.cold+0x90/0x9e [94877.514794] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.514901] [ T249695] ? report_bug+0xfa/0x140 [94877.514908] [ T249695] ? handle_bug+0x53/0x90 [94877.514914] [ T249695] ? exc_invalid_op+0x17/0x70 [94877.514920] [ T249695] ? asm_exc_invalid_op+0x1a/0x20 [94877.514928] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.515031] [ T249695] psp_v11_0_ring_destroy+0x2e/0x50 [amdgpu] [94877.515123] [ T249695] psp_hw_fini+0x126/0x380 [amdgpu] [94877.515208] [ T249695] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [94877.515337] [ T249695] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [94877.515451] [ T249695] amdgpu_pci_remove+0x40/0x70 [amdgpu] [94877.515533] [ T249695] pci_device_remove+0x3d/0xb0 [94877.515538] [ T249695] device_release_driver_internal+0x197/0x200 [94877.515544] [ T249695] pci_stop_bus_device+0x68/0x80 [94877.515549] [ T249695] pci_stop_bus_device+0x38/0x80 [94877.515554] [ T249695] pci_stop_and_remove_bus_device+0xd/0x20 [94877.515559] [ T249695] trim_stale_devices+0x147/0x1a0 [94877.515564] [ T249695] trim_stale_devices+0xa1/0x1a0 [94877.515570] [ T249695] acpiphp_check_bridge.part.0+0x126/0x170 [94877.515575] [ T249695] acpiphp_hotplug_notify+0xc1/0x260 [94877.515580] [ T249695] ? acpiphp_post_dock_fixup+0xe0/0xe0 [94877.515585] [ T249695] acpi_device_hotplug+0xc1/0x450 [94877.515590] [ T249695] acpi_hotplug_work_fn+0x19/0x30 [94877.515595] [ T249695] process_one_work+0x161/0x270 [94877.515599] [ T249695] worker_thread+0x30a/0x440 [94877.515604] [ T249695] ? rescuer_thread+0x500/0x500 [94877.515609] [ T249695] kthread+0xea/0x1e0 [94877.515614] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.515619] [ T249695] ret_from_fork+0x2f/0x50 [94877.515624] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.515629] [ T249695] ret_from_fork_asm+0x11/0x20 [94877.515636] [ T249695] </TASK> [94877.515639] [ T249695] ---[ end trace 0000000000000000 ]--- [94877.516672] [ T249695] ------------[ cut here ]------------ [94877.516676] [ T249695] WARNING: CPU: 10 PID: 249695 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.516767] [ T249695] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94877.516839] [ T249695] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94877.516884] [ T249695] CPU: 10 UID: 0 PID: 249695 Comm: kworker/u64:4 Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94877.516889] [ T249695] Tainted: [W]=WARN [94877.516893] [ T249695] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94877.516896] [ T249695] Workqueue: kacpi_hotplug acpi_hotplug_work_fn [94877.516901] [ T249695] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.516987] [ T249695] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 7f d0 5c c7 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 6e d0 5c c7 b8 ea ff ff ff e9 64 d0 5c c7 [94877.516991] [ T249695] RSP: 0018:ffffb97f105f7c10 EFLAGS: 00010246 [94877.516996] [ T249695] RAX: ffff9f0638b5efa8 RBX: ffff9f056be00000 RCX: 0000000000000000 [94877.516999] [ T249695] RDX: 0000000000000000 RSI: ffff9f056be00c78 RDI: ffff9f056be00000 [94877.517003] [ T249695] RBP: 0000000000000001 R08: 0000000000000001 R09: 0000000000000000 [94877.517007] [ T249695] R10: 000000000020001f R11: 0000000000000000 R12: ffff9f056be46e48 [94877.517010] [ T249695] R13: ffffffffc0cac1a8 R14: ffffffffc0cac1a8 R15: ffff9f0501370500 [94877.517014] [ T249695] FS: 0000000000000000(0000) GS:ffff9f07ba880000(0000) knlGS:0000000000000000 [94877.517018] [ T249695] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94877.517021] [ T249695] CR2: 000055e0b0eb1fe8 CR3: 0000000386c18000 CR4: 0000000000750ef0 [94877.517025] [ T249695] PKRU: 55555554 [94877.517029] [ T249695] Call Trace: [94877.517032] [ T249695] <TASK> [94877.517037] [ T249695] ? __warn.cold+0x90/0x9e [94877.517043] [ T249695] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.517147] [ T249695] ? report_bug+0xfa/0x140 [94877.517153] [ T249695] ? handle_bug+0x53/0x90 [94877.517159] [ T249695] ? exc_invalid_op+0x17/0x70 [94877.517164] [ T249695] ? asm_exc_invalid_op+0x1a/0x20 [94877.517171] [ T249695] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [94877.517284] [ T249695] gmc_v10_0_hw_fini+0x52/0xb0 [amdgpu] [94877.517389] [ T249695] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [94877.517550] [ T249695] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [94877.517695] [ T249695] amdgpu_pci_remove+0x40/0x70 [amdgpu] [94877.517792] [ T249695] pci_device_remove+0x3d/0xb0 [94877.517799] [ T249695] device_release_driver_internal+0x197/0x200 [94877.517804] [ T249695] pci_stop_bus_device+0x68/0x80 [94877.517811] [ T249695] pci_stop_bus_device+0x38/0x80 [94877.517816] [ T249695] pci_stop_and_remove_bus_device+0xd/0x20 [94877.517822] [ T249695] trim_stale_devices+0x147/0x1a0 [94877.517828] [ T249695] trim_stale_devices+0xa1/0x1a0 [94877.517834] [ T249695] acpiphp_check_bridge.part.0+0x126/0x170 [94877.517840] [ T249695] acpiphp_hotplug_notify+0xc1/0x260 [94877.517846] [ T249695] ? acpiphp_post_dock_fixup+0xe0/0xe0 [94877.517852] [ T249695] acpi_device_hotplug+0xc1/0x450 [94877.517859] [ T249695] acpi_hotplug_work_fn+0x19/0x30 [94877.517864] [ T249695] process_one_work+0x161/0x270 [94877.517870] [ T249695] worker_thread+0x30a/0x440 [94877.517876] [ T249695] ? rescuer_thread+0x500/0x500 [94877.517881] [ T249695] kthread+0xea/0x1e0 [94877.517887] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.517894] [ T249695] ret_from_fork+0x2f/0x50 [94877.517900] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.517906] [ T249695] ret_from_fork_asm+0x11/0x20 [94877.517913] [ T249695] </TASK> [94877.517918] [ T249695] ---[ end trace 0000000000000000 ]--- [94877.517978] [ T249695] ------------[ cut here ]------------ [94877.517983] [ T249695] WARNING: CPU: 10 PID: 249695 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.518067] [ T249695] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94877.518139] [ T249695] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94877.518184] [ T249695] CPU: 10 UID: 0 PID: 249695 Comm: kworker/u64:4 Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94877.518188] [ T249695] Tainted: [W]=WARN [94877.518192] [ T249695] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94877.518196] [ T249695] Workqueue: kacpi_hotplug acpi_hotplug_work_fn [94877.518201] [ T249695] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.518279] [ T249695] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [94877.518284] [ T249695] RSP: 0018:ffffb97f105f7be0 EFLAGS: 00010202 [94877.518288] [ T249695] RAX: ffff9f07ba8a5d80 RBX: ffff9f056be14b60 RCX: 000000000000016f [94877.518292] [ T249695] RDX: ffff9f056be14b68 RSI: ffff9f056be14b70 RDI: ffff9f056be14b60 [94877.518295] [ T249695] RBP: ffff9f056be14b68 R08: 00000000000056ee R09: 0000000000000009 [94877.518299] [ T249695] R10: 0000000000000008 R11: 0000000000000139 R12: ffff9f056be14b70 [94877.518303] [ T249695] R13: ffff9f0519e18800 R14: ffff9f056be0ef80 R15: ffff9f0501370500 [94877.518306] [ T249695] FS: 0000000000000000(0000) GS:ffff9f07ba880000(0000) knlGS:0000000000000000 [94877.518310] [ T249695] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94877.518314] [ T249695] CR2: 000055e0b0eb1fe8 CR3: 0000000386c18000 CR4: 0000000000750ef0 [94877.518318] [ T249695] PKRU: 55555554 [94877.518321] [ T249695] Call Trace: [94877.518325] [ T249695] <TASK> [94877.518329] [ T249695] ? __warn.cold+0x90/0x9e [94877.518333] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.518412] [ T249695] ? report_bug+0xfa/0x140 [94877.518417] [ T249695] ? handle_bug+0x53/0x90 [94877.518421] [ T249695] ? exc_invalid_op+0x17/0x70 [94877.518425] [ T249695] ? asm_exc_invalid_op+0x1a/0x20 [94877.518431] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.518518] [ T249695] amdgpu_ih_ring_fini+0x4f/0x80 [amdgpu] [94877.518608] [ T249695] amdgpu_irq_fini_hw+0x2f/0x80 [amdgpu] [94877.518693] [ T249695] amdgpu_device_fini_hw+0x231/0x2ad [amdgpu] [94877.518821] [ T249695] amdgpu_pci_remove+0x40/0x70 [amdgpu] [94877.518898] [ T249695] pci_device_remove+0x3d/0xb0 [94877.518903] [ T249695] device_release_driver_internal+0x197/0x200 [94877.518908] [ T249695] pci_stop_bus_device+0x68/0x80 [94877.518913] [ T249695] pci_stop_bus_device+0x38/0x80 [94877.518918] [ T249695] pci_stop_and_remove_bus_device+0xd/0x20 [94877.518922] [ T249695] trim_stale_devices+0x147/0x1a0 [94877.518928] [ T249695] trim_stale_devices+0xa1/0x1a0 [94877.518932] [ T249695] acpiphp_check_bridge.part.0+0x126/0x170 [94877.518937] [ T249695] acpiphp_hotplug_notify+0xc1/0x260 [94877.518942] [ T249695] ? acpiphp_post_dock_fixup+0xe0/0xe0 [94877.518947] [ T249695] acpi_device_hotplug+0xc1/0x450 [94877.518952] [ T249695] acpi_hotplug_work_fn+0x19/0x30 [94877.518956] [ T249695] process_one_work+0x161/0x270 [94877.518960] [ T249695] worker_thread+0x30a/0x440 [94877.518965] [ T249695] ? rescuer_thread+0x500/0x500 [94877.518969] [ T249695] kthread+0xea/0x1e0 [94877.518974] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.518979] [ T249695] ret_from_fork+0x2f/0x50 [94877.518984] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.518989] [ T249695] ret_from_fork_asm+0x11/0x20 [94877.518996] [ T249695] </TASK> [94877.518999] [ T249695] ---[ end trace 0000000000000000 ]--- [94877.519192] [ T249695] ------------[ cut here ]------------ [94877.519201] [ T249695] WARNING: CPU: 10 PID: 249695 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.519280] [ T249695] Modules linked in: ec_sys netconsole sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic btusb btrtl btintel snd_hda_codec_hdmi btbcm btmtk snd_hda_intel uvcvideo snd_intel_dspcfg videobuf2_vmalloc videobuf2_memops snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn uvc bluetooth snd_hda_codec videobuf2_v4l2 snd_soc_core snd_hwdep snd_hda_core videodev snd_pcm_oss snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_acp_config snd_soc_acpi msi_wmi ecdh_generic ecc sparse_keymap snd_timer wmi_bmof mc snd ccp soundcore k10temp snd_pci_acp3x battery ac button hid_sensor_prox hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf industrialio amd_pmc hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse [94877.519352] [ T249695] nvme_fabrics efi_pstore configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp xhci_hcd drm_buddy hid_multitouch hid_sensor_hub gpu_sched mfd_core hid_generic i2c_hid_acpi drm_display_helper psmouse usbcore amd_sfh i2c_hid nvme hid drm_kms_helper serio_raw nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [94877.519397] [ T249695] CPU: 10 UID: 0 PID: 249695 Comm: kworker/u64:4 Tainted: G W 6.14.0-mystery-00198-g74adf9e35384 #36 [94877.519402] [ T249695] Tainted: [W]=WARN [94877.519405] [ T249695] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [94877.519409] [ T249695] Workqueue: kacpi_hotplug acpi_hotplug_work_fn [94877.519414] [ T249695] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.519500] [ T249695] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 a7 99 65 c7 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 e5 9d 5d 00 eb bd 0f 1f [94877.519505] [ T249695] RSP: 0018:ffffb97f105f7c28 EFLAGS: 00010202 [94877.519509] [ T249695] RAX: 0000000000000000 RBX: ffff9f056be00a20 RCX: 0000000000000001 [94877.519513] [ T249695] RDX: ffff9f056be00a28 RSI: 0000000000000000 RDI: ffff9f056be00a20 [94877.519517] [ T249695] RBP: ffff9f056be00a28 R08: 0000000000000020 R09: ffffffff891e8340 [94877.519520] [ T249695] R10: 0000000000000020 R11: 000000000000016d R12: 0000000000000000 [94877.519524] [ T249695] R13: ffff9f0519e1b800 R14: ffff9f056be0ef80 R15: ffff9f0501370500 [94877.519528] [ T249695] FS: 0000000000000000(0000) GS:ffff9f07ba880000(0000) knlGS:0000000000000000 [94877.519532] [ T249695] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [94877.519535] [ T249695] CR2: 000055e0b0eb1fe8 CR3: 0000000386c18000 CR4: 0000000000750ef0 [94877.519539] [ T249695] PKRU: 55555554 [94877.519543] [ T249695] Call Trace: [94877.519546] [ T249695] <TASK> [94877.519550] [ T249695] ? __warn.cold+0x90/0x9e [94877.519555] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.519633] [ T249695] ? report_bug+0xfa/0x140 [94877.519638] [ T249695] ? handle_bug+0x53/0x90 [94877.519642] [ T249695] ? exc_invalid_op+0x17/0x70 [94877.519647] [ T249695] ? asm_exc_invalid_op+0x1a/0x20 [94877.519652] [ T249695] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [94877.519731] [ T249695] amdgpu_device_unmap_mmio+0x25/0x90 [amdgpu] [94877.519808] [ T249695] amdgpu_pci_remove+0x40/0x70 [amdgpu] [94877.519885] [ T249695] pci_device_remove+0x3d/0xb0 [94877.519889] [ T249695] device_release_driver_internal+0x197/0x200 [94877.519894] [ T249695] pci_stop_bus_device+0x68/0x80 [94877.519899] [ T249695] pci_stop_bus_device+0x38/0x80 [94877.519904] [ T249695] pci_stop_and_remove_bus_device+0xd/0x20 [94877.519908] [ T249695] trim_stale_devices+0x147/0x1a0 [94877.519913] [ T249695] trim_stale_devices+0xa1/0x1a0 [94877.519918] [ T249695] acpiphp_check_bridge.part.0+0x126/0x170 [94877.519923] [ T249695] acpiphp_hotplug_notify+0xc1/0x260 [94877.519928] [ T249695] ? acpiphp_post_dock_fixup+0xe0/0xe0 [94877.519933] [ T249695] acpi_device_hotplug+0xc1/0x450 [94877.519938] [ T249695] acpi_hotplug_work_fn+0x19/0x30 [94877.519942] [ T249695] process_one_work+0x161/0x270 [94877.519947] [ T249695] worker_thread+0x30a/0x440 [94877.519951] [ T249695] ? rescuer_thread+0x500/0x500 [94877.519956] [ T249695] kthread+0xea/0x1e0 [94877.519961] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.519965] [ T249695] ret_from_fork+0x2f/0x50 [94877.519970] [ T249695] ? kthreads_online_cpu+0xf0/0xf0 [94877.519975] [ T249695] ret_from_fork_asm+0x11/0x20 [94877.519981] [ T249695] </TASK> [94877.519985] [ T249695] ---[ end trace 0000000000000000 ]--- [94878.445496] [ T255166] pcieport 0000:02:00.0: Data Link Layer Link Active not set in 1000 msec [94878.446706] [ T249695] pci_bus 0000:03: busn_res: [bus 03] is released Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-10-31 18:35 ` Bert Karwatzki @ 2025-11-05 11:44 ` Bert Karwatzki 2025-11-05 21:31 ` Mario Limonciello (AMD) (kernel.org) 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-05 11:44 UTC (permalink / raw) To: Christian König, Mario Limonciello, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf I finally got a result from kdump regarding this bug. As I told I'm currently trying to bisect this (again ...) between v6.14 and v6.15. My test setup during overnight tests is to put on a long youtube video and then simulate some interactivity by running this script: #!/bin/bash for i in {0..10000} do echo $i evolution & sleep 3 killall evolution sleep 27 done I'm not done with the bisection, yet, but this night I got a result from kdump showing a NULL pointer dereference after a loss of the discrete GPU: (This may be a different bug though, as this did not result in a reboot but hang instead) faddr2line gives this regarding the NULL pointer: $ scripts/faddr2line drivers/gpu/drm/ttm/ttm_resource.o ttm_resource_move_to_lru_tail+0xc1/0xe0 ttm_resource_move_to_lru_tail+0xc1/0xe0: list_add_tail at /mnt/data/linux-forest/mystery_shutdown/./include/linux/list.h:183 (inlined by) list_move_tail at /mnt/data/linux-forest/mystery_shutdown/./include/linux/list.h:311 (inlined by) ttm_resource_move_to_lru_tail at /mnt/data/linux-forest/mystery_shutdown/drivers/gpu/drm/ttm/ttm_resource.c:291 So I probably should use CONFIG_DEBUG_LIST from now on. [13600.900669] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Down [13600.900678] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card not present [13600.971642] [ T53331] amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:7 param:0x00000000 message:DisableAllSmuFeatures? [13600.971649] [ T53331] amdgpu 0000:03:00.0: amdgpu: Failed to disable smu features. [13600.971653] [ T53331] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13600.971656] [ T53331] amdgpu 0000:03:00.0: amdgpu: [PrepareMp1] Failed! [13600.971658] [ T53331] [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* SMC failed to set mp1 state 2, -121 [13600.971779] [ T53331] amdgpu 0000:03:00.0: Unable to change power state from D0 to D3hot, device inaccessible [13600.971809] [ T140] amdgpu 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible [13611.504805] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... [13611.504924] [ T140] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [13611.504930] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [13611.504933] [ T140] amdgpu 0000:03:00.0: amdgpu: dpm has been enabled [13611.504936] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully! [13611.664765] [ T140] amdgpu 0000:03:00.0: amdgpu: rlc autoload: gc ucode autoload timeout [13611.664771] [ T140] amdgpu 0000:03:00.0: amdgpu: resume of IP block <gfx_v10_0> failed -110 [13611.664775] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). [13611.665372] [ T32730] pcieport 0000:02:00.0: Unable to change power state from D0 to D3hot, device inaccessible [13611.666216] [ T32730] pcieport 0000:01:00.0: Unable to change power state from D0 to D3hot, device inaccessible [13611.763659] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. [13611.763798] [ T140] ------------[ cut here ]------------ [13611.763801] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13611.763924] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13611.763993] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13611.764031] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Not tainted 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13611.764034] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13611.764036] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13611.764129] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13611.764132] [ T140] RSP: 0018:ffff9ac3c0743bf0 EFLAGS: 00010246 [13611.764135] [ T140] RAX: ffff8c3200c40280 RBX: ffff8c3200dba000 RCX: 0000000000000000 [13611.764137] [ T140] RDX: 0000000000000000 RSI: ffff8c3200dba008 RDI: ffff8c320a600000 [13611.764139] [ T140] RBP: ffff8c3200dba000 R08: 0000000000000001 R09: 0000000000000000 [13611.764141] [ T140] R10: 000000000040003f R11: 0000000000000000 R12: ffff8c320a600000 [13611.764142] [ T140] R13: ffffffffc0be01a8 R14: ffffffffc0be01a8 R15: ffff9ac3c0743d6e [13611.764144] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13611.764146] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13611.764148] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 [13611.764150] [ T140] PKRU: 55555554 [13611.764152] [ T140] Call Trace: [13611.764155] [ T140] <TASK> [13611.764157] [ T140] ? __warn.cold+0x90/0x9e [13611.764162] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13611.764248] [ T140] ? report_bug+0xfa/0x140 [13611.764253] [ T140] ? handle_bug+0x53/0x90 [13611.764256] [ T140] ? exc_invalid_op+0x17/0x70 [13611.764259] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13611.764263] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13611.764346] [ T140] smu_smc_hw_cleanup+0x5e/0x3e0 [amdgpu] [13611.764460] [ T140] smu_hw_fini+0xfb/0x1a0 [amdgpu] [13611.764573] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [13611.764699] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [13611.764815] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [13611.764893] [ T140] pci_device_remove+0x3d/0xb0 [13611.764897] [ T140] device_release_driver_internal+0x197/0x200 [13611.764900] [ T140] pci_stop_bus_device+0x68/0x80 [13611.764904] [ T140] pci_stop_bus_device+0x38/0x80 [13611.764907] [ T140] pci_stop_bus_device+0x27/0x80 [13611.764909] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [13611.764912] [ T140] pciehp_unconfigure_device+0x93/0x160 [13611.764916] [ T140] pciehp_disable_slot+0x62/0x100 [13611.764919] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [13611.764922] [ T140] pciehp_ist+0x13b/0x180 [13611.764925] [ T140] irq_thread_fn+0x1e/0x60 [13611.764929] [ T140] irq_thread+0x114/0x1e0 [13611.764932] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13611.764935] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13611.764938] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13611.764941] [ T140] kthread+0xea/0x1e0 [13611.764945] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.764948] [ T140] ret_from_fork+0x2f/0x50 [13611.764951] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.764954] [ T140] ret_from_fork_asm+0x11/0x20 [13611.764959] [ T140] </TASK> [13611.764960] [ T140] ---[ end trace 0000000000000000 ]--- [13611.764963] [ T140] amdgpu 0000:03:00.0: amdgpu: Fail to disable thermal alert! [13611.785004] [ T140] ------------[ cut here ]------------ [13611.785008] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.785128] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13611.785216] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13611.785262] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13611.785266] [ T140] Tainted: [W]=WARN [13611.785268] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13611.785271] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.785377] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f [13611.785380] [ T140] RSP: 0018:ffff9ac3c0743bd0 EFLAGS: 00010202 [13611.785384] [ T140] RAX: 0000000000000000 RBX: ffff8c320a63b830 RCX: 0000000080000000 [13611.785386] [ T140] RDX: ffff8c320a63b880 RSI: ffff8c320a63b888 RDI: ffff8c320a63b830 [13611.785389] [ T140] RBP: ffff8c320a63b880 R08: 0000000000000000 R09: 00000000ffffffea [13611.785391] [ T140] R10: ffff8c34e02fffa8 R11: 0000000000000003 R12: ffff8c320a63b888 [13611.785393] [ T140] R13: ffff8c3205b49800 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e [13611.785395] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13611.785398] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13611.785400] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 [13611.785403] [ T140] PKRU: 55555554 [13611.785405] [ T140] Call Trace: [13611.785408] [ T140] <TASK> [13611.785410] [ T140] ? __warn.cold+0x90/0x9e [13611.785415] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.785530] [ T140] ? report_bug+0xfa/0x140 [13611.785535] [ T140] ? handle_bug+0x53/0x90 [13611.785540] [ T140] ? exc_invalid_op+0x17/0x70 [13611.785543] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13611.785548] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.785651] [ T140] psp_v11_0_ring_destroy+0x2e/0x50 [amdgpu] [13611.785771] [ T140] psp_hw_fini+0x126/0x380 [amdgpu] [13611.785856] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [13611.785974] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [13611.786084] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [13611.786160] [ T140] pci_device_remove+0x3d/0xb0 [13611.786164] [ T140] device_release_driver_internal+0x197/0x200 [13611.786167] [ T140] pci_stop_bus_device+0x68/0x80 [13611.786170] [ T140] pci_stop_bus_device+0x38/0x80 [13611.786173] [ T140] pci_stop_bus_device+0x27/0x80 [13611.786175] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [13611.786178] [ T140] pciehp_unconfigure_device+0x93/0x160 [13611.786181] [ T140] pciehp_disable_slot+0x62/0x100 [13611.786184] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [13611.786201] [ T140] pciehp_ist+0x13b/0x180 [13611.786204] [ T140] irq_thread_fn+0x1e/0x60 [13611.786208] [ T140] irq_thread+0x114/0x1e0 [13611.786211] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13611.786213] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13611.786216] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13611.786219] [ T140] kthread+0xea/0x1e0 [13611.786223] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.786226] [ T140] ret_from_fork+0x2f/0x50 [13611.786229] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.786232] [ T140] ret_from_fork_asm+0x11/0x20 [13611.786236] [ T140] </TASK> [13611.786238] [ T140] ---[ end trace 0000000000000000 ]--- [13611.787301] [ T140] ------------[ cut here ]------------ [13611.787303] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.787382] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13611.787447] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13611.787494] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13611.787497] [ T140] Tainted: [W]=WARN [13611.787499] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13611.787501] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.787578] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f [13611.787580] [ T140] RSP: 0018:ffff9ac3c0743c00 EFLAGS: 00010202 [13611.787583] [ T140] RAX: ffff8c34ba7a5d80 RBX: ffff8c320a614b60 RCX: 000000000000016f [13611.787585] [ T140] RDX: ffff8c320a614b68 RSI: ffff8c320a614b70 RDI: ffff8c320a614b60 [13611.787586] [ T140] RBP: ffff8c320a614b68 R08: 00000000000056ee R09: 0000000000000009 [13611.787588] [ T140] R10: 00000000000000b2 R11: 000000000000000a R12: ffff8c320a614b70 [13611.787590] [ T140] R13: ffff8c320a6cb400 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e [13611.787592] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13611.787594] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13611.787596] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 [13611.787597] [ T140] PKRU: 55555554 [13611.787599] [ T140] Call Trace: [13611.787601] [ T140] <TASK> [13611.787603] [ T140] ? __warn.cold+0x90/0x9e [13611.787606] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.787691] [ T140] ? report_bug+0xfa/0x140 [13611.787695] [ T140] ? handle_bug+0x53/0x90 [13611.787699] [ T140] ? exc_invalid_op+0x17/0x70 [13611.787702] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13611.787707] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.787801] [ T140] amdgpu_ih_ring_fini+0x4f/0x80 [amdgpu] [13611.787909] [ T140] amdgpu_irq_fini_hw+0x2f/0x80 [amdgpu] [13611.788011] [ T140] amdgpu_device_fini_hw+0x231/0x2ad [amdgpu] [13611.788156] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [13611.788249] [ T140] pci_device_remove+0x3d/0xb0 [13611.788253] [ T140] device_release_driver_internal+0x197/0x200 [13611.788257] [ T140] pci_stop_bus_device+0x68/0x80 [13611.788261] [ T140] pci_stop_bus_device+0x38/0x80 [13611.788264] [ T140] pci_stop_bus_device+0x27/0x80 [13611.788267] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [13611.788270] [ T140] pciehp_unconfigure_device+0x93/0x160 [13611.788274] [ T140] pciehp_disable_slot+0x62/0x100 [13611.788277] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [13611.788281] [ T140] pciehp_ist+0x13b/0x180 [13611.788284] [ T140] irq_thread_fn+0x1e/0x60 [13611.788288] [ T140] irq_thread+0x114/0x1e0 [13611.788291] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13611.788295] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13611.788298] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13611.788302] [ T140] kthread+0xea/0x1e0 [13611.788306] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.788309] [ T140] ret_from_fork+0x2f/0x50 [13611.788313] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.788317] [ T140] ret_from_fork_asm+0x11/0x20 [13611.788322] [ T140] </TASK> [13611.788324] [ T140] ---[ end trace 0000000000000000 ]--- [13611.789149] [ T140] ------------[ cut here ]------------ [13611.789151] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.789230] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13611.789294] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13611.789328] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13611.789331] [ T140] Tainted: [W]=WARN [13611.789333] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13611.789334] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.789411] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f [13611.789413] [ T140] RSP: 0018:ffff9ac3c0743c48 EFLAGS: 00010202 [13611.789416] [ T140] RAX: 0000000000000000 RBX: ffff8c320a600a20 RCX: 0000000000000000 [13611.789418] [ T140] RDX: ffff8c320a600a28 RSI: 0000000000000000 RDI: ffff8c320a600a20 [13611.789420] [ T140] RBP: ffff8c320a600a28 R08: ffff8c3221bd1c18 R09: 00007f363c49e000 [13611.789421] [ T140] R10: 0000000000000020 R11: 000000000000009d R12: 0000000000000000 [13611.789423] [ T140] R13: ffff8c320a6cb000 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e [13611.789425] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13611.789427] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13611.789429] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 [13611.789430] [ T140] PKRU: 55555554 [13611.789432] [ T140] Call Trace: [13611.789434] [ T140] <TASK> [13611.789436] [ T140] ? __warn.cold+0x90/0x9e [13611.789439] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.789524] [ T140] ? report_bug+0xfa/0x140 [13611.789528] [ T140] ? handle_bug+0x53/0x90 [13611.789531] [ T140] ? exc_invalid_op+0x17/0x70 [13611.789533] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13611.789537] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] [13611.789614] [ T140] amdgpu_device_unmap_mmio+0x25/0x90 [amdgpu] [13611.789689] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] [13611.789765] [ T140] pci_device_remove+0x3d/0xb0 [13611.789768] [ T140] device_release_driver_internal+0x197/0x200 [13611.789771] [ T140] pci_stop_bus_device+0x68/0x80 [13611.789774] [ T140] pci_stop_bus_device+0x38/0x80 [13611.789776] [ T140] pci_stop_bus_device+0x27/0x80 [13611.789779] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 [13611.789782] [ T140] pciehp_unconfigure_device+0x93/0x160 [13611.789785] [ T140] pciehp_disable_slot+0x62/0x100 [13611.789787] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 [13611.789790] [ T140] pciehp_ist+0x13b/0x180 [13611.789793] [ T140] irq_thread_fn+0x1e/0x60 [13611.789796] [ T140] irq_thread+0x114/0x1e0 [13611.789799] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13611.789801] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13611.789805] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13611.789807] [ T140] kthread+0xea/0x1e0 [13611.789810] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.789814] [ T140] ret_from_fork+0x2f/0x50 [13611.789817] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13611.789820] [ T140] ret_from_fork_asm+0x11/0x20 [13611.789824] [ T140] </TASK> [13611.789826] [ T140] ---[ end trace 0000000000000000 ]--- [13612.510583] [ T140] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible [13612.510746] [ T140] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible [13612.515993] [ T140] pci_bus 0000:03: busn_res: [bus 03] is released [13612.517813] [ T140] pci_bus 0000:02: busn_res: [bus 02-03] is released [13612.517957] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card present [13612.517960] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Up [13612.646970] [ T140] pci 0000:01:00.0: [1002:1478] type 01 class 0x060400 PCIe Switch Upstream Port [13612.647337] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff] [13612.647459] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [13612.648148] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [13612.649293] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [13612.651803] [ T140] pci 0000:01:00.0: PME# supported from D0 D3hot D3cold [13612.655075] [ T140] pci 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with 16.0 GT/s PCIe x8 link) [13612.655993] [ T140] pci 0000:01:00.0: Adding to iommu group 12 [13612.657710] [ T140] pci 0000:02:00.0: [1002:1479] type 01 class 0x060400 PCIe Switch Downstream Port [13612.658068] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [13612.658078] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [13612.658315] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [13612.661595] [ T140] pci 0000:02:00.0: PME# supported from D0 D3hot D3cold [13612.667373] [ T140] pci 0000:02:00.0: Adding to iommu group 13 [13612.667877] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [13612.668858] [ T140] pci 0000:03:00.0: [1002:73ff] type 00 class 0x038000 PCIe Legacy Endpoint [13612.669236] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref] [13612.669241] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref] [13612.669480] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff] [13612.669484] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref] [13612.672312] [ T140] pci 0000:03:00.0: PME# supported from D1 D2 D3hot D3cold [13612.673659] [ T140] pci 0000:03:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link) [13612.675633] [ T140] pci 0000:03:00.0: Adding to iommu group 14 [13612.676122] [ T140] pci 0000:03:00.1: [1002:ab28] type 00 class 0x040300 PCIe Legacy Endpoint [13612.677265] [ T140] pci 0000:03:00.1: BAR 0 [mem 0xfcb20000-0xfcb23fff] [13612.678612] [ T140] pci 0000:03:00.1: PME# supported from D1 D2 D3hot D3cold [13612.679722] [ T140] pci 0000:03:00.1: Adding to iommu group 15 [13612.680123] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [13612.680834] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x300000000 required for 0000:02:00.0 bridging to [bus 03] [13612.680838] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 03] requires relaxed alignment rules [13612.680842] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x400000000 required for 0000:01:00.0 bridging to [bus 02-03] [13612.680845] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 02-03] requires relaxed alignment rules [13612.680851] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned [13612.680853] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned [13612.680856] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff]: assigned [13612.680861] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned [13612.680864] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned [13612.680867] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref]: assigned [13612.680877] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref]: assigned [13612.680888] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff]: assigned [13612.681010] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref]: assigned [13612.681012] [ T140] pci 0000:03:00.1: BAR 0 [mem 0xfcb20000-0xfcb23fff]: assigned [13612.681233] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] [13612.681589] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [13612.681707] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [13612.681714] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] [13612.681833] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] [13612.681960] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [13612.681967] [ T140] pcieport 0000:00:01.1: PCI bridge to [bus 01-03] [13612.681970] [ T140] pcieport 0000:00:01.1: bridge window [io 0x1000-0x1fff] [13612.681974] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfca00000-0xfccfffff] [13612.681977] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] [13612.684765] [ T140] [drm] initializing kernel modesetting (DIMGREY_CAVEFISH 0x1002:0x73FF 0x1462:0x1313 0xC3). [13612.685143] [ T140] [drm] register mmio base: 0xFCA00000 [13612.685145] [ T140] [drm] register mmio size: 1048576 [13614.899803] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 0 <nv_common> [13614.899811] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 1 <gmc_v10_0> [13614.899815] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 2 <navi10_ih> [13614.899818] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 3 <psp> [13614.899821] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 4 <smu> [13614.899824] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 5 <dm> [13614.899827] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 6 <gfx_v10_0> [13614.899831] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 7 <sdma_v5_2> [13614.899834] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 8 <vcn_v3_0> [13614.899837] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 9 <jpeg_v3_0> [13614.899982] [ T140] amdgpu 0000:03:00.0: amdgpu: ACPI VFCT table present but broken (too short #2),skipping [13615.118633] [ T140] amdgpu 0000:03:00.0: amdgpu: Fetched VBIOS from ROM BAR [13615.118640] [ T140] amdgpu: ATOM BIOS: SWBRT77181.001 [13615.126013] [ T140] amdgpu 0000:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default) [13615.126355] [ T140] amdgpu 0000:03:00.0: amdgpu: MODE1 reset [13615.126359] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset [13615.128943] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset [13615.633347] [ T140] [drm] GPU posting now... [13615.633384] [ T140] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit [13615.633393] [ T140] amdgpu 0000:03:00.0: amdgpu: VRAM: 8176M 0x0000008000000000 - 0x00000081FEFFFFFF (8176M used) [13615.633397] [ T140] amdgpu 0000:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF [13615.633408] [ T140] [drm] Detected VRAM RAM=8176M, BAR=8192M [13615.633410] [ T140] [drm] RAM width 128bits GDDR6 [13615.633554] [ T140] [drm] amdgpu: 8176M of VRAM memory ready [13615.633557] [ T140] [drm] amdgpu: 6895M of GTT memory ready. [13615.633574] [ T140] [drm] GART: num cpu pages 131072, num gpu pages 131072 [13615.635159] [ T140] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). [13630.506743] [ T140] amdgpu 0000:03:00.0: amdgpu: STB initialized to 2048 entries [13630.506836] [ T140] [drm] Loading DMUB firmware via PSP: version=0x02020020 [13630.510341] [ T140] [drm] use_doorbell being set to: [true] [13630.511712] [ T140] [drm] use_doorbell being set to: [true] [13630.511733] [ T140] [drm] Found VCN firmware Version ENC: 1.33 DEC: 4 VEP: 0 Revision: 6 [13630.685530] [ T140] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR [13631.015392] [ T140] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available [13631.046278] [ T140] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [13631.046644] [ T140] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0) [13631.046649] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched [13631.046909] [ T140] amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable [13631.153391] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:12 param:0x00000000 message:GetEnabledSmuFeaturesLow? [13631.153396] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153399] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153401] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153404] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153406] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153408] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153411] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153413] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153415] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153417] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153419] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153421] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153423] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153425] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153427] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153429] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153431] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153433] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153435] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153437] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153438] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153440] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153442] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153444] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153446] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153448] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153451] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! [13631.153453] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! [13631.153455] [ T140] amdgpu 0000:03:00.0: amdgpu: Attempt to override pcie params failed! [13631.153457] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw! [13631.153459] [ T140] [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* hw_init of IP block <smu> failed -121 [13631.153638] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_init failed [13631.153641] [ T140] amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init [13631.161507] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. [13631.161633] [ T140] ------------[ cut here ]------------ [13631.161635] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.161751] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.161831] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.161876] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.161881] [ T140] Tainted: [W]=WARN [13631.161882] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.161885] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.161990] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.161992] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.161996] [ T140] RAX: ffff8c336755a3c0 RBX: ffff8c32a5898890 RCX: 0000000000000000 [13631.161998] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.162000] [ T140] RBP: ffff8c32a5890250 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.162001] [ T140] R10: 0000000000000082 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.162003] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.162005] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.162007] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.162009] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.162011] [ T140] PKRU: 55555554 [13631.162013] [ T140] Call Trace: [13631.162016] [ T140] <TASK> [13631.162019] [ T140] ? __warn.cold+0x90/0x9e [13631.162025] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.162140] [ T140] ? report_bug+0xfa/0x140 [13631.162146] [ T140] ? handle_bug+0x53/0x90 [13631.162149] [ T140] ? exc_invalid_op+0x17/0x70 [13631.162152] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.162157] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.162260] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.162263] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.162392] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.162536] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.162657] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.162749] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.162753] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.162756] [ T140] pci_device_probe+0xc0/0x180 [13631.162760] [ T140] really_probe+0xd9/0x340 [13631.162764] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.162769] [ T140] __driver_probe_device+0x73/0x110 [13631.162773] [ T140] driver_probe_device+0x1a/0xa0 [13631.162776] [ T140] __device_attach_driver+0x84/0x110 [13631.162780] [ T140] bus_for_each_drv+0x82/0xe0 [13631.162783] [ T140] __device_attach+0xab/0x1b0 [13631.162787] [ T140] pci_bus_add_device+0x53/0x80 [13631.162790] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.162792] [ T140] pci_bus_add_devices+0x56/0x70 [13631.162795] [ T140] pci_bus_add_devices+0x56/0x70 [13631.162797] [ T140] pciehp_configure_device+0xaa/0x160 [13631.162800] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.162803] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.162806] [ T140] pciehp_ist+0x13b/0x180 [13631.162809] [ T140] irq_thread_fn+0x1e/0x60 [13631.162813] [ T140] irq_thread+0x114/0x1e0 [13631.162815] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.162818] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.162822] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.162824] [ T140] kthread+0xea/0x1e0 [13631.162828] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.162831] [ T140] ret_from_fork+0x2f/0x50 [13631.162835] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.162838] [ T140] ret_from_fork_asm+0x11/0x20 [13631.162843] [ T140] </TASK> [13631.162844] [ T140] ---[ end trace 0000000000000000 ]--- [13631.162857] [ T140] ------------[ cut here ]------------ [13631.162858] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.162948] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.163013] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.163048] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.163051] [ T140] Tainted: [W]=WARN [13631.163053] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.163054] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.163139] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.163141] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.163143] [ T140] RAX: ffff8c336755a3c4 RBX: ffff8c32a5898ba8 RCX: 0000000000000000 [13631.163145] [ T140] RDX: 0000000000000001 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.163147] [ T140] RBP: ffff8c32a5890258 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.163149] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.163150] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.163152] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.163154] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.163156] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.163158] [ T140] PKRU: 55555554 [13631.163159] [ T140] Call Trace: [13631.163162] [ T140] <TASK> [13631.163163] [ T140] ? __warn.cold+0x90/0x9e [13631.163167] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.163250] [ T140] ? report_bug+0xfa/0x140 [13631.163254] [ T140] ? handle_bug+0x53/0x90 [13631.163257] [ T140] ? exc_invalid_op+0x17/0x70 [13631.163259] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.163263] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.163345] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.163348] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.163427] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.163557] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.163669] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.163746] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.163750] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.163753] [ T140] pci_device_probe+0xc0/0x180 [13631.163756] [ T140] really_probe+0xd9/0x340 [13631.163759] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.163763] [ T140] __driver_probe_device+0x73/0x110 [13631.163766] [ T140] driver_probe_device+0x1a/0xa0 [13631.163770] [ T140] __device_attach_driver+0x84/0x110 [13631.163773] [ T140] bus_for_each_drv+0x82/0xe0 [13631.163777] [ T140] __device_attach+0xab/0x1b0 [13631.163781] [ T140] pci_bus_add_device+0x53/0x80 [13631.163785] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.163787] [ T140] pci_bus_add_devices+0x56/0x70 [13631.163790] [ T140] pci_bus_add_devices+0x56/0x70 [13631.163792] [ T140] pciehp_configure_device+0xaa/0x160 [13631.163795] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.163798] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.163800] [ T140] pciehp_ist+0x13b/0x180 [13631.163803] [ T140] irq_thread_fn+0x1e/0x60 [13631.163807] [ T140] irq_thread+0x114/0x1e0 [13631.163810] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.163813] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.163816] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.163818] [ T140] kthread+0xea/0x1e0 [13631.163822] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.163825] [ T140] ret_from_fork+0x2f/0x50 [13631.163828] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.163831] [ T140] ret_from_fork_asm+0x11/0x20 [13631.163835] [ T140] </TASK> [13631.163837] [ T140] ---[ end trace 0000000000000000 ]--- [13631.163847] [ T140] ------------[ cut here ]------------ [13631.163849] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.163937] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.164001] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.164034] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.164037] [ T140] Tainted: [W]=WARN [13631.164039] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.164040] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.164124] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.164126] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.164129] [ T140] RAX: ffff8c336755a3c8 RBX: ffff8c32a5898ec8 RCX: 0000000000000000 [13631.164131] [ T140] RDX: 0000000000000002 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.164133] [ T140] RBP: ffff8c32a5890260 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.164135] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.164137] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.164139] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.164141] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.164143] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.164144] [ T140] PKRU: 55555554 [13631.164146] [ T140] Call Trace: [13631.164148] [ T140] <TASK> [13631.164149] [ T140] ? __warn.cold+0x90/0x9e [13631.164152] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.164235] [ T140] ? report_bug+0xfa/0x140 [13631.164239] [ T140] ? handle_bug+0x53/0x90 [13631.164242] [ T140] ? exc_invalid_op+0x17/0x70 [13631.164244] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.164248] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.164330] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.164333] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.164411] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.164538] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.164650] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.164758] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.164762] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.164765] [ T140] pci_device_probe+0xc0/0x180 [13631.164768] [ T140] really_probe+0xd9/0x340 [13631.164771] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.164774] [ T140] __driver_probe_device+0x73/0x110 [13631.164778] [ T140] driver_probe_device+0x1a/0xa0 [13631.164781] [ T140] __device_attach_driver+0x84/0x110 [13631.164784] [ T140] bus_for_each_drv+0x82/0xe0 [13631.164788] [ T140] __device_attach+0xab/0x1b0 [13631.164791] [ T140] pci_bus_add_device+0x53/0x80 [13631.164794] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.164797] [ T140] pci_bus_add_devices+0x56/0x70 [13631.164799] [ T140] pci_bus_add_devices+0x56/0x70 [13631.164801] [ T140] pciehp_configure_device+0xaa/0x160 [13631.164809] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.164811] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.164814] [ T140] pciehp_ist+0x13b/0x180 [13631.164817] [ T140] irq_thread_fn+0x1e/0x60 [13631.164821] [ T140] irq_thread+0x114/0x1e0 [13631.164823] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.164826] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.164829] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.164832] [ T140] kthread+0xea/0x1e0 [13631.164836] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.164839] [ T140] ret_from_fork+0x2f/0x50 [13631.164843] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.164846] [ T140] ret_from_fork_asm+0x11/0x20 [13631.164851] [ T140] </TASK> [13631.164852] [ T140] ---[ end trace 0000000000000000 ]--- [13631.164861] [ T140] ------------[ cut here ]------------ [13631.164863] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.164951] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.165014] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.165048] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.165050] [ T140] Tainted: [W]=WARN [13631.165052] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.165054] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.165151] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.165153] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.165155] [ T140] RAX: ffff8c336755a3cc RBX: ffff8c32a58991e0 RCX: 0000000000000000 [13631.165157] [ T140] RDX: 0000000000000003 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.165158] [ T140] RBP: ffff8c32a5890268 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.165160] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.165162] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.165164] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.165166] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.165167] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.165169] [ T140] PKRU: 55555554 [13631.165171] [ T140] Call Trace: [13631.165172] [ T140] <TASK> [13631.165174] [ T140] ? __warn.cold+0x90/0x9e [13631.165177] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.165259] [ T140] ? report_bug+0xfa/0x140 [13631.165263] [ T140] ? handle_bug+0x53/0x90 [13631.165267] [ T140] ? exc_invalid_op+0x17/0x70 [13631.165269] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.165272] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.165355] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.165358] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.165436] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.165560] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.165671] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.165748] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.165752] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.165755] [ T140] pci_device_probe+0xc0/0x180 [13631.165758] [ T140] really_probe+0xd9/0x340 [13631.165761] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.165764] [ T140] __driver_probe_device+0x73/0x110 [13631.165768] [ T140] driver_probe_device+0x1a/0xa0 [13631.165771] [ T140] __device_attach_driver+0x84/0x110 [13631.165774] [ T140] bus_for_each_drv+0x82/0xe0 [13631.165778] [ T140] __device_attach+0xab/0x1b0 [13631.165781] [ T140] pci_bus_add_device+0x53/0x80 [13631.165784] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.165786] [ T140] pci_bus_add_devices+0x56/0x70 [13631.165789] [ T140] pci_bus_add_devices+0x56/0x70 [13631.165791] [ T140] pciehp_configure_device+0xaa/0x160 [13631.165794] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.165796] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.165799] [ T140] pciehp_ist+0x13b/0x180 [13631.165802] [ T140] irq_thread_fn+0x1e/0x60 [13631.165805] [ T140] irq_thread+0x114/0x1e0 [13631.165808] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.165810] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.165814] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.165816] [ T140] kthread+0xea/0x1e0 [13631.165819] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.165823] [ T140] ret_from_fork+0x2f/0x50 [13631.165826] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.165829] [ T140] ret_from_fork_asm+0x11/0x20 [13631.165833] [ T140] </TASK> [13631.165835] [ T140] ---[ end trace 0000000000000000 ]--- [13631.165843] [ T140] ------------[ cut here ]------------ [13631.165845] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.165933] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.165996] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.166030] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.166032] [ T140] Tainted: [W]=WARN [13631.166034] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.166035] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.166127] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.166129] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.166132] [ T140] RAX: ffff8c336755a3d0 RBX: ffff8c32a58994f8 RCX: 0000000000000000 [13631.166133] [ T140] RDX: 0000000000000004 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.166135] [ T140] RBP: ffff8c32a5890270 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.166137] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.166139] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.166140] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.166142] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.166144] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.166146] [ T140] PKRU: 55555554 [13631.166147] [ T140] Call Trace: [13631.166149] [ T140] <TASK> [13631.166151] [ T140] ? __warn.cold+0x90/0x9e [13631.166154] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.166236] [ T140] ? report_bug+0xfa/0x140 [13631.166240] [ T140] ? handle_bug+0x53/0x90 [13631.166243] [ T140] ? exc_invalid_op+0x17/0x70 [13631.166245] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.166249] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.166331] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.166334] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.166413] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.166543] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.166654] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.166731] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.166735] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.166738] [ T140] pci_device_probe+0xc0/0x180 [13631.166741] [ T140] really_probe+0xd9/0x340 [13631.166744] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.166747] [ T140] __driver_probe_device+0x73/0x110 [13631.166750] [ T140] driver_probe_device+0x1a/0xa0 [13631.166754] [ T140] __device_attach_driver+0x84/0x110 [13631.166757] [ T140] bus_for_each_drv+0x82/0xe0 [13631.166760] [ T140] __device_attach+0xab/0x1b0 [13631.166764] [ T140] pci_bus_add_device+0x53/0x80 [13631.166767] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.166769] [ T140] pci_bus_add_devices+0x56/0x70 [13631.166772] [ T140] pci_bus_add_devices+0x56/0x70 [13631.166774] [ T140] pciehp_configure_device+0xaa/0x160 [13631.166776] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.166779] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.166782] [ T140] pciehp_ist+0x13b/0x180 [13631.166784] [ T140] irq_thread_fn+0x1e/0x60 [13631.166788] [ T140] irq_thread+0x114/0x1e0 [13631.166790] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.166793] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.166796] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.166799] [ T140] kthread+0xea/0x1e0 [13631.166802] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.166805] [ T140] ret_from_fork+0x2f/0x50 [13631.166808] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.166811] [ T140] ret_from_fork_asm+0x11/0x20 [13631.166815] [ T140] </TASK> [13631.166817] [ T140] ---[ end trace 0000000000000000 ]--- [13631.166825] [ T140] ------------[ cut here ]------------ [13631.166826] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.166915] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.166978] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.167010] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.167013] [ T140] Tainted: [W]=WARN [13631.167014] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.167016] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.167100] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.167102] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.167104] [ T140] RAX: ffff8c336755a3d4 RBX: ffff8c32a5899810 RCX: 0000000000000000 [13631.167106] [ T140] RDX: 0000000000000005 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.167108] [ T140] RBP: ffff8c32a5890278 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.167109] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.167111] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.167113] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.167115] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.167116] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.167118] [ T140] PKRU: 55555554 [13631.167120] [ T140] Call Trace: [13631.167121] [ T140] <TASK> [13631.167123] [ T140] ? __warn.cold+0x90/0x9e [13631.167126] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.167209] [ T140] ? report_bug+0xfa/0x140 [13631.167212] [ T140] ? handle_bug+0x53/0x90 [13631.167215] [ T140] ? exc_invalid_op+0x17/0x70 [13631.167217] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.167221] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.167309] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.167312] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.167390] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.167515] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.167627] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.167704] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.167708] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.167711] [ T140] pci_device_probe+0xc0/0x180 [13631.167714] [ T140] really_probe+0xd9/0x340 [13631.167717] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.167720] [ T140] __driver_probe_device+0x73/0x110 [13631.167723] [ T140] driver_probe_device+0x1a/0xa0 [13631.167726] [ T140] __device_attach_driver+0x84/0x110 [13631.167730] [ T140] bus_for_each_drv+0x82/0xe0 [13631.167733] [ T140] __device_attach+0xab/0x1b0 [13631.167737] [ T140] pci_bus_add_device+0x53/0x80 [13631.167739] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.167742] [ T140] pci_bus_add_devices+0x56/0x70 [13631.167744] [ T140] pci_bus_add_devices+0x56/0x70 [13631.167747] [ T140] pciehp_configure_device+0xaa/0x160 [13631.167749] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.167752] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.167754] [ T140] pciehp_ist+0x13b/0x180 [13631.167757] [ T140] irq_thread_fn+0x1e/0x60 [13631.167760] [ T140] irq_thread+0x114/0x1e0 [13631.167763] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.167766] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.167769] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.167772] [ T140] kthread+0xea/0x1e0 [13631.167775] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.167778] [ T140] ret_from_fork+0x2f/0x50 [13631.167781] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.167784] [ T140] ret_from_fork_asm+0x11/0x20 [13631.167788] [ T140] </TASK> [13631.167789] [ T140] ---[ end trace 0000000000000000 ]--- [13631.167798] [ T140] ------------[ cut here ]------------ [13631.167799] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.167887] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.167955] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.167988] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.167991] [ T140] Tainted: [W]=WARN [13631.167992] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.167994] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.168077] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.168079] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.168082] [ T140] RAX: ffff8c336755a3c8 RBX: ffff8c32a5899b28 RCX: 0000000000000000 [13631.168083] [ T140] RDX: 0000000000000002 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.168085] [ T140] RBP: ffff8c32a5890280 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.168087] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.168089] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.168090] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.168092] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.168094] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.168096] [ T140] PKRU: 55555554 [13631.168097] [ T140] Call Trace: [13631.168099] [ T140] <TASK> [13631.168101] [ T140] ? __warn.cold+0x90/0x9e [13631.168104] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.168187] [ T140] ? report_bug+0xfa/0x140 [13631.168190] [ T140] ? handle_bug+0x53/0x90 [13631.168193] [ T140] ? exc_invalid_op+0x17/0x70 [13631.168195] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.168199] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.168281] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.168284] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.168362] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.168491] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.168601] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.168678] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.168682] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.168685] [ T140] pci_device_probe+0xc0/0x180 [13631.168688] [ T140] really_probe+0xd9/0x340 [13631.168691] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.168694] [ T140] __driver_probe_device+0x73/0x110 [13631.168698] [ T140] driver_probe_device+0x1a/0xa0 [13631.168701] [ T140] __device_attach_driver+0x84/0x110 [13631.168704] [ T140] bus_for_each_drv+0x82/0xe0 [13631.168708] [ T140] __device_attach+0xab/0x1b0 [13631.168711] [ T140] pci_bus_add_device+0x53/0x80 [13631.168714] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.168716] [ T140] pci_bus_add_devices+0x56/0x70 [13631.168719] [ T140] pci_bus_add_devices+0x56/0x70 [13631.168721] [ T140] pciehp_configure_device+0xaa/0x160 [13631.168724] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.168726] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.168729] [ T140] pciehp_ist+0x13b/0x180 [13631.168732] [ T140] irq_thread_fn+0x1e/0x60 [13631.168735] [ T140] irq_thread+0x114/0x1e0 [13631.168738] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.168741] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.168744] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.168747] [ T140] kthread+0xea/0x1e0 [13631.168755] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.168758] [ T140] ret_from_fork+0x2f/0x50 [13631.168762] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.168765] [ T140] ret_from_fork_asm+0x11/0x20 [13631.168769] [ T140] </TASK> [13631.168771] [ T140] ---[ end trace 0000000000000000 ]--- [13631.168783] [ T140] ------------[ cut here ]------------ [13631.168784] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.168878] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.168941] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.168974] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.168977] [ T140] Tainted: [W]=WARN [13631.168978] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.168980] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.169064] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.169066] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.169069] [ T140] RAX: ffff8c336755a3cc RBX: ffff8c32a5899e40 RCX: 0000000000000000 [13631.169071] [ T140] RDX: 0000000000000003 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.169072] [ T140] RBP: ffff8c32a5890288 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.169074] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.169076] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.169078] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.169080] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.169081] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.169083] [ T140] PKRU: 55555554 [13631.169085] [ T140] Call Trace: [13631.169086] [ T140] <TASK> [13631.169094] [ T140] ? __warn.cold+0x90/0x9e [13631.169097] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.169180] [ T140] ? report_bug+0xfa/0x140 [13631.169183] [ T140] ? handle_bug+0x53/0x90 [13631.169186] [ T140] ? exc_invalid_op+0x17/0x70 [13631.169188] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.169192] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.169275] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.169278] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.169357] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.169490] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.169601] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.169678] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.169682] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.169685] [ T140] pci_device_probe+0xc0/0x180 [13631.169688] [ T140] really_probe+0xd9/0x340 [13631.169691] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.169694] [ T140] __driver_probe_device+0x73/0x110 [13631.169697] [ T140] driver_probe_device+0x1a/0xa0 [13631.169701] [ T140] __device_attach_driver+0x84/0x110 [13631.169704] [ T140] bus_for_each_drv+0x82/0xe0 [13631.169707] [ T140] __device_attach+0xab/0x1b0 [13631.169711] [ T140] pci_bus_add_device+0x53/0x80 [13631.169713] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.169716] [ T140] pci_bus_add_devices+0x56/0x70 [13631.169718] [ T140] pci_bus_add_devices+0x56/0x70 [13631.169721] [ T140] pciehp_configure_device+0xaa/0x160 [13631.169723] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.169726] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.169729] [ T140] pciehp_ist+0x13b/0x180 [13631.169731] [ T140] irq_thread_fn+0x1e/0x60 [13631.169734] [ T140] irq_thread+0x114/0x1e0 [13631.169737] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.169740] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.169743] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.169746] [ T140] kthread+0xea/0x1e0 [13631.169749] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.169752] [ T140] ret_from_fork+0x2f/0x50 [13631.169755] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.169758] [ T140] ret_from_fork_asm+0x11/0x20 [13631.169762] [ T140] </TASK> [13631.169764] [ T140] ---[ end trace 0000000000000000 ]--- [13631.169773] [ T140] ------------[ cut here ]------------ [13631.169774] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.169863] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.169925] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.169958] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.169961] [ T140] Tainted: [W]=WARN [13631.169962] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.169964] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.170048] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.170050] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.170053] [ T140] RAX: ffff8c336755a3d0 RBX: ffff8c32a589a158 RCX: 0000000000000000 [13631.170054] [ T140] RDX: 0000000000000004 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.170056] [ T140] RBP: ffff8c32a5890290 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.170058] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.170059] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.170061] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.170063] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.170065] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.170067] [ T140] PKRU: 55555554 [13631.170068] [ T140] Call Trace: [13631.170070] [ T140] <TASK> [13631.170072] [ T140] ? __warn.cold+0x90/0x9e [13631.170074] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.170159] [ T140] ? report_bug+0xfa/0x140 [13631.170162] [ T140] ? handle_bug+0x53/0x90 [13631.170165] [ T140] ? exc_invalid_op+0x17/0x70 [13631.170167] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.170171] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.170259] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.170262] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.170345] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.170464] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.170582] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.170664] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.170668] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.170671] [ T140] pci_device_probe+0xc0/0x180 [13631.170674] [ T140] really_probe+0xd9/0x340 [13631.170677] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.170680] [ T140] __driver_probe_device+0x73/0x110 [13631.170683] [ T140] driver_probe_device+0x1a/0xa0 [13631.170686] [ T140] __device_attach_driver+0x84/0x110 [13631.170690] [ T140] bus_for_each_drv+0x82/0xe0 [13631.170693] [ T140] __device_attach+0xab/0x1b0 [13631.170697] [ T140] pci_bus_add_device+0x53/0x80 [13631.170699] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.170702] [ T140] pci_bus_add_devices+0x56/0x70 [13631.170704] [ T140] pci_bus_add_devices+0x56/0x70 [13631.170707] [ T140] pciehp_configure_device+0xaa/0x160 [13631.170709] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.170712] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.170714] [ T140] pciehp_ist+0x13b/0x180 [13631.170717] [ T140] irq_thread_fn+0x1e/0x60 [13631.170720] [ T140] irq_thread+0x114/0x1e0 [13631.170723] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.170726] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.170729] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.170731] [ T140] kthread+0xea/0x1e0 [13631.170735] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.170738] [ T140] ret_from_fork+0x2f/0x50 [13631.170740] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.170743] [ T140] ret_from_fork_asm+0x11/0x20 [13631.170748] [ T140] </TASK> [13631.170749] [ T140] ---[ end trace 0000000000000000 ]--- [13631.170757] [ T140] ------------[ cut here ]------------ [13631.170758] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.170852] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.170914] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.170946] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.170949] [ T140] Tainted: [W]=WARN [13631.170950] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.170952] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.171041] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.171043] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.171045] [ T140] RAX: ffff8c336755a3d4 RBX: ffff8c32a589a470 RCX: 0000000000000000 [13631.171047] [ T140] RDX: 0000000000000005 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 [13631.171049] [ T140] RBP: ffff8c32a5890298 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.171051] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.171052] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.171054] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.171056] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.171058] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.171059] [ T140] PKRU: 55555554 [13631.171061] [ T140] Call Trace: [13631.171063] [ T140] <TASK> [13631.171064] [ T140] ? __warn.cold+0x90/0x9e [13631.171068] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.171155] [ T140] ? report_bug+0xfa/0x140 [13631.171159] [ T140] ? handle_bug+0x53/0x90 [13631.171162] [ T140] ? exc_invalid_op+0x17/0x70 [13631.171164] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.171168] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.171255] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.171258] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.171338] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.171454] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.171574] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.171674] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.171677] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.171680] [ T140] pci_device_probe+0xc0/0x180 [13631.171683] [ T140] really_probe+0xd9/0x340 [13631.171686] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.171690] [ T140] __driver_probe_device+0x73/0x110 [13631.171693] [ T140] driver_probe_device+0x1a/0xa0 [13631.171696] [ T140] __device_attach_driver+0x84/0x110 [13631.171699] [ T140] bus_for_each_drv+0x82/0xe0 [13631.171703] [ T140] __device_attach+0xab/0x1b0 [13631.171706] [ T140] pci_bus_add_device+0x53/0x80 [13631.171709] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.171711] [ T140] pci_bus_add_devices+0x56/0x70 [13631.171714] [ T140] pci_bus_add_devices+0x56/0x70 [13631.171716] [ T140] pciehp_configure_device+0xaa/0x160 [13631.171718] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.171721] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.171724] [ T140] pciehp_ist+0x13b/0x180 [13631.171726] [ T140] irq_thread_fn+0x1e/0x60 [13631.171729] [ T140] irq_thread+0x114/0x1e0 [13631.171732] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.171735] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.171738] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.171741] [ T140] kthread+0xea/0x1e0 [13631.171744] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.171747] [ T140] ret_from_fork+0x2f/0x50 [13631.171750] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.171753] [ T140] ret_from_fork_asm+0x11/0x20 [13631.171757] [ T140] </TASK> [13631.171758] [ T140] ---[ end trace 0000000000000000 ]--- [13631.171767] [ T140] ------------[ cut here ]------------ [13631.171768] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.171856] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.171918] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.171951] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.171953] [ T140] Tainted: [W]=WARN [13631.171955] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.171956] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.172039] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.172041] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.172044] [ T140] RAX: ffff8c3203e26ba8 RBX: ffff8c32a5896d20 RCX: 0000000000000000 [13631.172045] [ T140] RDX: 0000000000000000 RSI: ffff8c32a5897038 RDI: ffff8c32a5880000 [13631.172047] [ T140] RBP: ffff8c32a58902a0 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.172049] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.172051] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.172052] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.172054] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.172056] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.172058] [ T140] PKRU: 55555554 [13631.172059] [ T140] Call Trace: [13631.172061] [ T140] <TASK> [13631.172063] [ T140] ? __warn.cold+0x90/0x9e [13631.172066] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.172148] [ T140] ? report_bug+0xfa/0x140 [13631.172151] [ T140] ? handle_bug+0x53/0x90 [13631.172154] [ T140] ? exc_invalid_op+0x17/0x70 [13631.172156] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.172160] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.172242] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.172245] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.172323] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.172440] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.172558] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.172635] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.172638] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.172641] [ T140] pci_device_probe+0xc0/0x180 [13631.172644] [ T140] really_probe+0xd9/0x340 [13631.172647] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.172650] [ T140] __driver_probe_device+0x73/0x110 [13631.172654] [ T140] driver_probe_device+0x1a/0xa0 [13631.172657] [ T140] __device_attach_driver+0x84/0x110 [13631.172660] [ T140] bus_for_each_drv+0x82/0xe0 [13631.172664] [ T140] __device_attach+0xab/0x1b0 [13631.172667] [ T140] pci_bus_add_device+0x53/0x80 [13631.172670] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.172672] [ T140] pci_bus_add_devices+0x56/0x70 [13631.172675] [ T140] pci_bus_add_devices+0x56/0x70 [13631.172677] [ T140] pciehp_configure_device+0xaa/0x160 [13631.172679] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.172682] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.172685] [ T140] pciehp_ist+0x13b/0x180 [13631.172688] [ T140] irq_thread_fn+0x1e/0x60 [13631.172691] [ T140] irq_thread+0x114/0x1e0 [13631.172693] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.172696] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.172699] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.172702] [ T140] kthread+0xea/0x1e0 [13631.172705] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.172708] [ T140] ret_from_fork+0x2f/0x50 [13631.172711] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.172714] [ T140] ret_from_fork_asm+0x11/0x20 [13631.172718] [ T140] </TASK> [13631.172720] [ T140] ---[ end trace 0000000000000000 ]--- [13631.172728] [ T140] ------------[ cut here ]------------ [13631.172730] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.172818] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.172880] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.172913] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.172915] [ T140] Tainted: [W]=WARN [13631.172917] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.172919] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.173002] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.173004] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.173006] [ T140] RAX: ffff8c320135da78 RBX: ffff8c32a58a6460 RCX: 0000000000000000 [13631.173008] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58aca50 RDI: ffff8c32a5880000 [13631.173010] [ T140] RBP: ffff8c32a58902a8 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.173011] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.173013] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.173015] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.173017] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.173019] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.173020] [ T140] PKRU: 55555554 [13631.173022] [ T140] Call Trace: [13631.173024] [ T140] <TASK> [13631.173025] [ T140] ? __warn.cold+0x90/0x9e [13631.173028] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.173111] [ T140] ? report_bug+0xfa/0x140 [13631.173114] [ T140] ? handle_bug+0x53/0x90 [13631.173117] [ T140] ? exc_invalid_op+0x17/0x70 [13631.173119] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.173123] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.173205] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.173208] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.173286] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.173404] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.173524] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.173600] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.173604] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.173607] [ T140] pci_device_probe+0xc0/0x180 [13631.173610] [ T140] really_probe+0xd9/0x340 [13631.173613] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.173616] [ T140] __driver_probe_device+0x73/0x110 [13631.173619] [ T140] driver_probe_device+0x1a/0xa0 [13631.173623] [ T140] __device_attach_driver+0x84/0x110 [13631.173626] [ T140] bus_for_each_drv+0x82/0xe0 [13631.173629] [ T140] __device_attach+0xab/0x1b0 [13631.173633] [ T140] pci_bus_add_device+0x53/0x80 [13631.173635] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.173638] [ T140] pci_bus_add_devices+0x56/0x70 [13631.173640] [ T140] pci_bus_add_devices+0x56/0x70 [13631.173643] [ T140] pciehp_configure_device+0xaa/0x160 [13631.173645] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.173648] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.173650] [ T140] pciehp_ist+0x13b/0x180 [13631.173653] [ T140] irq_thread_fn+0x1e/0x60 [13631.173656] [ T140] irq_thread+0x114/0x1e0 [13631.173659] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.173662] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.173665] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.173668] [ T140] kthread+0xea/0x1e0 [13631.173671] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.173674] [ T140] ret_from_fork+0x2f/0x50 [13631.173677] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.173680] [ T140] ret_from_fork_asm+0x11/0x20 [13631.173684] [ T140] </TASK> [13631.173685] [ T140] ---[ end trace 0000000000000000 ]--- [13631.173693] [ T140] ------------[ cut here ]------------ [13631.173695] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.173782] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.173844] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.173877] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.173879] [ T140] Tainted: [W]=WARN [13631.173881] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.173882] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.173965] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.173967] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.173970] [ T140] RAX: ffff8c320135da7c RBX: ffff8c32a58a6ac0 RCX: 0000000000000000 [13631.173971] [ T140] RDX: 0000000000000001 RSI: ffff8c32a58aca50 RDI: ffff8c32a5880000 [13631.173973] [ T140] RBP: ffff8c32a58902b0 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.173975] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.173976] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.173978] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.173980] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.173982] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.173984] [ T140] PKRU: 55555554 [13631.173985] [ T140] Call Trace: [13631.173987] [ T140] <TASK> [13631.173989] [ T140] ? __warn.cold+0x90/0x9e [13631.173992] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.174074] [ T140] ? report_bug+0xfa/0x140 [13631.174077] [ T140] ? handle_bug+0x53/0x90 [13631.174080] [ T140] ? exc_invalid_op+0x17/0x70 [13631.174082] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.174086] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.174168] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.174171] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.174249] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.174366] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.174485] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.174562] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.174566] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.174569] [ T140] pci_device_probe+0xc0/0x180 [13631.174572] [ T140] really_probe+0xd9/0x340 [13631.174575] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.174578] [ T140] __driver_probe_device+0x73/0x110 [13631.174582] [ T140] driver_probe_device+0x1a/0xa0 [13631.174585] [ T140] __device_attach_driver+0x84/0x110 [13631.174588] [ T140] bus_for_each_drv+0x82/0xe0 [13631.174592] [ T140] __device_attach+0xab/0x1b0 [13631.174595] [ T140] pci_bus_add_device+0x53/0x80 [13631.174598] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.174600] [ T140] pci_bus_add_devices+0x56/0x70 [13631.174603] [ T140] pci_bus_add_devices+0x56/0x70 [13631.174605] [ T140] pciehp_configure_device+0xaa/0x160 [13631.174608] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.174610] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.174613] [ T140] pciehp_ist+0x13b/0x180 [13631.174616] [ T140] irq_thread_fn+0x1e/0x60 [13631.174619] [ T140] irq_thread+0x114/0x1e0 [13631.174622] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.174624] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.174628] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.174630] [ T140] kthread+0xea/0x1e0 [13631.174633] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.174637] [ T140] ret_from_fork+0x2f/0x50 [13631.174639] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.174642] [ T140] ret_from_fork_asm+0x11/0x20 [13631.174647] [ T140] </TASK> [13631.174648] [ T140] ---[ end trace 0000000000000000 ]--- [13631.174656] [ T140] ------------[ cut here ]------------ [13631.174658] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.174746] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.174809] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.174842] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.174844] [ T140] Tainted: [W]=WARN [13631.174846] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.174847] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.174932] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.174934] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.174936] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58aee20 RCX: 0000000000000000 [13631.174938] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 [13631.174939] [ T140] RBP: ffff8c32a58902b8 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.174941] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.174943] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.174945] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.174947] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.174948] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.174950] [ T140] PKRU: 55555554 [13631.174952] [ T140] Call Trace: [13631.174953] [ T140] <TASK> [13631.174955] [ T140] ? __warn.cold+0x90/0x9e [13631.174958] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.175041] [ T140] ? report_bug+0xfa/0x140 [13631.175045] [ T140] ? handle_bug+0x53/0x90 [13631.175048] [ T140] ? exc_invalid_op+0x17/0x70 [13631.175050] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.175054] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.175136] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.175139] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.175218] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.175335] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.175446] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.175529] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.175533] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.175536] [ T140] pci_device_probe+0xc0/0x180 [13631.175539] [ T140] really_probe+0xd9/0x340 [13631.175542] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.175545] [ T140] __driver_probe_device+0x73/0x110 [13631.175548] [ T140] driver_probe_device+0x1a/0xa0 [13631.175552] [ T140] __device_attach_driver+0x84/0x110 [13631.175555] [ T140] bus_for_each_drv+0x82/0xe0 [13631.175558] [ T140] __device_attach+0xab/0x1b0 [13631.175562] [ T140] pci_bus_add_device+0x53/0x80 [13631.175564] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.175567] [ T140] pci_bus_add_devices+0x56/0x70 [13631.175569] [ T140] pci_bus_add_devices+0x56/0x70 [13631.175572] [ T140] pciehp_configure_device+0xaa/0x160 [13631.175574] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.175577] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.175580] [ T140] pciehp_ist+0x13b/0x180 [13631.175582] [ T140] irq_thread_fn+0x1e/0x60 [13631.175585] [ T140] irq_thread+0x114/0x1e0 [13631.175588] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.175591] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.175594] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.175597] [ T140] kthread+0xea/0x1e0 [13631.175600] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.175603] [ T140] ret_from_fork+0x2f/0x50 [13631.175606] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.175609] [ T140] ret_from_fork_asm+0x11/0x20 [13631.175613] [ T140] </TASK> [13631.175615] [ T140] ---[ end trace 0000000000000000 ]--- [13631.175622] [ T140] ------------[ cut here ]------------ [13631.175624] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.175712] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.175774] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.175807] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.175809] [ T140] Tainted: [W]=WARN [13631.175811] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.175812] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.175896] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.175898] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.175900] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58af138 RCX: 0000000000000000 [13631.175902] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 [13631.175904] [ T140] RBP: ffff8c32a58902c0 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.175906] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.175907] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.175909] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.175911] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.175913] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.175915] [ T140] PKRU: 55555554 [13631.175916] [ T140] Call Trace: [13631.175918] [ T140] <TASK> [13631.175920] [ T140] ? __warn.cold+0x90/0x9e [13631.175922] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.176005] [ T140] ? report_bug+0xfa/0x140 [13631.176009] [ T140] ? handle_bug+0x53/0x90 [13631.176012] [ T140] ? exc_invalid_op+0x17/0x70 [13631.176014] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.176018] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.176113] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.176116] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.176195] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.176312] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.176424] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.176509] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.176513] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.176516] [ T140] pci_device_probe+0xc0/0x180 [13631.176519] [ T140] really_probe+0xd9/0x340 [13631.176522] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.176525] [ T140] __driver_probe_device+0x73/0x110 [13631.176528] [ T140] driver_probe_device+0x1a/0xa0 [13631.176532] [ T140] __device_attach_driver+0x84/0x110 [13631.176535] [ T140] bus_for_each_drv+0x82/0xe0 [13631.176538] [ T140] __device_attach+0xab/0x1b0 [13631.176542] [ T140] pci_bus_add_device+0x53/0x80 [13631.176544] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.176547] [ T140] pci_bus_add_devices+0x56/0x70 [13631.176549] [ T140] pci_bus_add_devices+0x56/0x70 [13631.176552] [ T140] pciehp_configure_device+0xaa/0x160 [13631.176554] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.176557] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.176559] [ T140] pciehp_ist+0x13b/0x180 [13631.176562] [ T140] irq_thread_fn+0x1e/0x60 [13631.176565] [ T140] irq_thread+0x114/0x1e0 [13631.176568] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.176571] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.176574] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.176577] [ T140] kthread+0xea/0x1e0 [13631.176580] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.176583] [ T140] ret_from_fork+0x2f/0x50 [13631.176586] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.176588] [ T140] ret_from_fork_asm+0x11/0x20 [13631.176593] [ T140] </TASK> [13631.176594] [ T140] ---[ end trace 0000000000000000 ]--- [13631.176602] [ T140] ------------[ cut here ]------------ [13631.176603] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.176691] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.176754] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.176786] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.176789] [ T140] Tainted: [W]=WARN [13631.176790] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.176792] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.176875] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.176877] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.176880] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58af450 RCX: 0000000000000000 [13631.176881] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 [13631.176883] [ T140] RBP: ffff8c32a58902c8 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.176885] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.176886] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.176888] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.176890] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.176892] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.176894] [ T140] PKRU: 55555554 [13631.176895] [ T140] Call Trace: [13631.176897] [ T140] <TASK> [13631.176899] [ T140] ? __warn.cold+0x90/0x9e [13631.176902] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.176984] [ T140] ? report_bug+0xfa/0x140 [13631.176988] [ T140] ? handle_bug+0x53/0x90 [13631.176991] [ T140] ? exc_invalid_op+0x17/0x70 [13631.176993] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.176996] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.177078] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.177081] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.177160] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.177278] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.177388] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.177465] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.177479] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.177482] [ T140] pci_device_probe+0xc0/0x180 [13631.177485] [ T140] really_probe+0xd9/0x340 [13631.177488] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.177492] [ T140] __driver_probe_device+0x73/0x110 [13631.177495] [ T140] driver_probe_device+0x1a/0xa0 [13631.177498] [ T140] __device_attach_driver+0x84/0x110 [13631.177502] [ T140] bus_for_each_drv+0x82/0xe0 [13631.177505] [ T140] __device_attach+0xab/0x1b0 [13631.177508] [ T140] pci_bus_add_device+0x53/0x80 [13631.177511] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.177514] [ T140] pci_bus_add_devices+0x56/0x70 [13631.177516] [ T140] pci_bus_add_devices+0x56/0x70 [13631.177518] [ T140] pciehp_configure_device+0xaa/0x160 [13631.177521] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.177523] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.177526] [ T140] pciehp_ist+0x13b/0x180 [13631.177529] [ T140] irq_thread_fn+0x1e/0x60 [13631.177532] [ T140] irq_thread+0x114/0x1e0 [13631.177534] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.177537] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.177541] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.177543] [ T140] kthread+0xea/0x1e0 [13631.177546] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.177549] [ T140] ret_from_fork+0x2f/0x50 [13631.177552] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.177555] [ T140] ret_from_fork_asm+0x11/0x20 [13631.177560] [ T140] </TASK> [13631.177561] [ T140] ---[ end trace 0000000000000000 ]--- [13631.177569] [ T140] ------------[ cut here ]------------ [13631.177571] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.177659] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.177722] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.177754] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.177757] [ T140] Tainted: [W]=WARN [13631.177758] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.177760] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.177844] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.177846] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 [13631.177848] [ T140] RAX: ffff8c320135dd00 RBX: ffff8c32a58b23d0 RCX: 0000000000000000 [13631.177850] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58b42c0 RDI: ffff8c32a5880000 [13631.177852] [ T140] RBP: ffff8c32a58902d0 R08: 0000000000000002 R09: ffff8c34ba798f40 [13631.177853] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 [13631.177855] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 [13631.177857] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.177859] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.177861] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 [13631.177862] [ T140] PKRU: 55555554 [13631.177864] [ T140] Call Trace: [13631.177866] [ T140] <TASK> [13631.177867] [ T140] ? __warn.cold+0x90/0x9e [13631.177870] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.177953] [ T140] ? report_bug+0xfa/0x140 [13631.177956] [ T140] ? handle_bug+0x53/0x90 [13631.177959] [ T140] ? exc_invalid_op+0x17/0x70 [13631.177961] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.177965] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.178047] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13631.178050] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] [13631.178128] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] [13631.178245] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.178356] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.178432] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.178436] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.178439] [ T140] pci_device_probe+0xc0/0x180 [13631.178442] [ T140] really_probe+0xd9/0x340 [13631.178445] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.178448] [ T140] __driver_probe_device+0x73/0x110 [13631.178451] [ T140] driver_probe_device+0x1a/0xa0 [13631.178455] [ T140] __device_attach_driver+0x84/0x110 [13631.178458] [ T140] bus_for_each_drv+0x82/0xe0 [13631.178461] [ T140] __device_attach+0xab/0x1b0 [13631.178465] [ T140] pci_bus_add_device+0x53/0x80 [13631.178479] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.178481] [ T140] pci_bus_add_devices+0x56/0x70 [13631.178484] [ T140] pci_bus_add_devices+0x56/0x70 [13631.178486] [ T140] pciehp_configure_device+0xaa/0x160 [13631.178489] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.178491] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.178494] [ T140] pciehp_ist+0x13b/0x180 [13631.178497] [ T140] irq_thread_fn+0x1e/0x60 [13631.178500] [ T140] irq_thread+0x114/0x1e0 [13631.178503] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.178506] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.178509] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.178511] [ T140] kthread+0xea/0x1e0 [13631.178515] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.178518] [ T140] ret_from_fork+0x2f/0x50 [13631.178521] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.178523] [ T140] ret_from_fork_asm+0x11/0x20 [13631.178528] [ T140] </TASK> [13631.178529] [ T140] ---[ end trace 0000000000000000 ]--- [13631.342117] [ T140] ------------[ cut here ]------------ [13631.342123] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.342257] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13631.342342] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13631.342389] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13631.342394] [ T140] Tainted: [W]=WARN [13631.342396] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13631.342399] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.342528] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 [13631.342532] [ T140] RSP: 0018:ffff9ac3c0743b30 EFLAGS: 00010246 [13631.342536] [ T140] RAX: ffff8c3203e262a0 RBX: ffff8c32a5880000 RCX: 0000000000000000 [13631.342538] [ T140] RDX: 0000000000000000 RSI: ffff8c32a5880c78 RDI: ffff8c32a5880000 [13631.342541] [ T140] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000000 [13631.342543] [ T140] R10: ffff8c34ba79de60 R11: 0000000000000000 R12: ffff8c32a58c6de8 [13631.342546] [ T140] R13: 0000000000000021 R14: ffff8c32a5880000 R15: ffff8c32a5880010 [13631.342548] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13631.342551] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13631.342553] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 [13631.342555] [ T140] PKRU: 55555554 [13631.342557] [ T140] Call Trace: [13631.342559] [ T140] <TASK> [13631.342562] [ T140] ? __warn.cold+0x90/0x9e [13631.342566] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.342652] [ T140] ? report_bug+0xfa/0x140 [13631.342656] [ T140] ? handle_bug+0x53/0x90 [13631.342660] [ T140] ? exc_invalid_op+0x17/0x70 [13631.342662] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13631.342666] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] [13631.342746] [ T140] gmc_v10_0_hw_fini+0x52/0xb0 [amdgpu] [13631.342838] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] [13631.342961] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] [13631.343073] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] [13631.343180] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] [13631.343255] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.343259] [ T140] ? driver_probe_device+0xa0/0xa0 [13631.343262] [ T140] pci_device_probe+0xc0/0x180 [13631.343266] [ T140] really_probe+0xd9/0x340 [13631.343269] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13631.343272] [ T140] __driver_probe_device+0x73/0x110 [13631.343275] [ T140] driver_probe_device+0x1a/0xa0 [13631.343279] [ T140] __device_attach_driver+0x84/0x110 [13631.343282] [ T140] bus_for_each_drv+0x82/0xe0 [13631.343285] [ T140] __device_attach+0xab/0x1b0 [13631.343289] [ T140] pci_bus_add_device+0x53/0x80 [13631.343292] [ T140] pci_bus_add_devices+0x2b/0x70 [13631.343294] [ T140] pci_bus_add_devices+0x56/0x70 [13631.343297] [ T140] pci_bus_add_devices+0x56/0x70 [13631.343299] [ T140] pciehp_configure_device+0xaa/0x160 [13631.343302] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13631.343304] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13631.343307] [ T140] pciehp_ist+0x13b/0x180 [13631.343310] [ T140] irq_thread_fn+0x1e/0x60 [13631.343314] [ T140] irq_thread+0x114/0x1e0 [13631.343316] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13631.343319] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13631.343323] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13631.343325] [ T140] kthread+0xea/0x1e0 [13631.343329] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.343332] [ T140] ret_from_fork+0x2f/0x50 [13631.343336] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13631.343338] [ T140] ret_from_fork_asm+0x11/0x20 [13631.343343] [ T140] </TASK> [13631.343345] [ T140] ---[ end trace 0000000000000000 ]--- [13631.351179] [ T140] amdgpu 0000:03:00.0: probe with driver amdgpu failed with error -121 [13632.005054] [ T140] ------------[ cut here ]------------ [13632.005063] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_buddy.c:337 drm_buddy_fini+0xa8/0xb0 [drm_buddy] [13632.005073] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13632.005147] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13632.005189] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13632.005194] [ T140] Tainted: [W]=WARN [13632.005196] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13632.005199] [ T140] RIP: 0010:drm_buddy_fini+0xa8/0xb0 [drm_buddy] [13632.005202] [ T140] Code: 44 3b 6d 10 72 a3 4c 8b 65 20 4c 39 65 28 75 1e 48 8b 7d 08 e8 79 1d f1 c5 48 8b 7d 00 5b 5d 41 5c 41 5d 41 5e e9 68 1d f1 c5 <0f> 0b eb b3 0f 0b eb de f3 0f 1e fa 48 8b 0e 89 c8 25 00 0c 00 00 [13632.005205] [ T140] RSP: 0018:ffff9ac3c0743a90 EFLAGS: 00010206 [13632.005208] [ T140] RAX: 0000000000000c00 RBX: 000000000000000c RCX: 00000001feacbfff [13632.005210] [ T140] RDX: ffff8c3203757ea0 RSI: ffff8c3221b8f750 RDI: ffff8c3205edda00 [13632.005212] [ T140] RBP: ffff8c32a588fa50 R08: 0000000000000001 R09: 0000000000000000 [13632.005214] [ T140] R10: ffff8c3205edda00 R11: 00000001feaca000 R12: 0000000001000000 [13632.005216] [ T140] R13: 0000000000000008 R14: 00000000ffffffff R15: ffff8c32a588fa50 [13632.005218] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13632.005220] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13632.005222] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 [13632.005224] [ T140] PKRU: 55555554 [13632.005226] [ T140] Call Trace: [13632.005229] [ T140] <TASK> [13632.005232] [ T140] ? __warn.cold+0x90/0x9e [13632.005238] [ T140] ? drm_buddy_fini+0xa8/0xb0 [drm_buddy] [13632.005242] [ T140] ? report_bug+0xfa/0x140 [13632.005247] [ T140] ? handle_bug+0x53/0x90 [13632.005252] [ T140] ? exc_invalid_op+0x17/0x70 [13632.005255] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13632.005260] [ T140] ? drm_buddy_fini+0xa8/0xb0 [drm_buddy] [13632.005264] [ T140] amdgpu_vram_mgr_fini+0x17a/0x1b0 [amdgpu] [13632.005422] [ T140] amdgpu_ttm_fini+0x14b/0x210 [amdgpu] [13632.005540] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] [13632.005649] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] [13632.005772] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] [13632.005879] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] [13632.005990] [ T140] drm_dev_put.part.0+0x37/0x60 [13632.005993] [ T140] devres_release_all+0xa6/0xf0 [13632.005998] [ T140] ? driver_probe_device+0xa0/0xa0 [13632.006001] [ T140] device_unbind_cleanup+0x9/0x70 [13632.006004] [ T140] really_probe+0x21c/0x340 [13632.006008] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13632.006012] [ T140] __driver_probe_device+0x73/0x110 [13632.006016] [ T140] driver_probe_device+0x1a/0xa0 [13632.006019] [ T140] __device_attach_driver+0x84/0x110 [13632.006022] [ T140] bus_for_each_drv+0x82/0xe0 [13632.006026] [ T140] __device_attach+0xab/0x1b0 [13632.006030] [ T140] pci_bus_add_device+0x53/0x80 [13632.006033] [ T140] pci_bus_add_devices+0x2b/0x70 [13632.006036] [ T140] pci_bus_add_devices+0x56/0x70 [13632.006038] [ T140] pci_bus_add_devices+0x56/0x70 [13632.006041] [ T140] pciehp_configure_device+0xaa/0x160 [13632.006044] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13632.006047] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13632.006050] [ T140] pciehp_ist+0x13b/0x180 [13632.006053] [ T140] irq_thread_fn+0x1e/0x60 [13632.006056] [ T140] irq_thread+0x114/0x1e0 [13632.006059] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13632.006062] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13632.006065] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13632.006068] [ T140] kthread+0xea/0x1e0 [13632.006072] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.006075] [ T140] ret_from_fork+0x2f/0x50 [13632.006079] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.006082] [ T140] ret_from_fork_asm+0x11/0x20 [13632.006087] [ T140] </TASK> [13632.006088] [ T140] ---[ end trace 0000000000000000 ]--- [13632.006100] [ T140] ------------[ cut here ]------------ [13632.006102] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_buddy.c:344 drm_buddy_fini+0xac/0xb0 [drm_buddy] [13632.006105] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13632.006169] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13632.006203] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13632.006206] [ T140] Tainted: [W]=WARN [13632.006208] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13632.006210] [ T140] RIP: 0010:drm_buddy_fini+0xac/0xb0 [drm_buddy] [13632.006212] [ T140] Code: 72 a3 4c 8b 65 20 4c 39 65 28 75 1e 48 8b 7d 08 e8 79 1d f1 c5 48 8b 7d 00 5b 5d 41 5c 41 5d 41 5e e9 68 1d f1 c5 0f 0b eb b3 <0f> 0b eb de f3 0f 1e fa 48 8b 0e 89 c8 25 00 0c 00 00 3d 00 04 00 [13632.006214] [ T140] RSP: 0018:ffff9ac3c0743a90 EFLAGS: 00010287 [13632.006217] [ T140] RAX: 0000000001000000 RBX: 000000000000000c RCX: 000000000000000c [13632.006219] [ T140] RDX: 0000000000001000 RSI: ffff8c3221b8f750 RDI: 0000000000380022 [13632.006221] [ T140] RBP: ffff8c32a588fa50 R08: 0000000000000001 R09: 0000000000000000 [13632.006222] [ T140] R10: 0000000000380022 R11: 0000000000000000 R12: 00000001ff000000 [13632.006224] [ T140] R13: 0000000000000009 R14: 00000000ffffffff R15: ffff8c32a588fa50 [13632.006226] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13632.006228] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13632.006230] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 [13632.006232] [ T140] PKRU: 55555554 [13632.006233] [ T140] Call Trace: [13632.006235] [ T140] <TASK> [13632.006237] [ T140] ? __warn.cold+0x90/0x9e [13632.006240] [ T140] ? drm_buddy_fini+0xac/0xb0 [drm_buddy] [13632.006242] [ T140] ? report_bug+0xfa/0x140 [13632.006246] [ T140] ? handle_bug+0x53/0x90 [13632.006249] [ T140] ? exc_invalid_op+0x17/0x70 [13632.006252] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13632.006255] [ T140] ? drm_buddy_fini+0xac/0xb0 [drm_buddy] [13632.006258] [ T140] amdgpu_vram_mgr_fini+0x17a/0x1b0 [amdgpu] [13632.006385] [ T140] amdgpu_ttm_fini+0x14b/0x210 [amdgpu] [13632.006511] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] [13632.006632] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] [13632.006763] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] [13632.006883] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] [13632.006964] [ T140] drm_dev_put.part.0+0x37/0x60 [13632.006966] [ T140] devres_release_all+0xa6/0xf0 [13632.006970] [ T140] ? driver_probe_device+0xa0/0xa0 [13632.006973] [ T140] device_unbind_cleanup+0x9/0x70 [13632.006976] [ T140] really_probe+0x21c/0x340 [13632.006979] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13632.006983] [ T140] __driver_probe_device+0x73/0x110 [13632.006986] [ T140] driver_probe_device+0x1a/0xa0 [13632.006989] [ T140] __device_attach_driver+0x84/0x110 [13632.006993] [ T140] bus_for_each_drv+0x82/0xe0 [13632.006996] [ T140] __device_attach+0xab/0x1b0 [13632.007000] [ T140] pci_bus_add_device+0x53/0x80 [13632.007003] [ T140] pci_bus_add_devices+0x2b/0x70 [13632.007005] [ T140] pci_bus_add_devices+0x56/0x70 [13632.007008] [ T140] pci_bus_add_devices+0x56/0x70 [13632.007010] [ T140] pciehp_configure_device+0xaa/0x160 [13632.007013] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13632.007015] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13632.007018] [ T140] pciehp_ist+0x13b/0x180 [13632.007021] [ T140] irq_thread_fn+0x1e/0x60 [13632.007024] [ T140] irq_thread+0x114/0x1e0 [13632.007027] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13632.007030] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13632.007033] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13632.007036] [ T140] kthread+0xea/0x1e0 [13632.007039] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.007042] [ T140] ret_from_fork+0x2f/0x50 [13632.007045] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.007048] [ T140] ret_from_fork_asm+0x11/0x20 [13632.007053] [ T140] </TASK> [13632.007054] [ T140] ---[ end trace 0000000000000000 ]--- [13632.007059] [ T140] ------------[ cut here ]------------ [13632.007061] [ T140] Memory manager not clean during takedown. [13632.007066] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_mm.c:964 drm_mm_takedown+0x22/0x30 [13632.007069] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13632.007133] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13632.007167] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13632.007170] [ T140] Tainted: [W]=WARN [13632.007171] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13632.007173] [ T140] RIP: 0010:drm_mm_takedown+0x22/0x30 [13632.007175] [ T140] Code: 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 8b 47 38 48 83 c7 38 48 39 f8 75 05 e9 55 74 8e ff 48 c7 c7 f0 ad e7 86 e8 be c1 a7 ff <0f> 0b e9 42 74 8e ff 0f 1f 80 00 00 00 00 f3 0f 1e fa 41 57 49 89 [13632.007177] [ T140] RSP: 0018:ffff9ac3c0743ac8 EFLAGS: 00010282 [13632.007180] [ T140] RAX: 0000000000000000 RBX: 0000000000000007 RCX: 0000000000000027 [13632.007181] [ T140] RDX: ffff8c34ba797808 RSI: 0000000000000001 RDI: ffff8c34ba797800 [13632.007183] [ T140] RBP: ffff8c3205edfe00 R08: 0000000000000000 R09: ffff9ac3c0743950 [13632.007185] [ T140] R10: ffff8c34e02fffa8 R11: 0000000000000003 R12: ffff8c32a588ef80 [13632.007187] [ T140] R13: ffff8c3205edff70 R14: 0000000000000000 R15: ffff8c3276051358 [13632.007188] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13632.007190] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13632.007192] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 [13632.007194] [ T140] PKRU: 55555554 [13632.007195] [ T140] Call Trace: [13632.007197] [ T140] <TASK> [13632.007199] [ T140] ? __warn.cold+0x90/0x9e [13632.007202] [ T140] ? drm_mm_takedown+0x22/0x30 [13632.007204] [ T140] ? report_bug+0xfa/0x140 [13632.007207] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 [13632.007211] [ T140] ? handle_bug+0x53/0x90 [13632.007214] [ T140] ? exc_invalid_op+0x17/0x70 [13632.007216] [ T140] ? asm_exc_invalid_op+0x1a/0x20 [13632.007220] [ T140] ? drm_mm_takedown+0x22/0x30 [13632.007222] [ T140] ? drm_mm_takedown+0x22/0x30 [13632.007224] [ T140] ttm_range_man_fini_nocheck+0x86/0x100 [ttm] [13632.007230] [ T140] amdgpu_ttm_fini+0x18f/0x210 [amdgpu] [13632.007310] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] [13632.007390] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] [13632.007484] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] [13632.007563] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] [13632.007641] [ T140] drm_dev_put.part.0+0x37/0x60 [13632.007644] [ T140] devres_release_all+0xa6/0xf0 [13632.007648] [ T140] ? driver_probe_device+0xa0/0xa0 [13632.007651] [ T140] device_unbind_cleanup+0x9/0x70 [13632.007654] [ T140] really_probe+0x21c/0x340 [13632.007657] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13632.007660] [ T140] __driver_probe_device+0x73/0x110 [13632.007663] [ T140] driver_probe_device+0x1a/0xa0 [13632.007666] [ T140] __device_attach_driver+0x84/0x110 [13632.007670] [ T140] bus_for_each_drv+0x82/0xe0 [13632.007673] [ T140] __device_attach+0xab/0x1b0 [13632.007677] [ T140] pci_bus_add_device+0x53/0x80 [13632.007680] [ T140] pci_bus_add_devices+0x2b/0x70 [13632.007682] [ T140] pci_bus_add_devices+0x56/0x70 [13632.007685] [ T140] pci_bus_add_devices+0x56/0x70 [13632.007687] [ T140] pciehp_configure_device+0xaa/0x160 [13632.007690] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13632.007692] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13632.007695] [ T140] pciehp_ist+0x13b/0x180 [13632.007698] [ T140] irq_thread_fn+0x1e/0x60 [13632.007701] [ T140] irq_thread+0x114/0x1e0 [13632.007704] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13632.007707] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13632.007710] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13632.007713] [ T140] kthread+0xea/0x1e0 [13632.007716] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.007719] [ T140] ret_from_fork+0x2f/0x50 [13632.007722] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.007725] [ T140] ret_from_fork_asm+0x11/0x20 [13632.007729] [ T140] </TASK> [13632.007731] [ T140] ---[ end trace 0000000000000000 ]--- [13632.007752] [ T140] [drm] amdgpu: ttm finalized [13632.007775] [ T140] BUG: kernel NULL pointer dereference, address: 0000000000000058 [13632.007777] [ T140] #PF: supervisor read access in kernel mode [13632.007779] [ T140] #PF: error_code(0x0000) - not-present page [13632.007781] [ T140] PGD 175454067 P4D 175454067 PUD 0 [13632.007786] [ T140] Oops: Oops: 0000 [#1] PREEMPT SMP NOPTI [13632.007788] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 [13632.007791] [ T140] Tainted: [W]=WARN [13632.007793] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 [13632.007795] [ T140] RIP: 0010:ttm_resource_move_to_lru_tail+0xc1/0xe0 [ttm] [13632.007799] [ T140] Code: 46 40 48 8b 94 ca 98 00 00 00 48 8b 4e 48 48 89 4f 08 48 89 39 48 89 c1 48 83 c0 03 48 c1 e1 04 48 c1 e0 04 48 01 d1 48 01 c2 <48> 8b 79 38 4c 89 41 38 48 89 56 40 48 89 7e 48 4c 89 07 e9 42 0d [13632.007801] [ T140] RSP: 0018:ffff9ac3c0743af0 EFLAGS: 00010206 [13632.007803] [ T140] RAX: 0000000000000050 RBX: ffff8c3276008848 RCX: 0000000000000020 [13632.007805] [ T140] RDX: 0000000000000050 RSI: ffff8c332918e100 RDI: ffff8c332918e140 [13632.007807] [ T140] RBP: ffff8c32a588ef80 R08: ffff8c332918e140 R09: 0000000000000000 [13632.007809] [ T140] R10: 0000000000400032 R11: 0000000000000000 R12: 0000000000000000 [13632.007811] [ T140] R13: ffff8c3276008800 R14: ffff8c32a588ef80 R15: ffff8c3276051358 [13632.007812] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 [13632.007814] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [13632.007816] [ T140] CR2: 0000000000000058 CR3: 000000010f62a000 CR4: 0000000000750ef0 [13632.007818] [ T140] PKRU: 55555554 [13632.007820] [ T140] Call Trace: [13632.007821] [ T140] <TASK> [13632.007823] [ T140] ? __die+0x51/0x92 [13632.007826] [ T140] ? page_fault_oops+0x99/0x220 [13632.007831] [ T140] ? exc_page_fault+0x32e/0x600 [13632.007834] [ T140] ? asm_exc_page_fault+0x26/0x30 [13632.007837] [ T140] ? ttm_resource_move_to_lru_tail+0xc1/0xe0 [ttm] [13632.007841] [ T140] ttm_bo_unpin+0x58/0x80 [ttm] [13632.007845] [ T140] amdgpu_bo_unpin+0x19/0x90 [amdgpu] [13632.007926] [ T140] amdgpu_bo_free_kernel+0x77/0x100 [amdgpu] [13632.008006] [ T140] amdgpu_device_fini_sw+0x339/0x3c0 [amdgpu] [13632.008085] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] [13632.008163] [ T140] drm_dev_put.part.0+0x37/0x60 [13632.008166] [ T140] devres_release_all+0xa6/0xf0 [13632.008169] [ T140] ? driver_probe_device+0xa0/0xa0 [13632.008173] [ T140] device_unbind_cleanup+0x9/0x70 [13632.008176] [ T140] really_probe+0x21c/0x340 [13632.008179] [ T140] ? pm_runtime_barrier+0x4f/0x90 [13632.008182] [ T140] __driver_probe_device+0x73/0x110 [13632.008185] [ T140] driver_probe_device+0x1a/0xa0 [13632.008188] [ T140] __device_attach_driver+0x84/0x110 [13632.008192] [ T140] bus_for_each_drv+0x82/0xe0 [13632.008195] [ T140] __device_attach+0xab/0x1b0 [13632.008199] [ T140] pci_bus_add_device+0x53/0x80 [13632.008201] [ T140] pci_bus_add_devices+0x2b/0x70 [13632.008204] [ T140] pci_bus_add_devices+0x56/0x70 [13632.008206] [ T140] pci_bus_add_devices+0x56/0x70 [13632.008209] [ T140] pciehp_configure_device+0xaa/0x160 [13632.008211] [ T140] ? pcie_capability_read_word+0x7a/0x90 [13632.008214] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 [13632.008217] [ T140] pciehp_ist+0x13b/0x180 [13632.008220] [ T140] irq_thread_fn+0x1e/0x60 [13632.008223] [ T140] irq_thread+0x114/0x1e0 [13632.008225] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 [13632.008228] [ T140] ? irq_set_affinity_notifier+0x120/0x120 [13632.008232] [ T140] ? irq_affinity_notify+0xd0/0xd0 [13632.008235] [ T140] kthread+0xea/0x1e0 [13632.008238] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.008241] [ T140] ret_from_fork+0x2f/0x50 [13632.008244] [ T140] ? kthreads_online_cpu+0xf0/0xf0 [13632.008247] [ T140] ret_from_fork_asm+0x11/0x20 [13632.008251] [ T140] </TASK> [13632.008253] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore [13632.008316] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform i2c_designware_core [13632.008349] [ T140] CR2: 0000000000000058 Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-11-05 11:44 ` Bert Karwatzki @ 2025-11-05 21:31 ` Mario Limonciello (AMD) (kernel.org) 2025-11-07 13:09 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Mario Limonciello (AMD) (kernel.org) @ 2025-11-05 21:31 UTC (permalink / raw) To: Bert Karwatzki, Christian König, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki On 11/5/2025 5:44 AM, Bert Karwatzki wrote: > I finally got a result from kdump regarding this bug. As I told I'm currently > trying to bisect this (again ...) between v6.14 and v6.15. My test setup during > overnight tests is to put on a long youtube video and then simulate some > interactivity by running this script: > > #!/bin/bash > for i in {0..10000} > do > echo $i > evolution & > sleep 3 > killall evolution > sleep 27 > done > > > I'm not done with the bisection, yet, but this night I got a result from kdump > showing a NULL pointer dereference after a loss of the discrete GPU: > (This may be a different bug though, as this did not result in a reboot > but hang instead) I think this is a different problem with how we handle cleanup from a GPU that disappeared. FWIW we do have a lot of fixups past 6.14 in this area. I also have done some failed suspend unwind code very recently that's going to 6.19 that might help this. I'm not sure if it's in drm-next yet, if it's not it will be soon! Once you're done with your bisect I'd be really interested if you can still reproduce the splats and NULL pointer on the recovery path using amd-staging-drm-next. > > faddr2line gives this regarding the NULL pointer: > $ scripts/faddr2line drivers/gpu/drm/ttm/ttm_resource.o ttm_resource_move_to_lru_tail+0xc1/0xe0 > ttm_resource_move_to_lru_tail+0xc1/0xe0: > list_add_tail at /mnt/data/linux-forest/mystery_shutdown/./include/linux/list.h:183 > (inlined by) list_move_tail at /mnt/data/linux-forest/mystery_shutdown/./include/linux/list.h:311 > (inlined by) ttm_resource_move_to_lru_tail at /mnt/data/linux-forest/mystery_shutdown/drivers/gpu/drm/ttm/ttm_resource.c:291 > > So I probably should use CONFIG_DEBUG_LIST from now on. > > > [13600.900669] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Down > [13600.900678] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card not present > [13600.971642] [ T53331] amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:7 param:0x00000000 message:DisableAllSmuFeatures? > [13600.971649] [ T53331] amdgpu 0000:03:00.0: amdgpu: Failed to disable smu features. > [13600.971653] [ T53331] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13600.971656] [ T53331] amdgpu 0000:03:00.0: amdgpu: [PrepareMp1] Failed! > [13600.971658] [ T53331] [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* SMC failed to set mp1 state 2, -121 > [13600.971779] [ T53331] amdgpu 0000:03:00.0: Unable to change power state from D0 to D3hot, device inaccessible > [13600.971809] [ T140] amdgpu 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible > [13611.504805] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is resuming... > [13611.504924] [ T140] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = > 0x003b3100 (59.49.0) > [13611.504930] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched > [13611.504933] [ T140] amdgpu 0000:03:00.0: amdgpu: dpm has been enabled > [13611.504936] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully! > [13611.664765] [ T140] amdgpu 0000:03:00.0: amdgpu: rlc autoload: gc ucode autoload timeout > [13611.664771] [ T140] amdgpu 0000:03:00.0: amdgpu: resume of IP block <gfx_v10_0> failed -110 > [13611.664775] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-110). > [13611.665372] [ T32730] pcieport 0000:02:00.0: Unable to change power state from D0 to D3hot, device inaccessible > [13611.666216] [ T32730] pcieport 0000:01:00.0: Unable to change power state from D0 to D3hot, device inaccessible > [13611.763659] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. > [13611.763798] [ T140] ------------[ cut here ]------------ > [13611.763801] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13611.763924] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13611.763993] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13611.764031] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Not tainted 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13611.764034] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13611.764036] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13611.764129] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13611.764132] [ T140] RSP: 0018:ffff9ac3c0743bf0 EFLAGS: 00010246 > [13611.764135] [ T140] RAX: ffff8c3200c40280 RBX: ffff8c3200dba000 RCX: 0000000000000000 > [13611.764137] [ T140] RDX: 0000000000000000 RSI: ffff8c3200dba008 RDI: ffff8c320a600000 > [13611.764139] [ T140] RBP: ffff8c3200dba000 R08: 0000000000000001 R09: 0000000000000000 > [13611.764141] [ T140] R10: 000000000040003f R11: 0000000000000000 R12: ffff8c320a600000 > [13611.764142] [ T140] R13: ffffffffc0be01a8 R14: ffffffffc0be01a8 R15: ffff9ac3c0743d6e > [13611.764144] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13611.764146] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13611.764148] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 > [13611.764150] [ T140] PKRU: 55555554 > [13611.764152] [ T140] Call Trace: > [13611.764155] [ T140] <TASK> > [13611.764157] [ T140] ? __warn.cold+0x90/0x9e > [13611.764162] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13611.764248] [ T140] ? report_bug+0xfa/0x140 > [13611.764253] [ T140] ? handle_bug+0x53/0x90 > [13611.764256] [ T140] ? exc_invalid_op+0x17/0x70 > [13611.764259] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13611.764263] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13611.764346] [ T140] smu_smc_hw_cleanup+0x5e/0x3e0 [amdgpu] > [13611.764460] [ T140] smu_hw_fini+0xfb/0x1a0 [amdgpu] > [13611.764573] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] > [13611.764699] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] > [13611.764815] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] > [13611.764893] [ T140] pci_device_remove+0x3d/0xb0 > [13611.764897] [ T140] device_release_driver_internal+0x197/0x200 > [13611.764900] [ T140] pci_stop_bus_device+0x68/0x80 > [13611.764904] [ T140] pci_stop_bus_device+0x38/0x80 > [13611.764907] [ T140] pci_stop_bus_device+0x27/0x80 > [13611.764909] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 > [13611.764912] [ T140] pciehp_unconfigure_device+0x93/0x160 > [13611.764916] [ T140] pciehp_disable_slot+0x62/0x100 > [13611.764919] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 > [13611.764922] [ T140] pciehp_ist+0x13b/0x180 > [13611.764925] [ T140] irq_thread_fn+0x1e/0x60 > [13611.764929] [ T140] irq_thread+0x114/0x1e0 > [13611.764932] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13611.764935] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13611.764938] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13611.764941] [ T140] kthread+0xea/0x1e0 > [13611.764945] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.764948] [ T140] ret_from_fork+0x2f/0x50 > [13611.764951] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.764954] [ T140] ret_from_fork_asm+0x11/0x20 > [13611.764959] [ T140] </TASK> > [13611.764960] [ T140] ---[ end trace 0000000000000000 ]--- > [13611.764963] [ T140] amdgpu 0000:03:00.0: amdgpu: Fail to disable thermal alert! > [13611.785004] [ T140] ------------[ cut here ]------------ > [13611.785008] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.785128] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13611.785216] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13611.785262] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13611.785266] [ T140] Tainted: [W]=WARN > [13611.785268] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13611.785271] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.785377] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 > <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f > [13611.785380] [ T140] RSP: 0018:ffff9ac3c0743bd0 EFLAGS: 00010202 > [13611.785384] [ T140] RAX: 0000000000000000 RBX: ffff8c320a63b830 RCX: 0000000080000000 > [13611.785386] [ T140] RDX: ffff8c320a63b880 RSI: ffff8c320a63b888 RDI: ffff8c320a63b830 > [13611.785389] [ T140] RBP: ffff8c320a63b880 R08: 0000000000000000 R09: 00000000ffffffea > [13611.785391] [ T140] R10: ffff8c34e02fffa8 R11: 0000000000000003 R12: ffff8c320a63b888 > [13611.785393] [ T140] R13: ffff8c3205b49800 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e > [13611.785395] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13611.785398] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13611.785400] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 > [13611.785403] [ T140] PKRU: 55555554 > [13611.785405] [ T140] Call Trace: > [13611.785408] [ T140] <TASK> > [13611.785410] [ T140] ? __warn.cold+0x90/0x9e > [13611.785415] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.785530] [ T140] ? report_bug+0xfa/0x140 > [13611.785535] [ T140] ? handle_bug+0x53/0x90 > [13611.785540] [ T140] ? exc_invalid_op+0x17/0x70 > [13611.785543] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13611.785548] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.785651] [ T140] psp_v11_0_ring_destroy+0x2e/0x50 [amdgpu] > [13611.785771] [ T140] psp_hw_fini+0x126/0x380 [amdgpu] > [13611.785856] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] > [13611.785974] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] > [13611.786084] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] > [13611.786160] [ T140] pci_device_remove+0x3d/0xb0 > [13611.786164] [ T140] device_release_driver_internal+0x197/0x200 > [13611.786167] [ T140] pci_stop_bus_device+0x68/0x80 > [13611.786170] [ T140] pci_stop_bus_device+0x38/0x80 > [13611.786173] [ T140] pci_stop_bus_device+0x27/0x80 > [13611.786175] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 > [13611.786178] [ T140] pciehp_unconfigure_device+0x93/0x160 > [13611.786181] [ T140] pciehp_disable_slot+0x62/0x100 > [13611.786184] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 > [13611.786201] [ T140] pciehp_ist+0x13b/0x180 > [13611.786204] [ T140] irq_thread_fn+0x1e/0x60 > [13611.786208] [ T140] irq_thread+0x114/0x1e0 > [13611.786211] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13611.786213] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13611.786216] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13611.786219] [ T140] kthread+0xea/0x1e0 > [13611.786223] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.786226] [ T140] ret_from_fork+0x2f/0x50 > [13611.786229] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.786232] [ T140] ret_from_fork_asm+0x11/0x20 > [13611.786236] [ T140] </TASK> > [13611.786238] [ T140] ---[ end trace 0000000000000000 ]--- > [13611.787301] [ T140] ------------[ cut here ]------------ > [13611.787303] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.787382] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13611.787447] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13611.787494] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13611.787497] [ T140] Tainted: [W]=WARN > [13611.787499] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13611.787501] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.787578] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 > <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f > [13611.787580] [ T140] RSP: 0018:ffff9ac3c0743c00 EFLAGS: 00010202 > [13611.787583] [ T140] RAX: ffff8c34ba7a5d80 RBX: ffff8c320a614b60 RCX: 000000000000016f > [13611.787585] [ T140] RDX: ffff8c320a614b68 RSI: ffff8c320a614b70 RDI: ffff8c320a614b60 > [13611.787586] [ T140] RBP: ffff8c320a614b68 R08: 00000000000056ee R09: 0000000000000009 > [13611.787588] [ T140] R10: 00000000000000b2 R11: 000000000000000a R12: ffff8c320a614b70 > [13611.787590] [ T140] R13: ffff8c320a6cb400 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e > [13611.787592] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13611.787594] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13611.787596] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 > [13611.787597] [ T140] PKRU: 55555554 > [13611.787599] [ T140] Call Trace: > [13611.787601] [ T140] <TASK> > [13611.787603] [ T140] ? __warn.cold+0x90/0x9e > [13611.787606] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.787691] [ T140] ? report_bug+0xfa/0x140 > [13611.787695] [ T140] ? handle_bug+0x53/0x90 > [13611.787699] [ T140] ? exc_invalid_op+0x17/0x70 > [13611.787702] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13611.787707] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.787801] [ T140] amdgpu_ih_ring_fini+0x4f/0x80 [amdgpu] > [13611.787909] [ T140] amdgpu_irq_fini_hw+0x2f/0x80 [amdgpu] > [13611.788011] [ T140] amdgpu_device_fini_hw+0x231/0x2ad [amdgpu] > [13611.788156] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] > [13611.788249] [ T140] pci_device_remove+0x3d/0xb0 > [13611.788253] [ T140] device_release_driver_internal+0x197/0x200 > [13611.788257] [ T140] pci_stop_bus_device+0x68/0x80 > [13611.788261] [ T140] pci_stop_bus_device+0x38/0x80 > [13611.788264] [ T140] pci_stop_bus_device+0x27/0x80 > [13611.788267] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 > [13611.788270] [ T140] pciehp_unconfigure_device+0x93/0x160 > [13611.788274] [ T140] pciehp_disable_slot+0x62/0x100 > [13611.788277] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 > [13611.788281] [ T140] pciehp_ist+0x13b/0x180 > [13611.788284] [ T140] irq_thread_fn+0x1e/0x60 > [13611.788288] [ T140] irq_thread+0x114/0x1e0 > [13611.788291] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13611.788295] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13611.788298] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13611.788302] [ T140] kthread+0xea/0x1e0 > [13611.788306] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.788309] [ T140] ret_from_fork+0x2f/0x50 > [13611.788313] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.788317] [ T140] ret_from_fork_asm+0x11/0x20 > [13611.788322] [ T140] </TASK> > [13611.788324] [ T140] ---[ end trace 0000000000000000 ]--- > [13611.789149] [ T140] ------------[ cut here ]------------ > [13611.789151] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_object.c:510 amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.789230] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13611.789294] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13611.789328] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13611.789331] [ T140] Tainted: [W]=WARN > [13611.789333] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13611.789334] [ T140] RIP: 0010:amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.789411] [ T140] Code: f6 ff ff 4d 85 e4 74 08 49 c7 04 24 00 00 00 00 48 85 ed 74 08 48 c7 45 00 00 00 00 00 5b 5d 41 5c 41 5d 41 5e e9 e7 49 b2 c5 > <0f> 0b e9 4b ff ff ff 3d 00 fe ff ff 0f 85 85 97 5d 00 eb bd 0f 1f > [13611.789413] [ T140] RSP: 0018:ffff9ac3c0743c48 EFLAGS: 00010202 > [13611.789416] [ T140] RAX: 0000000000000000 RBX: ffff8c320a600a20 RCX: 0000000000000000 > [13611.789418] [ T140] RDX: ffff8c320a600a28 RSI: 0000000000000000 RDI: ffff8c320a600a20 > [13611.789420] [ T140] RBP: ffff8c320a600a28 R08: ffff8c3221bd1c18 R09: 00007f363c49e000 > [13611.789421] [ T140] R10: 0000000000000020 R11: 000000000000009d R12: 0000000000000000 > [13611.789423] [ T140] R13: ffff8c320a6cb000 R14: ffff8c320a60ef80 R15: ffff9ac3c0743d6e > [13611.789425] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13611.789427] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13611.789429] [ T140] CR2: 000055fcf3dd7e44 CR3: 0000000271a64000 CR4: 0000000000750ef0 > [13611.789430] [ T140] PKRU: 55555554 > [13611.789432] [ T140] Call Trace: > [13611.789434] [ T140] <TASK> > [13611.789436] [ T140] ? __warn.cold+0x90/0x9e > [13611.789439] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.789524] [ T140] ? report_bug+0xfa/0x140 > [13611.789528] [ T140] ? handle_bug+0x53/0x90 > [13611.789531] [ T140] ? exc_invalid_op+0x17/0x70 > [13611.789533] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13611.789537] [ T140] ? amdgpu_bo_free_kernel+0xe4/0x100 [amdgpu] > [13611.789614] [ T140] amdgpu_device_unmap_mmio+0x25/0x90 [amdgpu] > [13611.789689] [ T140] amdgpu_pci_remove+0x40/0x70 [amdgpu] > [13611.789765] [ T140] pci_device_remove+0x3d/0xb0 > [13611.789768] [ T140] device_release_driver_internal+0x197/0x200 > [13611.789771] [ T140] pci_stop_bus_device+0x68/0x80 > [13611.789774] [ T140] pci_stop_bus_device+0x38/0x80 > [13611.789776] [ T140] pci_stop_bus_device+0x27/0x80 > [13611.789779] [ T140] pci_stop_and_remove_bus_device+0xd/0x20 > [13611.789782] [ T140] pciehp_unconfigure_device+0x93/0x160 > [13611.789785] [ T140] pciehp_disable_slot+0x62/0x100 > [13611.789787] [ T140] pciehp_handle_presence_or_link_change+0x72/0x350 > [13611.789790] [ T140] pciehp_ist+0x13b/0x180 > [13611.789793] [ T140] irq_thread_fn+0x1e/0x60 > [13611.789796] [ T140] irq_thread+0x114/0x1e0 > [13611.789799] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13611.789801] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13611.789805] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13611.789807] [ T140] kthread+0xea/0x1e0 > [13611.789810] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.789814] [ T140] ret_from_fork+0x2f/0x50 > [13611.789817] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13611.789820] [ T140] ret_from_fork_asm+0x11/0x20 > [13611.789824] [ T140] </TASK> > [13611.789826] [ T140] ---[ end trace 0000000000000000 ]--- > [13612.510583] [ T140] pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible > [13612.510746] [ T140] pcieport 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible > [13612.515993] [ T140] pci_bus 0000:03: busn_res: [bus 03] is released > [13612.517813] [ T140] pci_bus 0000:02: busn_res: [bus 02-03] is released > [13612.517957] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Card present > [13612.517960] [ T140] pcieport 0000:00:01.1: pciehp: Slot(0): Link Up > [13612.646970] [ T140] pci 0000:01:00.0: [1002:1478] type 01 class 0x060400 PCIe Switch Upstream Port > [13612.647337] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff] > [13612.647459] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] > [13612.648148] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > [13612.649293] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [13612.651803] [ T140] pci 0000:01:00.0: PME# supported from D0 D3hot D3cold > [13612.655075] [ T140] pci 0000:01:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 126.024 Gb/s with > 16.0 GT/s PCIe x8 link) > [13612.655993] [ T140] pci 0000:01:00.0: Adding to iommu group 12 > [13612.657710] [ T140] pci 0000:02:00.0: [1002:1479] type 01 class 0x060400 PCIe Switch Downstream Port > [13612.658068] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] > [13612.658078] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > [13612.658315] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [13612.661595] [ T140] pci 0000:02:00.0: PME# supported from D0 D3hot D3cold > [13612.667373] [ T140] pci 0000:02:00.0: Adding to iommu group 13 > [13612.667877] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] > [13612.668858] [ T140] pci 0000:03:00.0: [1002:73ff] type 00 class 0x038000 PCIe Legacy Endpoint > [13612.669236] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref] > [13612.669241] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref] > [13612.669480] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff] > [13612.669484] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref] > [13612.672312] [ T140] pci 0000:03:00.0: PME# supported from D1 D2 D3hot D3cold > [13612.673659] [ T140] pci 0000:03:00.0: 16.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x8 link at 0000:00:01.1 (capable of 252.048 Gb/s with > 16.0 GT/s PCIe x16 link) > [13612.675633] [ T140] pci 0000:03:00.0: Adding to iommu group 14 > [13612.676122] [ T140] pci 0000:03:00.1: [1002:ab28] type 00 class 0x040300 PCIe Legacy Endpoint > [13612.677265] [ T140] pci 0000:03:00.1: BAR 0 [mem 0xfcb20000-0xfcb23fff] > [13612.678612] [ T140] pci 0000:03:00.1: PME# supported from D1 D2 D3hot D3cold > [13612.679722] [ T140] pci 0000:03:00.1: Adding to iommu group 15 > [13612.680123] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] > [13612.680834] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x300000000 > required for 0000:02:00.0 bridging to [bus 03] > [13612.680838] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 03] requires relaxed alignment rules > [13612.680842] [ T140] pcieport 0000:00:01.1: Assigned bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 01-03] cannot fit 0x400000000 > required for 0000:01:00.0 bridging to [bus 02-03] > [13612.680845] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] to [bus 02-03] requires relaxed alignment rules > [13612.680851] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned > [13612.680853] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned > [13612.680856] [ T140] pci 0000:01:00.0: BAR 0 [mem 0xfcc00000-0xfcc03fff]: assigned > [13612.680861] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref]: assigned > [13612.680864] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff]: assigned > [13612.680867] [ T140] pci 0000:03:00.0: BAR 0 [mem 0xfc00000000-0xfdffffffff 64bit pref]: assigned > [13612.680877] [ T140] pci 0000:03:00.0: BAR 2 [mem 0xfe00000000-0xfe0fffffff 64bit pref]: assigned > [13612.680888] [ T140] pci 0000:03:00.0: BAR 5 [mem 0xfca00000-0xfcafffff]: assigned > [13612.681010] [ T140] pci 0000:03:00.0: ROM [mem 0xfcb00000-0xfcb1ffff pref]: assigned > [13612.681012] [ T140] pci 0000:03:00.1: BAR 0 [mem 0xfcb20000-0xfcb23fff]: assigned > [13612.681233] [ T140] pci 0000:02:00.0: PCI bridge to [bus 03] > [13612.681589] [ T140] pci 0000:02:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > [13612.681707] [ T140] pci 0000:02:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [13612.681714] [ T140] pci 0000:01:00.0: PCI bridge to [bus 02-03] > [13612.681833] [ T140] pci 0000:01:00.0: bridge window [mem 0xfca00000-0xfcbfffff] > [13612.681960] [ T140] pci 0000:01:00.0: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [13612.681967] [ T140] pcieport 0000:00:01.1: PCI bridge to [bus 01-03] > [13612.681970] [ T140] pcieport 0000:00:01.1: bridge window [io 0x1000-0x1fff] > [13612.681974] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfca00000-0xfccfffff] > [13612.681977] [ T140] pcieport 0000:00:01.1: bridge window [mem 0xfc00000000-0xfe0fffffff 64bit pref] > [13612.684765] [ T140] [drm] initializing kernel modesetting (DIMGREY_CAVEFISH 0x1002:0x73FF 0x1462:0x1313 0xC3). > [13612.685143] [ T140] [drm] register mmio base: 0xFCA00000 > [13612.685145] [ T140] [drm] register mmio size: 1048576 > [13614.899803] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 0 <nv_common> > [13614.899811] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 1 <gmc_v10_0> > [13614.899815] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 2 <navi10_ih> > [13614.899818] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 3 <psp> > [13614.899821] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 4 <smu> > [13614.899824] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 5 <dm> > [13614.899827] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 6 <gfx_v10_0> > [13614.899831] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 7 <sdma_v5_2> > [13614.899834] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 8 <vcn_v3_0> > [13614.899837] [ T140] amdgpu 0000:03:00.0: amdgpu: detected ip block number 9 <jpeg_v3_0> > [13614.899982] [ T140] amdgpu 0000:03:00.0: amdgpu: ACPI VFCT table present but broken (too short #2),skipping > [13615.118633] [ T140] amdgpu 0000:03:00.0: amdgpu: Fetched VBIOS from ROM BAR > [13615.118640] [ T140] amdgpu: ATOM BIOS: SWBRT77181.001 > [13615.126013] [ T140] amdgpu 0000:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default) > [13615.126355] [ T140] amdgpu 0000:03:00.0: amdgpu: MODE1 reset > [13615.126359] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset > [13615.128943] [ T140] amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset > [13615.633347] [ T140] [drm] GPU posting now... > [13615.633384] [ T140] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit > [13615.633393] [ T140] amdgpu 0000:03:00.0: amdgpu: VRAM: 8176M 0x0000008000000000 - 0x00000081FEFFFFFF (8176M used) > [13615.633397] [ T140] amdgpu 0000:03:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF > [13615.633408] [ T140] [drm] Detected VRAM RAM=8176M, BAR=8192M > [13615.633410] [ T140] [drm] RAM width 128bits GDDR6 > [13615.633554] [ T140] [drm] amdgpu: 8176M of VRAM memory ready > [13615.633557] [ T140] [drm] amdgpu: 6895M of GTT memory ready. > [13615.633574] [ T140] [drm] GART: num cpu pages 131072, num gpu pages 131072 > [13615.635159] [ T140] [drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). > [13630.506743] [ T140] amdgpu 0000:03:00.0: amdgpu: STB initialized to 2048 entries > [13630.506836] [ T140] [drm] Loading DMUB firmware via PSP: version=0x02020020 > [13630.510341] [ T140] [drm] use_doorbell being set to: [true] > [13630.511712] [ T140] [drm] use_doorbell being set to: [true] > [13630.511733] [ T140] [drm] Found VCN firmware Version ENC: 1.33 DEC: 4 VEP: 0 Revision: 6 > [13630.685530] [ T140] amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR > [13631.015392] [ T140] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available > [13631.046278] [ T140] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available > [13631.046644] [ T140] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = > 0x003b3100 (59.49.0) > [13631.046649] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched > [13631.046909] [ T140] amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable > [13631.153391] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:12 param:0x00000000 message:GetEnabledSmuFeaturesLow? > [13631.153396] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153399] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153401] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153404] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153406] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153408] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153411] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153413] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153415] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153417] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153419] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153421] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153423] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153425] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153427] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153429] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153431] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153433] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153435] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153437] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153438] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153440] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153442] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153444] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153446] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153448] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153451] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to retrieve enabled ppfeatures! > [13631.153453] [ T140] amdgpu 0000:03:00.0: amdgpu: SMU is in hanged state, failed to send smu message! > [13631.153455] [ T140] amdgpu 0000:03:00.0: amdgpu: Attempt to override pcie params failed! > [13631.153457] [ T140] amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw! > [13631.153459] [ T140] [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* hw_init of IP block <smu> failed -121 > [13631.153638] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_init failed > [13631.153641] [ T140] amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init > [13631.161507] [ T140] amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device. > [13631.161633] [ T140] ------------[ cut here ]------------ > [13631.161635] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.161751] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.161831] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.161876] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.161881] [ T140] Tainted: [W]=WARN > [13631.161882] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.161885] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.161990] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.161992] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.161996] [ T140] RAX: ffff8c336755a3c0 RBX: ffff8c32a5898890 RCX: 0000000000000000 > [13631.161998] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.162000] [ T140] RBP: ffff8c32a5890250 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.162001] [ T140] R10: 0000000000000082 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.162003] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.162005] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.162007] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.162009] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.162011] [ T140] PKRU: 55555554 > [13631.162013] [ T140] Call Trace: > [13631.162016] [ T140] <TASK> > [13631.162019] [ T140] ? __warn.cold+0x90/0x9e > [13631.162025] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.162140] [ T140] ? report_bug+0xfa/0x140 > [13631.162146] [ T140] ? handle_bug+0x53/0x90 > [13631.162149] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.162152] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.162157] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.162260] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.162263] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.162392] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.162536] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.162657] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.162749] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.162753] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.162756] [ T140] pci_device_probe+0xc0/0x180 > [13631.162760] [ T140] really_probe+0xd9/0x340 > [13631.162764] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.162769] [ T140] __driver_probe_device+0x73/0x110 > [13631.162773] [ T140] driver_probe_device+0x1a/0xa0 > [13631.162776] [ T140] __device_attach_driver+0x84/0x110 > [13631.162780] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.162783] [ T140] __device_attach+0xab/0x1b0 > [13631.162787] [ T140] pci_bus_add_device+0x53/0x80 > [13631.162790] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.162792] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.162795] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.162797] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.162800] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.162803] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.162806] [ T140] pciehp_ist+0x13b/0x180 > [13631.162809] [ T140] irq_thread_fn+0x1e/0x60 > [13631.162813] [ T140] irq_thread+0x114/0x1e0 > [13631.162815] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.162818] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.162822] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.162824] [ T140] kthread+0xea/0x1e0 > [13631.162828] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.162831] [ T140] ret_from_fork+0x2f/0x50 > [13631.162835] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.162838] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.162843] [ T140] </TASK> > [13631.162844] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.162857] [ T140] ------------[ cut here ]------------ > [13631.162858] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.162948] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.163013] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.163048] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.163051] [ T140] Tainted: [W]=WARN > [13631.163053] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.163054] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.163139] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.163141] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.163143] [ T140] RAX: ffff8c336755a3c4 RBX: ffff8c32a5898ba8 RCX: 0000000000000000 > [13631.163145] [ T140] RDX: 0000000000000001 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.163147] [ T140] RBP: ffff8c32a5890258 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.163149] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.163150] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.163152] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.163154] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.163156] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.163158] [ T140] PKRU: 55555554 > [13631.163159] [ T140] Call Trace: > [13631.163162] [ T140] <TASK> > [13631.163163] [ T140] ? __warn.cold+0x90/0x9e > [13631.163167] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.163250] [ T140] ? report_bug+0xfa/0x140 > [13631.163254] [ T140] ? handle_bug+0x53/0x90 > [13631.163257] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.163259] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.163263] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.163345] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.163348] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.163427] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.163557] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.163669] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.163746] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.163750] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.163753] [ T140] pci_device_probe+0xc0/0x180 > [13631.163756] [ T140] really_probe+0xd9/0x340 > [13631.163759] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.163763] [ T140] __driver_probe_device+0x73/0x110 > [13631.163766] [ T140] driver_probe_device+0x1a/0xa0 > [13631.163770] [ T140] __device_attach_driver+0x84/0x110 > [13631.163773] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.163777] [ T140] __device_attach+0xab/0x1b0 > [13631.163781] [ T140] pci_bus_add_device+0x53/0x80 > [13631.163785] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.163787] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.163790] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.163792] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.163795] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.163798] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.163800] [ T140] pciehp_ist+0x13b/0x180 > [13631.163803] [ T140] irq_thread_fn+0x1e/0x60 > [13631.163807] [ T140] irq_thread+0x114/0x1e0 > [13631.163810] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.163813] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.163816] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.163818] [ T140] kthread+0xea/0x1e0 > [13631.163822] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.163825] [ T140] ret_from_fork+0x2f/0x50 > [13631.163828] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.163831] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.163835] [ T140] </TASK> > [13631.163837] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.163847] [ T140] ------------[ cut here ]------------ > [13631.163849] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.163937] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.164001] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.164034] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.164037] [ T140] Tainted: [W]=WARN > [13631.164039] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.164040] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.164124] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.164126] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.164129] [ T140] RAX: ffff8c336755a3c8 RBX: ffff8c32a5898ec8 RCX: 0000000000000000 > [13631.164131] [ T140] RDX: 0000000000000002 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.164133] [ T140] RBP: ffff8c32a5890260 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.164135] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.164137] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.164139] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.164141] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.164143] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.164144] [ T140] PKRU: 55555554 > [13631.164146] [ T140] Call Trace: > [13631.164148] [ T140] <TASK> > [13631.164149] [ T140] ? __warn.cold+0x90/0x9e > [13631.164152] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.164235] [ T140] ? report_bug+0xfa/0x140 > [13631.164239] [ T140] ? handle_bug+0x53/0x90 > [13631.164242] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.164244] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.164248] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.164330] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.164333] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.164411] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.164538] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.164650] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.164758] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.164762] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.164765] [ T140] pci_device_probe+0xc0/0x180 > [13631.164768] [ T140] really_probe+0xd9/0x340 > [13631.164771] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.164774] [ T140] __driver_probe_device+0x73/0x110 > [13631.164778] [ T140] driver_probe_device+0x1a/0xa0 > [13631.164781] [ T140] __device_attach_driver+0x84/0x110 > [13631.164784] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.164788] [ T140] __device_attach+0xab/0x1b0 > [13631.164791] [ T140] pci_bus_add_device+0x53/0x80 > [13631.164794] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.164797] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.164799] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.164801] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.164809] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.164811] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.164814] [ T140] pciehp_ist+0x13b/0x180 > [13631.164817] [ T140] irq_thread_fn+0x1e/0x60 > [13631.164821] [ T140] irq_thread+0x114/0x1e0 > [13631.164823] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.164826] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.164829] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.164832] [ T140] kthread+0xea/0x1e0 > [13631.164836] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.164839] [ T140] ret_from_fork+0x2f/0x50 > [13631.164843] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.164846] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.164851] [ T140] </TASK> > [13631.164852] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.164861] [ T140] ------------[ cut here ]------------ > [13631.164863] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.164951] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.165014] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.165048] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.165050] [ T140] Tainted: [W]=WARN > [13631.165052] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.165054] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.165151] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.165153] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.165155] [ T140] RAX: ffff8c336755a3cc RBX: ffff8c32a58991e0 RCX: 0000000000000000 > [13631.165157] [ T140] RDX: 0000000000000003 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.165158] [ T140] RBP: ffff8c32a5890268 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.165160] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.165162] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.165164] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.165166] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.165167] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.165169] [ T140] PKRU: 55555554 > [13631.165171] [ T140] Call Trace: > [13631.165172] [ T140] <TASK> > [13631.165174] [ T140] ? __warn.cold+0x90/0x9e > [13631.165177] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.165259] [ T140] ? report_bug+0xfa/0x140 > [13631.165263] [ T140] ? handle_bug+0x53/0x90 > [13631.165267] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.165269] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.165272] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.165355] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.165358] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.165436] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.165560] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.165671] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.165748] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.165752] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.165755] [ T140] pci_device_probe+0xc0/0x180 > [13631.165758] [ T140] really_probe+0xd9/0x340 > [13631.165761] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.165764] [ T140] __driver_probe_device+0x73/0x110 > [13631.165768] [ T140] driver_probe_device+0x1a/0xa0 > [13631.165771] [ T140] __device_attach_driver+0x84/0x110 > [13631.165774] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.165778] [ T140] __device_attach+0xab/0x1b0 > [13631.165781] [ T140] pci_bus_add_device+0x53/0x80 > [13631.165784] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.165786] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.165789] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.165791] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.165794] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.165796] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.165799] [ T140] pciehp_ist+0x13b/0x180 > [13631.165802] [ T140] irq_thread_fn+0x1e/0x60 > [13631.165805] [ T140] irq_thread+0x114/0x1e0 > [13631.165808] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.165810] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.165814] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.165816] [ T140] kthread+0xea/0x1e0 > [13631.165819] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.165823] [ T140] ret_from_fork+0x2f/0x50 > [13631.165826] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.165829] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.165833] [ T140] </TASK> > [13631.165835] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.165843] [ T140] ------------[ cut here ]------------ > [13631.165845] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.165933] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.165996] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.166030] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.166032] [ T140] Tainted: [W]=WARN > [13631.166034] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.166035] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.166127] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.166129] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.166132] [ T140] RAX: ffff8c336755a3d0 RBX: ffff8c32a58994f8 RCX: 0000000000000000 > [13631.166133] [ T140] RDX: 0000000000000004 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.166135] [ T140] RBP: ffff8c32a5890270 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.166137] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.166139] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.166140] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.166142] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.166144] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.166146] [ T140] PKRU: 55555554 > [13631.166147] [ T140] Call Trace: > [13631.166149] [ T140] <TASK> > [13631.166151] [ T140] ? __warn.cold+0x90/0x9e > [13631.166154] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.166236] [ T140] ? report_bug+0xfa/0x140 > [13631.166240] [ T140] ? handle_bug+0x53/0x90 > [13631.166243] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.166245] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.166249] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.166331] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.166334] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.166413] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.166543] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.166654] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.166731] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.166735] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.166738] [ T140] pci_device_probe+0xc0/0x180 > [13631.166741] [ T140] really_probe+0xd9/0x340 > [13631.166744] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.166747] [ T140] __driver_probe_device+0x73/0x110 > [13631.166750] [ T140] driver_probe_device+0x1a/0xa0 > [13631.166754] [ T140] __device_attach_driver+0x84/0x110 > [13631.166757] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.166760] [ T140] __device_attach+0xab/0x1b0 > [13631.166764] [ T140] pci_bus_add_device+0x53/0x80 > [13631.166767] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.166769] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.166772] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.166774] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.166776] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.166779] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.166782] [ T140] pciehp_ist+0x13b/0x180 > [13631.166784] [ T140] irq_thread_fn+0x1e/0x60 > [13631.166788] [ T140] irq_thread+0x114/0x1e0 > [13631.166790] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.166793] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.166796] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.166799] [ T140] kthread+0xea/0x1e0 > [13631.166802] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.166805] [ T140] ret_from_fork+0x2f/0x50 > [13631.166808] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.166811] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.166815] [ T140] </TASK> > [13631.166817] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.166825] [ T140] ------------[ cut here ]------------ > [13631.166826] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.166915] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.166978] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.167010] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.167013] [ T140] Tainted: [W]=WARN > [13631.167014] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.167016] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.167100] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.167102] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.167104] [ T140] RAX: ffff8c336755a3d4 RBX: ffff8c32a5899810 RCX: 0000000000000000 > [13631.167106] [ T140] RDX: 0000000000000005 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.167108] [ T140] RBP: ffff8c32a5890278 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.167109] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.167111] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.167113] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.167115] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.167116] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.167118] [ T140] PKRU: 55555554 > [13631.167120] [ T140] Call Trace: > [13631.167121] [ T140] <TASK> > [13631.167123] [ T140] ? __warn.cold+0x90/0x9e > [13631.167126] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.167209] [ T140] ? report_bug+0xfa/0x140 > [13631.167212] [ T140] ? handle_bug+0x53/0x90 > [13631.167215] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.167217] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.167221] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.167309] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.167312] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.167390] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.167515] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.167627] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.167704] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.167708] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.167711] [ T140] pci_device_probe+0xc0/0x180 > [13631.167714] [ T140] really_probe+0xd9/0x340 > [13631.167717] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.167720] [ T140] __driver_probe_device+0x73/0x110 > [13631.167723] [ T140] driver_probe_device+0x1a/0xa0 > [13631.167726] [ T140] __device_attach_driver+0x84/0x110 > [13631.167730] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.167733] [ T140] __device_attach+0xab/0x1b0 > [13631.167737] [ T140] pci_bus_add_device+0x53/0x80 > [13631.167739] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.167742] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.167744] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.167747] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.167749] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.167752] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.167754] [ T140] pciehp_ist+0x13b/0x180 > [13631.167757] [ T140] irq_thread_fn+0x1e/0x60 > [13631.167760] [ T140] irq_thread+0x114/0x1e0 > [13631.167763] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.167766] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.167769] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.167772] [ T140] kthread+0xea/0x1e0 > [13631.167775] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.167778] [ T140] ret_from_fork+0x2f/0x50 > [13631.167781] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.167784] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.167788] [ T140] </TASK> > [13631.167789] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.167798] [ T140] ------------[ cut here ]------------ > [13631.167799] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.167887] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.167955] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.167988] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.167991] [ T140] Tainted: [W]=WARN > [13631.167992] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.167994] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.168077] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.168079] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.168082] [ T140] RAX: ffff8c336755a3c8 RBX: ffff8c32a5899b28 RCX: 0000000000000000 > [13631.168083] [ T140] RDX: 0000000000000002 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.168085] [ T140] RBP: ffff8c32a5890280 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.168087] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.168089] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.168090] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.168092] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.168094] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.168096] [ T140] PKRU: 55555554 > [13631.168097] [ T140] Call Trace: > [13631.168099] [ T140] <TASK> > [13631.168101] [ T140] ? __warn.cold+0x90/0x9e > [13631.168104] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.168187] [ T140] ? report_bug+0xfa/0x140 > [13631.168190] [ T140] ? handle_bug+0x53/0x90 > [13631.168193] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.168195] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.168199] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.168281] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.168284] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.168362] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.168491] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.168601] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.168678] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.168682] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.168685] [ T140] pci_device_probe+0xc0/0x180 > [13631.168688] [ T140] really_probe+0xd9/0x340 > [13631.168691] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.168694] [ T140] __driver_probe_device+0x73/0x110 > [13631.168698] [ T140] driver_probe_device+0x1a/0xa0 > [13631.168701] [ T140] __device_attach_driver+0x84/0x110 > [13631.168704] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.168708] [ T140] __device_attach+0xab/0x1b0 > [13631.168711] [ T140] pci_bus_add_device+0x53/0x80 > [13631.168714] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.168716] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.168719] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.168721] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.168724] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.168726] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.168729] [ T140] pciehp_ist+0x13b/0x180 > [13631.168732] [ T140] irq_thread_fn+0x1e/0x60 > [13631.168735] [ T140] irq_thread+0x114/0x1e0 > [13631.168738] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.168741] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.168744] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.168747] [ T140] kthread+0xea/0x1e0 > [13631.168755] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.168758] [ T140] ret_from_fork+0x2f/0x50 > [13631.168762] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.168765] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.168769] [ T140] </TASK> > [13631.168771] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.168783] [ T140] ------------[ cut here ]------------ > [13631.168784] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.168878] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.168941] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.168974] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.168977] [ T140] Tainted: [W]=WARN > [13631.168978] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.168980] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.169064] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.169066] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.169069] [ T140] RAX: ffff8c336755a3cc RBX: ffff8c32a5899e40 RCX: 0000000000000000 > [13631.169071] [ T140] RDX: 0000000000000003 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.169072] [ T140] RBP: ffff8c32a5890288 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.169074] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.169076] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.169078] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.169080] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.169081] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.169083] [ T140] PKRU: 55555554 > [13631.169085] [ T140] Call Trace: > [13631.169086] [ T140] <TASK> > [13631.169094] [ T140] ? __warn.cold+0x90/0x9e > [13631.169097] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.169180] [ T140] ? report_bug+0xfa/0x140 > [13631.169183] [ T140] ? handle_bug+0x53/0x90 > [13631.169186] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.169188] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.169192] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.169275] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.169278] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.169357] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.169490] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.169601] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.169678] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.169682] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.169685] [ T140] pci_device_probe+0xc0/0x180 > [13631.169688] [ T140] really_probe+0xd9/0x340 > [13631.169691] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.169694] [ T140] __driver_probe_device+0x73/0x110 > [13631.169697] [ T140] driver_probe_device+0x1a/0xa0 > [13631.169701] [ T140] __device_attach_driver+0x84/0x110 > [13631.169704] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.169707] [ T140] __device_attach+0xab/0x1b0 > [13631.169711] [ T140] pci_bus_add_device+0x53/0x80 > [13631.169713] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.169716] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.169718] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.169721] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.169723] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.169726] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.169729] [ T140] pciehp_ist+0x13b/0x180 > [13631.169731] [ T140] irq_thread_fn+0x1e/0x60 > [13631.169734] [ T140] irq_thread+0x114/0x1e0 > [13631.169737] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.169740] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.169743] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.169746] [ T140] kthread+0xea/0x1e0 > [13631.169749] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.169752] [ T140] ret_from_fork+0x2f/0x50 > [13631.169755] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.169758] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.169762] [ T140] </TASK> > [13631.169764] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.169773] [ T140] ------------[ cut here ]------------ > [13631.169774] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.169863] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.169925] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.169958] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.169961] [ T140] Tainted: [W]=WARN > [13631.169962] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.169964] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.170048] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.170050] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.170053] [ T140] RAX: ffff8c336755a3d0 RBX: ffff8c32a589a158 RCX: 0000000000000000 > [13631.170054] [ T140] RDX: 0000000000000004 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.170056] [ T140] RBP: ffff8c32a5890290 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.170058] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.170059] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.170061] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.170063] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.170065] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.170067] [ T140] PKRU: 55555554 > [13631.170068] [ T140] Call Trace: > [13631.170070] [ T140] <TASK> > [13631.170072] [ T140] ? __warn.cold+0x90/0x9e > [13631.170074] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.170159] [ T140] ? report_bug+0xfa/0x140 > [13631.170162] [ T140] ? handle_bug+0x53/0x90 > [13631.170165] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.170167] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.170171] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.170259] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.170262] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.170345] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.170464] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.170582] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.170664] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.170668] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.170671] [ T140] pci_device_probe+0xc0/0x180 > [13631.170674] [ T140] really_probe+0xd9/0x340 > [13631.170677] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.170680] [ T140] __driver_probe_device+0x73/0x110 > [13631.170683] [ T140] driver_probe_device+0x1a/0xa0 > [13631.170686] [ T140] __device_attach_driver+0x84/0x110 > [13631.170690] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.170693] [ T140] __device_attach+0xab/0x1b0 > [13631.170697] [ T140] pci_bus_add_device+0x53/0x80 > [13631.170699] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.170702] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.170704] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.170707] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.170709] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.170712] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.170714] [ T140] pciehp_ist+0x13b/0x180 > [13631.170717] [ T140] irq_thread_fn+0x1e/0x60 > [13631.170720] [ T140] irq_thread+0x114/0x1e0 > [13631.170723] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.170726] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.170729] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.170731] [ T140] kthread+0xea/0x1e0 > [13631.170735] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.170738] [ T140] ret_from_fork+0x2f/0x50 > [13631.170740] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.170743] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.170748] [ T140] </TASK> > [13631.170749] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.170757] [ T140] ------------[ cut here ]------------ > [13631.170758] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.170852] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.170914] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.170946] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.170949] [ T140] Tainted: [W]=WARN > [13631.170950] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.170952] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.171041] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.171043] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.171045] [ T140] RAX: ffff8c336755a3d4 RBX: ffff8c32a589a470 RCX: 0000000000000000 > [13631.171047] [ T140] RDX: 0000000000000005 RSI: ffff8c32a58a54d0 RDI: ffff8c32a5880000 > [13631.171049] [ T140] RBP: ffff8c32a5890298 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.171051] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.171052] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.171054] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.171056] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.171058] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.171059] [ T140] PKRU: 55555554 > [13631.171061] [ T140] Call Trace: > [13631.171063] [ T140] <TASK> > [13631.171064] [ T140] ? __warn.cold+0x90/0x9e > [13631.171068] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.171155] [ T140] ? report_bug+0xfa/0x140 > [13631.171159] [ T140] ? handle_bug+0x53/0x90 > [13631.171162] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.171164] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.171168] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.171255] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.171258] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.171338] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.171454] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.171574] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.171674] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.171677] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.171680] [ T140] pci_device_probe+0xc0/0x180 > [13631.171683] [ T140] really_probe+0xd9/0x340 > [13631.171686] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.171690] [ T140] __driver_probe_device+0x73/0x110 > [13631.171693] [ T140] driver_probe_device+0x1a/0xa0 > [13631.171696] [ T140] __device_attach_driver+0x84/0x110 > [13631.171699] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.171703] [ T140] __device_attach+0xab/0x1b0 > [13631.171706] [ T140] pci_bus_add_device+0x53/0x80 > [13631.171709] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.171711] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.171714] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.171716] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.171718] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.171721] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.171724] [ T140] pciehp_ist+0x13b/0x180 > [13631.171726] [ T140] irq_thread_fn+0x1e/0x60 > [13631.171729] [ T140] irq_thread+0x114/0x1e0 > [13631.171732] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.171735] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.171738] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.171741] [ T140] kthread+0xea/0x1e0 > [13631.171744] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.171747] [ T140] ret_from_fork+0x2f/0x50 > [13631.171750] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.171753] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.171757] [ T140] </TASK> > [13631.171758] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.171767] [ T140] ------------[ cut here ]------------ > [13631.171768] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.171856] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.171918] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.171951] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.171953] [ T140] Tainted: [W]=WARN > [13631.171955] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.171956] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.172039] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.172041] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.172044] [ T140] RAX: ffff8c3203e26ba8 RBX: ffff8c32a5896d20 RCX: 0000000000000000 > [13631.172045] [ T140] RDX: 0000000000000000 RSI: ffff8c32a5897038 RDI: ffff8c32a5880000 > [13631.172047] [ T140] RBP: ffff8c32a58902a0 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.172049] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.172051] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.172052] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.172054] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.172056] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.172058] [ T140] PKRU: 55555554 > [13631.172059] [ T140] Call Trace: > [13631.172061] [ T140] <TASK> > [13631.172063] [ T140] ? __warn.cold+0x90/0x9e > [13631.172066] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.172148] [ T140] ? report_bug+0xfa/0x140 > [13631.172151] [ T140] ? handle_bug+0x53/0x90 > [13631.172154] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.172156] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.172160] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.172242] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.172245] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.172323] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.172440] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.172558] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.172635] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.172638] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.172641] [ T140] pci_device_probe+0xc0/0x180 > [13631.172644] [ T140] really_probe+0xd9/0x340 > [13631.172647] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.172650] [ T140] __driver_probe_device+0x73/0x110 > [13631.172654] [ T140] driver_probe_device+0x1a/0xa0 > [13631.172657] [ T140] __device_attach_driver+0x84/0x110 > [13631.172660] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.172664] [ T140] __device_attach+0xab/0x1b0 > [13631.172667] [ T140] pci_bus_add_device+0x53/0x80 > [13631.172670] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.172672] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.172675] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.172677] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.172679] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.172682] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.172685] [ T140] pciehp_ist+0x13b/0x180 > [13631.172688] [ T140] irq_thread_fn+0x1e/0x60 > [13631.172691] [ T140] irq_thread+0x114/0x1e0 > [13631.172693] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.172696] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.172699] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.172702] [ T140] kthread+0xea/0x1e0 > [13631.172705] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.172708] [ T140] ret_from_fork+0x2f/0x50 > [13631.172711] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.172714] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.172718] [ T140] </TASK> > [13631.172720] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.172728] [ T140] ------------[ cut here ]------------ > [13631.172730] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.172818] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.172880] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.172913] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.172915] [ T140] Tainted: [W]=WARN > [13631.172917] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.172919] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.173002] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.173004] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.173006] [ T140] RAX: ffff8c320135da78 RBX: ffff8c32a58a6460 RCX: 0000000000000000 > [13631.173008] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58aca50 RDI: ffff8c32a5880000 > [13631.173010] [ T140] RBP: ffff8c32a58902a8 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.173011] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.173013] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.173015] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.173017] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.173019] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.173020] [ T140] PKRU: 55555554 > [13631.173022] [ T140] Call Trace: > [13631.173024] [ T140] <TASK> > [13631.173025] [ T140] ? __warn.cold+0x90/0x9e > [13631.173028] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.173111] [ T140] ? report_bug+0xfa/0x140 > [13631.173114] [ T140] ? handle_bug+0x53/0x90 > [13631.173117] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.173119] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.173123] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.173205] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.173208] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.173286] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.173404] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.173524] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.173600] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.173604] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.173607] [ T140] pci_device_probe+0xc0/0x180 > [13631.173610] [ T140] really_probe+0xd9/0x340 > [13631.173613] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.173616] [ T140] __driver_probe_device+0x73/0x110 > [13631.173619] [ T140] driver_probe_device+0x1a/0xa0 > [13631.173623] [ T140] __device_attach_driver+0x84/0x110 > [13631.173626] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.173629] [ T140] __device_attach+0xab/0x1b0 > [13631.173633] [ T140] pci_bus_add_device+0x53/0x80 > [13631.173635] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.173638] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.173640] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.173643] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.173645] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.173648] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.173650] [ T140] pciehp_ist+0x13b/0x180 > [13631.173653] [ T140] irq_thread_fn+0x1e/0x60 > [13631.173656] [ T140] irq_thread+0x114/0x1e0 > [13631.173659] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.173662] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.173665] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.173668] [ T140] kthread+0xea/0x1e0 > [13631.173671] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.173674] [ T140] ret_from_fork+0x2f/0x50 > [13631.173677] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.173680] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.173684] [ T140] </TASK> > [13631.173685] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.173693] [ T140] ------------[ cut here ]------------ > [13631.173695] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.173782] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.173844] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.173877] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.173879] [ T140] Tainted: [W]=WARN > [13631.173881] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.173882] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.173965] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.173967] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.173970] [ T140] RAX: ffff8c320135da7c RBX: ffff8c32a58a6ac0 RCX: 0000000000000000 > [13631.173971] [ T140] RDX: 0000000000000001 RSI: ffff8c32a58aca50 RDI: ffff8c32a5880000 > [13631.173973] [ T140] RBP: ffff8c32a58902b0 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.173975] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.173976] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.173978] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.173980] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.173982] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.173984] [ T140] PKRU: 55555554 > [13631.173985] [ T140] Call Trace: > [13631.173987] [ T140] <TASK> > [13631.173989] [ T140] ? __warn.cold+0x90/0x9e > [13631.173992] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.174074] [ T140] ? report_bug+0xfa/0x140 > [13631.174077] [ T140] ? handle_bug+0x53/0x90 > [13631.174080] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.174082] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.174086] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.174168] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.174171] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.174249] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.174366] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.174485] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.174562] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.174566] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.174569] [ T140] pci_device_probe+0xc0/0x180 > [13631.174572] [ T140] really_probe+0xd9/0x340 > [13631.174575] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.174578] [ T140] __driver_probe_device+0x73/0x110 > [13631.174582] [ T140] driver_probe_device+0x1a/0xa0 > [13631.174585] [ T140] __device_attach_driver+0x84/0x110 > [13631.174588] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.174592] [ T140] __device_attach+0xab/0x1b0 > [13631.174595] [ T140] pci_bus_add_device+0x53/0x80 > [13631.174598] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.174600] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.174603] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.174605] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.174608] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.174610] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.174613] [ T140] pciehp_ist+0x13b/0x180 > [13631.174616] [ T140] irq_thread_fn+0x1e/0x60 > [13631.174619] [ T140] irq_thread+0x114/0x1e0 > [13631.174622] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.174624] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.174628] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.174630] [ T140] kthread+0xea/0x1e0 > [13631.174633] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.174637] [ T140] ret_from_fork+0x2f/0x50 > [13631.174639] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.174642] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.174647] [ T140] </TASK> > [13631.174648] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.174656] [ T140] ------------[ cut here ]------------ > [13631.174658] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.174746] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.174809] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.174842] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.174844] [ T140] Tainted: [W]=WARN > [13631.174846] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.174847] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.174932] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.174934] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.174936] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58aee20 RCX: 0000000000000000 > [13631.174938] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 > [13631.174939] [ T140] RBP: ffff8c32a58902b8 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.174941] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.174943] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.174945] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.174947] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.174948] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.174950] [ T140] PKRU: 55555554 > [13631.174952] [ T140] Call Trace: > [13631.174953] [ T140] <TASK> > [13631.174955] [ T140] ? __warn.cold+0x90/0x9e > [13631.174958] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.175041] [ T140] ? report_bug+0xfa/0x140 > [13631.175045] [ T140] ? handle_bug+0x53/0x90 > [13631.175048] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.175050] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.175054] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.175136] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.175139] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.175218] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.175335] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.175446] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.175529] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.175533] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.175536] [ T140] pci_device_probe+0xc0/0x180 > [13631.175539] [ T140] really_probe+0xd9/0x340 > [13631.175542] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.175545] [ T140] __driver_probe_device+0x73/0x110 > [13631.175548] [ T140] driver_probe_device+0x1a/0xa0 > [13631.175552] [ T140] __device_attach_driver+0x84/0x110 > [13631.175555] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.175558] [ T140] __device_attach+0xab/0x1b0 > [13631.175562] [ T140] pci_bus_add_device+0x53/0x80 > [13631.175564] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.175567] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.175569] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.175572] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.175574] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.175577] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.175580] [ T140] pciehp_ist+0x13b/0x180 > [13631.175582] [ T140] irq_thread_fn+0x1e/0x60 > [13631.175585] [ T140] irq_thread+0x114/0x1e0 > [13631.175588] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.175591] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.175594] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.175597] [ T140] kthread+0xea/0x1e0 > [13631.175600] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.175603] [ T140] ret_from_fork+0x2f/0x50 > [13631.175606] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.175609] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.175613] [ T140] </TASK> > [13631.175615] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.175622] [ T140] ------------[ cut here ]------------ > [13631.175624] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.175712] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.175774] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.175807] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.175809] [ T140] Tainted: [W]=WARN > [13631.175811] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.175812] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.175896] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.175898] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.175900] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58af138 RCX: 0000000000000000 > [13631.175902] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 > [13631.175904] [ T140] RBP: ffff8c32a58902c0 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.175906] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.175907] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.175909] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.175911] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.175913] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.175915] [ T140] PKRU: 55555554 > [13631.175916] [ T140] Call Trace: > [13631.175918] [ T140] <TASK> > [13631.175920] [ T140] ? __warn.cold+0x90/0x9e > [13631.175922] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.176005] [ T140] ? report_bug+0xfa/0x140 > [13631.176009] [ T140] ? handle_bug+0x53/0x90 > [13631.176012] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.176014] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.176018] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.176113] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.176116] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.176195] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.176312] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.176424] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.176509] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.176513] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.176516] [ T140] pci_device_probe+0xc0/0x180 > [13631.176519] [ T140] really_probe+0xd9/0x340 > [13631.176522] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.176525] [ T140] __driver_probe_device+0x73/0x110 > [13631.176528] [ T140] driver_probe_device+0x1a/0xa0 > [13631.176532] [ T140] __device_attach_driver+0x84/0x110 > [13631.176535] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.176538] [ T140] __device_attach+0xab/0x1b0 > [13631.176542] [ T140] pci_bus_add_device+0x53/0x80 > [13631.176544] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.176547] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.176549] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.176552] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.176554] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.176557] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.176559] [ T140] pciehp_ist+0x13b/0x180 > [13631.176562] [ T140] irq_thread_fn+0x1e/0x60 > [13631.176565] [ T140] irq_thread+0x114/0x1e0 > [13631.176568] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.176571] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.176574] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.176577] [ T140] kthread+0xea/0x1e0 > [13631.176580] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.176583] [ T140] ret_from_fork+0x2f/0x50 > [13631.176586] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.176588] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.176593] [ T140] </TASK> > [13631.176594] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.176602] [ T140] ------------[ cut here ]------------ > [13631.176603] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.176691] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.176754] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.176786] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.176789] [ T140] Tainted: [W]=WARN > [13631.176790] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.176792] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.176875] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.176877] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.176880] [ T140] RAX: ffff8c320beb3600 RBX: ffff8c32a58af450 RCX: 0000000000000000 > [13631.176881] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58afa88 RDI: ffff8c32a5880000 > [13631.176883] [ T140] RBP: ffff8c32a58902c8 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.176885] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.176886] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.176888] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.176890] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.176892] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.176894] [ T140] PKRU: 55555554 > [13631.176895] [ T140] Call Trace: > [13631.176897] [ T140] <TASK> > [13631.176899] [ T140] ? __warn.cold+0x90/0x9e > [13631.176902] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.176984] [ T140] ? report_bug+0xfa/0x140 > [13631.176988] [ T140] ? handle_bug+0x53/0x90 > [13631.176991] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.176993] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.176996] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.177078] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.177081] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.177160] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.177278] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.177388] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.177465] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.177479] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.177482] [ T140] pci_device_probe+0xc0/0x180 > [13631.177485] [ T140] really_probe+0xd9/0x340 > [13631.177488] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.177492] [ T140] __driver_probe_device+0x73/0x110 > [13631.177495] [ T140] driver_probe_device+0x1a/0xa0 > [13631.177498] [ T140] __device_attach_driver+0x84/0x110 > [13631.177502] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.177505] [ T140] __device_attach+0xab/0x1b0 > [13631.177508] [ T140] pci_bus_add_device+0x53/0x80 > [13631.177511] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.177514] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.177516] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.177518] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.177521] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.177523] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.177526] [ T140] pciehp_ist+0x13b/0x180 > [13631.177529] [ T140] irq_thread_fn+0x1e/0x60 > [13631.177532] [ T140] irq_thread+0x114/0x1e0 > [13631.177534] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.177537] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.177541] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.177543] [ T140] kthread+0xea/0x1e0 > [13631.177546] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.177549] [ T140] ret_from_fork+0x2f/0x50 > [13631.177552] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.177555] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.177560] [ T140] </TASK> > [13631.177561] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.177569] [ T140] ------------[ cut here ]------------ > [13631.177571] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.177659] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.177722] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.177754] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.177757] [ T140] Tainted: [W]=WARN > [13631.177758] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.177760] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.177844] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.177846] [ T140] RSP: 0018:ffff9ac3c0743b08 EFLAGS: 00010246 > [13631.177848] [ T140] RAX: ffff8c320135dd00 RBX: ffff8c32a58b23d0 RCX: 0000000000000000 > [13631.177850] [ T140] RDX: 0000000000000000 RSI: ffff8c32a58b42c0 RDI: ffff8c32a5880000 > [13631.177852] [ T140] RBP: ffff8c32a58902d0 R08: 0000000000000002 R09: ffff8c34ba798f40 > [13631.177853] [ T140] R10: 0000000000000282 R11: 0000000000000003 R12: ffff8c32a5890630 > [13631.177855] [ T140] R13: ffff8c32a5880010 R14: ffff8c32a5880000 R15: ffff9ac3c0743b14 > [13631.177857] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.177859] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.177861] [ T140] CR2: 00007fd810faa000 CR3: 000000010c6ee000 CR4: 0000000000750ef0 > [13631.177862] [ T140] PKRU: 55555554 > [13631.177864] [ T140] Call Trace: > [13631.177866] [ T140] <TASK> > [13631.177867] [ T140] ? __warn.cold+0x90/0x9e > [13631.177870] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.177953] [ T140] ? report_bug+0xfa/0x140 > [13631.177956] [ T140] ? handle_bug+0x53/0x90 > [13631.177959] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.177961] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.177965] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.178047] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13631.178050] [ T140] amdgpu_fence_driver_hw_fini+0xf2/0x120 [amdgpu] > [13631.178128] [ T140] amdgpu_device_fini_hw+0xad/0x2ad [amdgpu] > [13631.178245] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.178356] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.178432] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.178436] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.178439] [ T140] pci_device_probe+0xc0/0x180 > [13631.178442] [ T140] really_probe+0xd9/0x340 > [13631.178445] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.178448] [ T140] __driver_probe_device+0x73/0x110 > [13631.178451] [ T140] driver_probe_device+0x1a/0xa0 > [13631.178455] [ T140] __device_attach_driver+0x84/0x110 > [13631.178458] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.178461] [ T140] __device_attach+0xab/0x1b0 > [13631.178465] [ T140] pci_bus_add_device+0x53/0x80 > [13631.178479] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.178481] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.178484] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.178486] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.178489] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.178491] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.178494] [ T140] pciehp_ist+0x13b/0x180 > [13631.178497] [ T140] irq_thread_fn+0x1e/0x60 > [13631.178500] [ T140] irq_thread+0x114/0x1e0 > [13631.178503] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.178506] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.178509] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.178511] [ T140] kthread+0xea/0x1e0 > [13631.178515] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.178518] [ T140] ret_from_fork+0x2f/0x50 > [13631.178521] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.178523] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.178528] [ T140] </TASK> > [13631.178529] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.342117] [ T140] ------------[ cut here ]------------ > [13631.342123] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:631 amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.342257] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13631.342342] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13631.342389] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13631.342394] [ T140] Tainted: [W]=WARN > [13631.342396] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13631.342399] [ T140] RIP: 0010:amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.342528] [ T140] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf 80 a9 c5 e9 9f fd ff ff > <0f> 0b b8 ea ff ff ff e9 ae 80 a9 c5 b8 ea ff ff ff e9 a4 80 a9 c5 > [13631.342532] [ T140] RSP: 0018:ffff9ac3c0743b30 EFLAGS: 00010246 > [13631.342536] [ T140] RAX: ffff8c3203e262a0 RBX: ffff8c32a5880000 RCX: 0000000000000000 > [13631.342538] [ T140] RDX: 0000000000000000 RSI: ffff8c32a5880c78 RDI: ffff8c32a5880000 > [13631.342541] [ T140] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000000 > [13631.342543] [ T140] R10: ffff8c34ba79de60 R11: 0000000000000000 R12: ffff8c32a58c6de8 > [13631.342546] [ T140] R13: 0000000000000021 R14: ffff8c32a5880000 R15: ffff8c32a5880010 > [13631.342548] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13631.342551] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13631.342553] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 > [13631.342555] [ T140] PKRU: 55555554 > [13631.342557] [ T140] Call Trace: > [13631.342559] [ T140] <TASK> > [13631.342562] [ T140] ? __warn.cold+0x90/0x9e > [13631.342566] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.342652] [ T140] ? report_bug+0xfa/0x140 > [13631.342656] [ T140] ? handle_bug+0x53/0x90 > [13631.342660] [ T140] ? exc_invalid_op+0x17/0x70 > [13631.342662] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13631.342666] [ T140] ? amdgpu_irq_put+0x41/0x70 [amdgpu] > [13631.342746] [ T140] gmc_v10_0_hw_fini+0x52/0xb0 [amdgpu] > [13631.342838] [ T140] amdgpu_ip_block_hw_fini+0x2b/0x59 [amdgpu] > [13631.342961] [ T140] amdgpu_device_fini_hw+0x1fe/0x2ad [amdgpu] > [13631.343073] [ T140] amdgpu_driver_load_kms.cold+0x18/0x2e [amdgpu] > [13631.343180] [ T140] amdgpu_pci_probe+0x167/0x3e0 [amdgpu] > [13631.343255] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.343259] [ T140] ? driver_probe_device+0xa0/0xa0 > [13631.343262] [ T140] pci_device_probe+0xc0/0x180 > [13631.343266] [ T140] really_probe+0xd9/0x340 > [13631.343269] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13631.343272] [ T140] __driver_probe_device+0x73/0x110 > [13631.343275] [ T140] driver_probe_device+0x1a/0xa0 > [13631.343279] [ T140] __device_attach_driver+0x84/0x110 > [13631.343282] [ T140] bus_for_each_drv+0x82/0xe0 > [13631.343285] [ T140] __device_attach+0xab/0x1b0 > [13631.343289] [ T140] pci_bus_add_device+0x53/0x80 > [13631.343292] [ T140] pci_bus_add_devices+0x2b/0x70 > [13631.343294] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.343297] [ T140] pci_bus_add_devices+0x56/0x70 > [13631.343299] [ T140] pciehp_configure_device+0xaa/0x160 > [13631.343302] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13631.343304] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13631.343307] [ T140] pciehp_ist+0x13b/0x180 > [13631.343310] [ T140] irq_thread_fn+0x1e/0x60 > [13631.343314] [ T140] irq_thread+0x114/0x1e0 > [13631.343316] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13631.343319] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13631.343323] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13631.343325] [ T140] kthread+0xea/0x1e0 > [13631.343329] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.343332] [ T140] ret_from_fork+0x2f/0x50 > [13631.343336] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13631.343338] [ T140] ret_from_fork_asm+0x11/0x20 > [13631.343343] [ T140] </TASK> > [13631.343345] [ T140] ---[ end trace 0000000000000000 ]--- > [13631.351179] [ T140] amdgpu 0000:03:00.0: probe with driver amdgpu failed with error -121 > [13632.005054] [ T140] ------------[ cut here ]------------ > [13632.005063] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_buddy.c:337 drm_buddy_fini+0xa8/0xb0 [drm_buddy] > [13632.005073] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13632.005147] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13632.005189] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13632.005194] [ T140] Tainted: [W]=WARN > [13632.005196] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13632.005199] [ T140] RIP: 0010:drm_buddy_fini+0xa8/0xb0 [drm_buddy] > [13632.005202] [ T140] Code: 44 3b 6d 10 72 a3 4c 8b 65 20 4c 39 65 28 75 1e 48 8b 7d 08 e8 79 1d f1 c5 48 8b 7d 00 5b 5d 41 5c 41 5d 41 5e e9 68 1d f1 c5 > <0f> 0b eb b3 0f 0b eb de f3 0f 1e fa 48 8b 0e 89 c8 25 00 0c 00 00 > [13632.005205] [ T140] RSP: 0018:ffff9ac3c0743a90 EFLAGS: 00010206 > [13632.005208] [ T140] RAX: 0000000000000c00 RBX: 000000000000000c RCX: 00000001feacbfff > [13632.005210] [ T140] RDX: ffff8c3203757ea0 RSI: ffff8c3221b8f750 RDI: ffff8c3205edda00 > [13632.005212] [ T140] RBP: ffff8c32a588fa50 R08: 0000000000000001 R09: 0000000000000000 > [13632.005214] [ T140] R10: ffff8c3205edda00 R11: 00000001feaca000 R12: 0000000001000000 > [13632.005216] [ T140] R13: 0000000000000008 R14: 00000000ffffffff R15: ffff8c32a588fa50 > [13632.005218] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13632.005220] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13632.005222] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 > [13632.005224] [ T140] PKRU: 55555554 > [13632.005226] [ T140] Call Trace: > [13632.005229] [ T140] <TASK> > [13632.005232] [ T140] ? __warn.cold+0x90/0x9e > [13632.005238] [ T140] ? drm_buddy_fini+0xa8/0xb0 [drm_buddy] > [13632.005242] [ T140] ? report_bug+0xfa/0x140 > [13632.005247] [ T140] ? handle_bug+0x53/0x90 > [13632.005252] [ T140] ? exc_invalid_op+0x17/0x70 > [13632.005255] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13632.005260] [ T140] ? drm_buddy_fini+0xa8/0xb0 [drm_buddy] > [13632.005264] [ T140] amdgpu_vram_mgr_fini+0x17a/0x1b0 [amdgpu] > [13632.005422] [ T140] amdgpu_ttm_fini+0x14b/0x210 [amdgpu] > [13632.005540] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] > [13632.005649] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] > [13632.005772] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] > [13632.005879] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] > [13632.005990] [ T140] drm_dev_put.part.0+0x37/0x60 > [13632.005993] [ T140] devres_release_all+0xa6/0xf0 > [13632.005998] [ T140] ? driver_probe_device+0xa0/0xa0 > [13632.006001] [ T140] device_unbind_cleanup+0x9/0x70 > [13632.006004] [ T140] really_probe+0x21c/0x340 > [13632.006008] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13632.006012] [ T140] __driver_probe_device+0x73/0x110 > [13632.006016] [ T140] driver_probe_device+0x1a/0xa0 > [13632.006019] [ T140] __device_attach_driver+0x84/0x110 > [13632.006022] [ T140] bus_for_each_drv+0x82/0xe0 > [13632.006026] [ T140] __device_attach+0xab/0x1b0 > [13632.006030] [ T140] pci_bus_add_device+0x53/0x80 > [13632.006033] [ T140] pci_bus_add_devices+0x2b/0x70 > [13632.006036] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.006038] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.006041] [ T140] pciehp_configure_device+0xaa/0x160 > [13632.006044] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13632.006047] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13632.006050] [ T140] pciehp_ist+0x13b/0x180 > [13632.006053] [ T140] irq_thread_fn+0x1e/0x60 > [13632.006056] [ T140] irq_thread+0x114/0x1e0 > [13632.006059] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13632.006062] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13632.006065] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13632.006068] [ T140] kthread+0xea/0x1e0 > [13632.006072] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.006075] [ T140] ret_from_fork+0x2f/0x50 > [13632.006079] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.006082] [ T140] ret_from_fork_asm+0x11/0x20 > [13632.006087] [ T140] </TASK> > [13632.006088] [ T140] ---[ end trace 0000000000000000 ]--- > [13632.006100] [ T140] ------------[ cut here ]------------ > [13632.006102] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_buddy.c:344 drm_buddy_fini+0xac/0xb0 [drm_buddy] > [13632.006105] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13632.006169] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13632.006203] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13632.006206] [ T140] Tainted: [W]=WARN > [13632.006208] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13632.006210] [ T140] RIP: 0010:drm_buddy_fini+0xac/0xb0 [drm_buddy] > [13632.006212] [ T140] Code: 72 a3 4c 8b 65 20 4c 39 65 28 75 1e 48 8b 7d 08 e8 79 1d f1 c5 48 8b 7d 00 5b 5d 41 5c 41 5d 41 5e e9 68 1d f1 c5 0f 0b eb b3 > <0f> 0b eb de f3 0f 1e fa 48 8b 0e 89 c8 25 00 0c 00 00 3d 00 04 00 > [13632.006214] [ T140] RSP: 0018:ffff9ac3c0743a90 EFLAGS: 00010287 > [13632.006217] [ T140] RAX: 0000000001000000 RBX: 000000000000000c RCX: 000000000000000c > [13632.006219] [ T140] RDX: 0000000000001000 RSI: ffff8c3221b8f750 RDI: 0000000000380022 > [13632.006221] [ T140] RBP: ffff8c32a588fa50 R08: 0000000000000001 R09: 0000000000000000 > [13632.006222] [ T140] R10: 0000000000380022 R11: 0000000000000000 R12: 00000001ff000000 > [13632.006224] [ T140] R13: 0000000000000009 R14: 00000000ffffffff R15: ffff8c32a588fa50 > [13632.006226] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13632.006228] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13632.006230] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 > [13632.006232] [ T140] PKRU: 55555554 > [13632.006233] [ T140] Call Trace: > [13632.006235] [ T140] <TASK> > [13632.006237] [ T140] ? __warn.cold+0x90/0x9e > [13632.006240] [ T140] ? drm_buddy_fini+0xac/0xb0 [drm_buddy] > [13632.006242] [ T140] ? report_bug+0xfa/0x140 > [13632.006246] [ T140] ? handle_bug+0x53/0x90 > [13632.006249] [ T140] ? exc_invalid_op+0x17/0x70 > [13632.006252] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13632.006255] [ T140] ? drm_buddy_fini+0xac/0xb0 [drm_buddy] > [13632.006258] [ T140] amdgpu_vram_mgr_fini+0x17a/0x1b0 [amdgpu] > [13632.006385] [ T140] amdgpu_ttm_fini+0x14b/0x210 [amdgpu] > [13632.006511] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] > [13632.006632] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] > [13632.006763] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] > [13632.006883] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] > [13632.006964] [ T140] drm_dev_put.part.0+0x37/0x60 > [13632.006966] [ T140] devres_release_all+0xa6/0xf0 > [13632.006970] [ T140] ? driver_probe_device+0xa0/0xa0 > [13632.006973] [ T140] device_unbind_cleanup+0x9/0x70 > [13632.006976] [ T140] really_probe+0x21c/0x340 > [13632.006979] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13632.006983] [ T140] __driver_probe_device+0x73/0x110 > [13632.006986] [ T140] driver_probe_device+0x1a/0xa0 > [13632.006989] [ T140] __device_attach_driver+0x84/0x110 > [13632.006993] [ T140] bus_for_each_drv+0x82/0xe0 > [13632.006996] [ T140] __device_attach+0xab/0x1b0 > [13632.007000] [ T140] pci_bus_add_device+0x53/0x80 > [13632.007003] [ T140] pci_bus_add_devices+0x2b/0x70 > [13632.007005] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.007008] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.007010] [ T140] pciehp_configure_device+0xaa/0x160 > [13632.007013] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13632.007015] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13632.007018] [ T140] pciehp_ist+0x13b/0x180 > [13632.007021] [ T140] irq_thread_fn+0x1e/0x60 > [13632.007024] [ T140] irq_thread+0x114/0x1e0 > [13632.007027] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13632.007030] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13632.007033] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13632.007036] [ T140] kthread+0xea/0x1e0 > [13632.007039] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.007042] [ T140] ret_from_fork+0x2f/0x50 > [13632.007045] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.007048] [ T140] ret_from_fork_asm+0x11/0x20 > [13632.007053] [ T140] </TASK> > [13632.007054] [ T140] ---[ end trace 0000000000000000 ]--- > [13632.007059] [ T140] ------------[ cut here ]------------ > [13632.007061] [ T140] Memory manager not clean during takedown. > [13632.007066] [ T140] WARNING: CPU: 6 PID: 140 at drivers/gpu/drm/drm_mm.c:964 drm_mm_takedown+0x22/0x30 > [13632.007069] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13632.007133] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13632.007167] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13632.007170] [ T140] Tainted: [W]=WARN > [13632.007171] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13632.007173] [ T140] RIP: 0010:drm_mm_takedown+0x22/0x30 > [13632.007175] [ T140] Code: 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 8b 47 38 48 83 c7 38 48 39 f8 75 05 e9 55 74 8e ff 48 c7 c7 f0 ad e7 86 e8 be c1 a7 ff > <0f> 0b e9 42 74 8e ff 0f 1f 80 00 00 00 00 f3 0f 1e fa 41 57 49 89 > [13632.007177] [ T140] RSP: 0018:ffff9ac3c0743ac8 EFLAGS: 00010282 > [13632.007180] [ T140] RAX: 0000000000000000 RBX: 0000000000000007 RCX: 0000000000000027 > [13632.007181] [ T140] RDX: ffff8c34ba797808 RSI: 0000000000000001 RDI: ffff8c34ba797800 > [13632.007183] [ T140] RBP: ffff8c3205edfe00 R08: 0000000000000000 R09: ffff9ac3c0743950 > [13632.007185] [ T140] R10: ffff8c34e02fffa8 R11: 0000000000000003 R12: ffff8c32a588ef80 > [13632.007187] [ T140] R13: ffff8c3205edff70 R14: 0000000000000000 R15: ffff8c3276051358 > [13632.007188] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13632.007190] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13632.007192] [ T140] CR2: 00007fd810faa000 CR3: 000000010f62a000 CR4: 0000000000750ef0 > [13632.007194] [ T140] PKRU: 55555554 > [13632.007195] [ T140] Call Trace: > [13632.007197] [ T140] <TASK> > [13632.007199] [ T140] ? __warn.cold+0x90/0x9e > [13632.007202] [ T140] ? drm_mm_takedown+0x22/0x30 > [13632.007204] [ T140] ? report_bug+0xfa/0x140 > [13632.007207] [ T140] ? srso_alias_return_thunk+0x5/0xfbef5 > [13632.007211] [ T140] ? handle_bug+0x53/0x90 > [13632.007214] [ T140] ? exc_invalid_op+0x17/0x70 > [13632.007216] [ T140] ? asm_exc_invalid_op+0x1a/0x20 > [13632.007220] [ T140] ? drm_mm_takedown+0x22/0x30 > [13632.007222] [ T140] ? drm_mm_takedown+0x22/0x30 > [13632.007224] [ T140] ttm_range_man_fini_nocheck+0x86/0x100 [ttm] > [13632.007230] [ T140] amdgpu_ttm_fini+0x18f/0x210 [amdgpu] > [13632.007310] [ T140] amdgpu_bo_fini+0x1f/0x90 [amdgpu] > [13632.007390] [ T140] gmc_v10_0_sw_fini+0x29/0x40 [amdgpu] > [13632.007484] [ T140] amdgpu_device_fini_sw+0xc8/0x3c0 [amdgpu] > [13632.007563] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] > [13632.007641] [ T140] drm_dev_put.part.0+0x37/0x60 > [13632.007644] [ T140] devres_release_all+0xa6/0xf0 > [13632.007648] [ T140] ? driver_probe_device+0xa0/0xa0 > [13632.007651] [ T140] device_unbind_cleanup+0x9/0x70 > [13632.007654] [ T140] really_probe+0x21c/0x340 > [13632.007657] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13632.007660] [ T140] __driver_probe_device+0x73/0x110 > [13632.007663] [ T140] driver_probe_device+0x1a/0xa0 > [13632.007666] [ T140] __device_attach_driver+0x84/0x110 > [13632.007670] [ T140] bus_for_each_drv+0x82/0xe0 > [13632.007673] [ T140] __device_attach+0xab/0x1b0 > [13632.007677] [ T140] pci_bus_add_device+0x53/0x80 > [13632.007680] [ T140] pci_bus_add_devices+0x2b/0x70 > [13632.007682] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.007685] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.007687] [ T140] pciehp_configure_device+0xaa/0x160 > [13632.007690] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13632.007692] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13632.007695] [ T140] pciehp_ist+0x13b/0x180 > [13632.007698] [ T140] irq_thread_fn+0x1e/0x60 > [13632.007701] [ T140] irq_thread+0x114/0x1e0 > [13632.007704] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13632.007707] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13632.007710] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13632.007713] [ T140] kthread+0xea/0x1e0 > [13632.007716] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.007719] [ T140] ret_from_fork+0x2f/0x50 > [13632.007722] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.007725] [ T140] ret_from_fork_asm+0x11/0x20 > [13632.007729] [ T140] </TASK> > [13632.007731] [ T140] ---[ end trace 0000000000000000 ]--- > [13632.007752] [ T140] [drm] amdgpu: ttm finalized > [13632.007775] [ T140] BUG: kernel NULL pointer dereference, address: 0000000000000058 > [13632.007777] [ T140] #PF: supervisor read access in kernel mode > [13632.007779] [ T140] #PF: error_code(0x0000) - not-present page > [13632.007781] [ T140] PGD 175454067 P4D 175454067 PUD 0 > [13632.007786] [ T140] Oops: Oops: 0000 [#1] PREEMPT SMP NOPTI > [13632.007788] [ T140] CPU: 6 UID: 0 PID: 140 Comm: irq/43-pciehp Kdump: loaded Tainted: G W 6.14.0-rc1-mystery-00004-g822c11592522 #43 > [13632.007791] [ T140] Tainted: [W]=WARN > [13632.007793] [ T140] Hardware name: Micro-Star International Co., Ltd. Alpha 15 B5EEK/MS-158L, BIOS E158LAMS.10F 11/11/2024 > [13632.007795] [ T140] RIP: 0010:ttm_resource_move_to_lru_tail+0xc1/0xe0 [ttm] > [13632.007799] [ T140] Code: 46 40 48 8b 94 ca 98 00 00 00 48 8b 4e 48 48 89 4f 08 48 89 39 48 89 c1 48 83 c0 03 48 c1 e1 04 48 c1 e0 04 48 01 d1 48 01 c2 > <48> 8b 79 38 4c 89 41 38 48 89 56 40 48 89 7e 48 4c 89 07 e9 42 0d > [13632.007801] [ T140] RSP: 0018:ffff9ac3c0743af0 EFLAGS: 00010206 > [13632.007803] [ T140] RAX: 0000000000000050 RBX: ffff8c3276008848 RCX: 0000000000000020 > [13632.007805] [ T140] RDX: 0000000000000050 RSI: ffff8c332918e100 RDI: ffff8c332918e140 > [13632.007807] [ T140] RBP: ffff8c32a588ef80 R08: ffff8c332918e140 R09: 0000000000000000 > [13632.007809] [ T140] R10: 0000000000400032 R11: 0000000000000000 R12: 0000000000000000 > [13632.007811] [ T140] R13: ffff8c3276008800 R14: ffff8c32a588ef80 R15: ffff8c3276051358 > [13632.007812] [ T140] FS: 0000000000000000(0000) GS:ffff8c34ba780000(0000) knlGS:0000000000000000 > [13632.007814] [ T140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [13632.007816] [ T140] CR2: 0000000000000058 CR3: 000000010f62a000 CR4: 0000000000750ef0 > [13632.007818] [ T140] PKRU: 55555554 > [13632.007820] [ T140] Call Trace: > [13632.007821] [ T140] <TASK> > [13632.007823] [ T140] ? __die+0x51/0x92 > [13632.007826] [ T140] ? page_fault_oops+0x99/0x220 > [13632.007831] [ T140] ? exc_page_fault+0x32e/0x600 > [13632.007834] [ T140] ? asm_exc_page_fault+0x26/0x30 > [13632.007837] [ T140] ? ttm_resource_move_to_lru_tail+0xc1/0xe0 [ttm] > [13632.007841] [ T140] ttm_bo_unpin+0x58/0x80 [ttm] > [13632.007845] [ T140] amdgpu_bo_unpin+0x19/0x90 [amdgpu] > [13632.007926] [ T140] amdgpu_bo_free_kernel+0x77/0x100 [amdgpu] > [13632.008006] [ T140] amdgpu_device_fini_sw+0x339/0x3c0 [amdgpu] > [13632.008085] [ T140] amdgpu_driver_release_kms+0x11/0x30 [amdgpu] > [13632.008163] [ T140] drm_dev_put.part.0+0x37/0x60 > [13632.008166] [ T140] devres_release_all+0xa6/0xf0 > [13632.008169] [ T140] ? driver_probe_device+0xa0/0xa0 > [13632.008173] [ T140] device_unbind_cleanup+0x9/0x70 > [13632.008176] [ T140] really_probe+0x21c/0x340 > [13632.008179] [ T140] ? pm_runtime_barrier+0x4f/0x90 > [13632.008182] [ T140] __driver_probe_device+0x73/0x110 > [13632.008185] [ T140] driver_probe_device+0x1a/0xa0 > [13632.008188] [ T140] __device_attach_driver+0x84/0x110 > [13632.008192] [ T140] bus_for_each_drv+0x82/0xe0 > [13632.008195] [ T140] __device_attach+0xab/0x1b0 > [13632.008199] [ T140] pci_bus_add_device+0x53/0x80 > [13632.008201] [ T140] pci_bus_add_devices+0x2b/0x70 > [13632.008204] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.008206] [ T140] pci_bus_add_devices+0x56/0x70 > [13632.008209] [ T140] pciehp_configure_device+0xaa/0x160 > [13632.008211] [ T140] ? pcie_capability_read_word+0x7a/0x90 > [13632.008214] [ T140] pciehp_handle_presence_or_link_change+0x1b2/0x350 > [13632.008217] [ T140] pciehp_ist+0x13b/0x180 > [13632.008220] [ T140] irq_thread_fn+0x1e/0x60 > [13632.008223] [ T140] irq_thread+0x114/0x1e0 > [13632.008225] [ T140] ? irq_finalize_oneshot.part.0+0xc0/0xc0 > [13632.008228] [ T140] ? irq_set_affinity_notifier+0x120/0x120 > [13632.008232] [ T140] ? irq_affinity_notify+0xd0/0xd0 > [13632.008235] [ T140] kthread+0xea/0x1e0 > [13632.008238] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.008241] [ T140] ret_from_fork+0x2f/0x50 > [13632.008244] [ T140] ? kthreads_online_cpu+0xf0/0xf0 > [13632.008247] [ T140] ret_from_fork_asm+0x11/0x20 > [13632.008251] [ T140] </TASK> > [13632.008253] [ T140] Modules linked in: sd_mod scsi_mod scsi_common ccm snd_seq_dummy snd_hrtimer snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq > snd_seq_device rfcomm bnep nls_ascii nls_cp437 vfat fat snd_hda_codec_generic snd_hda_codec_hdmi btusb btrtl snd_hda_intel btintel snd_intel_dspcfg btbcm > uvcvideo snd_hda_codec btmtk videobuf2_vmalloc snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn videobuf2_memops snd_hwdep uvc bluetooth snd_soc_core snd_hda_core > videobuf2_v4l2 snd_pcm_oss videodev snd_mixer_oss snd_pcm snd_rn_pci_acp3x edac_mce_amd videobuf2_common snd_timer snd_acp_config msi_wmi snd_soc_acpi > ecdh_generic ecc mc wmi_bmof sparse_keymap snd k10temp snd_pci_acp3x soundcore ccp ac battery button hid_sensor_accel_3d hid_sensor_magn_3d hid_sensor_gyro_3d > hid_sensor_prox hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer kfifo_buf amd_pmc industrialio hid_sensor_iio_common joydev evdev mt7921e > mt7921_common mt792x_lib mt76_connac_lib mt76 mac80211 libarc4 cfg80211 rfkill msr fuse nvme_fabrics efi_pstore > [13632.008316] [ T140] configfs efivarfs autofs4 ext4 mbcache jbd2 usbhid amdgpu drm_client_lib i2c_algo_bit drm_ttm_helper ttm drm_panel_backlight_quirks > drm_exec drm_suballoc_helper cec xhci_pci amdxcp drm_buddy xhci_hcd gpu_sched hid_sensor_hub mfd_core hid_multitouch hid_generic drm_display_helper usbcore > psmouse i2c_hid_acpi amd_sfh nvme i2c_hid hid serio_raw drm_kms_helper nvme_core r8169 i2c_piix4 i2c_smbus usb_common crc16 i2c_designware_platform > i2c_designware_core > [13632.008349] [ T140] CR2: 0000000000000058 > > > Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-11-05 21:31 ` Mario Limonciello (AMD) (kernel.org) @ 2025-11-07 13:09 ` Bert Karwatzki 2025-11-07 17:09 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-07 13:09 UTC (permalink / raw) To: Mario Limonciello (AMD) (kernel.org), Christian König, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf Am Mittwoch, dem 05.11.2025 um 15:31 -0600 schrieb Mario Limonciello (AMD) (kernel.org): > > Once you're done with your bisect I'd be really interested if you can > still reproduce the splats and NULL pointer on the recovery path using > amd-staging-drm-next. > > There are good news and bad news on this: The good news: I found out that one can generate a large number of ACPI GPP0 events and resumes by scrolling through a large pdf (1305 pages - Gravitation by Wheeler, Misner and Thorne) using the arrow keys. This can generate these crashes quite fast. The bad news: Using the method above I could generate these crashes in v6.13 and v6.14, so all the previous bisecting was completely useless. Version v6.12 has not (yet, ...) crashed so I might be able to bisect between v6.12 and v6.13. Here's a short log of the recent tests and time to crash (with number of GPP0 wakeup events and GPU resumes) Retest: 6.14.0-stable booted 18:11:24, 6.11.2025, crashed 18:45:30 (~34min, 588 GPP0 events, 210 resumes) Retest: 6.14.11-stable booted 19:09:33, 6.11.2025, crashed 19:17:42 (~8min (new record!), 122 GPP0 events, 44 resumes) Testing (this was tested by the old method of starting evolution by script): v6.13 booted 23:46:21, 6.11.2025, GPU lost 4:38, 7.11.2025 (~5h, 760 GPP0 events, 807 resumes) no crash Retest: v6.13 booted 9:12, 7.11.2025 crashed 11:25, 7.11.2025 (~1.25h, 351 GPP0 events, 330 resumes) Testing: v6.12.52 booted 11:27, 7.11.2025 no crash after 1h, 735 GPP0 events, 301 resumes Testing: v6.12 booted 13:00, 7.11.2025 no crash after 1h, 890 GPP0 events, 287 resumes Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-11-07 13:09 ` Bert Karwatzki @ 2025-11-07 17:09 ` Bert Karwatzki 2025-11-10 13:33 ` Christian König 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-07 17:09 UTC (permalink / raw) To: Mario Limonciello (AMD) (kernel.org), Christian König, linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf Am Freitag, dem 07.11.2025 um 14:09 +0100 schrieb Bert Karwatzki: > > Testing: > v6.12 booted 13:00, 7.11.2025 no crash after 1h, 890 GPP0 events, 287 resumes > > > Bert Karwatzki v6.12 crashed after 2h, 946 GPP0 events and 499 resumes. So there's no base for a bisection. But the crash from v6.14.11 gave this error in netconsole: 2025-11-06T19:17:34.967439+01:00 T370;[drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). 2025-11-06T19:17:34.967439+01:00 T370;amdgpu 0000:03:00.0: amdgpu: PSP is resuming...#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:34.967588+01:00 T12;pci_bus 0000:03: Allocating resources#012 SUBSYSTEM=pci_bus#012 DEVICE=+pci_bus:0000:03 2025-11-06T19:17:35.143353+01:00 T370;amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.226021+01:00 T370;amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU is resuming...#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0)#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:6 param:0x00000000 message:EnableAllSmuFeatures?#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: Failed to enable requested dpm features!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: resume of IP block <smu> failed -121#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-121).#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-11-06T19:17:36.114889+01:00 C8;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 35.314 msecs 2025-11-06T19:17:36.114889+01:00 C8;perf: interrupt took too long (275880 > 2500), lowering kernel.perf_event_max_sample_rate to 1000 2025-11-06T19:17:37.930799+01:00 C4;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 152.914 msecs 2025-11-06T19:17:37.930799+01:00 C4;perf: interrupt took too long (1194640 > 344850), lowering kernel.perf_event_max_sample_rate to 1000 2025-11-06T19:17:38.939845+01:00 C14;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 197.312 msecs 2025-11-06T19:17:38.939845+01:00 C14;perf: interrupt took too long (1541521 > 1493300), lowering kernel.perf_event_max_sample_rate to 1000 These 4 lines have not been recorded previously, so perhaps I have to look for a NULL pointer dereference in an error path: 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: AE_TIME, Returned by Handler for [EmbeddedControl] (20240827/evregion-301) 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Timeout from EC hardware or EC device driver (20240827/evregion-311) 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Aborting method \x5c_SB.PCI0.SBRG.EC.BAT1.UPBS due to previous error (AE_TIME) (20240827/psparse-529) 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Aborting method \x5c_SB.PCI0.SBRG.EC.BAT1._BST due to previous error (AE_TIME) (20240827/psparse-529) Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: [REGRESSION 00/04] Crash during resume of pcie bridge 2025-11-07 17:09 ` Bert Karwatzki @ 2025-11-10 13:33 ` Christian König 2025-11-16 21:08 ` Crash during resume of pcie bridge due to infinite loop in ACPICA Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Christian König @ 2025-11-10 13:33 UTC (permalink / raw) To: Bert Karwatzki, Mario Limonciello (AMD) (kernel.org), linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki Hi Bert, well sorry to say that but from your dumps it looks more and more like you just have faulty HW. An SMU response of 0xFFFFFFFF means that the device has spontaneously fallen of the bus while trying to resume it. My educated guess is that this is caused by a faulty power management, but basically it could be anything. Regards, Christian. On 11/7/25 18:09, Bert Karwatzki wrote: > Am Freitag, dem 07.11.2025 um 14:09 +0100 schrieb Bert Karwatzki: >> >> Testing: >> v6.12 booted 13:00, 7.11.2025 no crash after 1h, 890 GPP0 events, 287 resumes >> >> >> Bert Karwatzki > > v6.12 crashed after 2h, 946 GPP0 events and 499 resumes. So there's no base > for a bisection. > > But the crash from v6.14.11 gave this error in netconsole: > > 2025-11-06T19:17:34.967439+01:00 T370;[drm] PCIE GART of 512M enabled (table at 0x00000081FEB00000). > 2025-11-06T19:17:34.967439+01:00 T370;amdgpu 0000:03:00.0: amdgpu: PSP is resuming...#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:34.967588+01:00 T12;pci_bus 0000:03: Allocating resources#012 SUBSYSTEM=pci_bus#012 DEVICE=+pci_bus:0000:03 > 2025-11-06T19:17:35.143353+01:00 T370;amdgpu 0000:03:00.0: amdgpu: reserve 0xa00000 from 0x81fd000000 for PSP TMR#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.226021+01:00 T370;amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available#012 SUBSYSTEM=pci#012 > DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU is resuming...#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, > version = 0x003b3100 (59.49.0)#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.237386+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:6 param:0x00000000 message:EnableAllSmuFeatures?#012 > SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: Failed to enable requested dpm features!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: resume of IP block <smu> failed -121#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:35.509600+01:00 T370;amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-121).#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 > 2025-11-06T19:17:36.114889+01:00 C8;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 35.314 msecs > 2025-11-06T19:17:36.114889+01:00 C8;perf: interrupt took too long (275880 > 2500), lowering kernel.perf_event_max_sample_rate to 1000 > 2025-11-06T19:17:37.930799+01:00 C4;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 152.914 msecs > 2025-11-06T19:17:37.930799+01:00 C4;perf: interrupt took too long (1194640 > 344850), lowering kernel.perf_event_max_sample_rate to 1000 > 2025-11-06T19:17:38.939845+01:00 C14;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 197.312 msecs > 2025-11-06T19:17:38.939845+01:00 C14;perf: interrupt took too long (1541521 > 1493300), lowering kernel.perf_event_max_sample_rate to 1000 > > These 4 lines have not been recorded previously, so perhaps I have to look > for a NULL pointer dereference in an error path: > > 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: AE_TIME, Returned by Handler for [EmbeddedControl] (20240827/evregion-301) > 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Timeout from EC hardware or EC device driver (20240827/evregion-311) > 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Aborting method \x5c_SB.PCI0.SBRG.EC.BAT1.UPBS due to previous error (AE_TIME) (20240827/psparse-529) > 2025-11-06T19:17:42.571252+01:00 T1896;ACPI Error: Aborting method \x5c_SB.PCI0.SBRG.EC.BAT1._BST due to previous error (AE_TIME) (20240827/psparse-529) > > > Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-10 13:33 ` Christian König @ 2025-11-16 21:08 ` Bert Karwatzki 2025-11-17 16:40 ` Rafael J. Wysocki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-16 21:08 UTC (permalink / raw) To: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel Cc: linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, spasswolf, acpica-devel, Robert Moore Am Montag, dem 10.11.2025 um 14:33 +0100 schrieb Christian König: > Hi Bert, > > well sorry to say that but from your dumps it looks more and more like you just have faulty HW. > > An SMU response of 0xFFFFFFFF means that the device has spontaneously fallen of the bus while trying to resume it. > > My educated guess is that this is caused by a faulty power management, but basically it could be anything. > > Regards, > Christian. I think there may be more than one error here. The loss of the GPU (with SMU respone log message) may be caused by faulty hardware but does not cause "the" crash (i.e. the crash which showed no log messages and was so hard one of my nvme devices was missing temporarily afterward, and which caused me to investigate this in the first place ...). As bisection of the crash is impossible I went back to inserting printk()s into acpi_power_transition() and the functions called by it. To reduce log spam I created _debug suffixed copies of the original functions. The code is found here in branch amdgpu_suspend_resume: https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads (Should I post the debug patches to the list?) The last two commits finally cleared up what happens (but I've yet to find out why this happens). 6.14.0-debug-00014-g2e933c56f3b6 booted 20:17, 15.11.2025 crashed 0:50, 16.11.2025 (~4.5h, 518 GPP0 events, 393 GPU resumes) The interesting part of the instrumented code is this: acpi_status acpi_ps_parse_aml_debug(struct acpi_walk_state *walk_state) { [...] printk(KERN_INFO "%s: before walk loop\n", __func__); while (walk_state) { if (ACPI_SUCCESS(status)) { /* * The parse_loop executes AML until the method terminates * or calls another method. */ status = acpi_ps_parse_loop(walk_state); } [...] } printk(KERN_INFO "%s: after walk loop\n", __func__); [...] } This gives the following message in netconsole 1. No crash: 2025-11-16T00:50:35.634745+01:00 10.0.0.1 6,21514,16419759755,-,caller=T59901;acpi_ps_execute_method_debug 329 2025-11-16T00:50:35.634745+01:00 10.0.0.1 6,21515,16419759781,-,caller=T59901;acpi_ps_parse_aml_debug: before walk loop 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21516,16420046219,-,caller=T59901;acpi_ps_parse_aml_debug: after walk loop 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21517,16420046231,-,caller=T59901;acpi_ps_execute_method_debug 331 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21518,16420046235,-,caller=T59901;acpi_ns_evaluate_debug 475 METHOD 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21519,16420046240,-,caller=T59901;acpi_evaluate_object_debug 255 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21520,16420046244,-,caller=T59901;__acpi_power_on_debug 369 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21521,16420046248,-,caller=T59901;acpi_power_on_unlocked_debug 446 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21522,16420046251,-,caller=T59901;acpi_power_on_debug 471 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21523,16420046255,-,caller=T59901;acpi_power_on_list_debug 642: result = 0 Resume successful, normal messages from resuming GPU follow. 2. Crash: 2025-11-16T00:50:46.483555+01:00 10.0.0.1 6,21566,16430609060,-,caller=T59702;acpi_ps_execute_method_debug 329 2025-11-16T00:50:46.483555+01:00 10.0.0.1 6,21567,16430609083,-,caller=T59702;acpi_ps_parse_aml_debug: before walk loop No more messages via netconsole due to crash. So here we can already say that the main loop in acpi_ps_parse_aml_debug() is not finishing properly. The next step is to put monitoring inside the loop: 6.14.0-debug-00015-gc09fd8dd0492 booted 12:09, 16.11.2025 crashed 19:55, 16.11.2025 (~8h, 1539 GPP0 events, 587 GPU resumes) "infinite" walk loop The interesting part of the instrumented code is this: acpi_status acpi_ps_parse_aml_debug(struct acpi_walk_state *walk_state) { [...] printk(KERN_INFO "%s: before walk loop\n", __func__); while (walk_state) { if (ACPI_SUCCESS(status)) { /* * The parse_loop executes AML until the method terminates * or calls another method. */ printk(KERN_INFO "%s: before parse loop\n", __func__); status = acpi_ps_parse_loop(walk_state); printk(KERN_INFO "%s: after parse loop\n", __func__); } [...] } printk(KERN_INFO "%s: after walk loop\n", __func__); [...] } This gives the following message in netconsole 1. No crash: 2025-11-16T19:55:54.203765+01:00 localhost 6,5479352,28054924877,-,caller=T5967;acpi_ps_execute_method_debug 329 2025-11-16T19:55:54.203765+01:00 localhost 6,5479353,28054924889,-,caller=T5967;acpi_ps_parse_aml_debug: before walk loop The next two lines are repeated 1500-1700 times (it varies a little ...): 2025-11-16T19:55:54.203765+01:00 localhost 6,5479354,28054924894,-,caller=T5967;acpi_ps_parse_aml_debug: before parse loop 2025-11-16T19:55:54.203765+01:00 localhost 6,5479355,28054924908,-,caller=T5967;acpi_ps_parse_aml_debug: after parse loop 2025-11-16T19:55:54.498216+01:00 localhost 6,5482288,28055219778,-,caller=T5967;acpi_ps_parse_aml_debug: after walk loop 2025-11-16T19:55:54.498216+01:00 localhost 6,5482289,28055219782,-,caller=T5967;acpi_ps_execute_method_debug 331 2025-11-16T19:55:54.498233+01:00 localhost 6,5482290,28055219786,-,caller=T5967;acpi_ns_evaluate_debug 475 METHOD 2025-11-16T19:55:54.498233+01:00 localhost 6,5482291,28055219791,-,caller=T5967;acpi_evaluate_object_debug 255 2025-11-16T19:55:54.498233+01:00 localhost 6,5482292,28055219795,-,caller=T5967;__acpi_power_on_debug 369 2025-11-16T19:55:54.498247+01:00 localhost 6,5482293,28055219799,-,caller=T5967;acpi_power_on_unlocked_debug 446 2025-11-16T19:55:54.498247+01:00 localhost 6,5482294,28055219802,-,caller=T5967;acpi_power_on_debug 471 2025-11-16T19:55:54.498247+01:00 localhost 6,5482295,28055219806,-,caller=T5967;acpi_power_on_list_debug 642: result = 0 Resume successful, normal messages from resuming GPU follow. 2. Crash: 2025-11-16T19:56:24.213495+01:00 localhost 6,5483042,28084932950,-,caller=T5967;acpi_ps_execute_method_debug 329 2025-11-16T19:56:24.213495+01:00 localhost 6,5483043,28084932965,-,caller=T5967;acpi_ps_parse_aml_debug: before walk loop The next two lines are repeated more than 30000 times, then the transmition stops due to the crash: 2025-11-16T19:56:24.213495+01:00 localhost 6,5483044,28084932971,-,caller=T5967;acpi_ps_parse_aml_debug: before parse loop 2025-11-16T19:56:24.213495+01:00 localhost 6,5483045,28084932991,-,caller=T5967;acpi_ps_parse_aml_debug: after parse loop No more messages via netconsole due to crash. So there is some kind of infinite recursion happening inside the loop in acpi_ps_parse_aml(). Even if there is some kind of error in the hardware this shouldn't happen, I think. This bug is present in every kernel version I've tested so far, that is 6.12.x, 6.13.x, 6.14.x, 6.15.x, 6.16.x, 6.17.x (here I only tested the release candidates). 6.18 has not been tested, yet. To get to this result took several months of 24/7 test runs, I hope resolving this will be faster. Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-16 21:08 ` Crash during resume of pcie bridge due to infinite loop in ACPICA Bert Karwatzki @ 2025-11-17 16:40 ` Rafael J. Wysocki 2025-11-24 22:34 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Rafael J. Wysocki @ 2025-11-17 16:40 UTC (permalink / raw) To: Bert Karwatzki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre +Saket On Sun, Nov 16, 2025 at 10:09 PM Bert Karwatzki <spasswolf@web.de> wrote: > > Am Montag, dem 10.11.2025 um 14:33 +0100 schrieb Christian König: > > Hi Bert, > > > > well sorry to say that but from your dumps it looks more and more like you just have faulty HW. > > > > An SMU response of 0xFFFFFFFF means that the device has spontaneously fallen of the bus while trying to resume it. > > > > My educated guess is that this is caused by a faulty power management, but basically it could be anything. > > > > Regards, > > Christian. > > I think there may be more than one error here. The loss of the GPU (with SMU respone log message) may be caused > by faulty hardware but does not cause "the" crash (i.e. the crash which showed no log messages and was so hard > one of my nvme devices was missing temporarily afterward, and which caused me to investigate this in the first place ...). > > As bisection of the crash is impossible I went back to inserting printk()s into acpi_power_transition() and the > functions called by it. To reduce log spam I created _debug suffixed copies of the original functions. > The code is found here in branch amdgpu_suspend_resume: > https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads > (Should I post the debug patches to the list?) > > The last two commits finally cleared up what happens (but I've yet to find out why this happens). > > 6.14.0-debug-00014-g2e933c56f3b6 booted 20:17, 15.11.2025 crashed 0:50, 16.11.2025 > (~4.5h, 518 GPP0 events, 393 GPU resumes) > > The interesting part of the instrumented code is this: > > acpi_status acpi_ps_parse_aml_debug(struct acpi_walk_state *walk_state) > { > [...] > printk(KERN_INFO "%s: before walk loop\n", __func__); > while (walk_state) { > if (ACPI_SUCCESS(status)) { > /* > * The parse_loop executes AML until the method terminates > * or calls another method. > */ > status = acpi_ps_parse_loop(walk_state); > } > [...] > } > printk(KERN_INFO "%s: after walk loop\n", __func__); > [...] > } > > This gives the following message in netconsole > 1. No crash: > 2025-11-16T00:50:35.634745+01:00 10.0.0.1 6,21514,16419759755,-,caller=T59901;acpi_ps_execute_method_debug 329 > 2025-11-16T00:50:35.634745+01:00 10.0.0.1 6,21515,16419759781,-,caller=T59901;acpi_ps_parse_aml_debug: before walk loop > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21516,16420046219,-,caller=T59901;acpi_ps_parse_aml_debug: after walk loop > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21517,16420046231,-,caller=T59901;acpi_ps_execute_method_debug 331 > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21518,16420046235,-,caller=T59901;acpi_ns_evaluate_debug 475 METHOD > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21519,16420046240,-,caller=T59901;acpi_evaluate_object_debug 255 > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21520,16420046244,-,caller=T59901;__acpi_power_on_debug 369 > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21521,16420046248,-,caller=T59901;acpi_power_on_unlocked_debug 446 > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21522,16420046251,-,caller=T59901;acpi_power_on_debug 471 > 2025-11-16T00:50:35.921210+01:00 10.0.0.1 6,21523,16420046255,-,caller=T59901;acpi_power_on_list_debug 642: result = 0 > Resume successful, normal messages from resuming GPU follow. > > 2. Crash: > 2025-11-16T00:50:46.483555+01:00 10.0.0.1 6,21566,16430609060,-,caller=T59702;acpi_ps_execute_method_debug 329 > 2025-11-16T00:50:46.483555+01:00 10.0.0.1 6,21567,16430609083,-,caller=T59702;acpi_ps_parse_aml_debug: before walk loop > No more messages via netconsole due to crash. > > So here we can already say that the main loop in acpi_ps_parse_aml_debug() is not finishing properly. > > The next step is to put monitoring inside the loop: > > 6.14.0-debug-00015-gc09fd8dd0492 booted 12:09, 16.11.2025 crashed 19:55, 16.11.2025 > (~8h, 1539 GPP0 events, 587 GPU resumes) "infinite" walk loop > > The interesting part of the instrumented code is this: > > acpi_status acpi_ps_parse_aml_debug(struct acpi_walk_state *walk_state) > { > [...] > printk(KERN_INFO "%s: before walk loop\n", __func__); > while (walk_state) { > if (ACPI_SUCCESS(status)) { > /* > * The parse_loop executes AML until the method terminates > * or calls another method. > */ > printk(KERN_INFO "%s: before parse loop\n", __func__); > status = acpi_ps_parse_loop(walk_state); > printk(KERN_INFO "%s: after parse loop\n", __func__); > } > [...] > } > printk(KERN_INFO "%s: after walk loop\n", __func__); > [...] > } > > This gives the following message in netconsole > 1. No crash: > 2025-11-16T19:55:54.203765+01:00 localhost 6,5479352,28054924877,-,caller=T5967;acpi_ps_execute_method_debug 329 > 2025-11-16T19:55:54.203765+01:00 localhost 6,5479353,28054924889,-,caller=T5967;acpi_ps_parse_aml_debug: before walk loop > The next two lines are repeated 1500-1700 times (it varies a little ...): > 2025-11-16T19:55:54.203765+01:00 localhost 6,5479354,28054924894,-,caller=T5967;acpi_ps_parse_aml_debug: before parse loop > 2025-11-16T19:55:54.203765+01:00 localhost 6,5479355,28054924908,-,caller=T5967;acpi_ps_parse_aml_debug: after parse loop > > 2025-11-16T19:55:54.498216+01:00 localhost 6,5482288,28055219778,-,caller=T5967;acpi_ps_parse_aml_debug: after walk loop > 2025-11-16T19:55:54.498216+01:00 localhost 6,5482289,28055219782,-,caller=T5967;acpi_ps_execute_method_debug 331 > 2025-11-16T19:55:54.498233+01:00 localhost 6,5482290,28055219786,-,caller=T5967;acpi_ns_evaluate_debug 475 METHOD > 2025-11-16T19:55:54.498233+01:00 localhost 6,5482291,28055219791,-,caller=T5967;acpi_evaluate_object_debug 255 > 2025-11-16T19:55:54.498233+01:00 localhost 6,5482292,28055219795,-,caller=T5967;__acpi_power_on_debug 369 > 2025-11-16T19:55:54.498247+01:00 localhost 6,5482293,28055219799,-,caller=T5967;acpi_power_on_unlocked_debug 446 > 2025-11-16T19:55:54.498247+01:00 localhost 6,5482294,28055219802,-,caller=T5967;acpi_power_on_debug 471 > 2025-11-16T19:55:54.498247+01:00 localhost 6,5482295,28055219806,-,caller=T5967;acpi_power_on_list_debug 642: result = 0 > Resume successful, normal messages from resuming GPU follow. > > 2. Crash: > 2025-11-16T19:56:24.213495+01:00 localhost 6,5483042,28084932950,-,caller=T5967;acpi_ps_execute_method_debug 329 > 2025-11-16T19:56:24.213495+01:00 localhost 6,5483043,28084932965,-,caller=T5967;acpi_ps_parse_aml_debug: before walk loop > The next two lines are repeated more than 30000 times, then the transmition stops due to the crash: > 2025-11-16T19:56:24.213495+01:00 localhost 6,5483044,28084932971,-,caller=T5967;acpi_ps_parse_aml_debug: before parse loop > 2025-11-16T19:56:24.213495+01:00 localhost 6,5483045,28084932991,-,caller=T5967;acpi_ps_parse_aml_debug: after parse loop > No more messages via netconsole due to crash. > > So there is some kind of infinite recursion happening inside the loop in acpi_ps_parse_aml(). Even if there is some kind > of error in the hardware this shouldn't happen, I think. > > This bug is present in every kernel version I've tested so far, that is 6.12.x, 6.13.x, 6.14.x, > 6.15.x, 6.16.x, 6.17.x (here I only tested the release candidates). 6.18 has not been tested, yet. > > To get to this result took several months of 24/7 test runs, I hope resolving this will > be faster. Well, what you have found appears to be an issue in the AML bytecode interpreter which may be one of two things: (1) a bug in the interpreter itself or (2) a bytecode issue that causes the interpreter to crash (eventually) and the latter is quite a bit more likely. I'd suggest opening a new issue at https://github.com/acpica/acpica/issues and attaching the acpidump output from the affected system, to start with. ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-17 16:40 ` Rafael J. Wysocki @ 2025-11-24 22:34 ` Bert Karwatzki 2025-11-25 19:46 ` Rafael J. Wysocki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-24 22:34 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre, spasswolf Am Montag, dem 17.11.2025 um 17:40 +0100 schrieb Rafael J. Wysocki: > > Well, what you have found appears to be an issue in the AML bytecode > interpreter which may be one of two things: (1) a bug in the > interpreter itself or (2) a bytecode issue that causes the interpreter > to crash (eventually) and the latter is quite a bit more likely. > > I'd suggest opening a new issue at > https://github.com/acpica/acpica/issues and attaching the acpidump > output from the affected system, to start with. I've reported the bug to ACPICA github: https://github.com/acpica/acpica/issues/1060 There's no "infinite" loop, but a loop running for 5051 (0x13BB) iteration until its timeout counter reaches Zero (most likely because the hardware is unresponsive). Soon (only a handfull of iterations in the walk loop in acpi_ps_parse_aml()) the crash happens. I think the crash actually occurs inside acpi_ps_parse_loop(), so I wouldn't rule out an interpreter bug just yet. The crash also always happens (if it happens ...) in the 30592th iteration of the walk loop, so I'm now monitoring the internal of acpi_ps_parse_loop() only in this iteration of the walk loop. (I've tried to monitor the parse loop before, but that only led to excessive memory consumption and an activated OOM killer). The debugging code can be found here: https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads So far I've had no crash with this. Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-24 22:34 ` Bert Karwatzki @ 2025-11-25 19:46 ` Rafael J. Wysocki 2025-11-27 0:08 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Rafael J. Wysocki @ 2025-11-25 19:46 UTC (permalink / raw) To: Bert Karwatzki Cc: Rafael J. Wysocki, Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre On Mon, Nov 24, 2025 at 11:34 PM Bert Karwatzki <spasswolf@web.de> wrote: > > Am Montag, dem 17.11.2025 um 17:40 +0100 schrieb Rafael J. Wysocki: > > > > Well, what you have found appears to be an issue in the AML bytecode > > interpreter which may be one of two things: (1) a bug in the > > interpreter itself or (2) a bytecode issue that causes the interpreter > > to crash (eventually) and the latter is quite a bit more likely. > > > > I'd suggest opening a new issue at > > https://github.com/acpica/acpica/issues and attaching the acpidump > > output from the affected system, to start with. > > I've reported the bug to ACPICA github: > https://github.com/acpica/acpica/issues/1060 I've seen your report, thanks for filing it. > There's no "infinite" loop, but a loop running for 5051 (0x13BB) iteration until its timeout > counter reaches Zero (most likely because the hardware is unresponsive). Soon (only a > handfull of iterations in the walk loop in acpi_ps_parse_aml()) the crash happens. I think > the crash actually occurs inside acpi_ps_parse_loop(), so I wouldn't rule out an interpreter > bug just yet. > The crash also always happens (if it happens ...) in the 30592th iteration of the walk loop, > so I'm now monitoring the internal of acpi_ps_parse_loop() only in this iteration of the walk > loop. (I've tried to monitor the parse loop before, but that only led to excessive memory > consumption and an activated OOM killer). The debugging code can be found here: > https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads > > So far I've had no crash with this. What may be happening, but this is just a theory, is that the interpreter aborts the evaluation of a method due to an internal timeout, essentially the control_state->control.loop_timeout check in acpi_ds_exec_end_control_op() and that leads to a subsequent hard failure like a deadlock. This may be tested by increasing the ACPI_MAX_LOOP_TIMEOUT value, but I'm not sure it's practical to try that. ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-25 19:46 ` Rafael J. Wysocki @ 2025-11-27 0:08 ` Bert Karwatzki 2025-11-27 13:02 ` Rafael J. Wysocki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-27 0:08 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre, spasswolf Am Dienstag, dem 25.11.2025 um 20:46 +0100 schrieb Rafael J. Wysocki: > > > What may be happening, but this is just a theory, is that the > interpreter aborts the evaluation of a method due to an internal > timeout, essentially the control_state->control.loop_timeout check in > acpi_ds_exec_end_control_op() and that leads to a subsequent hard > failure like a deadlock. > > This may be tested by increasing the ACPI_MAX_LOOP_TIMEOUT value, but > I'm not sure it's practical to try that. I don't think this the case here because ACPI_MAX_LOOP_TIMEOUT defaults to 30s and the walk loop until the crash only lasts ~2s. Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-27 0:08 ` Bert Karwatzki @ 2025-11-27 13:02 ` Rafael J. Wysocki 2025-11-28 20:47 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Rafael J. Wysocki @ 2025-11-27 13:02 UTC (permalink / raw) To: Bert Karwatzki Cc: Rafael J. Wysocki, Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre On Thu, Nov 27, 2025 at 1:08 AM Bert Karwatzki <spasswolf@web.de> wrote: > > Am Dienstag, dem 25.11.2025 um 20:46 +0100 schrieb Rafael J. Wysocki: > > > > > > What may be happening, but this is just a theory, is that the > > interpreter aborts the evaluation of a method due to an internal > > timeout, essentially the control_state->control.loop_timeout check in > > acpi_ds_exec_end_control_op() and that leads to a subsequent hard > > failure like a deadlock. > > > > This may be tested by increasing the ACPI_MAX_LOOP_TIMEOUT value, but > > I'm not sure it's practical to try that. > > I don't think this the case here because ACPI_MAX_LOOP_TIMEOUT defaults to > 30s and the walk loop until the crash only lasts ~2s. I see, thanks! ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-27 13:02 ` Rafael J. Wysocki @ 2025-11-28 20:47 ` Bert Karwatzki 2025-12-02 18:59 ` Rafael J. Wysocki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-11-28 20:47 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre, spasswolf This is not an ACPICA problem after all: I did some more monitoring: https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads and I still get a crash, but perhaps due to the delays the printk()s caused I actually get a helpful error message in netconsole: T5971;ACPI BIOS Error (bug): Could not resolve symbol [\x5cM013.VARR], AE_NOT_FOUND (20240827/psargs-332) T5971;acpi_ps_complete_op returned 0x5 T5971;acpi_ps_parse_aml_debug: parse loop returned = 0x5 T5971;ACPI Error: Aborting method \x5cM013 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;ACPI Error: Aborting method \x5cM017 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;ACPI Error: Aborting method \x5cM019 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M439 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M241 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M237._ON due to previous error (AE_NOT_FOUND) (20240827/psparse-935) T5971;acpi_ps_parse_aml_debug: after walk loop T5971;acpi_ps_execute_method_debug 331 T5971;acpi_ns_evaluate_debug 475 METHOD T5971;acpi_evaluate_object_debug 255 T5971;__acpi_power_on_debug 369 T5971;acpi_power_on_unlocked_debug 442 T5971;acpi_power_on_unlocked_debug 446 T5971;acpi_power_on_debug 471 T5971;acpi_power_on_list_debug 649: result = -19 T5971;pcieport 0000:00:01.1: pci_pm_default_resume_early 568#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 T5971;pcieport 0000:00:01.1: broken device, retraining non-functional downstream link at 2.5GT/s#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 T5971;pcieport 0000:00:01.1: retraining failed#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 T5971;pcieport 0000:00:01.1: Data Link Layer Link Active not set in 1000 msec#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 T5971;pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 This shows that there seems to be no problem with ACPICA, and acpi_power_on_list(_debug)() returns -ENODEV, the crash occurs later. This leaves two question: 1. Is this crash avoidable by different error handling in the pci code? 2. If the crash is not avoidable, can we at least modify the error handling in such a way that we get an error message through netconsole by default? (perhaps a little delay will suffice) Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-11-28 20:47 ` Bert Karwatzki @ 2025-12-02 18:59 ` Rafael J. Wysocki 2025-12-02 19:53 ` Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Rafael J. Wysocki @ 2025-12-02 18:59 UTC (permalink / raw) To: Bert Karwatzki Cc: Rafael J. Wysocki, Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre [-- Attachment #1: Type: text/plain, Size: 2828 bytes --] On Fri, Nov 28, 2025 at 9:47 PM Bert Karwatzki <spasswolf@web.de> wrote: > > This is not an ACPICA problem after all: > > I did some more monitoring: > https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads > and I still get a crash, but perhaps due to the delays the printk()s caused I actually get a helpful error message in netconsole: > > T5971;ACPI BIOS Error (bug): Could not resolve symbol [\x5cM013.VARR], AE_NOT_FOUND (20240827/psargs-332) > T5971;acpi_ps_complete_op returned 0x5 > T5971;acpi_ps_parse_aml_debug: parse loop returned = 0x5 > T5971;ACPI Error: Aborting method \x5cM013 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;ACPI Error: Aborting method \x5cM017 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;ACPI Error: Aborting method \x5cM019 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M439 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M241 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M237._ON due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > T5971;acpi_ps_parse_aml_debug: after walk loop > T5971;acpi_ps_execute_method_debug 331 > T5971;acpi_ns_evaluate_debug 475 METHOD > T5971;acpi_evaluate_object_debug 255 > T5971;__acpi_power_on_debug 369 > T5971;acpi_power_on_unlocked_debug 442 > T5971;acpi_power_on_unlocked_debug 446 > T5971;acpi_power_on_debug 471 > T5971;acpi_power_on_list_debug 649: result = -19 > T5971;pcieport 0000:00:01.1: pci_pm_default_resume_early 568#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > T5971;pcieport 0000:00:01.1: broken device, retraining non-functional downstream link at 2.5GT/s#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > T5971;pcieport 0000:00:01.1: retraining failed#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > T5971;pcieport 0000:00:01.1: Data Link Layer Link Active not set in 1000 msec#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > T5971;pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 > > This shows that there seems to be no problem with ACPICA, and acpi_power_on_list(_debug)() returns -ENODEV, > the crash occurs later. > > This leaves two question: > 1. Is this crash avoidable by different error handling in the pci code? > 2. If the crash is not avoidable, can we at least modify the error handling in such a way that > we get an error message through netconsole by default? (perhaps a little delay will suffice) I'm not sure how far this is going to get you, but you may try the attached patch. [-- Attachment #2: pci-pm-default-resume-early.patch --] [-- Type: text/x-patch, Size: 2289 bytes --] --- drivers/pci/pci-driver.c | 27 +++++++++++++++++++++------ 1 file changed, 21 insertions(+), 6 deletions(-) --- a/drivers/pci/pci-driver.c +++ b/drivers/pci/pci-driver.c @@ -555,11 +555,16 @@ static void pci_pm_default_resume(struct pci_enable_wake(pci_dev, PCI_D0, false); } -static void pci_pm_default_resume_early(struct pci_dev *pci_dev) +static int pci_pm_default_resume_early(struct pci_dev *pci_dev) { pci_pm_power_up_and_verify_state(pci_dev); + /* Bail out if the device is not accessible. */ + if (pci_dev->current_state == PCI_D3cold) + return -ENODEV; + pci_restore_state(pci_dev); pci_pme_restore(pci_dev); + return 0; } static void pci_pm_bridge_power_up_actions(struct pci_dev *pci_dev) @@ -958,8 +963,11 @@ static int pci_pm_resume_noirq(struct de * configuration here and attempting to put them into D0 again is * pointless, so avoid doing that. */ - if (!(skip_bus_pm && pm_suspend_no_platform())) - pci_pm_default_resume_early(pci_dev); + if (!(skip_bus_pm && pm_suspend_no_platform())) { + int error = pci_pm_default_resume_early(pci_dev); + if (error) + return error; + } pci_fixup_device(pci_fixup_resume_early, pci_dev); pcie_pme_root_status_cleanup(pci_dev); @@ -1221,8 +1229,12 @@ static int pci_pm_restore_noirq(struct d { struct pci_dev *pci_dev = to_pci_dev(dev); const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; + int error; + + error = pci_pm_default_resume_early(pci_dev); + if (error) + return error; - pci_pm_default_resume_early(pci_dev); pci_fixup_device(pci_fixup_resume_early, pci_dev); if (pci_has_legacy_pm_support(pci_dev)) @@ -1339,14 +1351,17 @@ static int pci_pm_runtime_resume(struct struct pci_dev *pci_dev = to_pci_dev(dev); const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; pci_power_t prev_state = pci_dev->current_state; - int error = 0; + int error; /* * Restoring config space is necessary even if the device is not bound * to a driver because although we left it in D0, it may have gone to * D3cold when the bridge above it runtime suspended. */ - pci_pm_default_resume_early(pci_dev); + error = pci_pm_default_resume_early(pci_dev); + if (error) + return error; + pci_resume_ptm(pci_dev); if (!pci_dev->driver) ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-12-02 18:59 ` Rafael J. Wysocki @ 2025-12-02 19:53 ` Bert Karwatzki 2025-12-02 20:01 ` Rafael J. Wysocki 0 siblings, 1 reply; 31+ messages in thread From: Bert Karwatzki @ 2025-12-02 19:53 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre, spasswolf Am Dienstag, dem 02.12.2025 um 19:59 +0100 schrieb Rafael J. Wysocki: > On Fri, Nov 28, 2025 at 9:47 PM Bert Karwatzki <spasswolf@web.de> wrote: > > > > This is not an ACPICA problem after all: > > > > I did some more monitoring: > > https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads > > and I still get a crash, but perhaps due to the delays the printk()s caused I actually get a helpful error message in netconsole: > > > > T5971;ACPI BIOS Error (bug): Could not resolve symbol [\x5cM013.VARR], AE_NOT_FOUND (20240827/psargs-332) > > T5971;acpi_ps_complete_op returned 0x5 > > T5971;acpi_ps_parse_aml_debug: parse loop returned = 0x5 > > T5971;ACPI Error: Aborting method \x5cM013 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;ACPI Error: Aborting method \x5cM017 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;ACPI Error: Aborting method \x5cM019 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M439 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M241 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M237._ON due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > T5971;acpi_ps_parse_aml_debug: after walk loop > > T5971;acpi_ps_execute_method_debug 331 > > T5971;acpi_ns_evaluate_debug 475 METHOD > > T5971;acpi_evaluate_object_debug 255 > > T5971;__acpi_power_on_debug 369 > > T5971;acpi_power_on_unlocked_debug 442 > > T5971;acpi_power_on_unlocked_debug 446 > > T5971;acpi_power_on_debug 471 > > T5971;acpi_power_on_list_debug 649: result = -19 > > T5971;pcieport 0000:00:01.1: pci_pm_default_resume_early 568#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > T5971;pcieport 0000:00:01.1: broken device, retraining non-functional downstream link at 2.5GT/s#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > T5971;pcieport 0000:00:01.1: retraining failed#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > T5971;pcieport 0000:00:01.1: Data Link Layer Link Active not set in 1000 msec#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > T5971;pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 > > > > This shows that there seems to be no problem with ACPICA, and acpi_power_on_list(_debug)() returns -ENODEV, > > the crash occurs later. > > > > This leaves two question: > > 1. Is this crash avoidable by different error handling in the pci code? > > 2. If the crash is not avoidable, can we at least modify the error handling in such a way that > > we get an error message through netconsole by default? (perhaps a little delay will suffice) > > I'm not sure how far this is going to get you, but you may try the > attached patch. This looks worth trying, I'll try it once my current test run has crashed. Currently I'm trying to figure out why this line is there: pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 This line comes from this part of pci_power_up(): pci_read_config_word(dev, dev->pm_cap + PCI_PM_CTRL, &pmcsr); if (PCI_POSSIBLE_ERROR(pmcsr)) { pci_err(dev, "Unable to change power state from %s to D0, device inaccessible\n", pci_power_name(dev->current_state)); WARN(1, "Who is calling %s?\n", __func__); // My debug statement. (No result, yet.) dev->current_state = PCI_D3cold; return -EIO; } The interesting thing here is that the pci device 0000:01:00.0 has already been disconnected (with pci_dev_set_disconnected()) when the resume of the bridge at 0000:00:01.1 failed (in the failure path of pci_pm_bridge_power_up_actions()) (I know for sure because I put printk()s there, too). I'm not sure if pci_power_up should be called in this case. Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to infinite loop in ACPICA 2025-12-02 19:53 ` Bert Karwatzki @ 2025-12-02 20:01 ` Rafael J. Wysocki 2025-12-05 10:05 ` Crash during resume of pcie bridge due to incorrect error handling Bert Karwatzki 0 siblings, 1 reply; 31+ messages in thread From: Rafael J. Wysocki @ 2025-12-02 20:01 UTC (permalink / raw) To: Bert Karwatzki Cc: Rafael J. Wysocki, Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre On Tue, Dec 2, 2025 at 8:54 PM Bert Karwatzki <spasswolf@web.de> wrote: > > Am Dienstag, dem 02.12.2025 um 19:59 +0100 schrieb Rafael J. Wysocki: > > On Fri, Nov 28, 2025 at 9:47 PM Bert Karwatzki <spasswolf@web.de> wrote: > > > > > > This is not an ACPICA problem after all: > > > > > > I did some more monitoring: > > > https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads > > > and I still get a crash, but perhaps due to the delays the printk()s caused I actually get a helpful error message in netconsole: > > > > > > T5971;ACPI BIOS Error (bug): Could not resolve symbol [\x5cM013.VARR], AE_NOT_FOUND (20240827/psargs-332) > > > T5971;acpi_ps_complete_op returned 0x5 > > > T5971;acpi_ps_parse_aml_debug: parse loop returned = 0x5 > > > T5971;ACPI Error: Aborting method \x5cM013 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;ACPI Error: Aborting method \x5cM017 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;ACPI Error: Aborting method \x5cM019 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M439 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M241 due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;ACPI Error: Aborting method \x5c_SB.PCI0.GPP0.M237._ON due to previous error (AE_NOT_FOUND) (20240827/psparse-935) > > > T5971;acpi_ps_parse_aml_debug: after walk loop > > > T5971;acpi_ps_execute_method_debug 331 > > > T5971;acpi_ns_evaluate_debug 475 METHOD > > > T5971;acpi_evaluate_object_debug 255 > > > T5971;__acpi_power_on_debug 369 > > > T5971;acpi_power_on_unlocked_debug 442 > > > T5971;acpi_power_on_unlocked_debug 446 > > > T5971;acpi_power_on_debug 471 > > > T5971;acpi_power_on_list_debug 649: result = -19 > > > T5971;pcieport 0000:00:01.1: pci_pm_default_resume_early 568#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > > T5971;pcieport 0000:00:01.1: broken device, retraining non-functional downstream link at 2.5GT/s#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > > T5971;pcieport 0000:00:01.1: retraining failed#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > > T5971;pcieport 0000:00:01.1: Data Link Layer Link Active not set in 1000 msec#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:00:01.1 > > > T5971;pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 > > > > > > This shows that there seems to be no problem with ACPICA, and acpi_power_on_list(_debug)() returns -ENODEV, > > > the crash occurs later. > > > > > > This leaves two question: > > > 1. Is this crash avoidable by different error handling in the pci code? > > > 2. If the crash is not avoidable, can we at least modify the error handling in such a way that > > > we get an error message through netconsole by default? (perhaps a little delay will suffice) > > > > I'm not sure how far this is going to get you, but you may try the > > attached patch. > > This looks worth trying, I'll try it once my current test run has crashed. > > Currently I'm trying to figure out why this line is there: > > pcieport 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:01:00.0 > > This line comes from this part of pci_power_up(): > > pci_read_config_word(dev, dev->pm_cap + PCI_PM_CTRL, &pmcsr); > if (PCI_POSSIBLE_ERROR(pmcsr)) { > pci_err(dev, "Unable to change power state from %s to D0, device inaccessible\n", > pci_power_name(dev->current_state)); > WARN(1, "Who is calling %s?\n", __func__); // My debug statement. (No result, yet.) > dev->current_state = PCI_D3cold; > return -EIO; > } > > The interesting thing here is that the pci device 0000:01:00.0 has already been disconnected > (with pci_dev_set_disconnected()) when the resume of the bridge at 0000:00:01.1 failed > (in the failure path of pci_pm_bridge_power_up_actions()) (I know for sure > because I put printk()s there, too). I would expect the pci_dev_is_disconnected() check in pci_power_up() to trigger then. > I'm not sure if pci_power_up should be called in this case. This is a failing case anyway, I don't think that avoiding to call it would help much. ^ permalink raw reply [flat|nested] 31+ messages in thread
* Re: Crash during resume of pcie bridge due to incorrect error handling 2025-12-02 20:01 ` Rafael J. Wysocki @ 2025-12-05 10:05 ` Bert Karwatzki 0 siblings, 0 replies; 31+ messages in thread From: Bert Karwatzki @ 2025-12-05 10:05 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Christian König, Mario Limonciello (AMD) (kernel.org), linux-kernel, linux-next, regressions, linux-pci, linux-acpi, Rafael J . Wysocki, acpica-devel, Robert Moore, Saket Dumbre, spasswolf, Bjorn Helgaas I've got good and bad news on this. The good news first: I was able to get through a failed resume without a crash with the following changes to the pci resume process: 1: pci_pm_bridge_power_up_actions() returns -ENODV when pci_bridge_wait_for_secondary() returns an error (pci_walk_bus_debug() is just pci_walk_bus() with added printk()s), and also sets state to PCI_D3cold on failure: static int pci_pm_bridge_power_up_actions(struct pci_dev *pci_dev) { int ret; dev_info(&pci_dev->dev, "%s %d\n", __func__, __LINE__); ret = pci_bridge_wait_for_secondary_bus(pci_dev, "resume"); dev_info(&pci_dev->dev, "%s %d: ret = %d\n", __func__, __LINE__, ret); if (ret) { /* * The downstream link failed to come up, so mark the * devices below as disconnected to make sure we don't * attempt to resume them. */ pci_walk_bus_debug(pci_dev->subordinate, pci_dev_set_disconnected, NULL); pci_update_current_state(pci_dev, PCI_D3cold); dev_info(&pci_dev->dev, "%s: bridge failed to power up\n", __func__); return -ENODEV; } /* * When powering on a bridge from D3cold, the whole hierarchy may be * powered on into D0uninitialized state, resume them to give them a * chance to suspend again */ pci_resume_bus(pci_dev->subordinate); return 0; } 2. pci_pm_runtime_resume() has an early exit if pci_pm_bridge_power_up_actions() fails: static int pci_pm_runtime_resume(struct device *dev) { struct pci_dev *pci_dev = to_pci_dev(dev); const struct dev_pm_ops *pm = dev->driver ? dev->driver->pm : NULL; pci_power_t prev_state = pci_dev->current_state; int error = 0; /* * Restoring config space is necessary even if the device is not bound * to a driver because although we left it in D0, it may have gone to * D3cold when the bridge above it runtime suspended. */ pci_pm_default_resume_early(pci_dev); pci_resume_ptm(pci_dev); if (!pci_dev->driver) return 0; pci_fixup_device(pci_fixup_resume_early, pci_dev); pci_pm_default_resume(pci_dev); if (prev_state == PCI_D3cold) { error = pci_pm_bridge_power_up_actions(pci_dev); if (error) return error; } [...] } 3. acpiphp_check_bridge() has an early exit if pm_runtime_get_sync() returns an error (pm_runtime_get_sync() basically returns the result of pci_pm_runtime_resume() which is called as a callback in rpm_resume()) static void acpiphp_check_bridge(struct acpiphp_bridge *bridge) { struct acpiphp_slot *slot; int ret; /* Bail out if the bridge is going away. */ if (bridge->is_going_away) return; if (bridge->pci_dev) { ret = pm_runtime_get_sync(&bridge->pci_dev->dev); if (ret < 0) { dev_info(&bridge->pci_dev->dev, "%s: pm_runtime_get_sync() failed with ret = %d\n", __func__, ret); return; } } With these changes I get the following messages in dmesg when acpi_power_on_list() fails for the pci bridge: 2025-12-05T01:58:53.260991+01:00 lisa kernel: [ T1772] acpi_power_on_list_debug 649: result = -19 2025-12-05T01:58:53.260993+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: pci_pm_default_resume_early 567 2025-12-05T01:58:53.260994+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: pci_pm_bridge_power_up_actions 576 2025-12-05T01:58:54.282032+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: broken device, retraining non-functional downstream link at 2.5GT/s 2025-12-05T01:58:55.282033+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: retraining failed 2025-12-05T01:58:55.282053+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: Data Link Layer Link Active not set in 1000 msec 2025-12-05T01:58:55.282055+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: pci_pm_bridge_power_up_actions 578: ret = -25 2025-12-05T01:58:55.282057+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: __pci_walk_bus_debug 0 2025-12-05T01:58:55.282058+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: pci_dev_set_disconnected: 0 2025-12-05T01:58:55.282060+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: pci_dev_set_disconnected: 1 2025-12-05T01:58:55.282063+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: __pci_walk_bus_debug 1: ret = 0 2025-12-05T01:58:55.282065+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: __pci_walk_bus_debug 1.0 2025-12-05T01:58:55.282067+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: __pci_walk_bus_debug 0 2025-12-05T01:58:55.282070+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: pci_dev_set_disconnected: 0 2025-12-05T01:58:55.282073+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: pci_dev_set_disconnected: 1 2025-12-05T01:58:55.282101+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: __pci_walk_bus_debug 1: ret = 0 2025-12-05T01:58:55.282103+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: __pci_walk_bus_debug 1.0 2025-12-05T01:58:55.282105+01:00 lisa kernel: [ T1772] amdgpu 0000:03:00.0: __pci_walk_bus_debug 0 2025-12-05T01:58:55.282107+01:00 lisa kernel: [ T1772] amdgpu 0000:03:00.0: pci_dev_set_disconnected: 0 2025-12-05T01:58:55.282109+01:00 lisa kernel: [ T1772] amdgpu 0000:03:00.0: pci_dev_set_disconnected: 1 2025-12-05T01:58:55.282111+01:00 lisa kernel: [ T1772] amdgpu 0000:03:00.0: __pci_walk_bus_debug 1: ret = 0 2025-12-05T01:58:55.282113+01:00 lisa kernel: [ T1772] snd_hda_intel 0000:03:00.1: __pci_walk_bus_debug 0 2025-12-05T01:58:55.282115+01:00 lisa kernel: [ T1772] snd_hda_intel 0000:03:00.1: pci_dev_set_disconnected: 0 2025-12-05T01:58:55.282116+01:00 lisa kernel: [ T1772] snd_hda_intel 0000:03:00.1: pci_dev_set_disconnected: 1 2025-12-05T01:58:55.282118+01:00 lisa kernel: [ T1772] snd_hda_intel 0000:03:00.1: __pci_walk_bus_debug 1: ret = 0 2025-12-05T01:58:55.282120+01:00 lisa kernel: [ T1772] __pci_walk_bus_debug: ret = 0 2025-12-05T01:58:55.282122+01:00 lisa kernel: [ T1772] pcieport 0000:02:00.0: __pci_walk_bus_debug 1.1: ret = 0 2025-12-05T01:58:55.282124+01:00 lisa kernel: [ T1772] __pci_walk_bus_debug: ret = 0 2025-12-05T01:58:55.282126+01:00 lisa kernel: [ T1772] pcieport 0000:01:00.0: __pci_walk_bus_debug 1.1: ret = 0 2025-12-05T01:58:55.282128+01:00 lisa kernel: [ T1772] __pci_walk_bus_debug: ret = 0 2025-12-05T01:58:55.282130+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: pci_pm_bridge_power_up_actions: bridge failed to power up 2025-12-05T01:58:55.282131+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 916 retval = -19 2025-12-05T01:58:55.282133+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 922 2025-12-05T01:58:55.282135+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 937 2025-12-05T01:58:55.282137+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 947 2025-12-05T01:58:55.282138+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 950 2025-12-05T01:58:55.282140+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -19 2025-12-05T01:58:55.282141+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:58:55.523974+01:00 lisa kernel: [T192257] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:21.896980+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:21.896986+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:22.141965+01:00 lisa kernel: [T192400] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:51.903964+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:51.903974+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T01:59:52.153973+01:00 lisa kernel: [T192554] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:21.915965+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:21.915971+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:22.161971+01:00 lisa kernel: [T192706] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:51.922981+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:51.922988+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:52.175975+01:00 lisa kernel: [T192899] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:52.931997+01:00 lisa kernel: [T180476] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:00:52.932008+01:00 lisa kernel: [T180476] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 2025-12-05T02:01:21.933961+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:21.933965+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:22.172962+01:00 lisa kernel: [T193054] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:23.001998+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:23.002009+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 2025-12-05T02:01:51.942980+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:51.942986+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:52.188969+01:00 lisa kernel: [T193209] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:52.958995+01:00 lisa kernel: [T188262] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:01:52.959009+01:00 lisa kernel: [T188262] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 2025-12-05T02:02:21.951971+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:21.951980+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:22.205974+01:00 lisa kernel: [T193359] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:23.117988+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:23.118000+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 2025-12-05T02:02:51.963970+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:51.963976+01:00 lisa kernel: [ T1772] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:52.209969+01:00 lisa kernel: [T193512] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:53.465055+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:53.465074+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 2025-12-05T02:02:57.133996+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: rpm_resume 957: retval = -22 2025-12-05T02:02:57.134006+01:00 lisa kernel: [T192396] pcieport 0000:00:01.1: acpiphp_check_bridge: pm_runtime_get_sync() failed with ret = -22 These lines continue, all further resumes fail but no crash occurs. The complete debug code used is here (it's rather messy though): https://gitlab.freedesktop.org/spasswolf/linux-stable/-/commits/amdgpu_suspend_resume?ref_type=heads I've not tested yet, If the fix above also works on a pure v6.14 without all the debug patches. The bad news is I've encountered another version of the crash. (at least 90% of these crashes occur after acpi_power_on() fails as above, but at least 2 crashes happened on suspend) Now a third version has appeared which occured after an SMU resume failure in amdgpu (up to now SMU failure have appeared but they did not result in a crash like this) 2025-12-04T12:52:19.589753+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: SMU is resuming...#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.589753+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000f, smu fw if version = 0x00000013, smu fw program = 0, version = 0x003b3100 (59.49.0)#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.589753+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:6 param:0x00000000 message:EnableAllSmuFeatures?#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: Failed to enable requested dpm features!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw!#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: resume of IP block <smu> failed -121#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_resume failed (-121).#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 2025-12-04T12:52:19.861952+01:00 T39596;amdgpu 0000:03:00.0: pci_pm_runtime_resume 1380 error = -121 state = 0x0#012 SUBSYSTEM=pci#012 DEVICE=+pci:0000:03:00.0 From here on the system hangs for quite a long time (9s!) while these error messages appear on netconsole: 2025-12-04T12:52:20.870639+01:00 C0;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 122.609 msecs 2025-12-04T12:52:21.072400+01:00 C0;perf: interrupt took too long (958843 > 2500), lowering kernel.perf_event_max_sample_rate to 1000 2025-12-04T12:52:21.879308+01:00 C5;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 138.016 msecs 2025-12-04T12:52:23.493066+01:00 C13;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 162.351 msecs 2025-12-04T12:52:23.694900+01:00 C13;perf: interrupt took too long (1268455 > 1198553), lowering kernel.perf_event_max_sample_rate to 1000 2025-12-04T12:52:26.318173+01:00 C11;INFO: NMI handler (perf_event_nmi_handler) took too long to run: 182.852 msecs 2025-12-04T12:52:28.940844+01:00 C0;perf: interrupt took too long (2079549 > 1585568), lowering kernel.perf_event_max_sample_rate to 1000 [crash, no further messages] So perhaps acpi_php_check_bridge() is not the only place where a returned error is incorrectly ignored. Bert Karwatzki ^ permalink raw reply [flat|nested] 31+ messages in thread
end of thread, other threads:[~2025-12-05 10:05 UTC | newest]
Thread overview: 31+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-10-06 12:09 [REGRESSION 00/04] Crash during resume of pcie bridge Bert Karwatzki
2025-10-06 12:09 ` [REGRESSION 01/04] " Bert Karwatzki
2025-10-06 12:09 ` [REGRESSION 02/04] " Bert Karwatzki
2025-10-06 12:09 ` [REGRESSION 03/04] " Bert Karwatzki
2025-10-06 12:09 ` [REGRESSION 04/04] " Bert Karwatzki
2025-10-06 12:39 ` [REGRESSION 00/04] " Christian König
2025-10-06 16:22 ` Bert Karwatzki
2025-10-07 6:50 ` Bert Karwatzki
2025-10-07 21:33 ` Mario Limonciello
2025-10-13 16:29 ` Bert Karwatzki
2025-10-13 18:51 ` Mario Limonciello
2025-10-14 10:50 ` Christian König
[not found] ` <1853e2af7f70cf726df278137b6d2d89d9d9dc82.camel@web.de>
2025-10-31 13:38 ` Bert Karwatzki
2025-10-31 13:47 ` Bert Karwatzki
2025-10-31 18:35 ` Bert Karwatzki
2025-11-05 11:44 ` Bert Karwatzki
2025-11-05 21:31 ` Mario Limonciello (AMD) (kernel.org)
2025-11-07 13:09 ` Bert Karwatzki
2025-11-07 17:09 ` Bert Karwatzki
2025-11-10 13:33 ` Christian König
2025-11-16 21:08 ` Crash during resume of pcie bridge due to infinite loop in ACPICA Bert Karwatzki
2025-11-17 16:40 ` Rafael J. Wysocki
2025-11-24 22:34 ` Bert Karwatzki
2025-11-25 19:46 ` Rafael J. Wysocki
2025-11-27 0:08 ` Bert Karwatzki
2025-11-27 13:02 ` Rafael J. Wysocki
2025-11-28 20:47 ` Bert Karwatzki
2025-12-02 18:59 ` Rafael J. Wysocki
2025-12-02 19:53 ` Bert Karwatzki
2025-12-02 20:01 ` Rafael J. Wysocki
2025-12-05 10:05 ` Crash during resume of pcie bridge due to incorrect error handling Bert Karwatzki
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).