Hi, A commit in 6.18.19 has introduced a bug and a new warning when doing amdgpu driver re-binding. In addition to the bug, the last line of the output below is a new warning re: the thermal alert This bug doesn't seem to cause any show-stopping problems, but it is a bug and it persists into 6.18.20. I can do a bisect if needed, but I'm hoping one of our AMD guys can more quickly spot what's going on :) amdgpu 0000:14:00.0: amdgpu: amdgpu: finishing device. ------------[ cut here ]------------ WARNING: CPU: 1 PID: 2773 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:639 amdgpu_irq_put+0xa4/0xc0 [amdgpu] Modules linked in: iptable_nat nf_nat ipt_REJECT nf_reject_ipv4 xt_multiport xt_LOG nf_log_syslog xt_state xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_tcpudp iptable_filter ip_tables x_tables bridge stp llc ipv6 nct6775 tun sg pcspkr nct6775_core nct6683 hwmon_vid edac_mce_amd uas usb_storage onboard_usb_dev joydev hid_generic usbhid hid amdgpu tpm_crb amdxcp drm_panel_backlight_quirks gpu_sched drm_buddy drm_ttm_helper snd_hda_codec_alc882 snd_hda_codec_realtek_lib ttm drm_exec intel_rapl_msr amd_atl snd_hda_codec_generic drm_suballoc_helper drm_client_lib intel_rapl_common snd_hda_codec_atihdmi snd_hda_codec_hdmi drm_display_helper cec snd_hda_intel rc_core snd_hda_codec kvm_amd snd_hda_core drm_kms_helper ee1004 wmi_bmof r8169 snd_intel_dspcfg realtek snd_intel_sdw_acpi mdio_devres kvm snd_hwdep drm of_mdio snd_pcm agpgart polyval_clmulni i2c_designware_pci fixed_phy ghash_clmulni_intel snd_timer i2c_algo_bit i2c_piix4 fwnode_mdio i2c_designware_core rapl! video i2c_smbus i2c_ccgx_ucsi snd libphy xhci_pci mfd_core soundcore ccp k10temp i2c_core mdio_bus igc xhci_hcd wmi gpio_amdpt tpm_tis gpio_generic tpm_tis_core evdev loop dm_snapshot dm_bufio vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd irqbypass CPU: 1 UID: 0 PID: 2773 Comm: bind-device.sh Not tainted 6.18.20 #1 PREEMPT(lazy) Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./B550 Taichi, BIOS P2.00 08/05/2021 RIP: 0010:amdgpu_irq_put+0xa4/0xc0 [amdgpu] Code: ea 48 8d 14 90 8b 12 85 d2 75 ae 5b b8 ea ff ff ff 5d 41 5c e9 a8 c9 e1 d1 89 ea 48 89 de 4c 89 e7 5b 5d 41 5c e9 9c fd ff ff <0f> 0b b8 ea ff ff ff eb a8 b8 fe ff ff ff eb a1 90 66 66 2e 0f 1f RSP: 0018:ffffcf4a8777fce0 EFLAGS: 00010246 RAX: ffff8c634233a908 RBX: ffff8c6345564008 RCX: 0000000000000000 RDX: 0000000000000000 RSI: ffff8c6345564008 RDI: ffff8c6348900000 RBP: 0000000000000000 R08: 0000000000000001 R09: ffff8c63455648e4 R10: ffffffffc14935d0 R11: ffff8c6375dd8470 R12: ffff8c6348900000 R13: ffff8c6345564000 R14: ffff8c6348900000 R15: 0000000000000000 FS: 00007f0549119740(0000) GS:ffff8c8268c28000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 000000002d3fa810 CR3: 0000000282aa9000 CR4: 0000000000f50ef0 PKRU: 55555554 Call Trace: smu_smc_hw_cleanup+0x61/0x490 [amdgpu] smu_hw_fini+0xef/0x180 [amdgpu] amdgpu_ip_block_hw_fini+0x37/0x41 [amdgpu] amdgpu_device_fini_hw+0x20d/0x284 [amdgpu] amdgpu_pci_remove+0x48/0x80 [amdgpu] pci_device_remove+0x46/0xb0 device_release_driver_internal+0x19a/0x200 unbind_store+0xa0/0xb0 kernfs_fop_write_iter+0x149/0x200 vfs_write+0x259/0x4b0 ksys_write+0x6f/0xe0 do_syscall_64+0x4c/0x1130 entry_SYSCALL_64_after_hwframe+0x76/0x7e RIP: 0033:0x7f0548e98bfa Code: b8 04 00 00 00 48 8b 15 ec 71 16 00 64 89 02 48 c7 c2 ff ff ff ff 48 83 c4 18 48 89 d0 c3 66 90 49 89 ca 48 8b 44 24 20 0f 05 <48> 63 d0 3d 00 f0 ff ff 77 0c 48 89 d0 48 83 c4 18 c3 0f 1f 40 00 RSP: 002b:00007ffeeb12b470 EFLAGS: 00000202 ORIG_RAX: 0000000000000001 RAX: ffffffffffffffda RBX: 00007f0549001780 RCX: 00007f0548e98bfa RDX: 000000000000000d RSI: 000000002d3fa810 RDI: 0000000000000001 RBP: 000000000000000d R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000202 R12: 000000000000000d R13: 000000002d3fa810 R14: 000000002d3fa810 R15: 0000000000000000 ---[ end trace 0000000000000000 ]--- amdgpu 0000:14:00.0: amdgpu: Fail to disable thermal alert! -- Cal Peake