All of lore.kernel.org
 help / color / mirror / Atom feed
* [Bug 221297] New: AMDGPU SMU driver interface version mismatch on R9700 - fan control broken under load
@ 2026-03-29 16:18 bugzilla-daemon
  2026-03-30  8:51 ` [Bug 221297] " bugzilla-daemon
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: bugzilla-daemon @ 2026-03-29 16:18 UTC (permalink / raw)
  To: dri-devel

https://bugzilla.kernel.org/show_bug.cgi?id=221297

            Bug ID: 221297
           Summary: AMDGPU SMU driver interface version mismatch on R9700
                    - fan control broken under load
           Product: Drivers
           Version: 2.5
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P3
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: regbob.home@gmail.com
        Regression: No

Hardware: ASUS Turbo Radeon AI Pro R9700 32GB
vBIOS: 115-G287BP00-100
OS: Ubuntu 24.04
Kernel: 6.17.0-19
ROCm: 7.2.1
AMDGPU driver: 6.16.13

BUG:
The GPU fan does not spin up automatically under thermal load. During AI
training the GPU reached 109°C and thermally throttled with the fan physically
stationary throughout.

ROOT CAUSE - SMU interface version mismatch (from dmesg):

amdgpu 0000:2d:00.0: amdgpu: smu driver if version = 0x0000002e (46)
amdgpu 0000:2d:00.0: amdgpu: smu fw if version = 0x00000032 (50)
amdgpu 0000:2d:00.0: amdgpu: smu fw version = 0x00684b00 (104.75.0)
amdgpu 0000:2d:00.0: amdgpu: SMU driver if version not matched

The card firmware is 4 interface versions ahead of what the AMDGPU driver
supports. As a result all fan control registers are inaccessible:
- rocm-smi --setfan returns 'Not supported on this system'
- sysfs pwm1 node is READ-ONLY (-r--r--r--)
- fan1_enable returns 'Invalid argument' when read
- GPU enters runtime power suspend under load, further suppressing fan response

Confirmed on AMDGPU 6.16.6 and 6.16.13 under ROCm 7.2.1.

FIX REQUIRED:
The amdgpu kernel driver needs to be updated to support SMU interface version
50 (0x00000032) as shipped on the R9700 (gfx1201, RDNA 4).

Related ROCm issue: https://github.com/ROCm/ROCm/issues/5908

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2026-04-01  8:32 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-29 16:18 [Bug 221297] New: AMDGPU SMU driver interface version mismatch on R9700 - fan control broken under load bugzilla-daemon
2026-03-30  8:51 ` [Bug 221297] " bugzilla-daemon
2026-03-30 13:01 ` bugzilla-daemon
2026-04-01  8:32 ` bugzilla-daemon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.