All of lore.kernel.org
 help / color / mirror / Atom feed
From: bugzilla-daemon@kernel.org
To: dri-devel@lists.freedesktop.org
Subject: [Bug 221297] New: AMDGPU SMU driver interface version mismatch on R9700 - fan control broken under load
Date: Sun, 29 Mar 2026 16:18:39 +0000	[thread overview]
Message-ID: <bug-221297-2300@https.bugzilla.kernel.org/> (raw)

https://bugzilla.kernel.org/show_bug.cgi?id=221297

            Bug ID: 221297
           Summary: AMDGPU SMU driver interface version mismatch on R9700
                    - fan control broken under load
           Product: Drivers
           Version: 2.5
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P3
         Component: Video(DRI - non Intel)
          Assignee: drivers_video-dri@kernel-bugs.osdl.org
          Reporter: regbob.home@gmail.com
        Regression: No

Hardware: ASUS Turbo Radeon AI Pro R9700 32GB
vBIOS: 115-G287BP00-100
OS: Ubuntu 24.04
Kernel: 6.17.0-19
ROCm: 7.2.1
AMDGPU driver: 6.16.13

BUG:
The GPU fan does not spin up automatically under thermal load. During AI
training the GPU reached 109°C and thermally throttled with the fan physically
stationary throughout.

ROOT CAUSE - SMU interface version mismatch (from dmesg):

amdgpu 0000:2d:00.0: amdgpu: smu driver if version = 0x0000002e (46)
amdgpu 0000:2d:00.0: amdgpu: smu fw if version = 0x00000032 (50)
amdgpu 0000:2d:00.0: amdgpu: smu fw version = 0x00684b00 (104.75.0)
amdgpu 0000:2d:00.0: amdgpu: SMU driver if version not matched

The card firmware is 4 interface versions ahead of what the AMDGPU driver
supports. As a result all fan control registers are inaccessible:
- rocm-smi --setfan returns 'Not supported on this system'
- sysfs pwm1 node is READ-ONLY (-r--r--r--)
- fan1_enable returns 'Invalid argument' when read
- GPU enters runtime power suspend under load, further suppressing fan response

Confirmed on AMDGPU 6.16.6 and 6.16.13 under ROCm 7.2.1.

FIX REQUIRED:
The amdgpu kernel driver needs to be updated to support SMU interface version
50 (0x00000032) as shipped on the R9700 (gfx1201, RDNA 4).

Related ROCm issue: https://github.com/ROCm/ROCm/issues/5908

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

             reply	other threads:[~2026-03-29 16:18 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-03-29 16:18 bugzilla-daemon [this message]
2026-03-30  8:51 ` [Bug 221297] AMDGPU SMU driver interface version mismatch on R9700 - fan control broken under load bugzilla-daemon
2026-03-30 13:01 ` bugzilla-daemon
2026-04-01  8:32 ` bugzilla-daemon

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-221297-2300@https.bugzilla.kernel.org/ \
    --to=bugzilla-daemon@kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.