From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id A073C3F7887; Tue, 31 Mar 2026 16:34:18 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1774974858; cv=none; b=PAjMWlOxbTq27YXuA64NqG5u8KcL7Uiy+jD4/ceDyKnUEi9dyxmQyrT4rkalOqLx25ARGR1cTMncOYMkBHfupN0zWqKktTYQLCkl6wfwQewtl5MkBNXmJfzExdUPaE8tp7NHSl2W7UhynmZZKjGp0CTqbOZVgur0nXYUyPUDhgM= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1774974858; c=relaxed/simple; bh=hk2YGHmARYPW3/4YTu9zfkaZ17W0E4lVx+3k0YXN9/4=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=ZLC9rSxRAa+N65WLMwek3HZ7bC5QnxAJBTbDnFXG8yJAf8gFS87QUWdTU/6Ef1+nb4fO5YPCfPvCS2DAyGG5wmTiiOCUu+bOd9T8Ae0mRs93suf6Tcwtv1S54R/AeNq45Q8O89OSSk9mfVIHG8kHwlxRaCGXe40+zAdLp/j1xzs= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=linuxfoundation.org header.i=@linuxfoundation.org header.b=YyDsSEoJ; arc=none smtp.client-ip=10.30.226.201 Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=linuxfoundation.org header.i=@linuxfoundation.org header.b="YyDsSEoJ" Received: by smtp.kernel.org (Postfix) with ESMTPSA id 36C1DC19423; Tue, 31 Mar 2026 16:34:18 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1774974858; bh=hk2YGHmARYPW3/4YTu9zfkaZ17W0E4lVx+3k0YXN9/4=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=YyDsSEoJ7Aw0kiKvicNFnHL/irrP5MLd8iovmcmhUpZXj2ngpwbe8SMgD9f3E3173 vKZgAK8THpgOJwwJFQgUEdSgSCgvFkbosXFQEKEAi8GFCa3CDRjnaOEP+bUrHPzoGu oBNGszrNyOzB+5VZB/jIfDTG58Nu8YiAZzJLuuVo= From: Greg Kroah-Hartman To: stable@vger.kernel.org Cc: Greg Kroah-Hartman , patches@lists.linux.dev, Yang Wang , Alex Deucher , Sasha Levin Subject: [PATCH 6.19 072/342] drm/amdgpu: fix gpu idle power consumption issue for gfx v12 Date: Tue, 31 Mar 2026 18:18:25 +0200 Message-ID: <20260331161801.543887498@linuxfoundation.org> X-Mailer: git-send-email 2.53.0 In-Reply-To: <20260331161758.909578033@linuxfoundation.org> References: <20260331161758.909578033@linuxfoundation.org> User-Agent: quilt/0.69 X-stable: review X-Patchwork-Hint: ignore Precedence: bulk X-Mailing-List: stable@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit 6.19-stable review patch. If anyone has any objections, please let me know. ------------------ From: Yang Wang [ Upstream commit a6571045cf06c4aa749b4801382ae96650e2f0e1 ] Older versions of the MES firmware may cause abnormal GPU power consumption. When performing inference tasks on the GPU (e.g., with Ollama using ROCm), the GPU may show abnormal power consumption in idle state and incorrect GPU load information. This issue has been fixed in firmware version 0x8b and newer. Closes: https://github.com/ROCm/ROCm/issues/5706 Signed-off-by: Yang Wang Acked-by: Alex Deucher Signed-off-by: Alex Deucher (cherry picked from commit 4e22a5fe6ea6e0b057e7f246df4ac3ff8bfbc46a) Signed-off-by: Sasha Levin --- drivers/gpu/drm/amd/amdgpu/mes_v12_0.c | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c b/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c index 231aba48d8d28..dcafbd7066c40 100644 --- a/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c +++ b/drivers/gpu/drm/amd/amdgpu/mes_v12_0.c @@ -731,6 +731,9 @@ static int mes_v12_0_set_hw_resources(struct amdgpu_mes *mes, int pipe) int i; struct amdgpu_device *adev = mes->adev; union MESAPI_SET_HW_RESOURCES mes_set_hw_res_pkt; + uint32_t mes_rev = (pipe == AMDGPU_MES_SCHED_PIPE) ? + (mes->sched_version & AMDGPU_MES_VERSION_MASK) : + (mes->kiq_version & AMDGPU_MES_VERSION_MASK); memset(&mes_set_hw_res_pkt, 0, sizeof(mes_set_hw_res_pkt)); @@ -785,7 +788,7 @@ static int mes_v12_0_set_hw_resources(struct amdgpu_mes *mes, int pipe) * handling support, other queue will not use the oversubscribe timer. * handling mode - 0: disabled; 1: basic version; 2: basic+ version */ - mes_set_hw_res_pkt.oversubscription_timer = 50; + mes_set_hw_res_pkt.oversubscription_timer = mes_rev < 0x8b ? 0 : 50; mes_set_hw_res_pkt.unmapped_doorbell_handling = 1; if (amdgpu_mes_log_enable) { -- 2.51.0