From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 110674] Crashes / Resets From AMDGPU / Radeon VII Date: Sun, 19 May 2019 23:02:27 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0179602056==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 6B4B5891AF for ; Sun, 19 May 2019 23:02:27 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0179602056== Content-Type: multipart/alternative; boundary="15583069471.AD48.24795" Content-Transfer-Encoding: 7bit --15583069471.AD48.24795 Date: Sun, 19 May 2019 23:02:27 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110674 --- Comment #25 from Tom B --- On 5.1.3 (and presumably all 5.1 kernels) I am seeing a strange power profi= le. Can everyone else run sensors (after sensors-detect if you don't have the amdgpu device showing) I'm seeing this: amdgpu-pci-4400 Adapter: PCI adapter vddgfx: +1.11 V=20=20 fan1: 0 RPM (min =3D 0 RPM, max =3D 3850 RPM) temp1: +33.0=C2=B0C (crit =3D +118.0=C2=B0C, hyst =3D -273.1=C2=B0C) power1: 135.00 W (cap =3D 250.00 W) Even at idle my GPU is running at 1100mv (my default base voltage) and constantly running at 135w. My output of cat /sys/kernel/debug/dri/0/amdgpu_pm_info shows the same thin= g: Clock Gating Flags Mask: 0x36974f Graphics Medium Grain Clock Gating: On Graphics Medium Grain memory Light Sleep: On Graphics Coarse Grain Clock Gating: On Graphics Coarse Grain memory Light Sleep: On Graphics Coarse Grain Tree Shader Clock Gating: Off Graphics Coarse Grain Tree Shader Light Sleep: Off Graphics Command Processor Light Sleep: On Graphics Run List Controller Light Sleep: Off Graphics 3D Coarse Grain Clock Gating: On Graphics 3D Coarse Grain memory Light Sleep: On Memory Controller Light Sleep: On Memory Controller Medium Grain Clock Gating: On System Direct Memory Access Light Sleep: On System Direct Memory Access Medium Grain Clock Gating: Off Bus Interface Medium Grain Clock Gating: Off Bus Interface Light Sleep: On Unified Video Decoder Medium Grain Clock Gating: Off Video Compression Engine Medium Grain Clock Gating: Off Host Data Path Light Sleep: On Host Data Path Medium Grain Clock Gating: Off Digital Right Management Medium Grain Clock Gating: Off Digital Right Management Light Sleep: On Rom Medium Grain Clock Gating: On Data Fabric Medium Grain Clock Gating: Off GFX Clocks and Power: 351 MHz (MCLK) 0 MHz (SCLK) 1373 MHz (PSTATE_SCLK) 1001 MHz (PSTATE_MCLK) 1106 mV (VDDGFX) 135.0 W (average GPU) GPU Temperature: 33 C GPU Load: 0 % SMC Feature Mask: 0x0000000000c0c002 UVD: Disabled VCE: Disabled It's locked at 135w and 1106mv. Are you guys seeing similar? Apologies for = the multiple posts but I'll post in a second after running unigine to see if it tries to boost before it crashes. --=20 You are receiving this mail because: You are the assignee for the bug.= --15583069471.AD48.24795 Date: Sun, 19 May 2019 23:02:27 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 25 on bug 11067= 4 from Tom = B
On 5.1.3 (and presumably all 5.1 kernels) I am seeing a strang=
e power profile.

Can everyone else run sensors (after sensors-detect if you don't have the
amdgpu device showing)

I'm seeing this:

amdgpu-pci-4400
Adapter: PCI adapter
vddgfx:       +1.11 V=20=20
fan1:           0 RPM  (min =3D    0 RPM, max =3D 3850 RPM)
temp1:        +33.0=C2=B0C  (crit =3D +118.0=C2=B0C, hyst =3D -273.1=C2=B0C)
power1:      135.00 W  (cap =3D 250.00 W)


Even at idle my GPU is running at 1100mv (my default base voltage) and
constantly running at 135w.


My output of cat /sys/kernel/debug/dri/0/amdgpu_pm_info shows the same thin=
g:

Clock Gating Flags Mask: 0x36974f
        Graphics Medium Grain Clock Gating: On
        Graphics Medium Grain memory Light Sleep: On
        Graphics Coarse Grain Clock Gating: On
        Graphics Coarse Grain memory Light Sleep: On
        Graphics Coarse Grain Tree Shader Clock Gating: Off
        Graphics Coarse Grain Tree Shader Light Sleep: Off
        Graphics Command Processor Light Sleep: On
        Graphics Run List Controller Light Sleep: Off
        Graphics 3D Coarse Grain Clock Gating: On
        Graphics 3D Coarse Grain memory Light Sleep: On
        Memory Controller Light Sleep: On
        Memory Controller Medium Grain Clock Gating: On
        System Direct Memory Access Light Sleep: On
        System Direct Memory Access Medium Grain Clock Gating: Off
        Bus Interface Medium Grain Clock Gating: Off
        Bus Interface Light Sleep: On
        Unified Video Decoder Medium Grain Clock Gating: Off
        Video Compression Engine Medium Grain Clock Gating: Off
        Host Data Path Light Sleep: On
        Host Data Path Medium Grain Clock Gating: Off
        Digital Right Management Medium Grain Clock Gating: Off
        Digital Right Management Light Sleep: On
        Rom Medium Grain Clock Gating: On
        Data Fabric Medium Grain Clock Gating: Off

GFX Clocks and Power:
        351 MHz (MCLK)
        0 MHz (SCLK)
        1373 MHz (PSTATE_SCLK)
        1001 MHz (PSTATE_MCLK)
        1106 mV (VDDGFX)
        135.0 W (average GPU)

GPU Temperature: 33 C
GPU Load: 0 %

SMC Feature Mask: 0x0000000000c0c002
UVD: Disabled

VCE: Disabled


It's locked at 135w and 1106mv. Are you guys seeing similar? Apologies for =
the
multiple posts but I'll post in a second after running unigine to see if it
tries to boost before it crashes.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15583069471.AD48.24795-- --===============0179602056== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0179602056==--