From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 110674] Crashes / Resets From AMDGPU / Radeon VII Date: Sat, 10 Aug 2019 19:00:17 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0548888994==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 832626E46C for ; Sat, 10 Aug 2019 19:00:17 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0548888994== Content-Type: multipart/alternative; boundary="15654636175.9eFFCA9.25912" Content-Transfer-Encoding: 7bit --15654636175.9eFFCA9.25912 Date: Sat, 10 Aug 2019 19:00:17 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110674 --- Comment #68 from Tom B --- Apologies for the multiple replies/emails. I think I must just have got luc= ky. It worked several boots (in a row) and now only works very occasionally. I think it was just coincidence that it worked a few times after I installed = that kernel, sorry guys. During my tests with 5.2.7 I have noticed some interesting findings with the wattage though. It will indeed get stuck on a specific wattage, I've had 33, 24, 45, 133, 134 and on several wattages there is some fluctuation. e.g. 33-34. Higher wattages are significantly more stable, 133w lasts quite a while bef= ore it crashes, 33w crashes instantly. I'm assuming this is because the card ju= st doesn't have enough power to do what's required. When the wattage gets stuck, if you force the performance mode: # echo high > /sys/class/drm/card0/device/power_dpm_force_performance_level it confuses the driver and sensors then shows ERROR: Can't get value of subfeature power1_average: I/O error Despite working until manually setting the power state. There doesn't seem = to be a way to get it back to a state where sensors shows the wattage after it reaches this state, other than rebooting. The inconsistent nature of this bug and the fact that it sometimes doesn't appear suggests a race condition. I'd assume something else on the system happens before or after amdgpu is expecting. Is there any way to delay loading the amdgpu driver and manually loading it after everything else? --=20 You are receiving this mail because: You are the assignee for the bug.= --15654636175.9eFFCA9.25912 Date: Sat, 10 Aug 2019 19:00:17 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 68 on bug 11067= 4 from Tom = B
Apologies for the multiple replies/emails. I think I must just=
 have got lucky.
It worked several boots (in a row) and now only works very occasionally. I
think it was just coincidence that it worked a few times after I installed =
that
kernel, sorry guys.

During my tests with 5.2.7 I have noticed some interesting findings with the
wattage though. It will indeed get stuck on a specific wattage, I've had 33,
24, 45, 133, 134 and on several wattages there is some fluctuation.  e.g.
33-34.

Higher wattages are significantly more stable, 133w lasts quite a while bef=
ore
it crashes, 33w crashes instantly. I'm assuming this is because the card ju=
st
doesn't have enough power to do what's required.

When the wattage gets stuck, if you force the performance mode:

# echo high > /sys/class/drm/card0/device/power_dpm_force_performance_le=
vel

it confuses the driver and sensors then shows

ERROR: Can't get value of subfeature power1_average: I/O error

Despite working until manually setting the power state. There doesn't seem =
to
be a way to get it back to a state where sensors shows the wattage after it
reaches this state, other than rebooting.


The inconsistent nature of this bug and the fact that it sometimes doesn't
appear suggests a race condition. I'd assume something else on the system
happens before or after amdgpu is expecting.

Is there any way to delay loading the amdgpu driver and manually loading it
after everything else?


You are receiving this mail because:
  • You are the assignee for the bug.
= --15654636175.9eFFCA9.25912-- --===============0548888994== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0548888994==--