From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 100964] RX-480 [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD) Date: Mon, 08 May 2017 01:38:51 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0147385347==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 9B20C6E126 for ; Mon, 8 May 2017 01:38:51 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0147385347== Content-Type: multipart/alternative; boundary="14942075310.c0CA4c9b1.17935"; charset="UTF-8" --14942075310.c0CA4c9b1.17935 Date: Mon, 8 May 2017 01:38:51 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D100964 Bug ID: 100964 Summary: RX-480 [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=3D0xCAFEDEAD) Product: DRI Version: unspecified Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: major Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: suzaku.29a@gmail.com Created attachment 131248 --> https://bugs.freedesktop.org/attachment.cgi?id=3D131248&action=3Dedit dmesg output on amdgpu load failure Inconsistent amdgpu driver loading for an RX-480: Most of the time, the driver will fail to load starting with the error: [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test fail= ed (scratch(0xC040)=3D0xCAFEDEAD) The screen goes into stand-by and I get no display output. The rest of the system still loads normally and I can ssh and look around. ------------ But it will, very seldom, load the driver normally (KMS enabled, display still active). ------------ I can use nomodeset to prevent amdgpu from being initialized and the display continues to work, but without 3D accel and just one monitor working (out of two). To confirm that the GPU actually works (both in general and in linux with amdgpu), I tested it on a friend's PC with the same distro I'm using (Debian Testing/amd64). The driver loaded normally on the first try and worked consistently for all the reboots we tried. Since then I've read anything I could find on the issue and I couldn't find any solution. There are similar reports on other AMD videocards, but none of them give me a solution for the inconsistency. Things tried so far: Debian testing and unstable with most of the kernels released since 4.7.0-1 up to their latest kernel 4.9.25-1 (4.9.0-3 in their versioning system). Gentoo with the genkernel 4.9.16. Gentoo custom kernel 4.9.16 Gentoo with kernel 4.11 from https://cgit.freedesktop.org/~agd5f/linux/ (drm-fixes-4.11). Alternating ACPI and most, if not all, amdgpu parameters in all tested kernels. Using different outputs (DVI, HDMI and both. I cannot test DisplayPort). The Debian and Gentoo installs are fresh in different drives. I can test things on either one. I'd be ok with my MB or CPU somehow being too old or incompatible. But the few successful boots tell me that it can and *does* work on my pc. I'm attaching two dmesg logs, one for the working boot and one for the failing one. They were run shortly after each other and nothing was changed in between. I'm attaching lspci output too. Please let me know if you need any other info or clarification on the error. Regards. HV --=20 You are receiving this mail because: You are the assignee for the bug.= --14942075310.c0CA4c9b1.17935 Date: Mon, 8 May 2017 01:38:51 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 100964
Summary RX-480 [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu:= ring 0 test failed (scratch(0xC040)=3D0xCAFEDEAD)
Product DRI
Version unspecified
Hardware x86-64 (AMD64)
OS Linux (All)
Status NEW
Severity major
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter suzaku.29a@gmail.com

Created atta=
chment 131248 [details]
dmesg output on amdgpu load failure

Inconsistent amdgpu driver loading for an RX-480:

Most of the time, the driver will fail to load starting with the error:
    [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 0 test fail=
ed
(scratch(0xC040)=3D0xCAFEDEAD)
The screen goes into stand-by and I get no display output. The rest of
the system still loads normally and I can ssh and look around.

------------
But it will, very seldom, load the driver normally (KMS enabled, display
still active).
------------

I can use nomodeset to prevent amdgpu from being initialized and the
display continues to work, but without 3D accel and just one monitor
working (out of two).

To confirm that the GPU actually works (both in general and in linux
with amdgpu), I tested it on a friend's PC with the same distro I'm
using (Debian Testing/amd64). The driver loaded normally on the first
try and worked consistently for all the reboots we tried.

Since then I've read anything I could find on the issue and I couldn't
find any solution. There are similar reports on other AMD videocards,
but none of them give me a solution for the inconsistency.

Things tried so far:
    Debian testing and unstable with most of the kernels released since
        4.7.0-1 up to their latest kernel 4.9.25-1 (4.9.0-3 in their
        versioning system).
    Gentoo with the genkernel 4.9.16.
    Gentoo custom kernel 4.9.16
    Gentoo with kernel 4.11 from https://cgit.freedesktop.org/~agd5f/linux/
        (drm-fixes-4.11).
    Alternating ACPI and most, if not all, amdgpu parameters in all
        tested kernels.
    Using different outputs (DVI, HDMI and both. I cannot test
        DisplayPort).

The Debian and Gentoo installs are fresh in different drives. I can
test things on either one.

I'd be ok with my MB or CPU somehow being too old or incompatible. But
the few successful boots tell me that it can and *does* work on my pc.

I'm attaching two dmesg logs, one for the working boot and one for the
failing one. They were run shortly after each other and nothing was
changed in between.

I'm attaching lspci output too.

Please let me know if you need any other info or clarification on the
error.

Regards.

HV


You are receiving this mail because:
  • You are the assignee for the bug.
= --14942075310.c0CA4c9b1.17935-- --===============0147385347== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0147385347==--