From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105733] Amdgpu randomly hangs and only ssh works. Mouse cursor moves sometimes but does nothing. Keyboard stops working. Date: Fri, 16 Nov 2018 14:28:16 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0959532132==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id BB39C6E7D6 for ; Fri, 16 Nov 2018 14:28:16 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0959532132== Content-Type: multipart/alternative; boundary="15423784963.edDFAaBA.14486" Content-Transfer-Encoding: 7bit --15423784963.edDFAaBA.14486 Date: Fri, 16 Nov 2018 14:28:16 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105733 --- Comment #41 from Philipp --- I can second much of what John W. says. The crashes have become less freque= nt with recent fimware/kernel versions, but they still happen. Also for me the crashes only started on my vega 64, when I threw out my anc= ient Intel CPU and replaced it with an AMD Ryzen 5 1600 on a GR-AB350M-Gaming 3 Board. I've done stability tests on that other OS, so I don't think I've got faulty hardware here. One of my crash logs: Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0: [gfxhub] VMC page f= ault (src_id:0 ring:158 vmid:7 pasid:32776, for process RocketLeague pid 6347 th= read RocketLeag:cs0 pid 6400 ) Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0: at address 0x0000800319593000 from 27 Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x0070053C Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0: [gfxhub] VMC page f= ault (src_id:0 ring:220 vmid:7 pasid:32776, for process RocketLeague pid 6347 th= read RocketLeag:cs0 pid 6400 ) Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0: at address 0x00008201004e0000 from 27 Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x007013B8 Nov 16 15:18:40 localhorst kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERRO= R* ring gfx timeout, signaled seq=3D38153, emitted seq=3D38155 Nov 16 15:18:40 localhorst kernel: [drm] GPU recovery disabled. --=20 You are receiving this mail because: You are the assignee for the bug.= --15423784963.edDFAaBA.14486 Date: Fri, 16 Nov 2018 14:28:16 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 41 on bug 10573= 3 from Philipp
I can second much of what John W. says. The crashes have becom=
e less frequent
with recent fimware/kernel versions, but they still happen.
Also for me the crashes only started on my vega 64, when I threw out my anc=
ient
Intel CPU and replaced it with an AMD Ryzen 5 1600 on a GR-AB350M-Gaming 3
Board.
I've done stability tests on that other OS, so I don't think I've got faulty
hardware here.

One of my crash logs:

Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0: [gfxhub] VMC page f=
ault
(src_id:0 ring:158 vmid:7 pasid:32776, for process RocketLeague pid 6347 th=
read
RocketLeag:cs0 pid 6400
                                )
Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0:   at address
0x0000800319593000 from 27
Nov 16 15:18:29 localhorst kernel: amdgpu 0000:08:00.0:
VM_L2_PROTECTION_FAULT_STATUS:0x0070053C
Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0: [gfxhub] VMC page f=
ault
(src_id:0 ring:220 vmid:7 pasid:32776, for process RocketLeague pid 6347 th=
read
RocketLeag:cs0 pid 6400
                                )
Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0:   at address
0x00008201004e0000 from 27
Nov 16 15:18:30 localhorst kernel: amdgpu 0000:08:00.0:
VM_L2_PROTECTION_FAULT_STATUS:0x007013B8
Nov 16 15:18:40 localhorst kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERRO=
R*
ring gfx timeout, signaled seq=3D38153, emitted seq=3D38155
Nov 16 15:18:40 localhorst kernel: [drm] GPU recovery disabled.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15423784963.edDFAaBA.14486-- --===============0959532132== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0959532132==--