From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: =?UTF-8?B?W0J1ZyAxMTA5MjhdIHd4NTEwMCBncHUgY3Jhc2jvvIHvvIHvvIHvvIE=?= Date: Mon, 17 Jun 2019 06:56:34 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0147586857==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 4636B8915B for ; Mon, 17 Jun 2019 06:56:34 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0147586857== Content-Type: multipart/alternative; boundary="15607545940.72bDaec.24690" Content-Transfer-Encoding: 7bit --15607545940.72bDaec.24690 Date: Mon, 17 Jun 2019 06:56:34 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110928 Bug ID: 110928 Summary: wx5100 gpu crash=EF=BC=81=EF=BC=81=EF=BC=81=EF=BC=81 Product: DRI Version: DRI git Hardware: All OS: Linux (All) Status: NEW Severity: critical Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: baopeng88_com@163.com When we used wx5100 for rendering and encoding, we encountered some gpu han= gs. This situation is very bad and must be resolved by rebooting. The log information is as follows. Please help analyze, thank you very much. situation 1: 2019-06-16T14:39:24.708544+08:00|err|kernel[-]|[398383.549799] amdgpu 0005:01:00.0: GPU fault detected: 146 0x04203d0c for process a.babycard.ssvs pid 330210 thread RenderThread pid 330511 2019-06-16T14:39:24.708703+08:00|err|kernel[-]|[398383.549803] amdgpu 0005:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00102184 2019-06-16T14:39:24.708812+08:00|err|kernel[-]|[398383.549805] amdgpu 0005:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0D03D014 2019-06-16T14:39:24.708908+08:00|err|kernel[-]|[398383.549809] amdgpu 0005:01:00.0: VM fault (0x14, vmid 6, pasid 33627) at page 1057156, write f= rom 'SDM1' (0x53444d31) (61) After the GPU fault, about 17 seconds later: 2019-06-16T14:39:41.924400+08:00|err|kernel[-]|[398400.765123] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vce0 timeout, signaled seq=3D3868950, emitted seq=3D3868954 2019-06-16T14:39:41.924463+08:00|info|kernel[-]|[398400.765132] [drm] GPU recovery disabled. situation 2=EF=BC=9A [Thu Jun 6 22:00:14 2019] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring = gfx timeout, signaled seq=3D919191055, emitted seq=3D919191057 [Thu Jun 6 22:00:14 2019] [drm] GPU recovery disabled. [Thu Jun 6 22:00:16 2019] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=3D101603699, emitted seq=3D101603701 [Thu Jun 6 22:00:16 2019] [drm] GPU recovery disabled. situation 3: 2019-06-16T14:59:05.248325+08:00|err|kernel[-]|[399194.411704] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=3D230984670, emitted seq=3D230984673 2019-06-16T14:59:05.248404+08:00|info|kernel[-]|[399194.411708] [drm] GPU recovery disabled. can you help me to analyze these situations to solve these problems? thank = you. --=20 You are receiving this mail because: You are the assignee for the bug.= --15607545940.72bDaec.24690 Date: Mon, 17 Jun 2019 06:56:34 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 110928
Summary wx5100 gpu crash=EF=BC=81=EF=BC=81=EF=BC=81=EF=BC=81
Product DRI
Version DRI git
Hardware All
OS Linux (All)
Status NEW
Severity critical
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter baopeng88_com@163.com

When we used wx5100 for rendering and encoding, we encountered=
 some gpu hangs.
This situation is very bad and must be resolved by rebooting. The log
information is as follows. Please help analyze, thank you very much.
situation 1:
2019-06-16T14:39:24.708544+08:00|err|kernel[-]|[398383.549799] amdgpu
0005:01:00.0: GPU fault detected: 146 0x04203d0c for process a.babycard.ssvs
pid 330210 thread RenderThread pid 330511
2019-06-16T14:39:24.708703+08:00|err|kernel[-]|[398383.549803] amdgpu
0005:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00102184
2019-06-16T14:39:24.708812+08:00|err|kernel[-]|[398383.549805] amdgpu
0005:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0D03D014
2019-06-16T14:39:24.708908+08:00|err|kernel[-]|[398383.549809] amdgpu
0005:01:00.0: VM fault (0x14, vmid 6, pasid 33627) at page 1057156, write f=
rom
'SDM1' (0x53444d31) (61)

After the GPU fault, about 17 seconds later:

2019-06-16T14:39:41.924400+08:00|err|kernel[-]|[398400.765123]
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vce0 timeout, signaled
seq=3D3868950, emitted seq=3D3868954
2019-06-16T14:39:41.924463+08:00|info|kernel[-]|[398400.765132] [drm] GPU
recovery disabled.

situation 2=EF=BC=9A
[Thu Jun  6 22:00:14 2019] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring =
gfx
timeout, signaled seq=3D919191055, emitted seq=3D919191057
[Thu Jun  6 22:00:14 2019] [drm] GPU recovery disabled.
[Thu Jun  6 22:00:16 2019] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring
sdma1 timeout, signaled seq=3D101603699, emitted seq=3D101603701
[Thu Jun  6 22:00:16 2019] [drm] GPU recovery disabled.

situation 3:
2019-06-16T14:59:05.248325+08:00|err|kernel[-]|[399194.411704]
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled
seq=3D230984670, emitted seq=3D230984673
2019-06-16T14:59:05.248404+08:00|info|kernel[-]|[399194.411708] [drm] GPU
recovery disabled.

can you help me to analyze these situations to solve these problems? thank =
you.


You are receiving this mail because:
  • You are the assignee for the bug.
= --15607545940.72bDaec.24690-- --===============0147586857== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============0147586857==--