From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 110345] Unrecoverable GPU crash with DiRT 4 Date: Sat, 06 Apr 2019 06:27:31 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1793278410==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 7CD8889DAB for ; Sat, 6 Apr 2019 06:27:31 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1793278410== Content-Type: multipart/alternative; boundary="15545320510.F08aA.4506" Content-Transfer-Encoding: 7bit --15545320510.F08aA.4506 Date: Sat, 6 Apr 2019 06:27:31 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D110345 Bug ID: 110345 Summary: Unrecoverable GPU crash with DiRT 4 Product: Mesa Version: unspecified Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: Drivers/Gallium/radeonsi Assignee: dri-devel@lists.freedesktop.org Reporter: v10lator@myway.de QA Contact: dri-devel@lists.freedesktop.org At first I thought this is a kernel bug bug Alex Deucher told it's most lik= ely a mesa bug. The game randomly freezes. When that happens the screen is frozen and input= by mouse or keyboard doesn't work (LEDs on the keyboard are frozen, too). It's still possible to SSH to the PC to get some logs but that's about it: Even = the reboot command freezes. Here's a log from Mesa 18.3.4 and kernel 5.0.4: > [52700.498697] [drm:amdgpu_job_timedout] *ERROR* ring gfx timeout, signal= ed seq=3D1423558, emitted seq=3D1423560` > [52700.498702] [drm:amdgpu_job_timedout] *ERROR* Process information: pro= cess Dirt4 pid 10332 thread WebViewRenderer pid 10391 > [52700.498705] amdgpu 0000:01:00.0: GPU reset begin! > [52710.728397] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] hw_d= one or flip_done timed out > [52873.699280] WARNING: CPU: 2 PID: 4034 at kernel/kthread.c:529 kthread_= park+0x67/0x78 > [52873.699283] Modules linked in: nfsd > [52873.699287] CPU: 2 PID: 4034 Comm: TaskSchedulerFo Not tainted 5.0.4 #1 > [52873.699288] Hardware name: To be filled by O.E.M. To be filled by O.E.= M./SABERTOOTH 990FX R2.0, BIOS 2901 05/04/2016 > [52873.699290] RIP: 0010:kthread_park+0x67/0x78 > [52873.699291] Code: 18 e8 9d 78 aa 00 be 40 00 00 00 48 89 df e8 60 72 0= 0 00 48 85 c0 74 1b 31 c0 5b 5d c3 0f 0b eb ae 0f 0b b8 da ff ff ff eb f0 <= 0f> 0b b8 f0 ff ff ff eb e7 0f 0b eb e3 0f 1f 40 00 f6 47 26 20 74 > [52873.699293] RSP: 0018:ffffa0144460fb78 EFLAGS: 00210202 > [52873.699294] RAX: 0000000000000004 RBX: ffff9155631210c0 RCX: 000000000= 0000000 > [52873.699295] RDX: ffff9155ef427428 RSI: ffff9155631210c0 RDI: ffff9155e= f9bbfc0 > [52873.699296] RBP: ffff9155f013b8a0 R08: ffff9155f2a97480 R09: ffff9155f= 2a94a00 > [52873.699297] R10: 0000d46d0abbfe3a R11: 000033d8b581bc78 R12: ffff9155e= f422790 > [52873.699298] R13: ffff9155a2f83c00 R14: 0000000000000202 R15: dead00000= 0000100 > [52873.699299] FS: 00007fc756cff700(0000) GS:ffff9155f2a80000(0000) knlG= S:0000000000000000 > [52873.699301] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [52873.699302] CR2: 00007fc7650b8070 CR3: 0000000322b86000 CR4: 000000000= 00406e0 > [52873.699302] Call Trace: > [52873.699307] drm_sched_entity_fini+0x32/0x180 > [52873.699309] amdgpu_vm_fini+0xa8/0x520 > [52873.699311] ? idr_destroy+0x78/0xc0 > [52873.699313] amdgpu_driver_postclose_kms+0x14c/0x268 > [52873.699316] drm_file_free.part.7+0x21a/0x2f8 > [52873.699318] drm_release+0xa5/0x120 > [52873.699320] __fput+0x9a/0x1c8 > [52873.699323] task_work_run+0x8a/0xb0 > [52873.699325] do_exit+0x2b5/0xb30 > [52873.699326] do_group_exit+0x35/0x98 > [52873.699328] get_signal+0xbd/0x690 > [52873.699331] ? _raw_spin_unlock+0xd/0x20 > [52873.699333] ? do_signal+0x2b/0x6b8 > [52873.699335] ? __x64_sys_futex+0x137/0x178 > [52873.699337] ? exit_to_usermode_loop+0x46/0xa0 > [52873.699338] ? do_syscall_64+0x14c/0x178 > [52873.699339] ? entry_SYSCALL_64_after_hwframe+0x44/0xa9 > [52873.699341] ---[ end trace 1e1efc0508ef22df ]--- > [52875.619562] [drm] Skip scheduling IBs! > [52875.625247] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize parser = -125! > [52885.826983] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:4= 7:crtc-0] flip_done timed out > [52896.066581] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRT= C:47:crtc-0] flip_done timed out > [52906.306280] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLA= NE:45:plane-5] flip_done timed out Mesa 19.0.1 / kernel 5.0.5: > [178793.032358] [drm:amdgpu_job_timedout] *ERROR* ring gfx timeout, signa= led seq=3D12332054, emitted seq=3D12332056 > [178793.032362] [drm:amdgpu_job_timedout] *ERROR* Process information: pr= ocess Dirt4 pid 31348 thread WebViewRenderer pid 31422 > [178793.032365] amdgpu 0000:01:00.0: GPU reset begin! > [178803.262008] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] hw_= done or flip_done timed out Mesa git (26e161b1e9) / kernel 5.0.5: > [ 7819.095648] [drm:amdgpu_job_timedout] *ERROR* ring gfx timeout, signal= ed seq=3D2652771, emitted seq=3D2652773 > [ 7819.095652] [drm:amdgpu_job_timedout] *ERROR* Process information: pro= cess Dirt4 pid 3075 thread WebViewRenderer pid 3152 > [ 7819.095655] amdgpu 0000:01:00.0: GPU reset begin! > [ 7829.315220] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] hw_d= one or flip_done timed out This is on with a Radeon RX 580 (Sapphire NITRO+). Link to the kernel bug report: https://bugzilla.kernel.org/show_bug.cgi?id=3D203111 --=20 You are receiving this mail because: You are the assignee for the bug.= --15545320510.F08aA.4506 Date: Sat, 6 Apr 2019 06:27:31 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 110345
Summary Unrecoverable GPU crash with DiRT 4
Product Mesa
Version unspecified
Hardware Other
OS All
Status NEW
Severity normal
Priority medium
Component Drivers/Gallium/radeonsi
Assignee dri-devel@lists.freedesktop.org
Reporter v10lator@myway.de
QA Contact dri-devel@lists.freedesktop.org

At first I thought this is a kernel bug bug Alex Deucher told =
it's most likely
a mesa bug.

The game randomly freezes. When that happens the screen is frozen and input=
 by
mouse or keyboard doesn't work (LEDs on the keyboard are frozen, too). It's
still possible to SSH to the PC to get some logs but that's about it: Even =
the
reboot command freezes.

Here's a log from Mesa 18.3.4 and kernel 5.0.4:

> [52700.498697] [drm:amdgpu_job_timedout] *ERROR*=
 ring gfx timeout, signaled seq=3D1423558, emitted seq=3D1423560`
> [52700.498702] [drm:amdgpu_job_timedout] *ERROR* Process information: =
process Dirt4 pid 10332 thread WebViewRenderer pid 10391
> [52700.498705] amdgpu 0000:01:00.0: GPU reset begin!
> [52710.728397] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] h=
w_done or flip_done timed out
> [52873.699280] WARNING: CPU: 2 PID: 4034 at kernel/kthread.c:529 kthre=
ad_park+0x67/0x78
> [52873.699283] Modules linked in: nfsd
> [52873.699287] CPU: 2 PID: 4034 Comm: TaskSchedulerFo Not tainted 5.0.=
4 #1
> [52873.699288] Hardware name: To be filled by O.E.M. To be filled by O=
.E.M./SABERTOOTH 990FX R2.0, BIOS 2901 05/04/2016
> [52873.699290] RIP: 0010:kthread_park+0x67/0x78
> [52873.699291] Code: 18 e8 9d 78 aa 00 be 40 00 00 00 48 89 df e8 60 7=
2 00 00 48 85 c0 74 1b 31 c0 5b 5d c3 0f 0b eb ae 0f 0b b8 da ff ff ff eb f=
0 <0f> 0b b8 f0 ff ff ff eb e7 0f 0b eb e3 0f 1f 40 00 f6 47 26 20 74
> [52873.699293] RSP: 0018:ffffa0144460fb78 EFLAGS: 00210202
> [52873.699294] RAX: 0000000000000004 RBX: ffff9155631210c0 RCX: 000000=
0000000000
> [52873.699295] RDX: ffff9155ef427428 RSI: ffff9155631210c0 RDI: ffff91=
55ef9bbfc0
> [52873.699296] RBP: ffff9155f013b8a0 R08: ffff9155f2a97480 R09: ffff91=
55f2a94a00
> [52873.699297] R10: 0000d46d0abbfe3a R11: 000033d8b581bc78 R12: ffff91=
55ef422790
> [52873.699298] R13: ffff9155a2f83c00 R14: 0000000000000202 R15: dead00=
0000000100
> [52873.699299] FS:  00007fc756cff700(0000) GS:ffff9155f2a80000(0000) k=
nlGS:0000000000000000
> [52873.699301] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [52873.699302] CR2: 00007fc7650b8070 CR3: 0000000322b86000 CR4: 000000=
00000406e0
> [52873.699302] Call Trace:
> [52873.699307]  drm_sched_entity_fini+0x32/0x180
> [52873.699309]  amdgpu_vm_fini+0xa8/0x520
> [52873.699311]  ? idr_destroy+0x78/0xc0
> [52873.699313]  amdgpu_driver_postclose_kms+0x14c/0x268
> [52873.699316]  drm_file_free.part.7+0x21a/0x2f8
> [52873.699318]  drm_release+0xa5/0x120
> [52873.699320]  __fput+0x9a/0x1c8
> [52873.699323]  task_work_run+0x8a/0xb0
> [52873.699325]  do_exit+0x2b5/0xb30
> [52873.699326]  do_group_exit+0x35/0x98
> [52873.699328]  get_signal+0xbd/0x690
> [52873.699331]  ? _raw_spin_unlock+0xd/0x20
> [52873.699333]  ? do_signal+0x2b/0x6b8
> [52873.699335]  ? __x64_sys_futex+0x137/0x178
> [52873.699337]  ? exit_to_usermode_loop+0x46/0xa0
> [52873.699338]  ? do_syscall_64+0x14c/0x178
> [52873.699339]  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
> [52873.699341] ---[ end trace 1e1efc0508ef22df ]---
> [52875.619562] [drm] Skip scheduling IBs!
> [52875.625247] [drm:amdgpu_cs_ioctl] *ERROR* Failed to initialize pars=
er -125!
> [52885.826983] [drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRT=
C:47:crtc-0] flip_done timed out
> [52896.066581] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [=
CRTC:47:crtc-0] flip_done timed out
> [52906.306280] [drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [=
PLANE:45:plane-5] flip_done timed out

Mesa 19.0.1 / kernel 5.0.5:

> [178793.032358] [drm:amdgpu_job_timedout] *ERROR=
* ring gfx timeout, signaled seq=3D12332054, emitted seq=3D12332056
> [178793.032362] [drm:amdgpu_job_timedout] *ERROR* Process information:=
 process Dirt4 pid 31348 thread WebViewRenderer pid 31422
> [178793.032365] amdgpu 0000:01:00.0: GPU reset begin!
> [178803.262008] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] =
hw_done or flip_done timed out

Mesa git (26e161b1e9) / kernel 5.0.5:

> [ 7819.095648] [drm:amdgpu_job_timedout] *ERROR*=
 ring gfx timeout, signaled seq=3D2652771, emitted seq=3D2652773
> [ 7819.095652] [drm:amdgpu_job_timedout] *ERROR* Process information: =
process Dirt4 pid 3075 thread WebViewRenderer pid 3152
> [ 7819.095655] amdgpu 0000:01:00.0: GPU reset begin!
> [ 7829.315220] [drm:amdgpu_dm_atomic_check] *ERROR* [CRTC:47:crtc-0] h=
w_done or flip_done timed out

This is on with a Radeon RX 580 (Sapphire NITRO+).

Link to the kernel bug report:
https://bu=
gzilla.kernel.org/show_bug.cgi?id=3D203111


You are receiving this mail because:
  • You are the assignee for the bug.
= --15545320510.F08aA.4506-- --===============1793278410== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVs --===============1793278410==--