From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 108710] Since 4.20 kernel Vega 56 hangs when I surf pages in steam client Date: Sun, 11 Nov 2018 11:46:17 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0155461745==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 3F897896F7 for ; Sun, 11 Nov 2018 11:46:17 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0155461745== Content-Type: multipart/alternative; boundary="15419367770.9EaECd2F6.9507" Content-Transfer-Encoding: 7bit --15419367770.9EaECd2F6.9507 Date: Sun, 11 Nov 2018 11:46:17 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D108710 Bug ID: 108710 Summary: Since 4.20 kernel Vega 56 hangs when I surf pages in steam client Product: DRI Version: XOrg git Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: mikhail.v.gavrilov@gmail.com Created attachment 142434 --> https://bugs.freedesktop.org/attachment.cgi?id=3D142434&action=3Dedit dmesg $ inxi -bM System: Host: localhost.localdomain Kernel: 4.20.0-0.rc1.git4.1.fc30.x86= _64 x86_64 bits: 64 Desktop: Gnome 3.30.1=20 Distro: Fedora release 30 (Rawhide)=20 Machine: Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-I GAMING v: Rev 1.xx serial: =20 UEFI: American Megatrends v: 0901 date: 07/23/2018=20 CPU: 8-Core: AMD Ryzen 7 2700X type: MT MCP speed: 3427 MHz min/max: 2200/4000 MHz=20 Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Vega 10 XL/XT [Radeon= RX Vega 56/64] driver: amdgpu v: kernel=20 Display: wayland server: Fedora Project X.org 1.20.3 driver: amd= gpu resolution: 3840x2160~60Hz=20 OpenGL: renderer: Radeon RX Vega (VEGA10 DRM 3.27.0 4.20.0-0.rc1.git4.1.fc30.x86_64 LLVM 7.0.0) v: 4.5 Mesa 18.2.4=20 Network: Device-1: Intel I211 Gigabit Network driver: igb=20 Device-2: Realtek RTL8822BE 802.11a/b/g/n/ac WiFi adapter driver: r8822be=20 Drives: Local Storage: total: 11.36 TiB used: 5.93 TiB (52.2%)=20 Info: Processes: 455 Uptime: 16m Memory: 31.30 GiB used: 15.99 GiB (51= .1%) Shell: bash inxi: 3.0.27=20 [ 3852.511166] gmc_v9_0_process_interrupt: 56 callbacks suppressed [ 3852.511182] amdgpu 0000:0b:00.0: [mmhub] VMC page fault (src_id:0 ring:1= 69 vmid:0 pasid:0, for process pid 0 thread pid 0) [ 3852.511184] amdgpu 0000:0b:00.0: in page starting at address 0x000000401080c000 from 18 [ 3852.511186] amdgpu 0000:0b:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00040152 [ 3862.673344] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeou= t, signaled seq=3D72072, emitted seq=3D72074 [ 3862.673356] [drm] GPU recovery disabled. [ 4044.170764] sysrq: SysRq : Show Blocked State [ 4044.170959] task PC stack pid father [ 4044.171026] kworker/u32:5 D10872 253 2 0x80000000 [ 4044.171060] Workqueue: events_unbound commit_work [drm_kms_helper] [ 4044.171063] Call Trace: [ 4044.171073] ? __schedule+0x2f3/0xb90 [ 4044.171077] ? __lock_acquire+0x279/0x1650 [ 4044.171085] ? dma_fence_default_wait+0x242/0x330 [ 4044.171089] schedule+0x2f/0x90 [ 4044.171092] schedule_timeout+0x31c/0x4f0 [ 4044.171096] ? find_held_lock+0x34/0xa0 [ 4044.171099] ? find_held_lock+0x34/0xa0 [ 4044.171104] ? mark_held_locks+0x57/0x80 [ 4044.171134] ? _raw_spin_unlock_irqrestore+0x4b/0x60 [ 4044.171140] ? dma_fence_default_wait+0x242/0x330 [ 4044.171143] dma_fence_default_wait+0x26e/0x330 [ 4044.171147] ? dma_fence_release+0x120/0x120 [ 4044.171153] dma_fence_wait_timeout+0x182/0x200 [ 4044.171160] reservation_object_wait_timeout_rcu+0x236/0x4e0 [ 4044.171263] amdgpu_dm_do_flip+0x112/0x380 [amdgpu] [ 4044.171378] amdgpu_dm_atomic_commit_tail+0x6d0/0xd30 [amdgpu] [ 4044.171386] ? _raw_spin_unlock_irq+0x29/0x40 [ 4044.171391] ? wait_for_completion_timeout+0x73/0x1a0 [ 4044.171408] commit_tail+0x3d/0x70 [drm_kms_helper] [ 4044.171413] process_one_work+0x27d/0x600 [ 4044.171423] worker_thread+0x3c/0x390 [ 4044.171428] ? drain_workqueue+0x180/0x180 [ 4044.171433] kthread+0x120/0x140 [ 4044.171437] ? kthread_park+0x80/0x80 [ 4044.171442] ret_from_fork+0x27/0x50 [ 4044.172479] (time-dir) D13944 15221 1 0x00000000 [ 4044.172487] Call Trace: [ 4044.172496] ? __schedule+0x2f3/0xb90 [ 4044.172501] ? prepare_to_wait_event+0xd2/0x180 [ 4044.172508] schedule+0x2f/0x90 [ 4044.172514] drm_sched_entity_flush+0x1df/0x1f0 [gpu_sched] [ 4044.172518] ? finish_wait+0x80/0x80 [ 4044.172580] amdgpu_ctx_mgr_entity_flush+0x7c/0xc0 [amdgpu] [ 4044.172637] amdgpu_flush+0x1f/0x30 [amdgpu] [ 4044.172640] filp_close+0x34/0x70 [ 4044.172645] __x64_sys_close+0x1e/0x50 [ 4044.172649] do_syscall_64+0x60/0x1f0 [ 4044.172653] entry_SYSCALL_64_after_hwframe+0x49/0xbe [ 4044.172656] RIP: 0033:0x7f5a96622ec7 [ 4044.172662] Code: Bad RIP value. [ 4044.172665] RSP: 002b:00007ffcce3d00e0 EFLAGS: 00000293 ORIG_RAX: 0000000000000003 [ 4044.172668] RAX: ffffffffffffffda RBX: 000000000000007c RCX: 00007f5a96622ec7 [ 4044.172671] RDX: 0000000000000000 RSI: 00007ffcce3d0180 RDI: 000000000000007c [ 4044.172673] RBP: 000055d29a73aa60 R08: 000055d29a73b676 R09: 0000000000000000 [ 4044.172675] R10: 00007f5a965bbae0 R11: 0000000000000293 R12: 00007f5a95939750 [ 4044.172677] R13: 0000000000000000 R14: 0000000000000001 R15: 00007ffcce3d0180 [ 4057.229953] INFO: task kworker/u32:5:253 blocked for more than 120 secon= ds. [ 4057.229957] Tainted: G WC=20=20=20=20=20=20=20 4.20.0-0.rc1.git4.1.fc30.x86_64 #1 [ 4057.229959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables = this message. [ 4057.229962] kworker/u32:5 D10872 253 2 0x80000000 [ 4057.229979] Workqueue: events_unbound commit_work [drm_kms_helper] [ 4057.229982] Call Trace: [ 4057.229994] ? __schedule+0x2f3/0xb90 [ 4057.229998] ? __lock_acquire+0x279/0x1650 [ 4057.230006] ? dma_fence_default_wait+0x242/0x330 [ 4057.230010] schedule+0x2f/0x90 [ 4057.230013] schedule_timeout+0x31c/0x4f0 [ 4057.230017] ? find_held_lock+0x34/0xa0 [ 4057.230020] ? find_held_lock+0x34/0xa0 [ 4057.230025] ? mark_held_locks+0x57/0x80 [ 4057.230028] ? _raw_spin_unlock_irqrestore+0x4b/0x60 [ 4057.230034] ? dma_fence_default_wait+0x242/0x330 [ 4057.230037] dma_fence_default_wait+0x26e/0x330 [ 4057.230041] ? dma_fence_release+0x120/0x120 [ 4057.230047] dma_fence_wait_timeout+0x182/0x200 [ 4057.230052] reservation_object_wait_timeout_rcu+0x236/0x4e0 [ 4057.230134] amdgpu_dm_do_flip+0x112/0x380 [amdgpu] [ 4057.230221] amdgpu_dm_atomic_commit_tail+0x6d0/0xd30 [amdgpu] [ 4057.230228] ? _raw_spin_unlock_irq+0x29/0x40 [ 4057.230232] ? wait_for_completion_timeout+0x73/0x1a0 [ 4057.230249] commit_tail+0x3d/0x70 [drm_kms_helper] [ 4057.230254] process_one_work+0x27d/0x600 [ 4057.230263] worker_thread+0x3c/0x390 [ 4057.230269] ? drain_workqueue+0x180/0x180 [ 4057.230272] kthread+0x120/0x140 [ 4057.230276] ? kthread_park+0x80/0x80 [ 4057.230281] ret_from_fork+0x27/0x50 [ 4057.230571]=20 Showing all locks held in the system: [ 4057.230581] 1 lock held by khungtaskd/94: [ 4057.230583] #0: 00000000a1fc4e6f (rcu_read_lock){....}, at: debug_show_all_locks+0x15/0x183 [ 4057.230596] 3 locks held by kworker/u32:5/253: [ 4057.230597] #0: 00000000156505f1 ((wq_completion)"events_unbound"){+.+.= }, at: process_one_work+0x1f3/0x600 [ 4057.230603] #1: 000000000d248f14 ((work_completion)(&state->commit_work)){+.+.}, at: process_one_work+0x1f3/0x600 [ 4057.230608] #2: 000000003df03870 (reservation_ww_class_mutex){+.+.}, at: amdgpu_dm_do_flip+0xd6/0x380 [amdgpu] [ 4057.230700] 2 locks held by gnome-shell/2152: [ 4057.230702] #0: 00000000a2cb2cbf (crtc_ww_class_acquire){+.+.}, at: drm_mode_cursor_common+0x95/0x220 [drm] [ 4057.230721] #1: 00000000e86bda0d (crtc_ww_class_mutex){+.+.}, at: drm_modeset_lock+0x101/0x120 [drm] [ 4057.230746] 5 locks held by Xwayland/2222: [ 4057.230784] 1 lock held by htop/3225: [ 4057.230848] 1 lock held by CPU 0/KVM/4333: [ 4057.230989] 1 lock held by (time-dir)/15221: [ 4057.230991] #0: 000000006ef8a6af (&mgr->lock){+.+.}, at: amdgpu_ctx_mgr_entity_flush+0x3c/0xc0 [amdgpu] [ 4057.231068] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --=20 You are receiving this mail because: You are the assignee for the bug.= --15419367770.9EaECd2F6.9507 Date: Sun, 11 Nov 2018 11:46:17 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 108710
Summary Since 4.20 kernel Vega 56 hangs when I surf pages in steam cl= ient
Product DRI
Version XOrg git
Hardware Other
OS All
Status NEW
Severity normal
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter mikhail.v.gavrilov@gmail.com

Created attachment 142434 [details]<=
/span>
dmesg

$ inxi -bM
System:    Host: localhost.localdomain Kernel: 4.20.0-0.rc1.git4.1.fc30.x86=
_64
x86_64 bits: 64 Desktop: Gnome 3.30.1=20
           Distro: Fedora release 30 (Rawhide)=20
Machine:   Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-I GAMING v: Rev
1.xx serial: <root required>=20
           UEFI: American Megatrends v: 0901 date: 07/23/2018=20
CPU:       8-Core: AMD Ryzen 7 2700X type: MT MCP speed: 3427 MHz min/max:
2200/4000 MHz=20
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Vega 10 XL/XT [Radeon=
 RX
Vega 56/64] driver: amdgpu v: kernel=20
           Display: wayland server: Fedora Project X.org 1.20.3 driver: amd=
gpu
resolution: 3840x2160~60Hz=20
           OpenGL: renderer: Radeon RX Vega (VEGA10 DRM 3.27.0
4.20.0-0.rc1.git4.1.fc30.x86_64 LLVM 7.0.0) v: 4.5 Mesa 18.2.4=20
Network:   Device-1: Intel I211 Gigabit Network driver: igb=20
           Device-2: Realtek RTL8822BE 802.11a/b/g/n/ac WiFi adapter driver:
r8822be=20
Drives:    Local Storage: total: 11.36 TiB used: 5.93 TiB (52.2%)=20
Info:      Processes: 455 Uptime: 16m Memory: 31.30 GiB used: 15.99 GiB (51=
.1%)
Shell: bash inxi: 3.0.27=20


[ 3852.511166] gmc_v9_0_process_interrupt: 56 callbacks suppressed
[ 3852.511182] amdgpu 0000:0b:00.0: [mmhub] VMC page fault (src_id:0 ring:1=
69
vmid:0 pasid:0, for process  pid 0 thread  pid 0)
[ 3852.511184] amdgpu 0000:0b:00.0:   in page starting at address
0x000000401080c000 from 18
[ 3852.511186] amdgpu 0000:0b:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00040152
[ 3862.673344] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeou=
t,
signaled seq=3D72072, emitted seq=3D72074
[ 3862.673356] [drm] GPU recovery disabled.
[ 4044.170764] sysrq: SysRq : Show Blocked State
[ 4044.170959]   task                        PC stack   pid father
[ 4044.171026] kworker/u32:5   D10872   253      2 0x80000000
[ 4044.171060] Workqueue: events_unbound commit_work [drm_kms_helper]
[ 4044.171063] Call Trace:
[ 4044.171073]  ? __schedule+0x2f3/0xb90
[ 4044.171077]  ? __lock_acquire+0x279/0x1650
[ 4044.171085]  ? dma_fence_default_wait+0x242/0x330
[ 4044.171089]  schedule+0x2f/0x90
[ 4044.171092]  schedule_timeout+0x31c/0x4f0
[ 4044.171096]  ? find_held_lock+0x34/0xa0
[ 4044.171099]  ? find_held_lock+0x34/0xa0
[ 4044.171104]  ? mark_held_locks+0x57/0x80
[ 4044.171134]  ? _raw_spin_unlock_irqrestore+0x4b/0x60
[ 4044.171140]  ? dma_fence_default_wait+0x242/0x330
[ 4044.171143]  dma_fence_default_wait+0x26e/0x330
[ 4044.171147]  ? dma_fence_release+0x120/0x120
[ 4044.171153]  dma_fence_wait_timeout+0x182/0x200
[ 4044.171160]  reservation_object_wait_timeout_rcu+0x236/0x4e0
[ 4044.171263]  amdgpu_dm_do_flip+0x112/0x380 [amdgpu]
[ 4044.171378]  amdgpu_dm_atomic_commit_tail+0x6d0/0xd30 [amdgpu]
[ 4044.171386]  ? _raw_spin_unlock_irq+0x29/0x40
[ 4044.171391]  ? wait_for_completion_timeout+0x73/0x1a0
[ 4044.171408]  commit_tail+0x3d/0x70 [drm_kms_helper]
[ 4044.171413]  process_one_work+0x27d/0x600
[ 4044.171423]  worker_thread+0x3c/0x390
[ 4044.171428]  ? drain_workqueue+0x180/0x180
[ 4044.171433]  kthread+0x120/0x140
[ 4044.171437]  ? kthread_park+0x80/0x80
[ 4044.171442]  ret_from_fork+0x27/0x50
[ 4044.172479] (time-dir)      D13944 15221      1 0x00000000
[ 4044.172487] Call Trace:
[ 4044.172496]  ? __schedule+0x2f3/0xb90
[ 4044.172501]  ? prepare_to_wait_event+0xd2/0x180
[ 4044.172508]  schedule+0x2f/0x90
[ 4044.172514]  drm_sched_entity_flush+0x1df/0x1f0 [gpu_sched]
[ 4044.172518]  ? finish_wait+0x80/0x80
[ 4044.172580]  amdgpu_ctx_mgr_entity_flush+0x7c/0xc0 [amdgpu]
[ 4044.172637]  amdgpu_flush+0x1f/0x30 [amdgpu]
[ 4044.172640]  filp_close+0x34/0x70
[ 4044.172645]  __x64_sys_close+0x1e/0x50
[ 4044.172649]  do_syscall_64+0x60/0x1f0
[ 4044.172653]  entry_SYSCALL_64_after_hwframe+0x49/0xbe
[ 4044.172656] RIP: 0033:0x7f5a96622ec7
[ 4044.172662] Code: Bad RIP value.
[ 4044.172665] RSP: 002b:00007ffcce3d00e0 EFLAGS: 00000293 ORIG_RAX:
0000000000000003
[ 4044.172668] RAX: ffffffffffffffda RBX: 000000000000007c RCX:
00007f5a96622ec7
[ 4044.172671] RDX: 0000000000000000 RSI: 00007ffcce3d0180 RDI:
000000000000007c
[ 4044.172673] RBP: 000055d29a73aa60 R08: 000055d29a73b676 R09:
0000000000000000
[ 4044.172675] R10: 00007f5a965bbae0 R11: 0000000000000293 R12:
00007f5a95939750
[ 4044.172677] R13: 0000000000000000 R14: 0000000000000001 R15:
00007ffcce3d0180
[ 4057.229953] INFO: task kworker/u32:5:253 blocked for more than 120 secon=
ds.
[ 4057.229957]       Tainted: G        WC=20=20=20=20=20=20=20
4.20.0-0.rc1.git4.1.fc30.x86_64 #1
[ 4057.229959] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs&qu=
ot; disables this
message.
[ 4057.229962] kworker/u32:5   D10872   253      2 0x80000000
[ 4057.229979] Workqueue: events_unbound commit_work [drm_kms_helper]
[ 4057.229982] Call Trace:
[ 4057.229994]  ? __schedule+0x2f3/0xb90
[ 4057.229998]  ? __lock_acquire+0x279/0x1650
[ 4057.230006]  ? dma_fence_default_wait+0x242/0x330
[ 4057.230010]  schedule+0x2f/0x90
[ 4057.230013]  schedule_timeout+0x31c/0x4f0
[ 4057.230017]  ? find_held_lock+0x34/0xa0
[ 4057.230020]  ? find_held_lock+0x34/0xa0
[ 4057.230025]  ? mark_held_locks+0x57/0x80
[ 4057.230028]  ? _raw_spin_unlock_irqrestore+0x4b/0x60
[ 4057.230034]  ? dma_fence_default_wait+0x242/0x330
[ 4057.230037]  dma_fence_default_wait+0x26e/0x330
[ 4057.230041]  ? dma_fence_release+0x120/0x120
[ 4057.230047]  dma_fence_wait_timeout+0x182/0x200
[ 4057.230052]  reservation_object_wait_timeout_rcu+0x236/0x4e0
[ 4057.230134]  amdgpu_dm_do_flip+0x112/0x380 [amdgpu]
[ 4057.230221]  amdgpu_dm_atomic_commit_tail+0x6d0/0xd30 [amdgpu]
[ 4057.230228]  ? _raw_spin_unlock_irq+0x29/0x40
[ 4057.230232]  ? wait_for_completion_timeout+0x73/0x1a0
[ 4057.230249]  commit_tail+0x3d/0x70 [drm_kms_helper]
[ 4057.230254]  process_one_work+0x27d/0x600
[ 4057.230263]  worker_thread+0x3c/0x390
[ 4057.230269]  ? drain_workqueue+0x180/0x180
[ 4057.230272]  kthread+0x120/0x140
[ 4057.230276]  ? kthread_park+0x80/0x80
[ 4057.230281]  ret_from_fork+0x27/0x50
[ 4057.230571]=20
               Showing all locks held in the system:
[ 4057.230581] 1 lock held by khungtaskd/94:
[ 4057.230583]  #0: 00000000a1fc4e6f (rcu_read_lock){....}, at:
debug_show_all_locks+0x15/0x183
[ 4057.230596] 3 locks held by kworker/u32:5/253:
[ 4057.230597]  #0: 00000000156505f1 ((wq_completion)"events_unbound&q=
uot;){+.+.},
at: process_one_work+0x1f3/0x600
[ 4057.230603]  #1: 000000000d248f14
((work_completion)(&state->commit_work)){+.+.}, at:
process_one_work+0x1f3/0x600
[ 4057.230608]  #2: 000000003df03870 (reservation_ww_class_mutex){+.+.}, at:
amdgpu_dm_do_flip+0xd6/0x380 [amdgpu]
[ 4057.230700] 2 locks held by gnome-shell/2152:
[ 4057.230702]  #0: 00000000a2cb2cbf (crtc_ww_class_acquire){+.+.}, at:
drm_mode_cursor_common+0x95/0x220 [drm]
[ 4057.230721]  #1: 00000000e86bda0d (crtc_ww_class_mutex){+.+.}, at:
drm_modeset_lock+0x101/0x120 [drm]
[ 4057.230746] 5 locks held by Xwayland/2222:
[ 4057.230784] 1 lock held by htop/3225:
[ 4057.230848] 1 lock held by CPU 0/KVM/4333:
[ 4057.230989] 1 lock held by (time-dir)/15221:
[ 4057.230991]  #0: 000000006ef8a6af (&mgr->lock){+.+.}, at:
amdgpu_ctx_mgr_entity_flush+0x3c/0xc0 [amdgpu]

[ 4057.231068] =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=


You are receiving this mail because:
  • You are the assignee for the bug.
= --15419367770.9EaECd2F6.9507-- --===============0155461745== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0155461745==--