From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105733] Amdgpu randomly hangs and only ssh works. Mouse cursor moves sometimes but does nothing. Keyboard stops working. Date: Sun, 25 Mar 2018 04:47:54 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0292459329==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 56B7B6E0B2 for ; Sun, 25 Mar 2018 04:47:55 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0292459329== Content-Type: multipart/alternative; boundary="15219532750.99Fa.4871" Content-Transfer-Encoding: 7bit --15219532750.99Fa.4871 Date: Sun, 25 Mar 2018 04:47:55 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105733 Bug ID: 105733 Summary: Amdgpu randomly hangs and only ssh works. Mouse cursor moves sometimes but does nothing. Keyboard stops working. Product: DRI Version: XOrg git Hardware: x86-64 (AMD64) OS: Linux (All) Status: NEW Severity: critical Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: allan4229@gmail.com Created attachment 138344 --> https://bugs.freedesktop.org/attachment.cgi?id=3D138344&action=3Dedit dmesg, killing pids, shutting down, unloading amdgpu, xorg log WHAT HAPPENS - Amdgpu hangs without any clear clue of what is happening. - The mouse cursor responds to movements when the system is not frozen, but also it does nothing as well. - The keyboard gets num lock frozen and even trying with a ps2 one does not work. - The video gets frozen. - Only ssh works, but only the times that the system is not frozen, of cour= se. - The most irritating part : the system can not be shutdown. No matter what= you do : -- If you press the power button from the case, it is the only answer that = you can get from the output display : it shows a console indicating that x-serv= er is trying to be turned off. But nothing else happens and the system can't be turned off. -- If you try anything from ssh : "init 0", "poweroff", "shutdown -P 0 -h", "reboot". It simply does not work. It keeps waiting for something that never happens. Then you have to press ctrl_c to get back to the ssh sessioon. In = an attempt it closed the ssh daemon but the shutdown itself never happened... = even after 30mins. -- It is IMPOSSIBLE to force unload amdgpu using "rmmod -f amdgpu". The task takes forever and never responds. It only hangs the ssh session. -- It is IMPOSSIBLE to kill some x-related pids properly. If you try to kil= l it either nothing will happen or the process will be in a defunct state. Not e= ven a "su -c 'kill -9 '" will work. TIPS - The crashes that allows ssh connection almost always happens when firefox= is openned and running a video (netflix, youtube) or whatsapp web. - The crashes that simply hangs the entire computer may occur at any time. OBSERVATIONS - I use a custom kernel (from 4.15). I've tried including the polaris binar= ies for my card, that showed an improvement (less freeze states) for a while. B= ut now it is the same again. - I use a nvidia io second pci-e slot for vfio. It is a must and I disable nouveau as well... It shoud not be a reason for failing. I tried also with another amd/none-card on second slot. The results were the same as I rememb= er. SYSTEM SPECS - Custom kernel compilation optimized for ryzen (https://wiki.gentoo.org/wiki/Ryzen) and using polaris binaries (https://wiki.gentoo.org/wiki/AMDGPU) - Chipset X370 (mobo) - RX480 in first slot - GTX 1070 on second slot. - Tried also with a RX 580 on second slot. - Tried also with nothing on second slot. - i3wm loading from startx command --=20 You are receiving this mail because: You are the assignee for the bug.= --15219532750.99Fa.4871 Date: Sun, 25 Mar 2018 04:47:55 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 105733
Summary Amdgpu randomly hangs and only ssh works. Mouse cursor moves = sometimes but does nothing. Keyboard stops working.
Product DRI
Version XOrg git
Hardware x86-64 (AMD64)
OS Linux (All)
Status NEW
Severity critical
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter allan4229@gmail.com

Created attachment 138344 [details]
dmesg, killing pids, shutting down, unloading amdgpu, xorg log

WHAT HAPPENS
- Amdgpu hangs without any clear clue of what is happening.
- The mouse cursor responds to movements when the system is not frozen, but
also it does nothing as well.
- The keyboard gets num lock frozen and even trying with a ps2 one does not
work.
- The video gets frozen.
- Only ssh works, but only the times that the system is not frozen, of cour=
se.
- The most irritating part : the system can not be shutdown. No matter what=
 you
do :
-- If you press the power button from the case, it is the only answer that =
you
can get from the output display : it shows a console indicating that x-serv=
er
is trying to be turned off. But nothing else happens and the system can't be
turned off.
-- If you try anything from ssh : "init 0", "poweroff",=
 "shutdown -P 0 -h",
"reboot". It simply does not work. It keeps waiting for something=
 that never
happens. Then you have to press ctrl_c to get back to the ssh sessioon. In =
an
attempt it closed the ssh daemon but the shutdown itself never happened... =
even
after 30mins.
-- It is IMPOSSIBLE to force unload amdgpu using "rmmod -f amdgpu"=
;. The task
takes forever and never responds. It only hangs the ssh session.
-- It is IMPOSSIBLE to kill some x-related pids properly. If you try to kil=
l it
either nothing will happen or the process will be in a defunct state. Not e=
ven
a "su -c 'kill -9 <pid>'" will work.

TIPS
- The crashes that allows ssh connection almost always happens when firefox=
 is
openned and running a video (netflix, youtube) or whatsapp web.
- The crashes that simply hangs the entire computer may occur at any time.

OBSERVATIONS
- I use a custom kernel (from 4.15). I've tried including the polaris binar=
ies
for my card, that showed an improvement (less freeze states) for a while. B=
ut
now it is the same again.
- I use a nvidia io second pci-e slot for vfio. It is a must and I disable
nouveau as well... It shoud not be a reason for failing. I tried also with
another amd/none-card on second slot. The results were the same as I rememb=
er.

SYSTEM SPECS
- Custom kernel compilation optimized for ryzen
(https://wiki.gentoo.org/wik=
i/Ryzen) and using polaris binaries
(https://wiki.gentoo.org/wi=
ki/AMDGPU)
- Chipset X370 (mobo)
- RX480 in first slot
- GTX 1070 on second slot.
- Tried also with a RX 580 on second slot.
- Tried also with nothing on second slot.
- i3wm loading from startx command


You are receiving this mail because:
  • You are the assignee for the bug.
= --15219532750.99Fa.4871-- --===============0292459329== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0292459329==--