From mboxrd@z Thu Jan 1 00:00:00 1970
From: bugzilla-daemon@freedesktop.org
Subject: [Bug 105425] 3D & games produce periodic GPU crashes (Radeon R7 370)
Date: Tue, 10 Apr 2018 09:13:14 +0000
Message-ID:
References:
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="===============2133731450=="
Return-path:
Received: from culpepper.freedesktop.org (culpepper.freedesktop.org
[131.252.210.165])
by gabe.freedesktop.org (Postfix) with ESMTP id 0C21F89D77
for ; Tue, 10 Apr 2018 09:13:14 +0000 (UTC)
In-Reply-To:
List-Unsubscribe: ,
List-Archive:
List-Post:
List-Help:
List-Subscribe: ,
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel"
To: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org
--===============2133731450==
Content-Type: multipart/alternative; boundary="15233515930.A18254e10.22713"
Content-Transfer-Encoding: 7bit
--15233515930.A18254e10.22713
Date: Tue, 10 Apr 2018 09:13:13 +0000
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
https://bugs.freedesktop.org/show_bug.cgi?id=3D105425
--- Comment #20 from iive@yahoo.com ---
(In reply to MirceaKitsune from comment #16)
> I have moved on to testing the various kernel parameters available for my
> driver and card. As was pointed out by malcolmlewis on the openSUSE forum=
s,
> they can be listed with the following commands:
>=20
> modinfo amdgpu
> systool -vm amdgpu
>=20
> I tested nearly half of them today, almost none made any difference. There
> were however a few settings that appeared to influence the frequency of t=
he
> freeze. The most notable one of all seems to be the following:
>=20
> amdgpu.moverate=3D4
>=20
> With no parameters changed, the freeze now occurs roughly once per 30
> minutes in Xonotic. With that move rate limited to 4MB/s however, I
> seemingly reduced it to only 90 minutes! The FPS will constantly drop and
> recover, but that makes sense as this setting explicitly limits the buffer
> migration rate.
>=20
> I may test other variables in the days to come, but for now I'm hoping th=
is
> offers at least some clue to get things started. My feeling is that the
> video card may be slowly loaded with information until something fills up,
> or perhaps some events throw too much data in at once and it reaches a
> bottleneck?
You are making a progress.
I just want to give you few tips.
1. You are always using 3D acceleration. The glamor driver that is used by =
XOrg
for 2D (DDX) acceleration is using EGL and shaders for drawing. If you have
composite manager (kde has one), it might do more load on it.
You might try "AccelMethod" "None" in xorg.conf, just to check if it makes =
any
difference. I hope that won't disable OpenGL entirely...
2. My videocard is also Gigabyte. I had it replaced ones, because in the fi=
rst
month my initial card (same model) had major issues. Like not starting up at
boot after few hours of gameplay.
3. On my chip failure the pins affected were these controlling the internal
VideoRAM. If you have chip problems, it might affect other pins first, like=
the
PCIE ones. So HW problem is not ruled out.
4. PCIE standard allows using of less parallel lanes for data transfer. If
broken pins are suspected, moving to 4x slot might alleviate the issue.
BTW, I see that the card is on PCI_ID #3.00.1 , is it in the first slot?
Usually the first slow is 16x and has extra electric power.
5. If you suspect issue with filled RAM, you might try environment variable
"GALLIUM_HUD" it has some GTT displays.
6. In that manner of thinking. Make sure that kernel option for CMA is
disabled... that's been causing me problems every time I enable it. You mig=
ht
also have IOMMU enabled, try disabling it, just for tests.
Once again,=20
Keep digging and good luck.
--=20
You are receiving this mail because:
You are the assignee for the bug.=
--15233515930.A18254e10.22713
Date: Tue, 10 Apr 2018 09:13:13 +0000
MIME-Version: 1.0
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Bugzilla-URL: http://bugs.freedesktop.org/
Auto-Submitted: auto-generated
Comme=
nt # 20
on bug 10542=
5
from iive@yahoo.com
(In reply to MirceaKitsune from comment #16)
> I have moved on to testing the various kernel pa=
rameters available for my
> driver and card. As was pointed out by malcolmlewis on the openSUSE fo=
rums,
> they can be listed with the following commands:
>=20
> modinfo amdgpu
> systool -vm amdgpu
>=20
> I tested nearly half of them today, almost none made any difference. T=
here
> were however a few settings that appeared to influence the frequency o=
f the
> freeze. The most notable one of all seems to be the following:
>=20
> amdgpu.moverate=3D4
>=20
> With no parameters changed, the freeze now occurs roughly once per 30
> minutes in Xonotic. With that move rate limited to 4MB/s however, I
> seemingly reduced it to only 90 minutes! The FPS will constantly drop =
and
> recover, but that makes sense as this setting explicitly limits the bu=
ffer
> migration rate.
>=20
> I may test other variables in the days to come, but for now I'm hoping=
this
> offers at least some clue to get things started. My feeling is that the
> video card may be slowly loaded with information until something fills=
up,
> or perhaps some events throw too much data in at once and it reaches a
> bottleneck?
You are making a progress.
I just want to give you few tips.
1. You are always using 3D acceleration. The glamor driver that is used by =
XOrg
for 2D (DDX) acceleration is using EGL and shaders for drawing. If you have
composite manager (kde has one), it might do more load on it.
You might try "AccelMethod" "None" in xorg.conf, just t=
o check if it makes any
difference. I hope that won't disable OpenGL entirely...
2. My videocard is also Gigabyte. I had it replaced ones, because in the fi=
rst
month my initial card (same model) had major issues. Like not starting up at
boot after few hours of gameplay.
3. On my chip failure the pins affected were these controlling the internal
VideoRAM. If you have chip problems, it might affect other pins first, like=
the
PCIE ones. So HW problem is not ruled out.
4. PCIE standard allows using of less parallel lanes for data transfer. If
broken pins are suspected, moving to 4x slot might alleviate the issue.
BTW, I see that the card is on PCI_ID #3.00.1 , is it in the first slot?
Usually the first slow is 16x and has extra electric power.
5. If you suspect issue with filled RAM, you might try environment variable
"GALLIUM_HUD" it has some GTT displays.
6. In that manner of thinking. Make sure that kernel option for CMA is
disabled... that's been causing me problems every time I enable it. You mig=
ht
also have IOMMU enabled, try disabling it, just for tests.
Once again,=20
Keep digging and good luck.
You are receiving this mail because:
- You are the assignee for the bug.
=
--15233515930.A18254e10.22713--
--===============2133731450==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: base64
Content-Disposition: inline
X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs
IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz
dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg==
--===============2133731450==--