From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 102322] System crashes after "[drm] IP block:gmc_v8_0 is hung!" / [drm] IP block:sdma_v3_0 is hung! Date: Wed, 08 Aug 2018 23:07:38 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1186252450==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 806086E614 for ; Wed, 8 Aug 2018 23:07:38 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1186252450== Content-Type: multipart/alternative; boundary="15337696582.f595f250.4642" Content-Transfer-Encoding: 7bit --15337696582.f595f250.4642 Date: Wed, 8 Aug 2018 23:07:38 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D102322 --- Comment #37 from dwagner --- In the related bug report (https://bugs.freedesktop.org/show_bug.cgi?id=3D1= 07152) I noticed that this bug can be triggered very reliably and quickly by playi= ng a video with a deliberately lowered frame rate: "mpv --no-correct-pts --fps=3D3 --ao=3Dnull some_arbitrary_video.webm" This led me to assume this bug might be caused by the dynamic power managem= ent, that often ramps performance up/down when a video is played at such a low f= rame rate. And indeed, I found this confirmed by many experiments: If I use a script l= ike > #!/bin/bash > cd /sys/class/drm/card0/device > echo manual >power_dpm_force_performance_level > # low > echo 0 >pp_dpm_mclk=20 > echo 0 >pp_dpm_sclk > # medium > #echo 1 >pp_dpm_mclk=20 > #echo 1 >pp_dpm_sclk > # high > #echo 1 >pp_dpm_mclk=20 > #echo 6 >pp_dpm_sclk to enforce just any performance level, then the crashes do not occur anymor= e - also with the "low frame rate video test". So it seems that the transition from one "dpm" performance level to another, with a certain probability, causes these crashes. And the more often the transitions occur, the sooner one will experience them. (BTW: For unknown reason, invoking "xrandr" or enabling a monitor after sle= ep causes the above settings to get lost, so one has to invoke above script again.) --=20 You are receiving this mail because: You are the assignee for the bug.= --15337696582.f595f250.4642 Date: Wed, 8 Aug 2018 23:07:38 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 37 on bug 10232= 2 from dwagner
In the related bug report (https://bugs.freedesktop.org/show_bug.=
cgi?id=3D107152)
I noticed that this bug can be triggered very reliably and quickly by playi=
ng a
video with a deliberately lowered frame rate:
 "mpv --no-correct-pts --fps=3D3 --ao=3Dnull some_arbitrary_video.webm=
"

This led me to assume this bug might be caused by the dynamic power managem=
ent,
that often ramps performance up/down when a video is played at such a low f=
rame
rate.

And indeed, I found this confirmed by many experiments: If I use a script l=
ike
> #!/bin/bash
> cd /sys/class/drm/card0/device
> echo manual >power_dpm_force_performance_level
> # low
> echo 0 >pp_dpm_mclk=20
> echo 0 >pp_dpm_sclk
> # medium
> #echo 1 >pp_dpm_mclk=20
> #echo 1 >pp_dpm_sclk
> # high
> #echo 1 >pp_dpm_mclk=20
> #echo 6 >pp_dpm_sclk
to enforce just any performance level, then the crashes do not occur anymor=
e -
also with the "low frame rate video test".

So it seems that the transition from one "dpm" performance level =
to another,
with a certain probability, causes these crashes. And the more often the
transitions occur, the sooner one will experience them.

(BTW: For unknown reason, invoking "xrandr" or enabling a monitor=
 after sleep
causes the above settings to get lost, so one has to invoke above script
again.)


You are receiving this mail because:
  • You are the assignee for the bug.
= --15337696582.f595f250.4642-- --===============1186252450== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1186252450==--