From: Daniel Vetter <daniel@ffwll.ch>
To: Eugeni Dodonov <eugeni@dodonov.net>
Cc: Daniel Vetter <daniel.vetter@ffwll.ch>,
Intel Graphics Development <intel-gfx@lists.freedesktop.org>
Subject: Re: [PATCH] drm/i915: add interface to simulate gpu hangs
Date: Sat, 5 May 2012 21:13:10 +0200 [thread overview]
Message-ID: <20120505191310.GB4985@phenom.ffwll.local> (raw)
In-Reply-To: <CAC7Lmns=YBtu1TG9Zu4v6SwFn9Vac4f3OqH-sk=pH+YuKy-MoA@mail.gmail.com>
On Thu, May 03, 2012 at 04:00:00PM -0300, Eugeni Dodonov wrote:
> On Thu, May 3, 2012 at 9:48 AM, Daniel Vetter <daniel.vetter@ffwll.ch>wrote:
>
> > gpu reset is a very important piece of our infrastructure.
> > Unfortunately we only really it test by actually hanging the gpu,
> > which often has bad side-effects for the entire system. And the gpu
> > hang handling code is one of the rather complicated pieces of code we
> > have, consisting of
> > - hang detection
> > - error capture
> > - actual gpu reset
> > - reset of all the gem bookkeeping
> > - reinitialition of the entire gpu
> >
> > This patch adds a debugfs to selectively stopping rings by ceasing to
> > update the hw tail pointer, which will result in the gpu no longer
> > updating it's head pointer and eventually to the hangcheck firing.
> > This way we can exercise the gpu hang code under controlled conditions
> > without a dying gpu taking down the entire systems.
> >
> > Patch motivated by me forgetting to properly reinitialize ppgtt after
> > a gpu reset.
> >
> > Usage:
> >
> > echo $((1 << $ringnum)) > i915_ring_stop # stops one ring
> >
> > echo 0xffffffff > i915_ring_stop # stops all, future-proof version
> >
> > then run whatever testload is desired. i915_ring_stop automatically
> > resets after a gpu hang is detected to avoid hanging the gpu to fast
> > and declaring it wedged.
> >
> > v2: Incorporate feedback from Chris Wilson.
> >
> > v3: Add the missing cleanup.
> >
> > v4: Fix up inconsistent size of ring_stop_read vs _write, noticed by
> > Eugeni Dodonov.
> >
> > Signed-Off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
> > Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
> >
>
> Reviewed-by: Eugeni Dodonov <eugeni.dodonov@intel.com>
I've slurped the hangman into -next, thanks for the review.
-Daniel
--
Daniel Vetter
Mail: daniel@ffwll.ch
Mobile: +41 (0)79 365 57 48
next prev parent reply other threads:[~2012-05-05 19:12 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-04-27 13:17 [PATCH 01/10] drm/i915: add interface to simulate gpu hangs Daniel Vetter
2012-04-27 13:17 ` [PATCH 02/10] drm/i915: rework dev->first_error locking Daniel Vetter
2012-05-04 17:04 ` Eugeni Dodonov
2012-04-27 13:17 ` [PATCH 03/10] drm/i915: allow the existing error_state to be destroyed Daniel Vetter
2012-05-04 11:56 ` Daniel Vetter
2012-05-04 17:15 ` Eugeni Dodonov
2012-05-04 19:58 ` Daniel Vetter
2012-04-27 13:17 ` [PATCH 04/10] drm/i915: simplify i915_reset a bit Daniel Vetter
2012-05-04 16:47 ` Eugeni Dodonov
2012-04-27 13:17 ` [PATCH 05/10] drm/i915: extract intel_gpu_reset Daniel Vetter
2012-04-30 1:03 ` Ben Widawsky
2012-05-04 16:51 ` Eugeni Dodonov
2012-04-27 13:17 ` [PATCH 06/10] drm/i915: make gpu hangman more resilient Daniel Vetter
2012-05-04 16:47 ` Eugeni Dodonov
2012-04-27 13:17 ` [PATCH 07/10] drm/i915: kill flags parameter for reset functions Daniel Vetter
2012-05-04 16:47 ` Eugeni Dodonov
2012-04-27 13:17 ` [PATCH 08/10] drm/i915: also reset the media engine on gen4/5 Daniel Vetter
2012-04-27 13:17 ` [PATCH 09/10] drm/i915: remove modeset reset from i915_reset Daniel Vetter
2012-04-27 13:17 ` [PATCH 10/10] drm/i915: kill gen4 gpu reset code Daniel Vetter
2012-04-27 18:49 ` Eric Anholt
2012-04-27 19:17 ` Daniel Vetter
2012-04-27 23:26 ` Eric Anholt
2012-05-02 19:33 ` [PATCH] drm/i915: fix gen4 gpu reset Daniel Vetter
2012-05-02 19:54 ` Kenneth Graunke
2012-05-04 12:07 ` Daniel Vetter
2012-05-04 17:06 ` Eugeni Dodonov
2012-04-28 4:56 ` [PATCH 01/10] drm/i915: add interface to simulate gpu hangs Ben Widawsky
2012-05-02 15:23 ` Daniel Vetter
2012-05-03 12:48 ` [PATCH] " Daniel Vetter
2012-05-03 19:00 ` Eugeni Dodonov
2012-05-05 19:13 ` Daniel Vetter [this message]
-- strict thread matches above, loose matches on Subject: below --
2011-11-10 13:18 [PATCH 3/9] " Daniel Vetter
2011-11-10 16:34 ` [PATCH] " Daniel Vetter
2011-12-02 22:21 ` Daniel Vetter
2011-12-03 1:33 ` Chris Wilson
2011-12-05 23:20 ` Ben Widawsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120505191310.GB4985@phenom.ffwll.local \
--to=daniel@ffwll.ch \
--cc=daniel.vetter@ffwll.ch \
--cc=eugeni@dodonov.net \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox