All of lore.kernel.org
 help / color / mirror / Atom feed
From: Chris Wilson <chris@chris-wilson.co.uk>
To: Norbert Preining <preining@logic.at>, Dave Airlie <airlied@gmail.com>
Cc: Daniel Vetter <daniel.vetter@ffwll.ch>,
	linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org
Subject: Re: drm i915 hangs on heavy io load
Date: Wed, 24 Oct 2012 09:11:03 +0100	[thread overview]
Message-ID: <b94cdc$72fm1r@fmsmga001.fm.intel.com> (raw)
In-Reply-To: <20121024003659.GA30962@gamma.logic.tuwien.ac.at>

On Wed, 24 Oct 2012 09:36:59 +0900, Norbert Preining <preining@logic.at> wrote:
> Hi Dave, hi Chris,
> 
> thanks for your answers.
> 
> On Di, 23 Okt 2012, Dave Airlie wrote:
> > Does booting with i915.i915_enable_rc6=0 help?
> 
> No,booted with that, it happened again on a completely idle
> system (well, I believe completely idle, I was doing the
> dishes ;-)
> 
> [12437.995026] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [12437.995034] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
> [12438.000213] [drm:init_ring_common] *ERROR* failed to set render ring head to zero ctl 00000000 head 5ee06f14 tail 00000000 start 00003000
> [12438.054894] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 0001f001 head 5ee06f14 tail 00000000 start 00003000
> [12439.583064] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [12439.583176] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
> [12439.583182] [drm:i915_reset] *ERROR* Failed to reset chip.
> 
> New output see here:
> http://www.logic.at/people/preining/i915_error_state.gz

That has a very similar look to it, so reasonable to assume that is the
same issue.
 
> > http://cgit.freedesktop.org/~danvet/drm/commit/?h=ilk-wa-pile&id=0d5fed2de763b49bb1a90140758153481f043757
> > is the missing ingredient.
> 
> I am compiling a kernel with this patch based on current git now.
> Should I still use the above kernel cmd argument (i915...rc6=0)
> or try without it?

Without any rc6 parameter would be best. But if rc6=0 wasn't the
solution for you, then I may have identified the wrong w/a. Can I ask
you try the patches in that branch until you find one (or more perhaps)
that stabilise your system?
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre

WARNING: multiple messages have this Message-ID (diff)
From: Chris Wilson <chris@chris-wilson.co.uk>
To: Norbert Preining <preining@logic.at>, Dave Airlie <airlied@gmail.com>
Cc: linux-kernel@vger.kernel.org,
	Daniel Vetter <daniel.vetter@ffwll.ch>,
	dri-devel@lists.freedesktop.org
Subject: Re: drm i915 hangs on heavy io load
Date: Wed, 24 Oct 2012 09:11:03 +0100	[thread overview]
Message-ID: <b94cdc$72fm1r@fmsmga001.fm.intel.com> (raw)
In-Reply-To: <20121024003659.GA30962@gamma.logic.tuwien.ac.at>

On Wed, 24 Oct 2012 09:36:59 +0900, Norbert Preining <preining@logic.at> wrote:
> Hi Dave, hi Chris,
> 
> thanks for your answers.
> 
> On Di, 23 Okt 2012, Dave Airlie wrote:
> > Does booting with i915.i915_enable_rc6=0 help?
> 
> No,booted with that, it happened again on a completely idle
> system (well, I believe completely idle, I was doing the
> dishes ;-)
> 
> [12437.995026] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [12437.995034] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
> [12438.000213] [drm:init_ring_common] *ERROR* failed to set render ring head to zero ctl 00000000 head 5ee06f14 tail 00000000 start 00003000
> [12438.054894] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 0001f001 head 5ee06f14 tail 00000000 start 00003000
> [12439.583064] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [12439.583176] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
> [12439.583182] [drm:i915_reset] *ERROR* Failed to reset chip.
> 
> New output see here:
> http://www.logic.at/people/preining/i915_error_state.gz

That has a very similar look to it, so reasonable to assume that is the
same issue.
 
> > http://cgit.freedesktop.org/~danvet/drm/commit/?h=ilk-wa-pile&id=0d5fed2de763b49bb1a90140758153481f043757
> > is the missing ingredient.
> 
> I am compiling a kernel with this patch based on current git now.
> Should I still use the above kernel cmd argument (i915...rc6=0)
> or try without it?

Without any rc6 parameter would be best. But if rc6=0 wasn't the
solution for you, then I may have identified the wrong w/a. Can I ask
you try the patches in that branch until you find one (or more perhaps)
that stabilise your system?
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre

  reply	other threads:[~2012-10-24  8:11 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-10-23  5:38 drm i915 hangs on heavy io load Norbert Preining
2012-10-23  6:56 ` Dave Airlie
2012-10-23  6:56 ` Dave Airlie
2012-10-23  7:24   ` Norbert Preining
2012-10-24  0:36   ` Norbert Preining
2012-10-24  8:11     ` Chris Wilson [this message]
2012-10-24  8:11       ` Chris Wilson
2012-10-28  2:47       ` Norbert Preining
2012-10-28 11:10         ` Chris Wilson
2012-10-28 12:32           ` Norbert Preining
2012-10-29  7:17             ` Tino Keitel
2012-10-30  0:49               ` Norbert Preining
2012-10-30  0:55                 ` Dave Airlie
2012-10-30  1:01                   ` Norbert Preining
2012-10-30  1:37                     ` Ben Widawsky
2012-10-30  3:13                       ` Norbert Preining
2012-11-04  0:44                   ` Norbert Preining
2012-11-04  6:08                     ` Dave Airlie
2012-11-05  0:33                       ` Norbert Preining
2012-11-05  0:33                         ` Norbert Preining
2012-11-05 20:29                       ` [bisected] " Lekensteyn
2012-10-30  0:39           ` Norbert Preining
2012-10-30 10:02             ` Chris Wilson
2012-10-23  6:56 ` Dave Airlie
2012-10-23  9:17 ` Chris Wilson
  -- strict thread matches above, loose matches on Subject: below --
2012-10-23  5:38 Norbert Preining
2012-10-23  5:38 Norbert Preining
2012-10-23  5:38 Norbert Preining

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='b94cdc$72fm1r@fmsmga001.fm.intel.com' \
    --to=chris@chris-wilson.co.uk \
    --cc=airlied@gmail.com \
    --cc=daniel.vetter@ffwll.ch \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=preining@logic.at \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.