public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* i915: GPU hang
@ 2012-12-17 22:36 Guennadi Liakhovetski
  2012-12-17 23:15 ` Heinz Diehl
  2012-12-17 23:15 ` Daniel Vetter
  0 siblings, 2 replies; 6+ messages in thread
From: Guennadi Liakhovetski @ 2012-12-17 22:36 UTC (permalink / raw)
  To: dri-devel; +Cc: Daniel Vetter, linux-kernel, David Airlie

Hi all

Sorry, not sure what information is most appropriate here. GPU hangs from 
time to time on this laptop, typically when running firefox on 
graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6 
(squeeze), lspci

00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 
Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller])
	Subsystem: Acer Incorporated [ALI] Device 011f
	Flags: bus master, fast devsel, latency 0, IRQ 46
	Memory at fc000000 (64-bit, non-prefetchable) [size=1M]
	Memory at d0000000 (64-bit, prefetchable) [size=256M]
	I/O ports at 1800 [size=8]
	Expansion ROM at <unassigned> [disabled]
	Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit-
	Capabilities: [d0] Power Management version 3
	Kernel driver in use: i915

kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the 
X-server freezes and restarts with no errors in dmesg. Is it a known 
problem?

Thanks
Guennadi
---
Guennadi Liakhovetski, Ph.D.
Freelance Open-Source Software Developer
http://www.open-technology.de/

[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[drm:i915_reset] *ERROR* Failed to reset chip.
------------[ cut here ]------------
WARNING: at drivers/gpu/drm/i915/intel_display.c:906 intel_enable_pipe+0x16c/0x1c0 [i915]()
Hardware name: Extensa 5220                   
PLL state assertion failure (expected on, current off)
Modules linked in: nls_iso8859_1 nls_cp437 vfat fat usb_storage tun crypto_blkcipher aes_generic mperf cpufreq_powersave cpufreq_userspace cpufreq_stats i915 drm_kms_helper bnep rfcomm bluetooth crc16 fuse cryptomgr aead arc4 crypto_algapi rt2800usb rt2800lib crc_ccitt rt2x00usb rt2x00lib mac80211 cfg80211 uhci_hcd sg ehci_hcd tg3 sr_mod tifm_7xx1 usbcore cdrom coretemp snd_hda_codec_realtek pcmcia psmouse tifm_core i2c_i801 yenta_socket pcmcia_rsrc snd_hda_intel hwmon libphy snd_hda_codec usb_common pcmcia_core sdhci_pci sdhci mmc_core
Pid: 1780, comm: Xorg Not tainted 3.5.2 #32
Call Trace:
[<c1029958>] ? warn_slowpath_common+0x78/0xb0
[<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915]
[<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915]
[<c1029a23>] ? warn_slowpath_fmt+0x33/0x40
[<f83bb4bc>] ? intel_enable_pipe+0x16c/0x1c0 [i915]
[<f83c0d32>] ? i9xx_crtc_mode_set+0x942/0x1070 [i915]
[<f83bd03b>] ? intel_crtc_mode_set+0x5b/0x90 [i915]
[<f8118cd8>] ? drm_crtc_helper_set_mode+0x358/0x4a0 [drm_kms_helper]
[<f83bf75c>] ? intel_get_load_detect_pipe+0x22c/0x3a0 [i915]
[<c1096bb3>] ? find_get_page+0x63/0xc0
[<c10f572a>] ? __find_get_block+0x9a/0x1e0
[<c10f522a>] ? __find_get_block_slow+0xfa/0x1b0
[<f83d7d9b>] ? intel_tv_detect+0x9b/0x460 [i915]
[<c10f588b>] ? __getblk+0x1b/0x320
[<c1215c01>] ? drm_get_connector_name+0x31/0x50
[<f8119cc8>] ? drm_helper_probe_single_connector_modes+0x258/0x310 [drm_kms_helper]
[<c1215acf>] ? drm_mode_getconnector+0x33f/0x370
[<c1206223>] ? drm_ioctl+0x263/0x4e0
[<c1215790>] ? drm_mode_getencoder+0xb0/0xb0
[<c1205fc0>] ? drm_version+0x90/0x90
[<c10dbecd>] ? do_vfs_ioctl+0x8d/0x600
[<c11032d6>] ? fsnotify+0x196/0x2c0
[<c10e69c0>] ? alloc_fd+0xc0/0x110
[<c10cc8c7>] ? vfs_write+0x107/0x150
[<c10dc47d>] ? sys_ioctl+0x3d/0x70
[<c133c92c>] ? sysenter_do_call+0x12/0x22
---[ end trace 81ff939dea1daea3 ]---

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: i915: GPU hang
  2012-12-17 22:36 i915: GPU hang Guennadi Liakhovetski
@ 2012-12-17 23:15 ` Heinz Diehl
  2012-12-17 23:17   ` Daniel Vetter
  2012-12-17 23:15 ` Daniel Vetter
  1 sibling, 1 reply; 6+ messages in thread
From: Heinz Diehl @ 2012-12-17 23:15 UTC (permalink / raw)
  To: linux-kernel; +Cc: dri-devel, Daniel Vetter, David Airlie

On 17.12.2012, Guennadi Liakhovetski wrote: 

> [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
> [drm:i915_reset] *ERROR* Failed to reset chip.

I have the same problem, are able to reproduce it and have bisected
it, but the commit which git --bisect identified seems not to be the
cause.

root@wildsau linux-git]# git bisect good
6c085a728cf000ac1865d66f8c9b52935558b328 is the first bad commit
commit 6c085a728cf000ac1865d66f8c9b52935558b328
Author: Chris Wilson <ch...@chris-wilson.co.uk>
Date:   Mon Aug 20 11:40:46 2012 +0200

    drm/i915: Track unbound pages


This is a quite nasty (3.7) regression. I have it on all of my three
machines and it drives me mad (3.6.x hangs my USB 3.0 port and 3.7 my
intel graphics).

Try to boot with "i915.i915_enable_rc6=0" and switch to SNA in your
Xorg.conf:

Section "Device"
   Identifier "Card0"
   Driver "intel"
   Option "AccelMethod" "SNA"
EndSection

There are tons of this "GPU hangcheck timer elapsed"
messages on the net...

Good luck!


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: i915: GPU hang
  2012-12-17 22:36 i915: GPU hang Guennadi Liakhovetski
  2012-12-17 23:15 ` Heinz Diehl
@ 2012-12-17 23:15 ` Daniel Vetter
  2012-12-30 20:53   ` Guennadi Liakhovetski
  1 sibling, 1 reply; 6+ messages in thread
From: Daniel Vetter @ 2012-12-17 23:15 UTC (permalink / raw)
  To: Guennadi Liakhovetski; +Cc: dri-devel, linux-kernel, David Airlie

On Mon, Dec 17, 2012 at 11:36 PM, Guennadi Liakhovetski
<g.liakhovetski@gmx.de> wrote:
> Sorry, not sure what information is most appropriate here. GPU hangs from
> time to time on this laptop, typically when running firefox on
> graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6
> (squeeze), lspci
>
> 00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960
> Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller])
>         Subsystem: Acer Incorporated [ALI] Device 011f
>         Flags: bus master, fast devsel, latency 0, IRQ 46
>         Memory at fc000000 (64-bit, non-prefetchable) [size=1M]
>         Memory at d0000000 (64-bit, prefetchable) [size=256M]
>         I/O ports at 1800 [size=8]
>         Expansion ROM at <unassigned> [disabled]
>         Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit-
>         Capabilities: [d0] Power Management version 3
>         Kernel driver in use: i915
>
> kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the
> X-server freezes and restarts with no errors in dmesg. Is it a known
> problem?

Upgrading to 3.7 highly recommended, if just to get the recent pile of
gpu reset fixes - those should increase the chances that the reset
works and your system survives. Also worth testing is upgrading
userspace gfx driver components to the latest versions (mesa +
xf86-video-intel). If you still have gpu hangs with all that, please
file a bug report against DRM -> DRI/Intel and please attach the
i915_error_state from debugfs after your gpu hung).

Yours, Daniel


-- 
Daniel Vetter
Software Engineer, Intel Corporation
+41 (0) 79 365 57 48 - http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: i915: GPU hang
  2012-12-17 23:15 ` Heinz Diehl
@ 2012-12-17 23:17   ` Daniel Vetter
  0 siblings, 0 replies; 6+ messages in thread
From: Daniel Vetter @ 2012-12-17 23:17 UTC (permalink / raw)
  To: Heinz Diehl; +Cc: linux-kernel, dri-devel, David Airlie

On Tue, Dec 18, 2012 at 12:15 AM, Heinz Diehl <htd@fancy-poultry.org> wrote:
>> [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
>> [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
>> [drm:i915_reset] *ERROR* Failed to reset chip.
>
> I have the same problem, are able to reproduce it and have bisected
> it, but the commit which git --bisect identified seems not to be the
> cause.
>
> root@wildsau linux-git]# git bisect good
> 6c085a728cf000ac1865d66f8c9b52935558b328 is the first bad commit
> commit 6c085a728cf000ac1865d66f8c9b52935558b328
> Author: Chris Wilson <ch...@chris-wilson.co.uk>
> Date:   Mon Aug 20 11:40:46 2012 +0200
>
>     drm/i915: Track unbound pages

Note that Guennadi is running kernel 3.5 which does not contain that
patch - generally there are billions of ways to hang your gpu, so
every gpu hang needs it's own bug (until we've analyzed the
error_state and triaged the bug taking other evidence into account).
Also, this is on a different gpu generation, so even more likely that
it's a different kind of hang.

Thanks, Daniel


-- 
Daniel Vetter
Software Engineer, Intel Corporation
+41 (0) 79 365 57 48 - http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: i915: GPU hang
  2012-12-17 23:15 ` Daniel Vetter
@ 2012-12-30 20:53   ` Guennadi Liakhovetski
  2012-12-30 22:53     ` Heinz Diehl
  0 siblings, 1 reply; 6+ messages in thread
From: Guennadi Liakhovetski @ 2012-12-30 20:53 UTC (permalink / raw)
  To: Daniel Vetter; +Cc: dri-devel, linux-kernel, David Airlie

Hi Daniel

Sorry for a late reply.

On Tue, 18 Dec 2012, Daniel Vetter wrote:

> On Mon, Dec 17, 2012 at 11:36 PM, Guennadi Liakhovetski
> <g.liakhovetski@gmx.de> wrote:
> > Sorry, not sure what information is most appropriate here. GPU hangs from
> > time to time on this laptop, typically when running firefox on
> > graphics-intensive sites. Error log at the bottom. Distro is Debian 6.0.6
> > (squeeze), lspci
> >
> > 00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960
> > Integrated Graphics Controller (rev 03) (prog-if 00 [VGA controller])
> >         Subsystem: Acer Incorporated [ALI] Device 011f
> >         Flags: bus master, fast devsel, latency 0, IRQ 46
> >         Memory at fc000000 (64-bit, non-prefetchable) [size=1M]
> >         Memory at d0000000 (64-bit, prefetchable) [size=256M]
> >         I/O ports at 1800 [size=8]
> >         Expansion ROM at <unassigned> [disabled]
> >         Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit-
> >         Capabilities: [d0] Power Management version 3
> >         Kernel driver in use: i915
> >
> > kernel version 3.5.2 (yes, I'll try to upgrade). Sometimes also the
> > X-server freezes and restarts with no errors in dmesg. Is it a known
> > problem?
> 
> Upgrading to 3.7 highly recommended, if just to get the recent pile of
> gpu reset fixes - those should increase the chances that the reset
> works and your system survives.

Did that and it did work for a while, longer than the average with 3.5. I 
was already about to write a success report, but then it hung again 
yesterday. I'm not using this laptop very intensively, so, it is hard to 
collect statistics.

> Also worth testing is upgrading
> userspace gfx driver components to the latest versions (mesa +
> xf86-video-intel).

Not sure I'll be bothered - recompiling user-space is too much of a hastle 
;-) Maybe I'll just upgrade my distro altogether. Thanks for the tips 
anyway!

> If you still have gpu hangs with all that, please
> file a bug report against DRM -> DRI/Intel and please attach the
> i915_error_state from debugfs after your gpu hung).
> 
> Yours, Daniel

Thanks
Guennadi
---
Guennadi Liakhovetski, Ph.D.
Freelance Open-Source Software Developer
http://www.open-technology.de/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: i915: GPU hang
  2012-12-30 20:53   ` Guennadi Liakhovetski
@ 2012-12-30 22:53     ` Heinz Diehl
  0 siblings, 0 replies; 6+ messages in thread
From: Heinz Diehl @ 2012-12-30 22:53 UTC (permalink / raw)
  To: Guennadi Liakhovetski
  Cc: Daniel Vetter, dri-devel, linux-kernel, David Airlie

On 30.12.2012, Guennadi Liakhovetski wrote: 

> Did that and it did work for a while, longer than the average with 3.5. I 
> was already about to write a success report, but then it hung again 
> yesterday. I'm not using this laptop very intensively, so, it is hard to 
> collect statistics.

You could try to reproduce the error by writing a big file e.g.

 dd if=/dev/zero of=deleteme bs=1M count=80000

or similar and watching high definition video on Youtube (1080p) or running a few
instances of glxgears. That triggers a gpu hang in my case after
just a couple of seconds.

In my case, the hang doesn't occur when using SNA (or a kernel < 3.7,
which isn't the case with your bug). I have this in my
xorg.conf:

Section "Device"
   Identifier "Card0"
   Driver "intel"
   Option "AccelMethod" "SNA"
EndSection

Without this, every 3.7 kernel produces a gpu hang within max. 1 min.

 

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2012-12-30 22:53 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-12-17 22:36 i915: GPU hang Guennadi Liakhovetski
2012-12-17 23:15 ` Heinz Diehl
2012-12-17 23:17   ` Daniel Vetter
2012-12-17 23:15 ` Daniel Vetter
2012-12-30 20:53   ` Guennadi Liakhovetski
2012-12-30 22:53     ` Heinz Diehl

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox