linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* 3.18-rc regression: drm/nouveau: use shared fences for readable objects
@ 2014-11-19  6:43 Michael Marineau
  2014-11-19  8:10 ` Maarten Lankhorst
  0 siblings, 1 reply; 14+ messages in thread
From: Michael Marineau @ 2014-11-19  6:43 UTC (permalink / raw)
  To: dri-devel; +Cc: linux-kernel, Maarten Lankhorst, Ben Skeggs, David Airlie

On 3.18-rc kernel's I have been intermittently experiencing GPU
lockups shortly after startup, accompanied with one or both of the
following errors:

nouveau E[   PFIFO][0000:01:00.0] read fault at 0x000734a000 [PTE]
from PBDMA0/HOST_CPU on channel 0x007faa3000 [unknown]
nouveau E[     DRM] GPU lockup - switching to software fbcon

I was able to trace the issue with bisect to commit
809e9447b92ffe1346b2d6ec390e212d5307f61c "drm/nouveau: use shared
fences for readable objects". The lockups appear to have cleared up
since reverting that and a few related followup commits:

809e9447: "drm/nouveau: use shared fences for readable objects"
055dffdf: "drm/nouveau: bump driver patchlevel to 1.2.1"
e3be4c23: "drm/nouveau: specify if interruptible wait is desired in
nouveau_fence_sync"
15a996bb: "drm/nouveau: assign fence_chan->name correctly"

For reference here is what the driver reports about my hardware:
nouveau 0000:01:00.0: enabling device (0006 -> 0007)
nouveau  [  DEVICE][0000:01:00.0] BOOT0  : 0x0e7290a2
nouveau  [  DEVICE][0000:01:00.0] Chipset: GK107 (NVE7)
nouveau  [  DEVICE][0000:01:00.0] Family : NVE0
nouveau  [   VBIOS][0000:01:00.0] checking PRAMIN for image...
nouveau  [   VBIOS][0000:01:00.0] ... appears to be valid
nouveau  [   VBIOS][0000:01:00.0] using image from PRAMIN
nouveau  [   VBIOS][0000:01:00.0] BIT signature found
nouveau  [   VBIOS][0000:01:00.0] version 80.07.c7.04.01
nouveau 0000:01:00.0: irq 39 for MSI/MSI-X
nouveau  [     PMC][0000:01:00.0] MSI interrupts enabled
nouveau  [     PFB][0000:01:00.0] RAM type: GDDR5
nouveau  [     PFB][0000:01:00.0] RAM size: 2048 MiB
nouveau  [     PFB][0000:01:00.0]    ZCOMP: 0 tags
nouveau  [  PTHERM][0000:01:00.0] FAN control: none / external
nouveau  [  PTHERM][0000:01:00.0] fan management: automatic
nouveau  [  PTHERM][0000:01:00.0] internal sensor: yes
nouveau  [     CLK][0000:01:00.0] 07: core 270-405 MHz memory 838 MHz
nouveau  [     CLK][0000:01:00.0] 0a: core 270-925 MHz memory 1560 MHz
nouveau  [     CLK][0000:01:00.0] 0e: core 270-925 MHz memory 4000 MHz
nouveau  [     CLK][0000:01:00.0] 0f: core 270-925 MHz memory 5016 MHz
nouveau  [     CLK][0000:01:00.0] --: core 405 MHz memory 680 MHz
nouveau  [     DRM] VRAM: 2048 MiB
nouveau  [     DRM] GART: 1048576 MiB
nouveau  [     DRM] TMDS table version 2.0
nouveau  [     DRM] DCB version 4.0
nouveau  [     DRM] DCB outp 00: 04810fb6 0f230010
nouveau  [     DRM] DCB outp 01: 01821fd6 0f420020
nouveau  [     DRM] DCB outp 02: 01021f12 00020020
nouveau  [     DRM] DCB outp 03: 08832fc6 0f420010
nouveau  [     DRM] DCB outp 04: 08032f02 00020010
nouveau  [     DRM] DCB outp 05: 02843f62 00020010
nouveau  [     DRM] DCB conn 00: 00020047
nouveau  [     DRM] DCB conn 01: 02208146
nouveau  [     DRM] DCB conn 02: 01104246
nouveau  [     DRM] DCB conn 03: 00410361
nouveau  [     DRM] MM: using COPY for buffer copies
nouveau  [     DRM] allocated 2880x1800 fb: 0x80000, bo ffff88046b26f800

-- 
Michael Marineau

^ permalink raw reply	[flat|nested] 14+ messages in thread
* Re: 3.18-rc regression: drm/nouveau: use shared fences for readable objects
@ 2014-11-29  0:51 Ian Kumlien
  0 siblings, 0 replies; 14+ messages in thread
From: Ian Kumlien @ 2014-11-29  0:51 UTC (permalink / raw)
  To: maarten.lankhorst
  Cc: mike, tobias.johannes.klausmann, linux-kernel@vger.kernel.org,
	dri-devel

Hi,

Sorry to but in like this but I'm suffering from the same kind of
deadlocks with nouveau...

The really odd thing is that i could boot some -rc6+ kernel without
problems but it hung
while playing video and then it refused to start properly again.


Anyway, to quote Maarten:
Ok that most likely means the interrupt based wait is borked somehow,
so lets find \
out why..

I fear that this happens because of a race in the interface, so my
first attempt will \
rule out abuse of the nvif api by nouveau_fence.c

Can you test below patch with the default wait function?
---

I tried the patch, straight on Linus' master and it didn't change
anything for me :(

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2014-11-30 21:10 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-11-19  6:43 3.18-rc regression: drm/nouveau: use shared fences for readable objects Michael Marineau
2014-11-19  8:10 ` Maarten Lankhorst
2014-11-19 15:18   ` Tobias Klausmann
2014-11-19 23:08   ` Tobias Klausmann
2014-11-20  8:41     ` Maarten Lankhorst
2014-11-20  4:06   ` Michael Marineau
2014-11-20  8:53     ` Maarten Lankhorst
     [not found]       ` <CAHW-aUcv3g7mGxmZawe-mfMMdFNnkKep-u-JyH9hhJqoaMM2Mw@mail.gmail.com>
2014-11-22 16:56         ` Maarten Lankhorst
     [not found]           ` <CAHW-aUe8SNbnPwCLsT9rqw3S5meA+feW85UQtWT6HogKsB971Q@mail.gmail.com>
     [not found]             ` <CAHW-aUdPKQf4yU_NnsRxXrFrW2Zi-v9QyajHaYU-exMzsXH4eg@mail.gmail.com>
2014-11-25  7:43               ` Maarten Lankhorst
2014-11-26 20:29                 ` Michael Marineau
2014-11-27  1:18                   ` Tobias Klausmann
2014-11-27  8:33                     ` Maarten Lankhorst
2014-11-30 21:10                       ` Michael Marineau
  -- strict thread matches above, loose matches on Subject: below --
2014-11-29  0:51 Ian Kumlien

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).