From mboxrd@z Thu Jan 1 00:00:00 1970 From: Chris Wilson Subject: Re: I've got the RC6 bug Date: Wed, 18 Jan 2012 11:17:52 +0000 Message-ID: References: <20120116163338.GA3627@phenom.ffwll.local> <20120118002426.GB4093@phenom.ffwll.local> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Received: from mga09.intel.com (mga09.intel.com [134.134.136.24]) by gabe.freedesktop.org (Postfix) with ESMTP id BC84C9E908 for ; Wed, 18 Jan 2012 03:18:03 -0800 (PST) In-Reply-To: <20120118002426.GB4093@phenom.ffwll.local> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org Errors-To: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org To: Daniel Vetter , CC Cc: intel-gfx@lists.freedesktop.org, Ben Widawsky List-Id: intel-gfx@lists.freedesktop.org On Wed, 18 Jan 2012 01:24:26 +0100, Daniel Vetter wrote: > On Wed, Jan 18, 2012 at 01:16:02AM +0100, CC wrote: > > I attached the error state. > > Nice one, your gpu seems to have simply disappeared. And the ringbuffer > contains a rather peculiar cmd sequence. Putting Chris (maybe he > recognizes the pattern) and Ben (he's got a patch in the works to dump a > debug register that might be interesting here) on cc. It's too late atm > for me to think about this some more. Not simply disappeared, someone clobbered it with an extremely large hammer. The GPU was killed by a stray write to address 0 which took out the render ring buffer and its hws page. So my first thought is a missing relocation, and i965g springs to mind. -Chris -- Chris Wilson, Intel Open Source Technology Centre