From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jesse Barnes Subject: Re: [Bug #15004] i915: *ERROR* Execbuf while wedged Date: Mon, 8 Feb 2010 15:39:50 -0800 Message-ID: <20100208153950.73f4e9a5@jbarnes-piketon> References: <20100205110100.57852c31@jbarnes-piketon> <20100205190916.GD2858@think> <20100205111747.6619b09b@jbarnes-piketon> <20100208172410.GA11311@think> <20100208093551.58a10643@jbarnes-piketon> <20100208200044.GA2677@think> Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20100208200044.GA2677@think> Sender: kernel-testers-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org List-ID: Content-Type: text/plain; charset="us-ascii" To: Chris Mason Cc: "Rafael J. Wysocki" , Linux Kernel Mailing List , Kernel Testers List , Chris Wilson , tomas m On Mon, 8 Feb 2010 15:00:44 -0500 Chris Mason wrote: > On Mon, Feb 08, 2010 at 09:35:51AM -0800, Jesse Barnes wrote: > > > > > > Ok, updating to rc7 and updating my xf86 driver to 2.10 seems to > > > have fixed up my suspend/resume problems. So, I should be able > > > to trigger the execbuf problem again. > > > > I've heard some reports that the 2D driver introduces and fixes > > hangs, so it's possible 2.10 will fix both issues for you. > > > > > Well, the rebooting isn't a huge deal, but if there's something I > > > can track/kick or force to core dump, would it help? > > > > We do have a test in intel-gpu-tools that will instigate a hang, > > but it would be best to figure out what's causing it in your > > environment. > > > > [63516.632060] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer > elapsed... GPU hung [63516.632069] render error detected, EIR: > 0x00000000 [63516.632092] [drm:i915_do_wait_request] *ERROR* > i915_do_wait_request returns -5 (awaiting 2590581 at 2590579) > > So, this popped up while surfing in firefox, which is usually where I > hit the execbuf errors. X was totally stuck afterwards, but I could > switch to a vc and get the dmesg. > > If there are specific procs that I can try to get traces of, just let > me know for next time. Hm, EIR is clear so this may be a failure of our hangcheck timer. Chris Wilson saw these recently too; hoping he has ideas. -- Jesse Barnes, Intel Open Source Technology Center