From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758193Ab0HJX6o (ORCPT ); Tue, 10 Aug 2010 19:58:44 -0400 Received: from 0122700014.0.fullrate.dk ([95.166.99.235]:38701 "EHLO kernel.dk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755894Ab0HJX6i (ORCPT ); Tue, 10 Aug 2010 19:58:38 -0400 Message-ID: <4C61E7B1.1090607@kernel.dk> Date: Tue, 10 Aug 2010 19:58:41 -0400 From: Jens Axboe MIME-Version: 1.0 To: Jeff Layton CC: Jeff Moyer , linux-kernel@vger.kernel.org Subject: Re: cfq: oops in __call_for_each_cic References: <20100810064045.6996f3b7@tlielax.poochiereds.net> <20100810102718.45bddc9d@tlielax.poochiereds.net> <4C6179DD.7010000@kernel.dk> <20100810123525.3112a382@tlielax.poochiereds.net> In-Reply-To: <20100810123525.3112a382@tlielax.poochiereds.net> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 08/10/2010 12:35 PM, Jeff Layton wrote: > On Tue, 10 Aug 2010 12:10:05 -0400 > Jens Axboe wrote: > >> On 08/10/2010 10:27 AM, Jeff Layton wrote: >>> On Tue, 10 Aug 2010 10:22:41 -0400 >>> Jeff Moyer wrote: >>> >>>> Jeff Layton writes: >>>> >>>>> Saw this oops on my test machine this morning. I rebooted the machine >>>>> last night and hadn't done anything on it other than log in this >>>>> morning. The kernel here is based on Steve French's git tree, which is >>>>> based on Linus' as of Sunday Aug 8th. Last non-cifs commit is: >>>> >>>> This looks a lot like this bug: >>>> https://bugzilla.redhat.com/show_bug.cgi?id=577968 >>>> >>>> See also: >>>> http://kerneloops.org/guilty.php?guilty=cfq_free_io_context&version=2.6.34-rc&start=2228224&end=2260991&class=oops >>>> >>>> It's been around since 2.6.30.8 according to kerneloops.org. If you >>>> find that you have a reliable way of reproducing the issue, that would >>>> be great. >>>> >>> >>> Ok, thanks -- no clear reproducer so far. This morning was the >>> first time I've seen it and it was on the console of my rawhide >>> machine. The last thing I did with it was reboot it last night. I >>> suspect that the gzip process came from a cron job or something. >> >> What version did you hit it on? >> > > It was a kernel built out of git, based on Steve French's git tree. The > last commit from Linus in it was > 45d7f32c7a43cbb9592886d38190e379e2eb2226. Everything else on top of > that was patches that only touched cifs code. cifs.ko hadn't been > plugged in since it was rebooted. OK. That bug is pretty elusive, so far I haven't been able to figure out what the heck is going on here and my attempts at reproducing have all failed. The reports so far seem to have the cron component in common. Does fedora ionice some cron jobs or anything like that? Or use CLONE_IO? -- Jens Axboe