From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail138.messagelabs.com (mail138.messagelabs.com [216.82.249.35]) by kanga.kvack.org (Postfix) with ESMTP id E117C8D0039 for ; Mon, 17 Jan 2011 05:28:09 -0500 (EST) Date: Mon, 17 Jan 2011 10:27:44 +0000 From: Mel Gorman Subject: Re: hunting an IO hang Message-ID: <20110117102744.GA27152@csn.ul.ie> References: <1295225684-sup-7168@think> <4D339C87.30100@fusionio.com> <1295228148-sup-7379@think> <1295229722-sup-6494@think> <20110116183000.cc632557.akpm@linux-foundation.org> <1295231547-sup-8036@think> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: <1295231547-sup-8036@think> Sender: owner-linux-mm@kvack.org To: Chris Mason Cc: Andrew Morton , Linus Torvalds , Jens Axboe , linux-mm , KAMEZAWA Hiroyuki , Andrea Arcangeli List-ID: On Sun, Jan 16, 2011 at 09:41:41PM -0500, Chris Mason wrote: > Excerpts from Andrew Morton's message of 2011-01-16 21:30:00 -0500: > > (lots of cc's added) > > > > On Sun, 16 Jan 2011 21:07:40 -0500 Chris Mason wrote: > > > > > Excerpts from Linus Torvalds's message of 2011-01-16 20:53:04 -0500: > > > > .. except I actually didn't add Andrew to the cc after all. > > > > > > > > NOW I did. > > > > > > > > Oh, and if you can repeat this and bisect it, it would obviously be > > > > great. But that sounds rather painful. > > > > > > Ok, so I've got 3 different problems in 3 totally different areas. > > > I'm running w/kvm, but this VM is very stable with 2.6.37. Running > > > Linus' current git it goes boom in exotic ways, this time it was only on > > > ext3, btrfs code never loaded. > > > > > > Linus, if you're planning on rc1 tonight I'll send my pull request out > > > the door. Otherwise I'd prefer to fix this and send my pull after > > > actually getting a long btrfs run on the current code. > > > > > > Next up, CONFIG_DEBUG*, always an adventure on rc1 kernels ;) > > > > > > WARNING: at lib/list_debug.c:57 list_del+0xc0/0xed() > > > Hardware name: Bochs > > > list_del corruption. next->prev should be ffffea000010cde0, but was ffff88007cff6bc8 > > > Modules linked in: > > > Pid: 524, comm: kswapd0 Not tainted 2.6.37-josef+ #180 > > > Call Trace: > > > [] ? warn_slowpath_common+0x85/0x9d > > > [] ? warn_slowpath_fmt+0x46/0x48 > > > [] ? list_del+0xc0/0xed > > > [] ? migrate_pages+0x26f/0x357 > > > [] ? compaction_alloc+0x0/0x2dc > > > [] ? compact_zone+0x391/0x5c4 > > > [] ? compact_zone_order+0xc2/0xd1 > > > [] ? _raw_spin_unlock+0xe/0x10 > > > [] ? kswapd+0x5c8/0x88f > > > [] ? kswapd+0x0/0x88f > > > [] ? kthread+0x82/0x8a > > > [] ? kernel_thread_helper+0x4/0x10 > > > [] ? kthread+0x0/0x8a > > > [] ? kernel_thread_helper+0x0/0x10 > > > ---[ end trace 5c6b7933d16b301f ]--- > > > > uh-oh. Does disabling CONFIG_COMPACTION make this go away (requires > > disabling CONFIG_TRANSPARENT_HUGEPAGE first). > > We'll see. I gave THP this same run of tests back in November, it > passed without any problems (after fixing the related btrfs migration > bug). All of the crashes I've seen this weekend had this in the > .config: > I can't find the reset of the thread on any mailing list and am trying to reproduce the problem locally. What workload were you running? -- Mel Gorman Part-time Phd Student Linux Technology Center University of Limerick IBM Dublin Software Lab -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: email@kvack.org