From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-pb0-f48.google.com (mail-pb0-f48.google.com [209.85.160.48]) by kanga.kvack.org (Postfix) with ESMTP id 780E66B008A for ; Tue, 11 Mar 2014 08:56:00 -0400 (EDT) Received: by mail-pb0-f48.google.com with SMTP id md12so8728061pbc.7 for ; Tue, 11 Mar 2014 05:56:00 -0700 (PDT) Received: from userp1040.oracle.com (userp1040.oracle.com. [156.151.31.81]) by mx.google.com with ESMTPS id xe9si20157707pab.25.2014.03.11.05.55.59 for (version=TLSv1 cipher=RC4-SHA bits=128/128); Tue, 11 Mar 2014 05:55:59 -0700 (PDT) Message-ID: <531F07D4.5000108@oracle.com> Date: Tue, 11 Mar 2014 08:55:48 -0400 From: Sasha Levin MIME-Version: 1.0 Subject: Re: bad rss-counter message in 3.14rc5 References: <20140305174503.GA16335@redhat.com> <20140305175725.GB16335@redhat.com> <20140307002210.GA26603@redhat.com> <20140311024906.GA9191@redhat.com> <20140310201340.81994295.akpm@linux-foundation.org> <20140310214612.3b4de36a.akpm@linux-foundation.org> <20140311045109.GB12551@redhat.com> <20140310220158.7e8b7f2a.akpm@linux-foundation.org> <20140311053017.GB14329@redhat.com> In-Reply-To: <20140311053017.GB14329@redhat.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org List-ID: To: Dave Jones , Andrew Morton , Linux Kernel , linux-mm@kvack.org, Linus Torvalds , Cyrill Gorcunov , Joonsoo Kim , Bob Liu , Konstantin Khlebnikov On 03/11/2014 01:30 AM, Dave Jones wrote: > On Mon, Mar 10, 2014 at 10:01:58PM -0700, Andrew Morton wrote: > > On Tue, 11 Mar 2014 00:51:09 -0400 Dave Jones wrote: > > > > > On Mon, Mar 10, 2014 at 09:46:12PM -0700, Andrew Morton wrote: > > > > On Mon, 10 Mar 2014 20:13:40 -0700 Andrew Morton wrote: > > > > > > > > > > Anyone ? I'm hitting this trace on an almost daily basis, which is a pain > > > > > > while trying to reproduce a different bug.. > > > > > > > > > > Damn, I thought we'd fixed that but it seems not. Cc's added. > > > > > > > > > > Guys, what stops the migration target page from coming unlocked in > > > > > parallel with zap_pte_range()'s call to migration_entry_to_page()? > > > > > > > > page_table_lock, sort-of. At least, transitions of is_migration_entry() > > > > and page_locked() happen under ptl. > > > > > > > > I don't see any holes in regular migration. Do you know if this is > > > > reproducible with CONFIG_NUMA_BALANCING=n or CONFIG_NUMA=n? > > > > > > CONFIG_NUMA_BALANCING was n already btw, so I'll do a NUMA=n run. > > > > There probably isn't much point unless trinity is using > > sys_move_pages(). Is it? If so it would be interesting to disable > > trinity's move_pages calls and see if it still fails. > > Ok, with move_pages excluded it still oopses. FWIW, yes - I still see both of these issues happening. It's easy to ignore the bad rss-counter, and I've commented out the BUG at swapops.h so that I could keep on testing. There are quite a few issues within mm/ right now, I think there are more than 5 different BUG()s hittable using trinity at this point without a fix. Thanks, Sasha -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org