From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932471AbbCRByB (ORCPT ); Tue, 17 Mar 2015 21:54:01 -0400 Received: from mga02.intel.com ([134.134.136.20]:30301 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752741AbbCRBx6 (ORCPT ); Tue, 17 Mar 2015 21:53:58 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.11,419,1422950400"; d="scan'208";a="700200993" Message-ID: <1426643634.5570.14.camel@intel.com> Subject: Re: [LKP] [mm] cc87317726f: WARNING: CPU: 0 PID: 1 at drivers/iommu/io-pgtable-arm.c:413 __arm_lpae_unmap+0x341/0x380() From: Huang Ying To: Johannes Weiner Cc: Linus Torvalds , Michal Hocko , Tetsuo Handa , David Rientjes , Andrew Morton , Dave Chinner , LKML , LKP ML , linux-mm Date: Wed, 18 Mar 2015 09:53:54 +0800 In-Reply-To: <20150317192413.GA7772@phnom.home.cmpxchg.org> References: <1426227621.6711.238.camel@intel.com> <20150317192413.GA7772@phnom.home.cmpxchg.org> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.12.9-1+b1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2015-03-17 at 15:24 -0400, Johannes Weiner wrote: > On Tue, Mar 17, 2015 at 10:15:29AM -0700, Linus Torvalds wrote: > > Explicitly adding the emails of other people involved with that commit > > and the original oom thread to make sure people are aware, since this > > didn't get any response. > > > > Commit cc87317726f8 fixed some behavior, but also seems to have turned > > an oom situation into a complete hang. So presumably we shouldn't loop > > *forever*. Hmm? > > It seems we are between a rock and a hard place here, as we reverted > specifically to that endless looping on request of filesystem people. > They said[1] they rely on these allocations never returning NULL, or > they might fail inside a transactions and corrupt on-disk data. > > Huang, against which kernels did you first run this test on this exact > setup? Is there a chance you could try to run a kernel without/before > 9879de7373fc? I want to make sure I'm not missing something, but all > versions preceding this commit should also have the same hang. There > should only be a tiny window between 9879de7373fc and cc87317726f8 -- > v3.19 -- where these allocations are allowed to fail. I checked the test result of v3.19-rc6. It shows that boot will hang at the same position. BTW: the test is run on 32 bit system. Best Regards, Huang, Ying