From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933087AbYDOV1a (ORCPT ); Tue, 15 Apr 2008 17:27:30 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1758907AbYDOV1S (ORCPT ); Tue, 15 Apr 2008 17:27:18 -0400 Received: from relay1.sgi.com ([192.48.171.29]:59495 "EHLO relay.sgi.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1760318AbYDOV1R (ORCPT ); Tue, 15 Apr 2008 17:27:17 -0400 Message-ID: <48051DB1.8080102@sgi.com> Date: Tue, 15 Apr 2008 14:27:13 -0700 From: Mike Travis User-Agent: Thunderbird 2.0.0.6 (X11/20070801) MIME-Version: 1.0 To: Ingo Molnar CC: Christoph Lameter , Linus Torvalds , Pekka Enberg , linux-kernel@vger.kernel.org, Mel Gorman , Nick Piggin , Andrew Morton , "Rafael J. Wysocki" , Yinghai.Lu@sun.com, apw@shadowen.org, KAMEZAWA Hiroyuki Subject: Re: [bug] SLUB + mm/slab.c boot crash in -rc9 References: <20080415195430.GA23015@elte.hu> <20080415201734.GA25628@elte.hu> <20080415202805.GA26880@elte.hu> <20080415203405.GA27958@elte.hu> <20080415204208.GA30432@elte.hu> <20080415205820.GC31645@elte.hu> <20080415211953.GA4243@elte.hu> In-Reply-To: <20080415211953.GA4243@elte.hu> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Ingo Molnar wrote: > * Christoph Lameter wrote: > >> On Tue, 15 Apr 2008, Ingo Molnar wrote: >> >>> and the bug pattern seems to be memory corruption - not memory >>> exhaustion. >> SLUB does not do a memory allocation where it fails here but simply >> accesses per cpu information that is expected to be already zeroed. >> >>> i.e. we allocated RAM but it got corrupted after allocation. >> In some situations we are screwing up the per cpu data handling on 32 >> bit x86? Adding Mike. This looks like the per cpu area overlaps with >> something else? > > yep, that was my other theory - and i doubled CONFIG_NR_CPUS to reduce > that chance. > > in hindsight ... that wont save us from any overlap, right? > > what's the best way to artificially increase the size of the allocated > per cpu area? (say double it) > > Ingo I don't know that there is a boot option. If modules are defined it adds an extra 8k. The size is defined in include/linux/percpu.h (PERCPU_ENOUGH_ROOM). Otherwise define a really large per_cpu variable...? -Mike