From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail144.messagelabs.com (mail144.messagelabs.com [216.82.254.51]) by kanga.kvack.org (Postfix) with ESMTP id 751F96B004F for ; Mon, 15 Jun 2009 12:10:37 -0400 (EDT) Date: Mon, 15 Jun 2009 18:19:04 +0200 From: Andi Kleen Subject: Re: [PATCH 00/22] HWPOISON: Intro (v5) Message-ID: <20090615161904.GH31969@one.firstfloor.org> References: <20090615024520.786814520@intel.com> <4A35BD7A.9070208@linux.vnet.ibm.com> <20090615042753.GA20788@localhost> <20090615140019.4e405d37@lxorguk.ukuu.org.uk> <20090615132934.GE31969@one.firstfloor.org> <20090615154832.73c89733@lxorguk.ukuu.org.uk> <20090615152427.GF31969@one.firstfloor.org> <20090615162804.4cb75b30@lxorguk.ukuu.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090615162804.4cb75b30@lxorguk.ukuu.org.uk> Sender: owner-linux-mm@kvack.org To: Alan Cox Cc: Andi Kleen , Hugh Dickins , Wu Fengguang , Balbir Singh , Andrew Morton , LKML , Ingo Molnar , Mel Gorman , Thomas Gleixner , "H. Peter Anvin" , Peter Zijlstra , Nick Piggin , "riel@redhat.com" , "chris.mason@oracle.com" , "linux-mm@kvack.org" List-ID: On Mon, Jun 15, 2009 at 04:28:04PM +0100, Alan "zSeries" Cox wrote: > curse a lot > suspend to disk > remove dirt from fans, clean/replace RAM > resume from disk > > The very act of making the ECC error not take out the box creates the Ok so at least you agree now that handling these errors without panic is the right thing to do. That's at least some progress. > environment whereby the underlying hardware error (if there was one) can > be cured. These ECC errors are still somewhat rare (or rather if they become common you should definitely service the system). That is why losing a single page of memory for them isn't a big issue normally. Sure you could spend effort making unpoisioning work, but it would seem very dubious to me. After all it's just another 4K of memory for each error. The only reasonably good use case I heard for unpoisoning was if you have a lot of huge pages (you can't use a full huge page with one bad small page), but that's also still relatively exotic. -Andi [1] mostly you need a new special form of RCU I think -- ak@linux.intel.com -- Speaking for myself only. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org