From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tony Luck Subject: Re: Hardware Error Kernel Mini-Summit Date: Mon, 24 May 2010 10:35:21 -0700 Message-ID: References: <4BF2392A.9040409@jp.fujitsu.com> <1274204560.17703.82.camel@Joe-Laptop.home> <20100518185305.GA23921@elte.hu> <987664A83D2D224EAE907B061CE93D53C61D1C57@orsmsx505.amr.corp.intel.com> <20100518191802.GG25224@aftab> <20100518222832.GJ22675@basil.fritz.box> <20100524155506.GA7145@sgi.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: QUOTED-PRINTABLE Return-path: In-Reply-To: <20100524155506.GA7145@sgi.com> Sender: linux-kernel-owner@vger.kernel.org To: Russ Anderson Cc: "Eric W. Biederman" , Andi Kleen , Borislav Petkov , Hidetoshi Seto , Mauro Carvalho Chehab , "Young, Brent" , Linux Kernel Mailing List , Ingo Molnar , Thomas Gleixner , Matt Domsch , Doug Thompson , Joe Perches , Ingo Molnar , "bluesmoke-devel@lists.sourceforge.net" , Linux Edac Mailing List List-Id: edac.vger.kernel.org On Mon, May 24, 2010 at 8:55 AM, Russ Anderson wrote: > ia64 had the Intel defined MCA Spec which defined the interaction > between SAL and the kernel. =A0x86 does not have a similar well > defined way of how errors should be handled. =A0It would be > good to agree on how the errors should be handled. X86 has machine check registers defined by the SDM. It also has some f/w <-> OS interactions defined by the APEI sections in the latest ACPI spec (chapter 17 of the 4.0a spec released last month - see http://acpi.info). Some parts look cleaner than the ia64 SAL spec. E.g. errors logged from before the current OS booted are presented in the Boot Error Record Table instead of just appearing among the stream of errors that SAL_GET_ERROR provides to the OS without any way to distinguish current errors from old ones. -Tony