From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756150Ab1HUXIp (ORCPT ); Sun, 21 Aug 2011 19:08:45 -0400 Received: from mail-pz0-f42.google.com ([209.85.210.42]:57438 "EHLO mail-pz0-f42.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752598Ab1HUXIo (ORCPT ); Sun, 21 Aug 2011 19:08:44 -0400 Message-ID: <4E518FE9.8040502@gmail.com> Date: Sun, 21 Aug 2011 16:08:25 -0700 From: "Justin P. Mattock" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:5.0) Gecko/20110707 Thunderbird/5.0 MIME-Version: 1.0 To: Andi Kleen CC: "linux-kernel@vger.kernel.org" , tony.luck@intel.com Subject: Re: using mce_inject I get: RIP 10: {ttm_bo_unref+0xf/0x45 [ttm]} References: <4E506DEA.1070601@gmail.com> <20110821221602.GK25996@one.firstfloor.org> In-Reply-To: <20110821221602.GK25996@one.firstfloor.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 08/21/2011 03:16 PM, Andi Kleen wrote: > On Sat, Aug 20, 2011 at 07:31:06PM -0700, Justin P. Mattock wrote: >> not sure if I am running mce_test correctly, but during its routine of >> testing things I do get a pause with everything, then the below shows up >> in dmesg..: > > The message is expected, but there should be no noticeable > pause. well looking and doing more of these tests I am getting a noticeable pause, lasts for about 2-3 seconds then everything goes back to normal. (all of these are whenver the tests do a timout test). > > -Andi > >> >> http://fpaste.org/kMRd/ >> >> >> [ 1810.670434] Triggering MCE exception on CPU 1 >> [ 1810.670462] [Hardware Error]: CPU 1: Machine Check Exception: 6 Bank >> 4: b300000000000000 >> [ 1810.670467] [Hardware Error]: RIP 73:<0000000012343434> >> [ 1810.670470] [Hardware Error]: TSC 38d1002c216 >> [ 1810.670474] [Hardware Error]: PROCESSOR 0:6f6 TIME 1313892803 SOCKET >> 0 APIC 1 >> [ 1810.670477] [Hardware Error]: Run the above through 'mcelog --ascii' >> [ 1810.670481] [Hardware Error]: Machine check: Processor context corrupt >> [ 1810.670483] [Hardware Error]: Fake kernel panic: Fatal Machine check >> [ 1810.670495] MCE exception done on CPU 1 >> [ 1819.064721] Triggering MCE exception on CPU 1 >> >> seems light of a pause, then everything resumes properly(music, etc..). >> Is this something that needs attention, or are these tests as extreme as >> can be, and should simply be ignored? >> (Note: if there is a mce list somewhere let me know so I direct this to >> the proper people) >> >> Justin P. Mattock >> >