public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* MCE hardware error, but no message
@ 2011-03-12 20:49 Jan Engelhardt
  2011-03-15  1:27 ` Hidetoshi Seto
  2011-03-16 18:31 ` Andi Kleen
  0 siblings, 2 replies; 3+ messages in thread
From: Jan Engelhardt @ 2011-03-12 20:49 UTC (permalink / raw)
  To: Linux Kernel Mailing List; +Cc: x86



Running Linux 2.6.37, I am getting these errors on one of a box:

[696782.810387] [Hardware Error]: No human readable MCE decoding support 
on this CPU type.
[696782.810470] [Hardware Error]: Run the message through 'mcelog 
--ascii' to decode.
[696783.585853] [Hardware Error]: No human readable MCE decoding support 
on this CPU type.
[696783.585937] [Hardware Error]: Run the message through 'mcelog 
--ascii' to decode.

Except that it never tells me the actual non-human readable form.
The error starts to show after 6-48 hours after a reboot (including 
warm reboots). A second machine of the exact same configuration shows no 
problems over the past 30 days. Environmental sensors of the problem box 
show normal parameters.

How would I get the messages to run through mcelog?

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: MCE hardware error, but no message
  2011-03-12 20:49 MCE hardware error, but no message Jan Engelhardt
@ 2011-03-15  1:27 ` Hidetoshi Seto
  2011-03-16 18:31 ` Andi Kleen
  1 sibling, 0 replies; 3+ messages in thread
From: Hidetoshi Seto @ 2011-03-15  1:27 UTC (permalink / raw)
  To: Jan Engelhardt; +Cc: Linux Kernel Mailing List, x86

(2011/03/13 5:49), Jan Engelhardt wrote:
> 
> 
> Running Linux 2.6.37, I am getting these errors on one of a box:
> 
> [696782.810387] [Hardware Error]: No human readable MCE decoding support 
> on this CPU type.
> [696782.810470] [Hardware Error]: Run the message through 'mcelog 
> --ascii' to decode.
> [696783.585853] [Hardware Error]: No human readable MCE decoding support 
> on this CPU type.
> [696783.585937] [Hardware Error]: Run the message through 'mcelog 
> --ascii' to decode.
> 
> Except that it never tells me the actual non-human readable form.
> The error starts to show after 6-48 hours after a reboot (including 
> warm reboots). A second machine of the exact same configuration shows no 
> problems over the past 30 days. Environmental sensors of the problem box 
> show normal parameters.
> 
> How would I get the messages to run through mcelog?

It looks like a kind of corrected error.

Let's try the latest mcelog:
  git://git.kernel.org/pub/scm/utils/cpu/mce/mcelog.git


Thanks,
H.Seto


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: MCE hardware error, but no message
  2011-03-12 20:49 MCE hardware error, but no message Jan Engelhardt
  2011-03-15  1:27 ` Hidetoshi Seto
@ 2011-03-16 18:31 ` Andi Kleen
  1 sibling, 0 replies; 3+ messages in thread
From: Andi Kleen @ 2011-03-16 18:31 UTC (permalink / raw)
  To: Jan Engelhardt; +Cc: Linux Kernel Mailing List, x86

Jan Engelhardt <jengelh@medozas.de> writes:

> Running Linux 2.6.37, I am getting these errors on one of a box:
>
> [696782.810387] [Hardware Error]: No human readable MCE decoding support 
> on this CPU type.
> [696782.810470] [Hardware Error]: Run the message through 'mcelog 
> --ascii' to decode.
> [696783.585853] [Hardware Error]: No human readable MCE decoding support 
> on this CPU type.
> [696783.585937] [Hardware Error]: Run the message through 'mcelog 
> --ascii' to decode.
>
> Except that it never tells me the actual non-human readable form.

mcelog logs them. The kernel shouldn't be spewing these messages
at all, especially not for corrected errors (this is a still 
unfixed regression for Intel CPUs)

Here's an older fix:

http://git.kernel.org/?p=linux/kernel/git/ak/linux-mce-2.6.git;a=commit;h=6e3c7411d2b86bff210c59caa432e8e862037bfd

> How would I get the messages to run through mcelog?

They are already logged, no need to do anything further.

-Andi

-- 
ak@linux.intel.com -- Speaking for myself only

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2011-03-16 18:32 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-12 20:49 MCE hardware error, but no message Jan Engelhardt
2011-03-15  1:27 ` Hidetoshi Seto
2011-03-16 18:31 ` Andi Kleen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox