public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Borislav Petkov <bp@suse.de>
To: "Luck, Tony" <tony.luck@intel.com>
Cc: "Wang, Rui Y" <rui.y.wang@intel.com>,
	"Chen, Gong" <gong.chen@intel.com>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: MCE Bug?
Date: Thu, 18 Jun 2015 12:25:20 +0200	[thread overview]
Message-ID: <20150618102520.GC1670@pd.tnic> (raw)
In-Reply-To: <3908561D78D1C84285E8C5FCA982C28F32A9E177@ORSMSX114.amr.corp.intel.com>

On Wed, Jun 17, 2015 at 11:53:53PM +0000, Luck, Tony wrote:
> > if you want to give those changes a run, I've uploaded them here:
> >
> > git://git.kernel.org/pub/scm/linux/kernel/git/ras/ras.git#tip-ras
> 
> Latest experiments show that sometimes checking kventd_up() before calling schedule_work()
> helps ... but mostly only when I fake some early logs from low numbered  cpus.  I added some
> traces to the real case of a left-over fatal error and got this splat:

Hmm, and calling mce_log from __mcheck_cpu_init_generic() as you
suggested yesterday seems to work on this box here:

[    1.588713] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-4620 0 @ 2.20GHz (fam: 06, model: 2d, stepping
: 07)
[    1.592727] Performance Events: PEBS fmt1+, 16-deep LBR, SandyBridge events, full-w Broken BIOS d
etected, complain to your hardware vendor.
[    1.997344] [Firmware Bug]: the BIOS has corrupted hw-PMU resources (MSR 38d is 330)
[    2.000146] Intel PMU driver.
[    2.001376] ... version:                3
[    2.002919] ... bit width:              48
[    2.004626] ... generic registers:      4
[    2.006137] ... value mask:             0000ffffffffffff
[    2.008064] ... max period:             0000ffffffffffff
[    2.010010] ... fixed-purpose events:   3
[    2.011528] ... event mask:             000000070000000f
[    2.017257] x86: Booting SMP configuration:
[    2.019232] .... node  #0, CPUs:          #1
[    2.033848] microcode: CPU1 microcode updated early to revision 0x710, date = 2013-06-17
[    2.038730] mce: [Hardware Error]: Machine check events logged
[    2.050735]    #2
[    2.050735] microcode: CPU2 microcode updated early to revision 0x710, date = 2013-06-17
[    2.056163] mce: [Hardware Error]: Machine check events logged
[    2.068133]    #3
[    2.068140] microcode: CPU3 microcode updated early to revision 0x710, date = 2013-06-17
[    2.07412.324641] microcode: CPU4 microcode updated early to revision 0x710, date = 2013-06-17
[    2.479404]    #5

Stuff gets logged just fine, no splats later.

Hmmm, more staring...

-- 
Regards/Gruss,
    Boris.

ECO tip #101: Trim your mails when you reply.
--

  reply	other threads:[~2015-06-18 10:25 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <FC9702EC51E4CA40A875703BEBD6CEF801AE738D@SHSMSX101.ccr.corp.intel.com>
2015-06-17  9:41 ` MCE Bug? Borislav Petkov
2015-06-17 17:45   ` Luck, Tony
2015-06-17 23:53   ` Luck, Tony
2015-06-18 10:25     ` Borislav Petkov [this message]
2015-06-18 13:10     ` [PATCH] x86/mce: Kill drain_mcelog_buffer() Borislav Petkov
2015-06-19  9:27     ` [PATCH] x86/mce: Initialize workqueues only once Borislav Petkov
2015-06-19 12:24       ` Borislav Petkov
2015-06-19 17:36       ` [PATCH] x86/mce: Initialize workqueues only once (alternate proposal) Luck, Tony
2015-06-19 19:02         ` Borislav Petkov
2015-06-18  9:18 MCE bug? Rui Wang
2015-06-18 10:02 ` Borislav Petkov
  -- strict thread matches above, loose matches on Subject: below --
2015-06-17  1:57 Rui Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150618102520.GC1670@pd.tnic \
    --to=bp@suse.de \
    --cc=gong.chen@intel.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=rui.y.wang@intel.com \
    --cc=tony.luck@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox