public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Aristeu Rozanski <aris@redhat.com>
Cc: linux-kernel@vger.kernel.org, dzickus@redhat.com,
	prarit@redhat.com, vgoyal@redhat.com
Subject: Re: [PATCH] NMI watchdog: when booting with reset_devices, clear the performance counters
Date: Mon, 22 Sep 2008 19:59:24 +0200	[thread overview]
Message-ID: <20080922175924.GH10535@elte.hu> (raw)
In-Reply-To: <20080922175803.GK16840@redhat.com>


* Aristeu Rozanski <aris@redhat.com> wrote:

> > > P4s have a quirk that makes necessary to clear P4_CCCR_OVF bit on the CCCR
> > > everytime the PMI is triggered. When booting the kernel with reset_devices
> > > (more specific kdump case), the counters reach zero and the PMI will be
> > > generated. This is not a problem on other processors but on P4s, it'll
> > > continue to generate NMIs until that bit is cleared. Since there may be
> > > other users of the performance counters, clear and disable all of them
> > > when booting with reset_devices option.
> > > 
> > > We have a P4 box here that crashes because of this problem. Since the kdump
> > > kernel usually boots with only one processor active, the second logical
> > > unit won't be set up, therefore, MSR_P4_IQ_CCCR1 (and other performance
> > > counter registers) won't be cleared and P4_CCCR_OVF may be still set because
> > > the previous kernel was using this register. An NMI is triggered because of
> > > the MSR_P4_IQ_CCCR1 right after the NMI delivery is enabled, triggering the
> > > race fixed on my previous email.
> > > 
> > > Signed-off-by: Aristeu Rozanski <aris@redhat.com>
> > > Acked-by: Don Zickus <dzickus@redhat.com>
> > > Acked-by: Prarit Bhargava <prarit@redhat.com>
> > > Acked-by: Vivek Goyal <vgoyal@redhat.com>
> > 
> > applied to tip/x86/nmi-watchdog, thanks.
> > 
> > i'm wondering, is this fix a v2.6.27 candidate?
> I believe so. Even being a small affected group (P4 based boxes using NMI
> watchdog and kdump), it's fully reproducible.

is there a bugzilla.kernel.org entry for it? If not, could you please 
file it? Thanks,

	Ingo

  reply	other threads:[~2008-09-22 17:59 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-09-22 17:14 [PATCH] NMI watchdog: when booting with reset_devices, clear the performance counters Aristeu Rozanski
2008-09-22 17:47 ` Ingo Molnar
2008-09-22 17:58   ` Aristeu Rozanski
2008-09-22 17:59     ` Ingo Molnar [this message]
2008-09-22 18:11       ` Aristeu Rozanski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080922175924.GH10535@elte.hu \
    --to=mingo@elte.hu \
    --cc=aris@redhat.com \
    --cc=dzickus@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=prarit@redhat.com \
    --cc=vgoyal@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox