All of lore.kernel.org
 help / color / mirror / Atom feed
From: Aristeu Rozanski <aris@redhat.com>
To: Ingo Molnar <mingo@elte.hu>
Cc: linux-kernel@vger.kernel.org, dzickus@redhat.com,
	prarit@redhat.com, vgoyal@redhat.com
Subject: Re: [PATCH] NMI watchdog: when booting with reset_devices, clear the performance counters
Date: Mon, 22 Sep 2008 13:58:09 -0400	[thread overview]
Message-ID: <20080922175803.GK16840@redhat.com> (raw)
In-Reply-To: <20080922174748.GC10535@elte.hu>

> > P4s have a quirk that makes necessary to clear P4_CCCR_OVF bit on the CCCR
> > everytime the PMI is triggered. When booting the kernel with reset_devices
> > (more specific kdump case), the counters reach zero and the PMI will be
> > generated. This is not a problem on other processors but on P4s, it'll
> > continue to generate NMIs until that bit is cleared. Since there may be
> > other users of the performance counters, clear and disable all of them
> > when booting with reset_devices option.
> > 
> > We have a P4 box here that crashes because of this problem. Since the kdump
> > kernel usually boots with only one processor active, the second logical
> > unit won't be set up, therefore, MSR_P4_IQ_CCCR1 (and other performance
> > counter registers) won't be cleared and P4_CCCR_OVF may be still set because
> > the previous kernel was using this register. An NMI is triggered because of
> > the MSR_P4_IQ_CCCR1 right after the NMI delivery is enabled, triggering the
> > race fixed on my previous email.
> > 
> > Signed-off-by: Aristeu Rozanski <aris@redhat.com>
> > Acked-by: Don Zickus <dzickus@redhat.com>
> > Acked-by: Prarit Bhargava <prarit@redhat.com>
> > Acked-by: Vivek Goyal <vgoyal@redhat.com>
> 
> applied to tip/x86/nmi-watchdog, thanks.
> 
> i'm wondering, is this fix a v2.6.27 candidate?
I believe so. Even being a small affected group (P4 based boxes using NMI
watchdog and kdump), it's fully reproducible.

-- 
Aristeu


  reply	other threads:[~2008-09-22 17:57 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-09-22 17:14 [PATCH] NMI watchdog: when booting with reset_devices, clear the performance counters Aristeu Rozanski
2008-09-22 17:47 ` Ingo Molnar
2008-09-22 17:58   ` Aristeu Rozanski [this message]
2008-09-22 17:59     ` Ingo Molnar
2008-09-22 18:11       ` Aristeu Rozanski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080922175803.GK16840@redhat.com \
    --to=aris@redhat.com \
    --cc=dzickus@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=prarit@redhat.com \
    --cc=vgoyal@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.