linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Don Zickus <dzickus@redhat.com>
To: Cyrill Gorcunov <gorcunov@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>,
	Robert Richter <robert.richter@amd.com>,
	Peter Zijlstra <peterz@infradead.org>,
	Lin Ming <ming.m.lin@intel.com>,
	"fweisbec@gmail.com" <fweisbec@gmail.com>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"Huang, Ying" <ying.huang@intel.com>,
	Yinghai Lu <yinghai@kernel.org>, Andi Kleen <andi@firstfloor.org>
Subject: Re: [PATCH -v3] perf, x86: try to handle unknown nmis with running perfctrs
Date: Wed, 25 Aug 2010 17:20:37 -0400	[thread overview]
Message-ID: <20100825212037.GI4879@redhat.com> (raw)
In-Reply-To: <20100825202458.GE14874@lenovo>

On Thu, Aug 26, 2010 at 12:24:58AM +0400, Cyrill Gorcunov wrote:
> On Wed, Aug 25, 2010 at 04:11:06PM -0400, Don Zickus wrote:
> ...
> > >  Uhhuh. NMI received for unknown reason 00 on CPU 15.
> > >  Do you have a strange power saving mode enabled?
> > >  Dazed and confused, but trying to continue
> > 
> > So I found a Nehalem box that can reliably reproduce Ingo's problem using
> > something as simple 'perf top'.  But like above, I am noticing the
> > samething, an extra NMI(PMI??) that comes out of nowhere.
> > 
> > Looking at the data above the delta between nmis is very small compared to
> > the other nmis.  It almost suggests that this is an extra PMI.
> > Considering there is already two cpu errata discussing extra PMIs under
> > certain configurations, I wouldn't be surprised if this was a third.
> > 
> > Cheers,
> > Don
> > 
> 
> Oh. I'm not sure if it would be a good idea at all but maybe we could
> use kind of Robert's idea about "pmu nmi relaxing time" ie some time
> slice in which we treat nmi's as being from pmu, but not arbitrary number
> but equal to the number of PMI turned off. Say we handle NMI and found
> that 4 events are overflowed, we clear them, arm timer and wait for
> 3 unknow nmis to happen, if they are not happening during some time
> period we clear this waitqueue, if they happen or partially happen
> - we destroy the timer. Ie almost the same as Robert's idea but
> without tsc? Just a thought.

The only problem is only one counter is overflowing in these cases, so we
would have to do it all the time, which may not be hard.  But I was
thinking of something similar.

For now, I am trying to force counter0 off, seeing that most of the perf
errata on nehalem have been on counter0.  Or maybe I can get 'perf top' to
use something other than counter0 by running 'perf record' first?

Cheers,
Don

> 
> 	-- Cyrill

  reply	other threads:[~2010-08-25 21:21 UTC|newest]

Thread overview: 70+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-20 15:05 [PATCH -v3] perf, x86: try to handle unknown nmis with running perfctrs Don Zickus
2010-08-20 15:25 ` Ingo Molnar
2010-08-23  8:53   ` Ingo Molnar
2010-08-24 16:22     ` Cyrill Gorcunov
2010-08-24 17:09       ` Robert Richter
2010-08-24 17:20         ` Cyrill Gorcunov
2010-08-24 17:21           ` Cyrill Gorcunov
2010-08-24 17:15     ` Robert Richter
2010-08-24 17:28       ` Cyrill Gorcunov
2010-08-24 18:46         ` Don Zickus
2010-08-24 18:54           ` Cyrill Gorcunov
2010-08-24 19:52             ` Cyrill Gorcunov
2010-08-24 20:27               ` Don Zickus
2010-08-24 20:40                 ` Cyrill Gorcunov
2010-08-25 23:52                   ` Frederic Weisbecker
2010-08-26  9:11                     ` Cyrill Gorcunov
2010-08-25 10:20                 ` Robert Richter
2010-08-26 21:14     ` Don Zickus
2010-08-27  7:51       ` Robert Richter
2010-08-27 13:39         ` Don Zickus
2010-08-27  8:10       ` Robert Richter
2010-08-27 13:44         ` Don Zickus
2010-08-27 14:05           ` Robert Richter
2010-08-27 15:05             ` Don Zickus
2010-08-27 15:48               ` Robert Richter
2010-08-27 18:57         ` Don Zickus
2010-08-27 19:00           ` Yinghai Lu
2010-08-27 19:33           ` Robert Richter
2010-08-25  9:48   ` Robert Richter
2010-08-25 10:41     ` Ingo Molnar
2010-08-25 11:00       ` Ingo Molnar
2010-08-25 20:11         ` Don Zickus
2010-08-25 20:24           ` Cyrill Gorcunov
2010-08-25 21:20             ` Don Zickus [this message]
2010-08-25 21:36               ` Cyrill Gorcunov
2010-08-26  9:00           ` Robert Richter
2010-08-26  9:18             ` Cyrill Gorcunov
2010-08-26 14:31               ` Don Zickus
2010-08-26 15:22               ` Don Zickus
2010-08-26 15:34                 ` Cyrill Gorcunov
2010-08-26 16:40                   ` Don Zickus
2010-08-26 18:02                     ` Cyrill Gorcunov
2010-08-27  7:57                       ` Robert Richter
2010-08-27  8:11                         ` Peter Zijlstra
2010-08-27  8:31                           ` Robert Richter
2010-08-25 11:02       ` Robert Richter
2010-08-25 11:19         ` Ingo Molnar
2010-08-20 23:31 ` Don Zickus
  -- strict thread matches above, loose matches on Subject: below --
2010-08-04 15:18 A question of perf NMI handler Cyrill Gorcunov
2010-08-04 15:50 ` Don Zickus
2010-08-04 16:10   ` Cyrill Gorcunov
2010-08-04 16:20     ` Don Zickus
2010-08-04 16:39       ` Cyrill Gorcunov
2010-08-04 18:48         ` Robert Richter
2010-08-04 19:26           ` Cyrill Gorcunov
2010-08-06  6:52             ` Robert Richter
2010-08-06 14:21               ` Don Zickus
2010-08-09 19:48                 ` [PATCH] perf, x86: try to handle unknown nmis with running perfctrs Robert Richter
2010-08-17 15:22                   ` [PATCH -v3] " Robert Richter
2010-08-17 16:17                     ` Cyrill Gorcunov
2010-08-19 10:45                     ` Peter Zijlstra
2010-08-19 12:39                       ` Robert Richter
2010-08-19 14:12                       ` Don Zickus
2010-08-19 14:27                         ` Peter Zijlstra
2010-08-19 15:20                           ` Don Zickus
2010-08-19 17:43                           ` Cyrill Gorcunov
2010-08-19 17:53                             ` Peter Zijlstra
2010-08-19 21:58                           ` Don Zickus
2010-08-20  8:50                             ` Peter Zijlstra
2010-08-20  1:50                           ` Don Zickus
2010-08-20  8:16                             ` Ingo Molnar
2010-08-20 10:04                               ` Peter Zijlstra
2010-08-20 10:30                                 ` Cyrill Gorcunov
2010-08-20 12:39                                 ` Don Zickus
2010-08-20 13:27                                   ` Ingo Molnar
2010-08-20 13:51                                     ` Don Zickus
2010-08-20 14:17                                       ` Ingo Molnar
2010-08-20 20:45                                         ` Cyrill Gorcunov
2010-08-24 21:48                                         ` Don Zickus
2010-08-20  8:36                             ` Robert Richter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100825212037.GI4879@redhat.com \
    --to=dzickus@redhat.com \
    --cc=andi@firstfloor.org \
    --cc=fweisbec@gmail.com \
    --cc=gorcunov@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=ming.m.lin@intel.com \
    --cc=mingo@elte.hu \
    --cc=peterz@infradead.org \
    --cc=robert.richter@amd.com \
    --cc=ying.huang@intel.com \
    --cc=yinghai@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).