All of lore.kernel.org
 help / color / mirror / Atom feed
From: Don Zickus <dzickus@redhat.com>
To: Ingo Molnar <mingo@elte.hu>
Cc: fweisbec@gmail.com, Peter Zijlstra <peterz@infradead.org>,
	LKML <linux-kernel@vger.kernel.org>,
	akpm@linux-foundation.org, sergey.senozhatsky@gmail.com
Subject: Re: [PATCH v3] watchdog:  touch_nmi_watchdog should only touch local cpu not every one
Date: Wed, 10 Nov 2010 14:03:52 -0500	[thread overview]
Message-ID: <20101110190352.GA31794@redhat.com> (raw)
In-Reply-To: <20101110180711.GB22410@elte.hu>

On Wed, Nov 10, 2010 at 07:07:11PM +0100, Ingo Molnar wrote:
> 
> * Don Zickus <dzickus@redhat.com> wrote:
> 
> > > Hm, the flip side is that if a CPU is stuck spewing backtraces, we will now make 
> > > all the other CPUs a lot more noisy - which might only 'lock up' because this 
> > > CPU is stuck spewing oopses, right?
> > 
> > When you say the other CPUs will be a lot more noisy, is that because they are 
> > busy processing backtraces for the first cpu to spew?  I guess I don't understand 
> > how the other CPUs could have their interrupts off the whole time while the first 
> > cpu is spewing a backtrace (just trying to educate myself).
> 
> Say the kernel crashes on a CPU and keeps spewing new oopses, while write-holding 
> tasklist_lock.
> 
> Any other CPU that delivers a signal from IRQ context, trying to take the 
> tasklist_lock, will loop indefinitely until that crashing CPU releases the lock.
> 
> In that case the 'secondary' NMI warnings from all other CPUs (eventually every CPU 
> gets stuck in such a scenario) will start spewing NMI lockup messages.
> 
> Dunno. Maybe we should do your change - but also have an option to 'shut up' the 
> kernel after the first hard oops [not warning]. That would silence the secondary NMI 
> watchdog messages as well.

You mean inside the panic() routine? like a ratelimit?

Cheers,
Don

  reply	other threads:[~2010-11-10 19:04 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-11-08 18:28 [PATCH v3] watchdog: touch_nmi_watchdog should only touch local cpu not every one Don Zickus
2010-11-10  7:49 ` Ingo Molnar
2010-11-10 15:08   ` Andrew Morton
2010-11-10 16:05   ` Don Zickus
2010-11-10 18:07     ` Ingo Molnar
2010-11-10 19:03       ` Don Zickus [this message]
2010-11-10 19:14         ` Ingo Molnar
2010-11-15 18:23           ` Don Zickus

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20101110190352.GA31794@redhat.com \
    --to=dzickus@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=fweisbec@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=peterz@infradead.org \
    --cc=sergey.senozhatsky@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.