From: Don Zickus <dzickus@redhat.com>
To: Russ Anderson <rja@sgi.com>
Cc: linux-kernel@vger.kernel.org, x86@kernel.org,
Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>, "H. Peter Anvin" <hpa@zytor.com>,
Andrew Morton <akpm@linux-foundation.org>,
Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
rja@americas.sgi.com
Subject: Re: [PATCH] x86: Avoid intermixing cpu dump_stack output on multi-processor systems
Date: Tue, 29 May 2012 19:54:07 -0400 [thread overview]
Message-ID: <20120529235407.GH32472@redhat.com> (raw)
In-Reply-To: <20120529231135.GD4592@sgi.com>
On Tue, May 29, 2012 at 06:11:35PM -0500, Russ Anderson wrote:
> > > In this case, I am just using the hardware NMI, which sends the NMI
> > > signal to each logical cpu. Since each cpu receives the NMI at nearly
> > > the exact same time, they end up in dump_stack() at the same time.
> > > Without some form of locking, trace lines from different cpus end
> > > up intermixed, making it impossible to tell what any individual
> > > cpu was doing.
> >
> > I forgot the original reasons for having the NMI go to each CPU instead of
> > just the boot CPU (commit 78c06176), but it seems like if you revert that
> > patch and have the nmi handler just call trigger_all_cpu_backtrace()
> > instead (which does stack trace locking for pretty output), that would
> > solve your problem, no? That locking is safe because it is only called in
> > the NMI context.
>
> We want NMI to hit all the cpus at the same time to get a coherent
> snapshot of what is happening in the system at one point in time.
> Sending an IPI one cpu at a time skews the results, and doesn't
Oh, I thought it was broadcasting, but I see the apic_uv code serializes
it. Though getting all those hardware locks in the nmi handler has to be
time consuming? But I know you guys did some tricks to speed that up.
> really solve the problem of multiple cpus going into dump_stack()
> at the same time. NMI isn't the only possible caller of dump_stack().
I am curious, your NMI handler has locking wrapped around dump_stack,
shouldn't that serialize the output the way you want it? Why isn't that
working?
>
> FWIW, "Wait for up to 10 seconds for all CPUs to do the backtrace" on
> a 4096 cpu system isn't long enough. :-)
Good point. :-)
>
> > Whereas the lock you are proposing can be called in a mixture of NMI and
> > IRQ which could cause deadlocks I believe.
>
> Since this is a lock just around the dump_stack printk, would
> checking for forward progress and a timeout to catch any possible
> deadlock be sufficient? In the unlikely case of a deadlock the
> lock gets broken and some of the cpu backtraces get intermixed.
> That is still a huge improvement over the current case where
> all of the backtraces get intermixed.
I saw your new patch based on Frederick's input. It seems to take care of
deadlock situations though you run into the starving lock problem that
ticketed spinlocks solved. Which is why I am curious why moving the
locking one layer up to the NMI handler (which is where it is currently),
didn't fix your problem.
Cheers,
Don
next prev parent reply other threads:[~2012-05-29 23:54 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-05-24 14:42 [PATCH] x86: Avoid intermixing cpu dump_stack output on multi-processor systems Russ Anderson
2012-05-24 15:34 ` Frederic Weisbecker
2012-05-29 18:50 ` Russ Anderson
2012-05-29 17:53 ` Don Zickus
2012-05-29 19:19 ` Russ Anderson
2012-05-29 22:39 ` Don Zickus
2012-05-29 23:11 ` Russ Anderson
2012-05-29 23:54 ` Don Zickus [this message]
2012-06-01 22:56 ` Russ Anderson
2012-06-04 14:23 ` Don Zickus
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120529235407.GH32472@redhat.com \
--to=dzickus@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=gregkh@linuxfoundation.org \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=rja@americas.sgi.com \
--cc=rja@sgi.com \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox