From: Don Zickus <dzickus@redhat.com>
To: DDD <dongdong.deng@windriver.com>
Cc: mingo@elte.hu, peterz@infradead.org, tglx@linutronix.de,
mingo@redhat.com, hpa@zytor.com, x86@kernel.org,
linux-kernel@vger.kernel.org
Subject: Re: [V3 PATCH] x86: avoid calling arch_trigger_all_cpu_backtrace() at the same time on SMP
Date: Thu, 11 Nov 2010 22:39:54 -0500 [thread overview]
Message-ID: <20101112033954.GL4823@redhat.com> (raw)
In-Reply-To: <4CDCA895.7010806@windriver.com>
On Fri, Nov 12, 2010 at 10:38:13AM +0800, DDD wrote:
> Dongdong Deng wrote:
> >The spin_lock_debug/rcu_cpu_stall detector uses
> >trigger_all_cpu_backtrace() to dump cpu backtrace.
> >Therefore it is possible that trigger_all_cpu_backtrace()
> >could be called at the same time on different CPUs, which
> >triggers and 'unknown reason NMI' warning. The following case
> >illustrates the problem:
> >
> > CPU1 CPU2 ... CPU N
> > trigger_all_cpu_backtrace()
> > set "backtrace_mask" to cpu mask
> > |
> >generate NMI interrupts generate NMI interrupts ...
> > \ | /
> > \ | /
> > The "backtrace_mask" will be cleaned by the first NMI interrupt
> > at nmi_watchdog_tick(), then the following NMI interrupts generated
> >by other cpus's arch_trigger_all_cpu_backtrace() will be took as
> >unknown reason NMI interrupts.
> >
> >This patch uses a lock to avoid the problem, and stop the
> >arch_trigger_all_cpu_backtrace() calling to avoid dumping double cpu
> >backtrace info when there is already a trigger_all_cpu_backtrace()
> >in progress.
> >
> >Signed-off-by: Dongdong Deng <dongdong.deng@windriver.com>
> >Reviewed-by: Bruce Ashfield <bruce.ashfield@windriver.com>
> >CC: Thomas Gleixner <tglx@linutronix.de>
> >CC: Ingo Molnar <mingo@redhat.com>
> >CC: "H. Peter Anvin" <hpa@zytor.com>
> >CC: x86@kernel.org
> >CC: linux-kernel@vger.kernel.org
> >Signed-off-by: Don Zickus <dzickus@redhat.com>
> >---
> > arch/x86/kernel/apic/hw_nmi.c | 23 +++++++++++++++++++++++
> > arch/x86/kernel/apic/nmi.c | 23 +++++++++++++++++++++++
> > 2 files changed, 46 insertions(+), 0 deletions(-)
> >
> >diff --git a/arch/x86/kernel/apic/hw_nmi.c b/arch/x86/kernel/apic/hw_nmi.c
> >index cefd694..bfdab3b 100644
> >--- a/arch/x86/kernel/apic/hw_nmi.c
> >+++ b/arch/x86/kernel/apic/hw_nmi.c
> >@@ -26,9 +26,27 @@ u64 hw_nmi_get_sample_period(void)
> > }
> > #ifdef ARCH_HAS_NMI_WATCHDOG
> >+/* "in progress" flag of arch_trigger_all_cpu_backtrace */
> >+static unsigned long backtrace_flag;
> >+
> > void arch_trigger_all_cpu_backtrace(void)
> > {
> > int i;
> >+ unsigned long flags;
> >+
> >+ /*
> >+ * Have to disable irq here, as the
> >+ * arch_trigger_all_cpu_backtrace() could be
> >+ * triggered by "spin_lock()" with irqs on.
> >+ */
> >+ local_irq_save(flags);
> >+
> >+ if (test_and_set_bit(0, &backtrace_flag))
> >+ /*
> >+ * If there is already a trigger_all_cpu_backtrace() in progress
> >+ * (backtrace_flag == 1), don't output double cpu dump infos.
> >+ */
> >+ goto out_restore_irq;
> > cpumask_copy(to_cpumask(backtrace_mask), cpu_online_mask);
> >@@ -41,6 +59,11 @@ void arch_trigger_all_cpu_backtrace(void)
> > break;
> > mdelay(1);
> > }
> >+
> >+ clear_bit(0, &backtrace_flag);
>
> Hi Don,
>
> The clear_bit() on x86 (arch/x86/include/asm/bitops.h) does not
> contain a memory barrier, Could you add a memory barrier ops here
> when you pick up this patch?
>
> + clear_bit(0, &backtrace_flag);
> + smp_mb__after_clear_bit();
>
>
Ok. Thanks.
Cheers,
Don
prev parent reply other threads:[~2010-11-12 3:40 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-11-11 11:01 [V3 PATCH] x86: avoid calling arch_trigger_all_cpu_backtrace() at the same time on SMP Dongdong Deng
2010-11-11 11:17 ` Ingo Molnar
2010-11-11 11:23 ` DDD
2010-11-11 12:12 ` Dongdong Deng
2010-11-11 14:34 ` Don Zickus
2010-11-11 18:06 ` Ingo Molnar
2010-11-12 2:38 ` DDD
2010-11-12 3:39 ` Don Zickus [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20101112033954.GL4823@redhat.com \
--to=dzickus@redhat.com \
--cc=dongdong.deng@windriver.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=mingo@redhat.com \
--cc=peterz@infradead.org \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.