From: Sean Christopherson <seanjc@google.com>
To: Thomas Gleixner <tglx@linutronix.de>
Cc: Ingo Molnar <mingo@redhat.com>, Borislav Petkov <bp@alien8.de>,
Dave Hansen <dave.hansen@linux.intel.com>,
x86@kernel.org, "H. Peter Anvin" <hpa@zytor.com>,
linux-kernel@vger.kernel.org,
"Guilherme G . Piccoli" <gpiccoli@igalia.com>,
Vitaly Kuznetsov <vkuznets@redhat.com>,
Paolo Bonzini <pbonzini@redhat.com>
Subject: Re: [PATCH 1/2] x86/crash: Disable virt in core NMI crash handler to avoid double list_add
Date: Thu, 12 May 2022 14:14:20 +0000 [thread overview]
Message-ID: <Yn0WPBGGI4VcbM4S@google.com> (raw)
In-Reply-To: <87wnervxb7.ffs@tglx>
On Thu, May 12, 2022, Thomas Gleixner wrote:
> Sean,
>
> On Wed, May 11 2022 at 23:43, Sean Christopherson wrote:
> > @@ -840,6 +858,20 @@ void nmi_shootdown_cpus(nmi_shootdown_cb callback)
> > unsigned long msecs;
> > local_irq_disable();
> >
> > + /*
> > + * Invoking multiple callbacks is not currently supported, registering
> > + * the NMI handler twice will cause a list_add() double add BUG().
> > + * The exception is the "nop" handler in the emergency reboot path,
> > + * which can run after e.g. kdump's shootdown. Do nothing if the crash
> > + * handler has already run, i.e. has already prepared other CPUs, the
> > + * reboot path doesn't have any work of its to do, it just needs to
> > + * ensure all CPUs have prepared for reboot.
>
> This is confusing at best. The double list add is just one part of the
> problem, which would be trivial enough to fix.
>
> The real point is that after the first shoot down all other CPUs are
> stuck in crash_nmi_callback() and won't respond to the second NMI.
Well that's embarrasingly obvious in hindsight.
>
> So trying to run this twice is completely pointless and guaranteed to
> run into the timeout.
>
> > + */
> > + if (shootdown_callback) {
> > + WARN_ON_ONCE(callback != nmi_shootdown_nop);
> > + return;
>
> Instead of playing games with the callback pointer, I prefer to make
> this all explicit. Delta patch below.
Much better. If you're planning on doing fixup, can you also include a comment
tweak about why the callback is left set? If not, I'll do it in v2. A bit
overkill, but it's another thing that for me was obvious only once I realized "why".
Thanks!
diff --git a/arch/x86/kernel/reboot.c b/arch/x86/kernel/reboot.c
index 4e3a839ae146..808d3e75fb2d 100644
--- a/arch/x86/kernel/reboot.c
+++ b/arch/x86/kernel/reboot.c
@@ -896,7 +896,11 @@ void nmi_shootdown_cpus(nmi_shootdown_cb callback)
msecs--;
}
- /* Leave the nmi callback set */
+ /*
+ * Leave the nmi callback set, shootdown is a one-time thing. Clearing
+ * the callback could result in a NULL pointer dereference if a CPU
+ * (finally) responds after the timeout expires.
+ */
}
static inline void nmi_shootdown_cpus_on_restart(void)
next prev parent reply other threads:[~2022-05-12 14:14 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-05-11 23:43 [PATCH 0/2] x86/crash: Fix double list_add nmi_shootdown bug Sean Christopherson
2022-05-11 23:43 ` [PATCH 1/2] x86/crash: Disable virt in core NMI crash handler to avoid double list_add Sean Christopherson
2022-05-12 9:14 ` Vitaly Kuznetsov
2022-05-12 10:51 ` Thomas Gleixner
2022-05-12 14:14 ` Sean Christopherson [this message]
2022-05-12 14:35 ` Sean Christopherson
2022-05-12 15:48 ` Thomas Gleixner
2022-05-11 23:43 ` [PATCH 2/2] x86/reboot: Disable virtualization in an emergency if SVM is supported Sean Christopherson
2022-05-12 8:37 ` Vitaly Kuznetsov
2022-05-12 10:57 ` Thomas Gleixner
2022-05-12 14:39 ` Sean Christopherson
2022-05-12 15:47 ` Thomas Gleixner
2022-05-13 11:10 ` [PATCH] x86/nmi: Make register_nmi_handler() more robust Thomas Gleixner
2022-05-15 11:37 ` Thomas Gleixner
2022-05-15 11:39 ` [PATCH V2] " Thomas Gleixner
2022-05-17 7:34 ` [tip: x86/core] " tip-bot2 for Thomas Gleixner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Yn0WPBGGI4VcbM4S@google.com \
--to=seanjc@google.com \
--cc=bp@alien8.de \
--cc=dave.hansen@linux.intel.com \
--cc=gpiccoli@igalia.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=pbonzini@redhat.com \
--cc=tglx@linutronix.de \
--cc=vkuznets@redhat.com \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.