From mboxrd@z Thu Jan 1 00:00:00 1970 From: Don Zickus Subject: Re: WARNING: at arch/x86/kernel/smp.c:119 native_smp_send_reschedule+0x25/0x43() Date: Fri, 10 Feb 2012 15:31:17 -0500 Message-ID: <20120210203117.GI5650@redhat.com> References: <1328751082.5611.6.camel@lappy> <4F34EC35.7010109@linux.vnet.ibm.com> <1328900283.25989.45.camel@laptop> <1328900633.25989.47.camel@laptop> <20120210200250.GG5650@redhat.com> <1328905121.25989.52.camel@laptop> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: "Srivatsa S. Bhat" , Sasha Levin , Josh Boyer , "H. Peter Anvin" , Ingo Molnar , Thomas Gleixner , Avi Kivity , kvm , linux-kernel , x86 , Suresh B Siddha , Sergey Senozhatsky To: Peter Zijlstra Return-path: Content-Disposition: inline In-Reply-To: <1328905121.25989.52.camel@laptop> Sender: linux-kernel-owner@vger.kernel.org List-Id: kvm.vger.kernel.org On Fri, Feb 10, 2012 at 09:18:41PM +0100, Peter Zijlstra wrote: > On Fri, 2012-02-10 at 15:02 -0500, Don Zickus wrote: > > I also ran into the same problem you did and hacked up another patch that > > checked a global atomic variable that let the system know we were shutting > > down and not to do the WARN_ON (the global is already created for the NMI > > case now). > > system_state seems like that thing.. except it doesn't seem to have a PANIC state, though we could add one I suppose. The thing is even if you reverted my changes: e58d429 x86, reboot: Fix typo in nmi reboot path bda6263 x86, NMI: Add knob to disable using NMI IPIs to stop cpus 3603a25 x86, reboot: Use NMI instead of REBOOT_VECTOR to stop cpus I think you still run into the same problem because the reschedule code changed. So my second patch which I will eventually post will just skip the WARN_ON if the system is going down. Not sure if that is the proper way to address this problem or change all of the stop_this_cpu code to use a different bitmask than the cpu_online bitmask (but then you run the risk of a stuck IPI I guess if the cpu is halted without notifying anyone). Cheers, Don