From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752567Ab1LIRTf (ORCPT ); Fri, 9 Dec 2011 12:19:35 -0500 Received: from hrndva-omtalb.mail.rr.com ([71.74.56.122]:63863 "EHLO hrndva-omtalb.mail.rr.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751561Ab1LIRTe (ORCPT ); Fri, 9 Dec 2011 12:19:34 -0500 X-Authority-Analysis: v=2.0 cv=Z6Nu7QtA c=1 sm=0 a=ZycB6UtQUfgMyuk2+PxD7w==:17 a=gD6wIu1_qO8A:10 a=5SG0PmZfjMsA:10 a=IkcTkHD0fZMA:10 a=sJHsf99JVXcf-4EybcAA:9 a=upQplXEDf3Dp7Tv9EHMA:7 a=QEXdDO2ut3YA:10 a=ZycB6UtQUfgMyuk2+PxD7w==:117 X-Cloudmark-Score: 0 X-Originating-IP: 74.67.80.29 Subject: Re: [RFC][PATCH 3/3] x86: Add workaround to NMI iret woes From: Steven Rostedt To: linux-kernel@vger.kernel.org Cc: Ingo Molnar , Andrew Morton , Thomas Gleixner , Peter Zijlstra , Linus Torvalds , "H. Peter Anvin" , Frederic Weisbecker , Jason Baron , Mathieu Desnoyers , "H. Peter Anvin" , Paul Turner , Borislav Petkov In-Reply-To: <1323448478.1937.26.camel@frodo> References: <20111208193003.112037550@goodmis.org> <20111208193136.366941904@goodmis.org> <1323373012.30977.123.camel@frodo> <1323398616.30977.167.camel@frodo> <1323448478.1937.26.camel@frodo> Content-Type: text/plain; charset="UTF-8" Date: Fri, 09 Dec 2011 12:19:31 -0500 Message-ID: <1323451171.1937.33.camel@frodo> Mime-Version: 1.0 X-Mailer: Evolution 2.32.3 (2.32.3-1.fc14) Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [ added Boris as he's my AMD guy ] On Fri, 2011-12-09 at 11:34 -0500, Steven Rostedt wrote: > On Thu, 2011-12-08 at 21:43 -0500, Steven Rostedt wrote: > > > diff --git a/arch/x86/kernel/traps.c b/arch/x86/kernel/traps.c > > index a8e3eb8..906a02a 100644 > > --- a/arch/x86/kernel/traps.c > > +++ b/arch/x86/kernel/traps.c > > @@ -723,4 +723,9 @@ void __init trap_init(void) > > cpu_init(); > > > > x86_init.irqs.trap_init(); > > + > > +#ifdef CONFIG_X86_64 > > + memcpy(&nmi_idt_table, &idt_table, IDT_ENTRIES * 16); > > + set_nmi_gate(1, &debug); > > Frederic Weisbecker told me on IRC that int3 is 3 #bp, not 1 #db. I need > to also add that: > > set_nmi_gate(3, &int3) > > Mathieu says we need to worry about MCEs, so maybe we can add that stack > as well. > > set_nmi_gate(18, &machine_check); Looking at the documentation, I'm not sure NMIs can interrupt an MCE. The MCE is higher up on the exception priority chart (thanks Peter for pointing that out). But the documentation is vague at best. Boris, H. Peter, Could you shed some light on this. Can an NMI interrupt an MCE in progress? Of course if it can, we have the NMI->MCE->NMI that could happen too. And this problem exists today. Actually, just having an MCE happen inside an NMI can cause the NMI->NMI problem as well. -- Steve > > > If we make NMIs not modify any stack, then we can remove the "NMI > executing variable" on the stack. As any nested NMIs will see that it > preempted an NMI by just checking the stack. We have to check it anyway, > and by removing another check, this may be good to do.