From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756505Ab1CRSYR (ORCPT ); Fri, 18 Mar 2011 14:24:17 -0400 Received: from smtp1.linux-foundation.org ([140.211.169.13]:51299 "EHLO smtp1.linux-foundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755189Ab1CRSYH (ORCPT ); Fri, 18 Mar 2011 14:24:07 -0400 Date: Fri, 18 Mar 2011 11:23:53 -0700 From: Andrew Morton To: Don Zickus Cc: x86@kernel.org, Peter Zijlstra , jwjstone@fastmail.fm, LKML Subject: Re: [PATCH 1/2 v2] watchdog, nmi: Allow hardlockup to panic by default Message-Id: <20110318112353.66ce884b.akpm@linux-foundation.org> In-Reply-To: <20110318171932.GH2743@redhat.com> References: <1299533860-1642-1-git-send-email-dzickus@redhat.com> <20110317185013.7c8be1e0.akpm@linux-foundation.org> <20110318171932.GH2743@redhat.com> X-Mailer: Sylpheed 3.0.2 (GTK+ 2.20.1; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 18 Mar 2011 13:19:32 -0400 Don Zickus wrote: > On Thu, Mar 17, 2011 at 06:50:13PM -0700, Andrew Morton wrote: > > On Mon, 7 Mar 2011 16:37:39 -0500 Don Zickus wrote: > > > > > Add a Kconfig option to allow users to set the hardlockup to panic > > > by default. Also add in a 'nmi_watchdog=nopanic' to override this. > > > > > > > Changelog forgot to tell us "why". > > Yeah, sorry about that. > > When a cpu is considered stuck, instead of limping along and just printing > a warning, it is sometimes preferred to just panic, let kdump capture the > vmcore and reboot. This gets the machine back into a stable state quickly > while saving the info that got it into a stuck state to begin with. Ah, makes sense, thanks. I updated the changelog. > > > > > > Format: [state][,regs][,debounce][,die] > > > > > > nmi_watchdog= [KNL,BUGS=X86] Debugging features for SMP kernels > > > - Format: [panic,][num] > > > + Format: [panic,][nopanic,][num] > > > > It would be better to support panic=[0|1], if that can be simply done > > in a back-compatible fashion. > > I am open to the idea, just can't figure the best way to implement that in > a backwards compatible way. It's not worth busting a gut over ;) > Personally I was wondering if there were situations where you would _not_ > want it to panic. If the cpu is stuck spinning after 60 seconds, the odds > of it freeing itself is low and you are probably stuck rebooting anyway. > > > > > > > static int __init hardlockup_panic_setup(char *str) > > > { > > > if (!strncmp(str, "panic", 5)) > > > hardlockup_panic = 1; > > > + else if (!strncmp(str, "nopanic", 5)) > > > > s/5/7/ > > doh. > > I can send a refreshed patch with the above changes. I fixed that up.