From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756354AbXD0RCe (ORCPT ); Fri, 27 Apr 2007 13:02:34 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1756352AbXD0RCe (ORCPT ); Fri, 27 Apr 2007 13:02:34 -0400 Received: from colin.muc.de ([193.149.48.1]:4597 "EHLO mail.muc.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756341AbXD0RCd (ORCPT ); Fri, 27 Apr 2007 13:02:33 -0400 Date: 27 Apr 2007 19:02:30 +0200 Date: Fri, 27 Apr 2007 19:02:30 +0200 From: Andi Kleen To: Tim Hockin Cc: vojtech@suse.cz, linux-kernel@vger.kernel.org, akpm@google.com Subject: Re: [PATCH] x86_64: dynamic MCE poll interval Message-ID: <20070427170230.GA4574@muc.de> References: <20070427090917.GA24922@muc.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.1i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 27, 2007 at 09:58:14AM -0700, Tim Hockin wrote: > On 27 Apr 2007 11:09:17 +0200, Andi Kleen wrote: > >On Thu, Apr 26, 2007 at 06:02:52PM -0700, Tim Hockin wrote: > >> Description: > >> This patch makes the MCE poller adjust the polling interval dynamically. > >> If we find an MCE, poll 2x faster (down to 10 ms). When we stop finding > >> MCEs, poll 2x slower (up to check_interval seconds). The check_interval > >> tunable becomes the max polling interval. > > > >Can you please fix the documentation then? > > Which documentation, specifically? :) Documentation/x86_64/{boot-options.txt,machinecheck} > > >> Result: > >> If you start to take a lot of correctable errors (not exceptions), you > >> log them faster and more accurately (less chance of overflowing the MCA > >> registers). If you don't take a lot of errors, you will see no change. > > > >Makes sense. > > > >AMD RevF can do this using the threshold interrupts too for DIMM errors > >too without any delays -- perhaps it would also make sense to configure > >this by default that it always triggers on all DIMM errors. > >Right now it is just an option in /sys > > Can I look at this as a followon patch? I have a number of mce Sure. -Andi