From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2B45CC77B75 for ; Mon, 15 May 2023 20:32:41 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229558AbjEOUck (ORCPT ); Mon, 15 May 2023 16:32:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49198 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S245277AbjEOUcj (ORCPT ); Mon, 15 May 2023 16:32:39 -0400 Received: from lobo.ruivo.org (lobo.ruivo.org [173.14.175.98]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 030A4D848 for ; Mon, 15 May 2023 13:32:38 -0700 (PDT) Received: by lobo.ruivo.org (Postfix, from userid 1011) id 05A2752B7C; Mon, 15 May 2023 16:32:37 -0400 (EDT) Received: from jake.ruivo.org (bob.qemu.ruivo [192.168.72.19]) by lobo.ruivo.org (Postfix) with ESMTPA id 4B5D052A1A; Mon, 15 May 2023 16:32:20 -0400 (EDT) Received: by jake.ruivo.org (Postfix, from userid 1000) id 3CBD922005A; Mon, 15 May 2023 16:32:20 -0400 (EDT) Date: Mon, 15 May 2023 16:32:20 -0400 From: Aristeu Rozanski To: "Luck, Tony" Cc: Borislav Petkov , "linux-edac@vger.kernel.org" , Aristeu Rozanski Subject: Re: [PATCH] mce: prevent concurrent polling of MCE events Message-ID: <20230515203220.GE4090740@cathedrallabs.org> References: <20230515143225.GC4090740@cathedrallabs.org> <20230515145227.GGZGJHKyH9sAfToD03@fat_crate.local> <20230515183050.GJZGJ6Wsmr4Yf/H5Ps@fat_crate.local> <20230515194423.GKZGKLl+8mJiLoJAp1@fat_crate.local> <20230515202057.GD4090740@cathedrallabs.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/2.2.9 (2022-11-12) Precedence: bulk List-ID: X-Mailing-List: linux-edac@vger.kernel.org On Mon, May 15, 2023 at 08:27:19PM +0000, Luck, Tony wrote: > If CMCI storms are their problem ... perhaps this is an answer: > > https://lore.kernel.org/all/20230411173841.70491-1-tony.luck@intel.com/ Apparently they're disabling CMCI for CE and UCNA events when OS-first is enabled for two entire lines of products. So there might be more behind this than I suspected. I just sent a message to their engineers to find out what's going on. Now on the CMCI storm mitigation, wouldn't a system that got a storm be subject to this same issue of duplicated events since it switches to polling? -- Aristeu