From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ozlabs.org (ozlabs.org [103.22.144.67]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 3s4c7p2TYgzDqRt for ; Thu, 4 Aug 2016 14:16:22 +1000 (AEST) Received: from mx0a-001b2d01.pphosted.com (mx0a-001b2d01.pphosted.com [148.163.156.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 3s4c7n4lfRz9stY for ; Thu, 4 Aug 2016 14:16:21 +1000 (AEST) Received: from pps.filterd (m0098404.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.11/8.16.0.11) with SMTP id u744EuDW039301 for ; Thu, 4 Aug 2016 00:16:19 -0400 Received: from e18.ny.us.ibm.com (e18.ny.us.ibm.com [129.33.205.208]) by mx0a-001b2d01.pphosted.com with ESMTP id 24kkagfr66-1 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=NOT) for ; Thu, 04 Aug 2016 00:16:19 -0400 Received: from localhost by e18.ny.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Thu, 4 Aug 2016 00:14:48 -0400 From: Stewart Smith To: Mahesh J Salgaonkar , linuxppc-dev , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman Cc: "Shreyas B. Prabhu" Subject: Re: [PATCH v2 3/3] powernv: Fix MCE handler to avoid trashing CR0/CR1 registers. In-Reply-To: <20160301054817.15714.20605.stgit@mars.in.ibm.com> References: <20160301054655.15714.48085.stgit@mars.in.ibm.com> <20160301054817.15714.20605.stgit@mars.in.ibm.com> Date: Thu, 04 Aug 2016 14:14:39 +1000 MIME-Version: 1.0 Content-Type: text/plain Message-Id: <87mvkt891s.fsf@linux.vnet.ibm.com> List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Mahesh J Salgaonkar writes: > From: Mahesh Salgaonkar > > The current implementation of MCE early handling modifies CR0/1 registers > without saving its old values. Fix this by moving early check for > powersaving mode to machine_check_handle_early(). >>From (internal bug report) it seems as though in a test where one injects continuous SLB Multi Hit errors, this bug could lead to rebooting "due to to Platform error" rather than continuing to recover successfully. It might be a good idea to mention that in commit message here. Also, should this go to stable? -- Stewart Smith OPAL Architect, IBM.