From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id AF5C1CF8548 for ; Wed, 2 Oct 2024 23:51:06 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id ED2B46B04EF; Wed, 2 Oct 2024 19:51:05 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id E5B616B04F0; Wed, 2 Oct 2024 19:51:05 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id CADEB6B04F1; Wed, 2 Oct 2024 19:51:05 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0014.hostedemail.com [216.40.44.14]) by kanga.kvack.org (Postfix) with ESMTP id A41B36B04EF for ; Wed, 2 Oct 2024 19:51:05 -0400 (EDT) Received: from smtpin16.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay07.hostedemail.com (Postfix) with ESMTP id 183DF1609F6 for ; Wed, 2 Oct 2024 23:51:05 +0000 (UTC) X-FDA: 82630310490.16.01F3AEF Received: from mx0a-00069f02.pphosted.com (mx0a-00069f02.pphosted.com [205.220.165.32]) by imf18.hostedemail.com (Postfix) with ESMTP id 8754A1C0014 for ; Wed, 2 Oct 2024 23:51:01 +0000 (UTC) Authentication-Results: imf18.hostedemail.com; dkim=pass header.d=oracle.com header.s=corp-2023-11-20 header.b=cqjI0sbe; dkim=pass header.d=oracle.onmicrosoft.com header.s=selector2-oracle-onmicrosoft-com header.b=DcgmdVoW; dmarc=pass (policy=reject) header.from=oracle.com; arc=pass ("microsoft.com:s=arcselector10001:i=1"); spf=pass (imf18.hostedemail.com: domain of jane.chu@oracle.com designates 205.220.165.32 as permitted sender) smtp.mailfrom=jane.chu@oracle.com ARC-Seal: i=2; s=arc-20220608; d=hostedemail.com; t=1727912967; a=rsa-sha256; cv=pass; b=MbiNP6bNL4Qtl/OF0ykHKIOjDQloo6r3T5CNsGLvfPQs6C4u1+MPmh8tOFOaeHt8nTY0h6 uZA548Ps/nS262HBt94zH8Cwzm5Mt1PVpOh+w7tqGgufbkteZUzao4VrpCWeh4L2CHYafG Uxa5soBIqQWCzkE9+/GEyV2JEElMpK4= ARC-Authentication-Results: i=2; imf18.hostedemail.com; dkim=pass header.d=oracle.com header.s=corp-2023-11-20 header.b=cqjI0sbe; dkim=pass header.d=oracle.onmicrosoft.com header.s=selector2-oracle-onmicrosoft-com header.b=DcgmdVoW; dmarc=pass (policy=reject) header.from=oracle.com; arc=pass ("microsoft.com:s=arcselector10001:i=1"); spf=pass (imf18.hostedemail.com: domain of jane.chu@oracle.com designates 205.220.165.32 as permitted sender) smtp.mailfrom=jane.chu@oracle.com ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1727912967; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=t+sIL9m+J8QtkxGwuiXMjc4hCvQrWxswpAQDDLy20GQ=; b=Smy+CduDW/v5Rjd2KRqyiqjGm6S+SgfGqq2TjIt3UoQDmz+PoTreSwFiMe78fEmnS9Vzrp Aj6FpwSl3o52gdd/ZkbM/OGTgVl7KwtTG04ZPzFY89eTX1GAovZWKH9HOAkk/VMEQOrcyZ 0E/+/JHbAlTeZ9wypBxSQ9g2pVKYBCI= Received: from pps.filterd (m0246627.ppops.net [127.0.0.1]) by mx0b-00069f02.pphosted.com (8.18.1.2/8.18.1.2) with ESMTP id 492MgMfW003377; Wed, 2 Oct 2024 23:50:52 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h= message-id:date:subject:to:cc:references:from:in-reply-to :content-type:content-transfer-encoding:mime-version; s= corp-2023-11-20; bh=t+sIL9m+J8QtkxGwuiXMjc4hCvQrWxswpAQDDLy20GQ=; b= cqjI0sber3fm+rX7RjTCLuy9z7WATY1S5XqsuRe6kJAeJj3Gzsa30nBgifaq/lVZ KMBLOMjWuIEgqupMOuPaplVmUihk/9nJEe6t3qVsHQNxj9qLkowZMCQFJxu//2Mi e5sFmeLmKprfig7WXYke1LMYkuYvc+7GvONysiKcwIbQjSDnMSJ70dK9NWD/rYk4 aqOoGsG8UwjjCZPWBz2Qk4ogmpQVgIpJ+cA3Y3a14xlF7CXUZBAgr0D26QbPbvFs 73ZGzYudZQ0baZzuFFC+L8PQXqQKCZivGF+/3+pAFzkNOfFkFrAJRQHvOZV64Pvd /Iv5CSn0sd839QWxv74QMg== Received: from iadpaimrmta02.imrmtpd1.prodappiadaev1.oraclevcn.com (iadpaimrmta02.appoci.oracle.com [147.154.18.20]) by mx0b-00069f02.pphosted.com (PPS) with ESMTPS id 41x8d1jtxg-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 02 Oct 2024 23:50:51 +0000 (GMT) Received: from pps.filterd (iadpaimrmta02.imrmtpd1.prodappiadaev1.oraclevcn.com [127.0.0.1]) by iadpaimrmta02.imrmtpd1.prodappiadaev1.oraclevcn.com (8.18.1.2/8.18.1.2) with ESMTP id 492LTgUN026273; Wed, 2 Oct 2024 23:50:50 GMT Received: from nam02-sn1-obe.outbound.protection.outlook.com (mail-sn1nam02lp2040.outbound.protection.outlook.com [104.47.57.40]) by iadpaimrmta02.imrmtpd1.prodappiadaev1.oraclevcn.com (PPS) with ESMTPS id 41x889jr73-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 02 Oct 2024 23:50:50 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=vdUAA2Wb5Y906b6hDqoJMiNZ5s25vKeeQPBQEJk4l6jTueO0DnnL3WlYznZK/umr6cUEFnTaPsLS7UJxZgRM7qTyaI6/Y7Rf9EX/VK0NU73B2WYGJtY0XPmczgLcOWbjWkLmZgjWMbFdsAjlKNi391YP0Q1Q9H06Q00oWZAj5GpDHOFDDPpxXESh4tDZndBt7POTUDs3jrbVgCCmr3CZSw5IEqaJyvxKHNl5jne/7K3u01tLCT0Gaq+03O534IMQApwZ4T4K0FGOktVfXl6Bu5MOlX4TzC7azXT2yMQvyUNJcMfJQ7ZtUPeY2Y+Xcz9ulInH6OnyGtslNwogzEv0nw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=t+sIL9m+J8QtkxGwuiXMjc4hCvQrWxswpAQDDLy20GQ=; b=c8c9ZGehUqcSgNNtX46SvL0fbR0Cz/qGa361QjI27ceS4icD5EndOUPxh9/KwFZ7jK5+y7eVvrPmQiTYVumLTc9yntiCkpTHP8RFZ37iJKgCsAh2lEUJmVbkqrO2XLHGW34++A7ddsOcvJb2zmm5yMsq9rMNzjM/6nEdkJy98M2MJgIOYhzaj1d5sjif/V+H/plo3TXmH7igiZRk6YF1HRIenCVMNkUuTHhPMQNH7/KfACPZF7M6cGnK+CN0ldrAxGeNDIfR+PEuJNZcd0KkIneDhJjeAHkdv5z5RnfWrWz4XuPXbDJMkoH1FTu/FZlTill0LlR34zF3v+bdOpOn/A== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=oracle.com; dmarc=pass action=none header.from=oracle.com; dkim=pass header.d=oracle.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.onmicrosoft.com; s=selector2-oracle-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=t+sIL9m+J8QtkxGwuiXMjc4hCvQrWxswpAQDDLy20GQ=; b=DcgmdVoWYKQe/igBkYb9ttjWHefhn8pc+l6bb9uEO0LfrhQO+blKMHNgA/VoK0ZwMVcH0CbhDIckyVopTfXM2Y+pKuxNDDyhG4RoJm21bIQ9oNBncPXVZzVkrrlLmQcLBoR46JqKgBwF0CcsShJQvd9HQBvtbSBww4XiGSpd+ro= Received: from SA2PR10MB4780.namprd10.prod.outlook.com (2603:10b6:806:118::5) by BN0PR10MB5094.namprd10.prod.outlook.com (2603:10b6:408:129::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.8026.16; Wed, 2 Oct 2024 23:50:47 +0000 Received: from SA2PR10MB4780.namprd10.prod.outlook.com ([fe80::b66:5132:4bd6:3acb]) by SA2PR10MB4780.namprd10.prod.outlook.com ([fe80::b66:5132:4bd6:3acb%7]) with mapi id 15.20.8026.016; Wed, 2 Oct 2024 23:50:47 +0000 Message-ID: Date: Wed, 2 Oct 2024 16:50:43 -0700 User-Agent: Mozilla Thunderbird Subject: Re: [RFC PATCH v1 1/2] mm/memory-failure: introduce global MFR policy To: Jiaqi Yan , nao.horiguchi@gmail.com, linmiaohe@huawei.com Cc: tony.luck@intel.com, wangkefeng.wang@huawei.com, akpm@linux-foundation.org, osalvador@suse.de, rientjes@google.com, duenwen@google.com, jthoughton@google.com, jgg@nvidia.com, ankita@nvidia.com, peterx@redhat.com, linux-mm@kvack.org, jane.chu@oracle.com References: <20240924043924.3562257-1-jiaqiyan@google.com> <20240924043924.3562257-2-jiaqiyan@google.com> Content-Language: en-US From: jane.chu@oracle.com In-Reply-To: <20240924043924.3562257-2-jiaqiyan@google.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-ClientProxiedBy: BYAPR11CA0107.namprd11.prod.outlook.com (2603:10b6:a03:f4::48) To SA2PR10MB4780.namprd10.prod.outlook.com (2603:10b6:806:118::5) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: SA2PR10MB4780:EE_|BN0PR10MB5094:EE_ X-MS-Office365-Filtering-Correlation-Id: 04253f4f-78e7-4efd-a74b-08dce33d09ce X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0;ARA:13230040|366016|7416014|376014|1800799024; X-Microsoft-Antispam-Message-Info: =?utf-8?B?OU0xUXUzVW5SUG5Rc1ZjZnZwOE9mZDNZMEErVGhKb09KTHB1aVQ2cTYvbk02?= =?utf-8?B?SHBubGI2ZWpxNjNKWU50a3FwLytubHorNlNzOUp1ZW5ES05TdGk4TFhlM05L?= =?utf-8?B?Uk1jN0ZLVXpjajlrTCtraS9IaVlaajQyQm9wc1oyUkNoNWhsVVFRZlpaZThy?= =?utf-8?B?SEM0NXJ1WDVibnBiTlB5eE5pUHM4YmhoT0ptdWxkRlRKS1RTOXBnd25LRGQz?= =?utf-8?B?WDhEZDhUS1lqbnRQUnlyRWFiMGNYMUJiTlNEN1ZhWGRFVDA5MU8zOVhTMldW?= =?utf-8?B?THlMNjQ5a0RzcEhEM0F0cmRwWDNTbHFRS2I1b1ppZWlGNGhTcUkwT2t2ZzBW?= =?utf-8?B?OXZVeVlSQ09Jd0NaVHJoTURwcSt1Nk0wTGFza0hzTDQ4TThQYlFQVGxCYlVQ?= =?utf-8?B?VXVoQnZtYjRLMmpWOU1Cak1lclBxL0RkdU9Rd0RKdHFROEM5OVErcjlvSE41?= =?utf-8?B?dTJoRmUrVGoxOEZXRzBsT0ovY3pxd09UVHVmWWlWYW9qZnJvNEpTSng3OEh2?= =?utf-8?B?dmNBN1hoVFliaitoYlh2TE9XdTNLdDdQSVEyNFR6RmRCTTJDRlFhL29SQTN0?= =?utf-8?B?dGZWTmYvUzNUS0tkOTV6MEVnY0VISkdxMjBKSUI2MW1PbDdTWFBhNkFuWWlh?= =?utf-8?B?UDhWeUl3QTAyemx5MmUzREpVOEx1RnZiZWxWWFhtbFBzWjFGZU10WDVSQWZ5?= =?utf-8?B?SVdoQWlVcnR3M1dzS2MwRjNaeDluMmFBaDhWbDhOL2RIRjJGaXdXaWtFYUp4?= =?utf-8?B?bU9Vc0g1MFhUTmwyWXFJVzZFelJKY0wxZ01tanhSQzNwK2NrS05nNFBxSktm?= =?utf-8?B?ZkMzMHdqZ1kyYXFFRHNJb0dGbENWMUZDalVrYXk1bmdjL3BwaW8rOThYTGQv?= =?utf-8?B?U1ZGb0htT1ZQYXlta2c0a0ZVUlVDYmlOYUZHcjVvY2MvS1dxRDRrYUt1MXRD?= =?utf-8?B?cllGeDdrbWp3WUZhVDdpOHFJOE1hOGcrQ2Z2TG9JMjFQZVRhdHplRERQQnlr?= =?utf-8?B?Uk1LRW5HdDc1SCtjTzRrVDZVemIvOTBkQVhBbmZDQjJCRVJzancyR2taK0lS?= =?utf-8?B?Z1JoTytxTjNiTVI5TjU0cTNERlJsRUNWU0x0Sms1cnFEMzQzcndHcFd3eWdk?= =?utf-8?B?Ny9UR1Q2MDRUM0twNnFoanQ5Uzc1NlQ1c2ZNOXJYZXpPaDh6cHNkMHZvdFZo?= =?utf-8?B?bEsyMGkxUjVzUWFHQ09oRTdYZGV3VHRNZWZjcldtVzhYakhmMEU4RldtVk1r?= =?utf-8?B?Zndmei9NMDdIT3Z6QmE1M0o0TnY2ZVNsQWFFMmpQamNKbWJPeXAzaG5RUVlN?= =?utf-8?B?T09xZVJROXFFQ3lyZ3E3ZDl0OVBhQ1ROSEVEQmIybUtGaTEvZ29qWXN2WFVV?= =?utf-8?B?clhEa1ltL3UyajMzejlDejF0YTBEL0FPOVJ2bk9wOXpmcWdkT1MrMHNlbmxv?= =?utf-8?B?YndwbjFLVHJFTkhHQ0Y1bjdFQ3Z0UWNkMTl1bm5FMDM1Y201MFd6a2taQnlG?= =?utf-8?B?M0VPMEtKVWhzbm0vWmcySWJCN0tScTBDMUczRTJ2SFNpbmFJMVdveU5jNTFW?= =?utf-8?B?M1NTM0lqcS9uV1hPaDQ5emg0OTRFblVNYU5KeFAwRk9HaDVWMFNNdVJTakU5?= =?utf-8?B?aEpibHd4MWdoYlFEVjBEMmd1S0ZlSEFaNDQvTC96amV2aFdJSy9WRmo3WXYr?= =?utf-8?B?dXlQVG1XNUErenppYTBiUi9SN0J4cGRIQkZiWkZXeUxWQkdWb2dLREZBPT0=?= X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:SA2PR10MB4780.namprd10.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230040)(366016)(7416014)(376014)(1800799024);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?B?VjlCSk1ub1BFV09RZ2liOUNibCtBM1VrRC9uZkl5ZloxUklNc3J0b29mZTVz?= =?utf-8?B?TGNUbXowUFpyVTNGMytrQkZVWlBZYU5Ld2M3Vk1aTUlyRTd6Y3dFVFpGVmxz?= =?utf-8?B?YzNKMmtzM1h0bld3SzFaNkJwNGgyWlM2dXJ6WGEzYUFPNWdtSkVmMHpkSzI2?= =?utf-8?B?bEtDQUdGN3p5eWc1SWIwMU5hT1JzZGIwVVVGTFA5U0syNm9PdENWM2Z2cXNq?= =?utf-8?B?VUh2Y3dYVnhzN0wwYWxZbTBqNlBySlZkOGlSRmEvMVpzWGdKUW1JOGFyWGIy?= =?utf-8?B?a2tHTUNXUEQ2QTJiRWdzSTkxeU1IU0xWdkFkcUl1NmNHN2V5aW9rb0xLais4?= =?utf-8?B?UTJLWlFNSlRQMzVLZUxSUEZaeGVsOTU2NWZpMGJldC9oRU13d3U5R280ZzNB?= =?utf-8?B?WHNPejB5elh5THhnc0o3eGdYUnMrQnNFaUhRdkk1VE96RG1GL3VYQXZzbjRm?= =?utf-8?B?RWZLVjczL1pMa3ZJWU4xZzJMNEIwZmZMY0N0T3RWMVdBWmU5ZStZN292NVBq?= =?utf-8?B?bUhsOSs3c2IwVzBlZVV5ZEpKNWRjeW50MHRqcE9qei9iNm5iVUYzNGxab1BK?= =?utf-8?B?aVk0SytNRmw0MEtnVU1kVEFvb2FkRkRWWDgwZHY0bUtzc25WOWtHSFJUY0Zn?= =?utf-8?B?NWtScG1YK1FpVFdMZ2FJbWYvc0h1em1EVDd4RVBkZ1BUNlBocDh4U2phcERB?= =?utf-8?B?TE1HeDNMdXRFemJlQnBNdktJeE13bzRJK3BEVld2SFd5T1h6Z2s0Rlo0ZjYr?= =?utf-8?B?RkROdk5KQ1lnN0FUMzJzdGZRcXd3YjV1QjM4cjF3dFJzNUJTUlRLaTZIcnJk?= =?utf-8?B?YWlDcEM2TmVWREw1WGFGRWl2YmQ1dXZsWWJGYVhDODE1Sm9ZSUVWSnVGQ2g2?= =?utf-8?B?MmhIcjNmQnVidnBNM3VUMnJGTDB4V2ZxRGoyaEtxeE1jMHBMMXRtVmJHeGtS?= =?utf-8?B?M0V1cWc4QkxYVzNyRlRvU2M5cjBFWDA5Q1p3ZlVreFNaSUVuajIzbTV2UXgr?= =?utf-8?B?eXhrcVYvMEVyRzR6MDZtSnRWSUIvZ3BZNCtGbGYzVWpTRWd5YjNQS1dXRU9y?= =?utf-8?B?TGRLS0xZdEZmUXY1RmtPYlhFdUtvc3BhMDdGSlFhZnNIZEVzeldDYk9ZSEFZ?= =?utf-8?B?Mk5DSTF1Y1h6QUZCcTY3ME5PcFlYWnFiUU4ralEwNkJjc0NjeG0wamlhOUdn?= =?utf-8?B?K05jYWErTys4aUZzM3Rtdm1CWlF3a3lGUnRTd1BTNUs5a0NzZ29SYVVoQzJi?= =?utf-8?B?YzM3NnpQMkFYc3NubmIyeFZocWNKQVJOcXRSUWw5c05QVGd6RWx2aUtvczJ0?= =?utf-8?B?SUhrU2VZaHlsL2U4bHJaTnBpcW9PdCt1ZkdQb0pUM0t1M0RVMFh1a2ErWHdz?= =?utf-8?B?ZXlLM09pMWVWZVlIaEhKYUxzWXBxSmV1N2YzYjhUNkNXeHpXd2gvTXJ0N3Uy?= =?utf-8?B?R0drT0NLWU9xdmxPR0hGM2pVMkdUSzJ3ODZxQ0RReCtjUVRySXJnNm9qd0NM?= =?utf-8?B?Y3Jnd1ljVjJ1dnhQZEFvVWFZZGNpWWt6OTRXbklna1lDWjhtTzZ0QlFnblk4?= =?utf-8?B?d1hKLzYzVnRmd2RFeGEwemFVRkFFMDc2MGRWUjVOQ2UyK1A0S1QwTHhtMFRp?= =?utf-8?B?OTVOb3Z3YUlMQUd2Q2tSMmMvQ0Z0dDhUOWUycjNUOXV4V3Bqb3ZMVzhYN2FY?= =?utf-8?B?Um5yWEc3NmZ4QWhCUytLOUNZcVl3eVdTamY4NE5mNjdLVklRZVVOelNLYWxI?= =?utf-8?B?V1hOUzA5MTVubklRblRpSGpZT2lCV2NWeHNIam5QVTZ2bHVySTdjdURBcHJU?= =?utf-8?B?UU0xbWJJWjh3Y1U3Vnp0SFp6ck5ncDdEcTIzeEVtNVRad0FzYWVwVDN5RDlN?= =?utf-8?B?STU5Y1ZPbkQxRXNScVRCSnF6b0dWdDFBWVNFR3paZHZ4NGw5UVl5NDJqVmhG?= =?utf-8?B?V0I4OFViRWMwM3ZQeEpGRS8zS0JjYzNxRm5ZSCtIeXgrZzZZalpUODVwV3Nh?= =?utf-8?B?R1U3VmQ4ejJWL0RyQ0FlZWtINHNBK0tXdHBLbG1odlR1ME5xbXVpZXN5MFBy?= =?utf-8?B?OXAyRUxDRmN3Z0djVFlpdjhIWHdCeUorSGMySzJWTWFHZjJkYkR3QW9SZHpk?= =?utf-8?Q?Uucoh3NCeZCY4s9XH0bdclfp+?= X-MS-Exchange-AntiSpam-ExternalHop-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-ExternalHop-MessageData-0: 8bKHCIsi69wbk4qEmCcCWwjBaBG7ofH3vA1SGd58y9gng5MVn4dDtkGIk6QXOjDJeicucjLSTf4DyyajZOHENZWKJEncp+V73dSRm35gMoDFheyr4cgccCe6k6EhsAMvU3XVcWLSp5Hu2p1F0wcqXuWTDvl7zg23grDWWtB4fbF/17ywBzqgLEfCSVZMcOOhBNMABuXAUo/QoKKIAytM/hjWCqbRvQEqeROXcDHZjSqPi976p6oS292wA7xbCqGaHHQlowS7/Sc8zCMQ4OGjQw2yMfbqvIAkifMMrLL5+ele3IhmzKFl7LoRuYfjt8w/76sYPysaXs2LzPrf92TCNR5QkB28jgOrWw4zuLt8JdXVAGi0nRXjDh5zdQCMJeu7zoesTARxF/evJCHn30VjB088npiwecPVhO4+WFoF5R+g+TXM1OlB6Jjr3hOP5yD736HUIERWjvgD3KuZH/x1RmUcugi75hTnuzyxHb6v1/pnu3umA0xEl9dDS5BrqkFX/qF7y/nTb4RUZzxiaFclKVz/U2Qfd7eLWEHSCYVZWj+PmWEsTrIpP5ujDojlKvEOApQrLDlRoYRCVxYeqLMoXE84vJL78t7uSfIpPb3dSP8= X-OriginatorOrg: oracle.com X-MS-Exchange-CrossTenant-Network-Message-Id: 04253f4f-78e7-4efd-a74b-08dce33d09ce X-MS-Exchange-CrossTenant-AuthSource: SA2PR10MB4780.namprd10.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 02 Oct 2024 23:50:47.8350 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 4e2c6054-71cb-48f1-bd6c-3a9705aca71b X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: Mk0ZnTgQalw5PKVRSMo/d240npeQ+wXi17JaYP0giXxBc3XaV4rHJ3T0RIUwV2h0I3fGYrBHXUDyMWcDU4ailQ== X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN0PR10MB5094 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.293,Aquarius:18.0.1051,Hydra:6.0.680,FMLib:17.12.62.30 definitions=2024-10-02_21,2024-09-30_01,2024-09-30_01 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 adultscore=0 bulkscore=0 suspectscore=0 malwarescore=0 mlxlogscore=999 phishscore=0 spamscore=0 mlxscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2408220000 definitions=main-2410020169 X-Proofpoint-ORIG-GUID: XSOvi6cCU_KgQqnGb8wPQGV_qg0Fi4W1 X-Proofpoint-GUID: XSOvi6cCU_KgQqnGb8wPQGV_qg0Fi4W1 X-Rspamd-Queue-Id: 8754A1C0014 X-Rspam-User: X-Rspamd-Server: rspam05 X-Stat-Signature: ojh77hzd7b4d851r3zf4eiiy1bqasboj X-HE-Tag: 1727913061-636598 X-HE-Meta: U2FsdGVkX1+LBO4IYgD3JsF4MuVsR07MnNbVwLuGPB3xjmQgInTBtt8mxe9Khky1jWN9E1rDd8uUgmIpj0LMWzV94MxevGOUnKD0fu7oLmwUBqOMFMdVa97S8oFesXLQG4VIZhkSEcmUp2GMUdrwQb8NtlgVTBJ2wds26oRJL+mqN0TvN7b0/RWoG/6Xyp2XQpbofIZL6XFECupErO65cjBh53Hy1FaIUJ+p4PKSt6loXP+pREyn0uOIcWcB0BxzMg2GJIUt3POn54mg+jjNrAYLlMUmIbsBdAD06OIqglPIiNfPnRD/zeQhmJSMsbBzSz6leXHfiFslpp+gwmSBaFqJ9XtQ96n2kxjjfLIr54yNCGSzANKpIOArLRve+L+y10SIf+7olswuyx7/ywO+GtKurHGNXyzMXx8Qf0vxZMdy+iu4WXxmj2SjGALKeZ9E1TT1p41zDxOfgUoP29ja//OFvfOB+9b8zP8mWynbwppCsbgmyUR2GyM4yF8SuqLo/d74qTyy+aFr/nvU/Kuz8cAUjBe/LYEWyCRphM2B9JOgdgSf/6mPp+O1qX0ScpYtumDFuYbqI0ykUWO6i4HJSoZW5w98Ia8GXKTERTIpixr2Scm1uou8pyvIYwR7hzeHLVBil3gLQzK0kkexbNZg6lMF3OXDFgyoQpkknAtLrgGzpFbzGvRTUSHFStuLiuYAJ+Y29wPWRPorFrE0lSLUoKiOfNhjgOCVJxRywMbSzpWMk6x2ghWO7FtqQpksLrAUNn9cuhtYYSHYhUZMX0T3ya/O0u8q8xlInClOONTkDtt8u+zZRaS51jjqGyTODJo/wfbpz29qUhcmRMvnr/6NnKSZ37T0T6oe73Ot8rEGAOcvJA5XOJEfujlQuhuxShcEa2mU0daBMyQe2iLW+4pg/5Y3eGqWP29ENYrqhJf3xZUftVgYYo9z/MU3hpenbb3Q5B3AkZrXcUlrlh1LvKP 7ExhcmHv q62mpvHHdZWUY8vi4IUS4/u1/+G3OTmlkUWbA6sf0IInFWGMiOnjSPzjtfmcFSkFZmFeruk60UYz9qANlcPo4ul6WssILnfGcQoMED4y4YQ+CXph8/JEwvFtdYH2atFloLSUeqk0keEWxVKtBG6eU9m9mMNeyX9GZ3xikHu+whDINmtqlhfMRZlMLc5GA70ZZF+Npqz2Nuen9bZanLN+97C1xAwuajtd8l8/fiPGozinM45HUCY3vnKlKVC2qC6nx9m9CELQxEhMN1fjFvEE0BoQB+YNbSyDTT5sa772IIkrh/0Yva7F8Ag8a7lVVPAc9FdO9vh2r3diDVYQnh0a275FpU/XC4apeA9/KtpebW5OHagOKIYTTTG8mTe1o1sJAkDcfQG1mEbuIeAyT+XDWPa5d80O9t42Kb68j+j0gSourlBE7+wPR69RXAQk/buO/mVat7u5sr2/X7L11yABRhAAC0+sngwN8js0v44xmTNt6IR9/zcX4TKjJwy+GymKF+roQJ4xqvE/D9+FDCISN+i/zKpMsJ44vSx732LiRuRCre/Fi4urm2PVPDFtRpyaG7a/FFUuSPJzFr62MYpm5VeOl0/uz2iII4/Xi7Raa0ZvIsTsvT64ygJkDoUmYsNq6M++oWZjSY51VOLU= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Hi, On 9/23/2024 9:39 PM, Jiaqi Yan wrote: > > + /* > + * On ARM64, if APEI failed to claims SEA, (e.g. GHES driver doesn't > + * register to SEA notifications from firmware), memory_failure will > + * never be synchrounous to the error consumption thread. Notifying > + * it via SIGBUS synchrnously has to be done by either core kernel in > + * do_mem_abort, or KVM in kvm_handle_guest_abort. > + */ > + if (!sysctl_enable_hard_offline) { > + pr_info_once("%#lx: disabled by /proc/sys/vm/enable_hard_offline\n", pfn); > + kill_procs_now(p, pfn, flags, page_folio(p)); > + res = -EOPNOTSUPP; > + goto unlock_mutex; > + } > + I am curious why the SIGBUS is sent without setting PG_hwpoison in the page.   In 0/2 there seems to be indication about threads coordinate with each other such that clean subpages in a poisoned hugetlb page continue to be accessible, and at some point, (or perhaps I misread), the poisoned page (sub- or huge-) will eventually be isolated, because, it's unthinkable to let a poisoned page laying around and kernel treats it like a clean page ?  But I'm not sure how do you plan to handle it without PG_hwpoison while hard_offline is disabled globally. Another thing I'm curious at is whether you have tested with real hardware UE - the one that triggers MCE.  When a real UE is consumed by the training process, the user process must longjmp out in order to avoid getting stuck at the same instruction that fetched a UE memory.  Given a longjmp is needed (unless I am missing something), the training process is already in a situation where it has to figure out things like rewind, where-to-restart-from, does it even keep states? etc. On the whole, whether the burden to ask user application to deal with what's lacking in the kernel, namely the lack of splitting up a hugetlb page, is worthwhile, is something that need to be weighed over. Thanks, -jane