From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5CAF9C7EE30 for ; Wed, 2 Jul 2025 06:02:32 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id F10796B00C2; Wed, 2 Jul 2025 02:02:31 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id EC0576B00D7; Wed, 2 Jul 2025 02:02:31 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id D88916B00D8; Wed, 2 Jul 2025 02:02:31 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id C4EA76B00C2 for ; Wed, 2 Jul 2025 02:02:31 -0400 (EDT) Received: from smtpin12.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay02.hostedemail.com (Postfix) with ESMTP id 9341812472B for ; Wed, 2 Jul 2025 06:02:31 +0000 (UTC) X-FDA: 83618280102.12.BBBFC1A Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by imf16.hostedemail.com (Postfix) with ESMTP id C8862180008 for ; Wed, 2 Jul 2025 06:02:29 +0000 (UTC) Authentication-Results: imf16.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=CEkudyYT; spf=pass (imf16.hostedemail.com: domain of npache@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=npache@redhat.com; dmarc=pass (policy=quarantine) header.from=redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1751436149; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=nIGpPqgNzgq47EY5ZTEnTs1FneYRqT/zHlE7fmslHU8=; b=8LZpOKWyhBKJF+mPt54PrtVdEZUUaQOhn5u+YaQaXdRsF4+bJFPADmRWDSk4Ml4Nx0cHVm hsWMApgSVkBHA7sIs0V+8HfqEGxay4rNqwsLa/CuhBRlsAf/GO7J4ddcmnLvH2sHWIOWTs PLeJP1KkBB2cAYWOut9dKzvzUg1p6Yo= ARC-Authentication-Results: i=1; imf16.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=CEkudyYT; spf=pass (imf16.hostedemail.com: domain of npache@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=npache@redhat.com; dmarc=pass (policy=quarantine) header.from=redhat.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1751436149; a=rsa-sha256; cv=none; b=GdmNpyE29tgXpHKwwsUDdqzJH5ntXu4ZBejHJqkaggO5Anc+8bNXhoD175cbNNbH1m00SQ himYEtp/zr4UIJjL2Tt3ke+Cs9L5DeXQ/nvwv1ufZniEOpzbEyVXmlRLCY6YNEecV1bMSV YO7l2lmRzhhdr7RSt9g7Eqqc73nMiw0= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1751436149; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=nIGpPqgNzgq47EY5ZTEnTs1FneYRqT/zHlE7fmslHU8=; b=CEkudyYT8gvyAG5K5o9PXfT2ya5g9TObKuc45vKeYfGPJxYMes+v9u/kbhxM5wJR44K1o9 JPuyy0Vcnj0ns+YB2ve2Q2cuRwb1w6FI59QUci6m0015RgA7JR/YuL3+mQMmyaX2f0saoJ PW8Oz1vd30VUpcUDb+Zh5jm0mi0f61k= Received: from mx-prod-mc-02.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-636-exMEV-qmP7mWzW_W7wlbDA-1; Wed, 02 Jul 2025 02:02:25 -0400 X-MC-Unique: exMEV-qmP7mWzW_W7wlbDA-1 X-Mimecast-MFC-AGG-ID: exMEV-qmP7mWzW_W7wlbDA_1751436141 Received: from mx-prod-int-06.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-06.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.93]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-02.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 3F2E51955ECA; Wed, 2 Jul 2025 06:02:21 +0000 (UTC) Received: from h1.redhat.com (unknown [10.22.88.112]) by mx-prod-int-06.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 1B00618003FC; Wed, 2 Jul 2025 06:02:05 +0000 (UTC) From: Nico Pache To: linux-mm@kvack.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-trace-kernel@vger.kernel.org Cc: david@redhat.com, ziy@nvidia.com, baolin.wang@linux.alibaba.com, lorenzo.stoakes@oracle.com, Liam.Howlett@oracle.com, ryan.roberts@arm.com, dev.jain@arm.com, corbet@lwn.net, rostedt@goodmis.org, mhiramat@kernel.org, mathieu.desnoyers@efficios.com, akpm@linux-foundation.org, baohua@kernel.org, willy@infradead.org, peterx@redhat.com, wangkefeng.wang@huawei.com, usamaarif642@gmail.com, sunnanyong@huawei.com, vishal.moola@gmail.com, thomas.hellstrom@linux.intel.com, yang@os.amperecomputing.com, kirill.shutemov@linux.intel.com, aarcange@redhat.com, raquini@redhat.com, anshuman.khandual@arm.com, catalin.marinas@arm.com, tiwai@suse.de, will@kernel.org, dave.hansen@linux.intel.com, jack@suse.cz, cl@gentwo.org, jglisse@google.com, surenb@google.com, zokeefe@google.com, hannes@cmpxchg.org, rientjes@google.com, mhocko@suse.com, rdunlap@infradead.org, Bagas Sanjaya Subject: [PATCH v8 15/15] Documentation: mm: update the admin guide for mTHP collapse Date: Tue, 1 Jul 2025 23:57:42 -0600 Message-ID: <20250702055742.102808-16-npache@redhat.com> In-Reply-To: <20250702055742.102808-1-npache@redhat.com> References: <20250702055742.102808-1-npache@redhat.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 3.4.1 on 10.30.177.93 X-Rspamd-Server: rspam02 X-Rspamd-Queue-Id: C8862180008 X-Stat-Signature: tc7jxp8fxs13ussoioktss4ftuje3d3x X-Rspam-User: X-HE-Tag: 1751436149-688957 X-HE-Meta: U2FsdGVkX19nCqfcE0oGld0P80Vy4L0sKcguJ5lmOD5w9pVzKbgFxQgBxr5Dd66to53R86KQLme9PLNhZ8OGSZs0owpq5VLf6Bwok9hEJW5EuY2W7IFOW90h4fxrruOGPIgX69P1xRSANoJjMZ91vK5VsIYPmUv9tUuMTpJFh07RdLS98sp3n6B3/OwRcNXfCwetw0QLBsPDjtzFzuqwoESJuBY7tq1qsU0T3AIHvGCto8d040oO8BZLkZhnMvx4f3zVL323GKCStNu/hN7r+CkXHrpqcrtC9RXzcRuQlAhjNFs3S9AQssQNdhawPvsBKrOXu0BjOsiBwHfawxvkD79iTtjy0pLFfNh4wz9Sj2L5jc24fmAU1LrauHUN3vEYsmBlQa0YmDiaJRIzudxwBhPYUvx/awV+Zky8VacF+6YjJ4UIu1JCvdML93Wo1Ud2wQqVjdxGmUnyHchyjHeCZ4gNciz5SJebzZ1v7ejvGMKfswR744EIZbJxnELOJzK2UrYjOH+oTUJ1u4ieJoRH8fRBzHelcxi9TQHJM/fkUuomv7ivcc7jNgWUOy14WIshmFERVZ4TPCI8LZFO2AWAUfsi7yNohpNEpVeSlEyVZz1SOoyxKLN6CTW5eSEsv75ZLVuz3KbuybtqBQBFMmDsH9CGNFwfb3hPIaE023ze0v+D2n64QQKX06RpduA+swhrMPvJh71K3EE75OHcjvlNuCA9msB2Bxups04gWczrpBYRsEoVz+cOUGyIgWafQZwgpESbh+fGXBwxsoiZR2vKT5UJz73IR7Iecc5G4gNRs2Jnpnb1G+NRKPpRWgCUeFaN4aiGwjrF36g454lokRiLRlSh4Ijsw0hXMCAtOISNXkYZ5tbj1Pni99yQHKID1TnDcjjt6f+DA6RNrC134uTi6i4GGrDJALfKLEX4NEJxE9A1IJ3ILZS/OwUM+a0bCO8TJJvMGkKqEvVUHE/40wI WpC/VXN7 D0fPtIn6ZQEaX4vuArz+Fus+oTfPm2BT4ViAGnBcbKyOh3jb5Fu/UHhgNo4l4KY59Xn4jsFL0SYSBQU9atntPKrgYOy6sSegbiu1XJPr4dN69G/QY32nAlaIcvWkNRNzfQ1pguZvEvGExj5nRsv1jxNy7DMgoaQHjuKkYHJuu6XC7+ZGe2ZgOJDOyaZZXyHQhlJOJNU57U/IpRDoBDY6n4h1diaQ0UTVzDeoTEp/4FTw0MN8bSbzjMeDLgUMvmj+pKs3dnFs9+ru+93kw1rvSQ9N1wgeKh7c3aIgLrHrhKVb/HZ3eeyGIAZO2jRNS4RLM6QRiJ4bavvoQRLPv0ukrP+pI69dcDQljgITJGzYxUNclnA+qSHG4jGc9zLyQS7zCt2ExAf4875sPVSdRmSwjBLhUXToVATcZjz9T5GOTHarvX00= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Now that we can collapse to mTHPs lets update the admin guide to reflect these changes and provide proper guidence on how to utilize it. Reviewed-by: Bagas Sanjaya Signed-off-by: Nico Pache --- Documentation/admin-guide/mm/transhuge.rst | 19 +++++++++++++------ 1 file changed, 13 insertions(+), 6 deletions(-) diff --git a/Documentation/admin-guide/mm/transhuge.rst b/Documentation/admin-guide/mm/transhuge.rst index dff8d5985f0f..878796b4d7d3 100644 --- a/Documentation/admin-guide/mm/transhuge.rst +++ b/Documentation/admin-guide/mm/transhuge.rst @@ -63,7 +63,7 @@ often. THP can be enabled system wide or restricted to certain tasks or even memory ranges inside task's address space. Unless THP is completely disabled, there is ``khugepaged`` daemon that scans memory and -collapses sequences of basic pages into PMD-sized huge pages. +collapses sequences of basic pages into huge pages. The THP behaviour is controlled via :ref:`sysfs ` interface and using madvise(2) and prctl(2) system calls. @@ -144,6 +144,18 @@ hugepage sizes have enabled="never". If enabling multiple hugepage sizes, the kernel will select the most appropriate enabled size for a given allocation. +khugepaged uses max_ptes_none scaled to the order of the enabled mTHP size +to determine collapses. When using mTHPs it's recommended to set +max_ptes_none low-- ideally less than HPAGE_PMD_NR / 2 (255 on 4k page +size). This will prevent undesired "creep" behavior that leads to +continuously collapsing to the largest mTHP size; when we collapse, we are +bringing in new non-zero pages that will, on a subsequent scan, cause the +max_ptes_none check of the +1 order to always be satisfied. By limiting +this to less than half the current order, we make sure we don't cause this +feedback loop. max_ptes_shared and max_ptes_swap have no effect when +collapsing to a mTHP, and mTHP collapse will fail on shared or swapped out +pages. + It's also possible to limit defrag efforts in the VM to generate anonymous hugepages in case they're not immediately free to madvise regions or to never try to defrag memory and simply fallback to regular @@ -221,11 +233,6 @@ top-level control are "never") Khugepaged controls ------------------- -.. note:: - khugepaged currently only searches for opportunities to collapse to - PMD-sized THP and no attempt is made to collapse to other THP - sizes. - khugepaged runs usually at low frequency so while one may not want to invoke defrag algorithms synchronously during the page faults, it should be worth invoking defrag at least in khugepaged. However it's -- 2.49.0