From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id D0A6AC4332F for ; Mon, 13 Nov 2023 05:18:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender: Content-Transfer-Encoding:Content-Type:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=9OSrexmOu/LTJYMRE+dQ0u73nrotORrHTiMDpXztweQ=; b=CL61J41nLa+plg dy6pb3FAu7zt/mto186QJdYaUDwm6pnoos6hMQ+DbUsqdLJIHvVu4rqm+dQ6axRWGP4JjbCeYwyJi 9etjsI6rtNLrBmaf6bNbCAHW1qWGgWJwpBEsgMcxC6bP7ZoMC/hLsG7Cze5RUjnCxedE5uW9HKT12 9W1PTBfC3rwFRz7/L66Lquho+X1PYMLP0S5rej0Aj1vnfvfcgRAqNU/Cxs0I1lBVA8b9kvEqINVKc Qzfe018xWvVhK/pqk1IX0ML8mjp3PxhDGdLo6HWkggDetM4m+qFSqk3v3BtMhfL6ytKlTnlmwbTji SUAGjx0kXddaF/9Apvfw==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.96 #2 (Red Hat Linux)) id 1r2PKw-00DJGm-2q; Mon, 13 Nov 2023 05:18:14 +0000 Received: from casper.infradead.org ([2001:8b0:10b:1236::1]) by bombadil.infradead.org with esmtps (Exim 4.96 #2 (Red Hat Linux)) id 1r2PKw-00DJGf-07 for linux-arm-kernel@bombadil.infradead.org; Mon, 13 Nov 2023 05:18:14 +0000 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=Mw5XVGZ53qHqO6W75lYP3KgdM5arMrga6Cb5+Lo7O5w=; b=jEuPH2X3TXGxfIus5MqSUUo0rE WbVw0o+U0Q5hXmq+HbiaQlrcdxkwZDdFRWSX1yy5UzPeVxebXs3+jWvyz3dN5OF1ExcMg8u7ZkYVt Fq1ePl76US38DuAlbHhjPFEwWaQ1fY3u84J/UjOdHoIDuu7txdFJsNAHja7cfk9MHYfRtcb/Cdob3 hB28RS6D8bCKXnNBRCmv5MXJ9BQv89ucTxyUcrkIKyb5tz3AqyfJYjSLCw1nn+H4jlrajcrptNuNK LeiEeGoUS/gHIp54evg6FVSDhmCPJDqMRnstXRABo5Fo0EgV9eHdKHTP0ZRdYjlbbBjcg5E8RJs/6 UZfxZZFQ==; Received: from willy by casper.infradead.org with local (Exim 4.94.2 #2 (Red Hat Linux)) id 1r2PKq-00CLiX-S5; Mon, 13 Nov 2023 05:18:08 +0000 Date: Mon, 13 Nov 2023 05:18:08 +0000 From: Matthew Wilcox To: John Hubbard Cc: Ryan Roberts , Andrew Morton , Yin Fengwei , David Hildenbrand , Yu Zhao , Catalin Marinas , Anshuman Khandual , Yang Shi , "Huang, Ying" , Zi Yan , Luis Chamberlain , Itaru Kitayama , "Kirill A. Shutemov" , David Rientjes , Vlastimil Babka , Hugh Dickins , linux-mm@kvack.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org Subject: Re: [PATCH v6 0/9] variable-order, large folios for anonymous memory Message-ID: References: <20230929114421.3761121-1-ryan.roberts@arm.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On Sun, Nov 12, 2023 at 10:57:47PM -0500, John Hubbard wrote: > I've done some initial performance testing of this patchset on an arm64 > SBSA server. When these patches are combined with the arm64 arch contpte > patches in Ryan's git tree (he has conveniently combined everything > here: [1]), we are seeing a remarkable, consistent speedup of 10.5x on > some memory-intensive workloads. Many test runs, conducted independently > by different engineers and on different machines, have convinced me and > my colleagues that this is an accurate result. > > In order to achieve that result, we used the git tree in [1] with > following settings: > > echo always >/sys/kernel/mm/transparent_hugepage/enabled > echo recommend >/sys/kernel/mm/transparent_hugepage/anon_orders > > This was on a aarch64 machine configure to use a 64KB base page size. > That configuration means that the PMD size is 512MB, which is of course > too large for practical use as a pure PMD-THP. However, with with these > small-size (less than PMD-sized) THPs, we get the improvements in TLB > coverage, while still getting pages that are small enough to be > effectively usable. That is quite remarkable! My hope is to abolish the 64kB page size configuration. ie instead of using the mixture of page sizes that you currently are -- 64k and 1M (right? Order-0, and order-4), that 4k, 64k and 2MB (order-0, order-4 and order-9) will provide better performance. Have you run any experiements with a 4kB page size? _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel