From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 733E4C4345F for ; Fri, 12 Apr 2024 14:56:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender: Content-Transfer-Encoding:Content-Type:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=PO550x7Wx0EFiYYPHY85eti/K4oAfn7C/LZx0eGnYhU=; b=gdJ8qWLsWLVEKS zUNhRcevWLPqe2pjxLZyWxr3lMNhBpd4Sw3NsEEqybxeA05CtlD+ECf5U0HcGHxkjRLVpRjg8+hvt w8xc0IAm2VzI7e/WbIqyyh6rm2ZnFq8aeJCQ2RrgEUfnlH7jHlkWXDEMhgmfR/1dSSkKc2Xn/r/Ii 7Urs8W1XylkvFaZucxahNJ1UfcNRdQxoZi2GGmXQY40jYhtbx9VsyjMBsxC7ICqHq7YNF+5tfGdxM AKrIcD+jOP+lPi0Ikz99k7ssjpftNKLIWPBKYOfzSib6np1a1nN0esWqxAs5i3R+6SM9KTuX3ryTb tJ6ARaiazjs062U+RTSA==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.97.1 #2 (Red Hat Linux)) id 1rvIKE-000000006es-3WMs; Fri, 12 Apr 2024 14:56:22 +0000 Received: from foss.arm.com ([217.140.110.172]) by bombadil.infradead.org with esmtp (Exim 4.97.1 #2 (Red Hat Linux)) id 1rvIKB-000000006eE-4B2r for linux-arm-kernel@lists.infradead.org; Fri, 12 Apr 2024 14:56:21 +0000 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 8ECDA339; Fri, 12 Apr 2024 07:56:46 -0700 (PDT) Received: from FVFF77S0Q05N.cambridge.arm.com (FVFF77S0Q05N.cambridge.arm.com [10.1.37.153]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 2C7963F64C; Fri, 12 Apr 2024 07:56:16 -0700 (PDT) Date: Fri, 12 Apr 2024 15:56:08 +0100 From: Mark Rutland To: Ryan Roberts , Catalin Marinas , Will Deacon Cc: Ard Biesheuvel , David Hildenbrand , Donald Dutile , Eric Chanudet , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v3 0/3] Speed up boot with faster linear map creation Message-ID: References: <20240412131908.433043-1-ryan.roberts@arm.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <20240412131908.433043-1-ryan.roberts@arm.com> X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20240412_075620_160422_712F7AE6 X-CRM114-Status: GOOD ( 14.76 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On Fri, Apr 12, 2024 at 02:19:05PM +0100, Ryan Roberts wrote: > Hi All, > > It turns out that creating the linear map can take a significant proportion of > the total boot time, especially when rodata=full. And most of the time is spent > waiting on superfluous tlb invalidation and memory barriers. This series reworks > the kernel pgtable generation code to significantly reduce the number of those > TLBIs, ISBs and DSBs. See each patch for details. > > The below shows the execution time of map_mem() across a couple of different > systems with different RAM configurations. We measure after applying each patch > and show the improvement relative to base (v6.9-rc2): > > | Apple M2 VM | Ampere Altra| Ampere Altra| Ampere Altra > | VM, 16G | VM, 64G | VM, 256G | Metal, 512G > ---------------|-------------|-------------|-------------|------------- > | ms (%) | ms (%) | ms (%) | ms (%) > ---------------|-------------|-------------|-------------|------------- > base | 168 (0%) | 2198 (0%) | 8644 (0%) | 17447 (0%) > no-cont-remap | 78 (-53%) | 435 (-80%) | 1723 (-80%) | 3779 (-78%) > batch-barriers | 11 (-93%) | 161 (-93%) | 656 (-92%) | 1654 (-91%) > no-alloc-remap | 10 (-94%) | 104 (-95%) | 438 (-95%) | 1223 (-93%) > > This series applies on top of v6.9-rc2. All mm selftests pass. I've compile and > boot tested various PAGE_SIZE and VA size configs. Nice! > Ryan Roberts (3): > arm64: mm: Don't remap pgtables per-cont(pte|pmd) block > arm64: mm: Batch dsb and isb when populating pgtables > arm64: mm: Don't remap pgtables for allocate vs populate For the series: Reviewed-by: Mark Rutland Catalin, Will, are you happy to pick this up? Mark. _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel