From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id E4F39C25B75 for ; Mon, 3 Jun 2024 07:59:39 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender: Content-Transfer-Encoding:Content-Type:List-Subscribe:List-Help:List-Post: List-Archive:List-Unsubscribe:List-Id:In-Reply-To:MIME-Version:References: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: List-Owner; bh=CPQAYzlFBWE6rNeTfEP2IAGXgPy8AZVI9jA0CJ2vQ8I=; b=zuBSeaOY5NSSBf ZG+6BDYnqYY+dr9kEFRfbvJWWEZLHKqdhen9qcnykSo05JLT7r1J1p8ovSgeIbNkzwDN9OdTDmZoy cC/CnkBxe5a382wc94zqA/duz+LUfPrCSI9pCmHaPQOsJIU3vXieSm59cDS5f+6PyQ5OfStAnx0qt xv4+OZDBscTJ4uihlwD7vB/9TvU279D9hXDLCza8cXkAoV5eFIV54GofufOknh04RAynsOD5J2DUx VlQ6zKXssGgim29YXYTHjSKWz+DNa4MU61L5E6bcfsh2NxhyzRfvUg7KkcVUn6QAwaQGpiBWZPheI pSbNYXUM5O+0XWq3nBAg==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.97.1 #2 (Red Hat Linux)) id 1sE2bL-0000000Fygr-344t; Mon, 03 Jun 2024 07:59:31 +0000 Received: from sin.source.kernel.org ([145.40.73.55]) by bombadil.infradead.org with esmtps (Exim 4.97.1 #2 (Red Hat Linux)) id 1sE2bJ-0000000FygL-27bH for linux-arm-kernel@lists.infradead.org; Mon, 03 Jun 2024 07:59:30 +0000 Received: from smtp.kernel.org (transwarp.subspace.kernel.org [100.75.92.58]) by sin.source.kernel.org (Postfix) with ESMTP id 75839CE0B21; Mon, 3 Jun 2024 07:59:26 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 6200AC2BD10; Mon, 3 Jun 2024 07:59:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1717401565; bh=0VegF9XMtUv3jnXb7b8g6QP9TKzjqBXKAtuGcGkfzSw=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=P417+ZpojmienU72GF2y2X1GT/j9dsT9IwUmXNWyPmAhYHJ+DUO5beFRRa+cw9t0M xkm6RWvLHW3WqRYpS6YKFL0w/lPt2+9zpzpa6O19cWbB1GG3tjMZsXHXEVPE5IU69e Ee20ljgRsmEK0lUS1B4lT/nf6z5P0YHlViluk7furLOEY6veclDnjNwfHoNEID8S74 sNGuDahB7hQjY2qzrkqa+dWhkE1r3vwDRPOG6HIpPqzSOyc1vVgjPp0o9B1hanlFJp 9p/IHg4ONG0DSpezYLDx930uXVrXe+Rlw5kHOAj1MApnomBt9rJPfXb7SbSKCq8VbP 7GeakgRT+ZZbQ== Date: Mon, 3 Jun 2024 10:57:28 +0300 From: Mike Rapoport To: David Hildenbrand Cc: Jonathan Cameron , Dan Williams , linux-cxl@vger.kernel.org, linux-arm-kernel@lists.infradead.org, Sudeep Holla , Andrew Morton , Will Deacon , Jia He , Mike Rapoport , linuxarm@huawei.com, catalin.marinas@arm.com, Anshuman.Khandual@arm.com, Yuquan Wang , Oscar Salvador , Lorenzo Pieralisi , James Morse Subject: Re: [RFC PATCH 8/8] HACK: mm: memory_hotplug: Drop memblock_phys_free() call in try_remove_memory() Message-ID: References: <20240529171236.32002-1-Jonathan.Cameron@huawei.com> <20240529171236.32002-9-Jonathan.Cameron@huawei.com> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20240603_005929_932609_A0DCC516 X-CRM114-Status: GOOD ( 36.53 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org On Fri, May 31, 2024 at 09:49:32AM +0200, David Hildenbrand wrote: > On 29.05.24 19:12, Jonathan Cameron wrote: > > I'm not sure what this is balancing, but it if is necessary then the reserved > > memblock approach can't be used to stash NUMA node assignments as after the > > first add / remove cycle the entry is dropped so not available if memory is > > re-added at the same HPA. > > > > This patch is here to hopefully spur comments on what this is there for! > > > > Signed-off-by: Jonathan Cameron > > --- > > mm/memory_hotplug.c | 2 +- > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c > > index 431b1f6753c0..3d8dd4749dfc 100644 > > --- a/mm/memory_hotplug.c > > +++ b/mm/memory_hotplug.c > > @@ -2284,7 +2284,7 @@ static int __ref try_remove_memory(u64 start, u64 size) > > } > > if (IS_ENABLED(CONFIG_ARCH_KEEP_MEMBLOCK)) { > > - memblock_phys_free(start, size); > > + // memblock_phys_free(start, size); > > memblock_remove(start, size); > > } > > memblock_phys_free() works on memblock.reserved, memblock_remove() works on > memblock.memory. > > If you take a look at the doc at the top of memblock.c: > > memblock.memory: physical memory available to the system > memblock.reserved: regions that were allocated [during boot] > > > memblock.memory is supposed to be a superset of memblock.reserved. Your No it's not. memblock.reserved is more of "if there is memory, don't touch it". Some regions in memblock.reserved are boot time allocations and they are indeed a subset of memblock.memory, but some are reservations done by firmware (e.g. reserved memory in DT) that just might not have a corresponding regions in memblock.memory. It can happen for example, when the same firmware runs on devices with different memory configuration, but still wants to preserve some physical addresses. > "hack" here indicates that you somehow would be relying on the opposite > being true, which indicates that you are doing the wrong thing. I'm not sure about that, I still have to digest the patches :) > memblock_remove() indeed balances against memblock_add_node() for hotplugged > memory [add_memory_resource()]. There seem to a case where we would succeed > in hotunplugging memory that was part of "memblock.reserved". > > But how could that happen? I think the following way: > > Once the buddy is up and running, memory allocated during early boot is not > freed back to memblock, but usually we simply go via something like > free_reserved_page(), not memblock_free() [because the buddy took over]. So > one could end up unplugging memory that still resides in memblock.reserved > set. > > So with memblock_phys_free(), we are enforcing the invariant that > memblock.memory is a superset of memblock.reserved. > > Likely, arm64 should store that node assignment elsewhere from where it can > be queried. Or it should be using something like > CONFIG_HAVE_MEMBLOCK_PHYS_MAP for these static windows. > > -- > Cheers, > > David / dhildenb > -- Sincerely yours, Mike. _______________________________________________ linux-arm-kernel mailing list linux-arm-kernel@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-arm-kernel