From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.5 required=3.0 tests=DKIM_INVALID,DKIM_SIGNED, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 7225ECA9EAF for ; Thu, 24 Oct 2019 19:43:11 +0000 (UTC) Received: from lists.ozlabs.org (lists.ozlabs.org [203.11.71.2]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id D142B21655 for ; Thu, 24 Oct 2019 19:43:10 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="MV95PON9" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D142B21655 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Received: from bilbo.ozlabs.org (lists.ozlabs.org [IPv6:2401:3900:2:1::3]) by lists.ozlabs.org (Postfix) with ESMTP id 46zd2M5NFnzDqbC for ; Fri, 25 Oct 2019 06:43:07 +1100 (AEDT) Authentication-Results: lists.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=redhat.com (client-ip=205.139.110.120; helo=us-smtp-1.mimecast.com; envelope-from=david@redhat.com; receiver=) Authentication-Results: lists.ozlabs.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: lists.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=redhat.com header.i=@redhat.com header.b="MV95PON9"; dkim-atps=neutral Received: from us-smtp-1.mimecast.com (us-smtp-delivery-1.mimecast.com [205.139.110.120]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 46zQzl3YTSzDqTq for ; Thu, 24 Oct 2019 23:10:11 +1100 (AEDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1571919008; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=TooFwxaibKWioSMXkjdsd9KqTbD0P4TIgojllh09QfQ=; b=MV95PON9VMAJOC6c8dsscFeKqUiGlv6uvdFCLmGF2Wic3u0gcAskpmCqHLztGEIt3CJ+TM ch8U/EiehGGNZvheJW7dvWMgY5HaIfEsqQ6iM0+KoTZKAaWLPZRoS1RHFl8FoeW+F8as5t WYEue9EOkftDM7dwFbQQi7QqrFUOVgw= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-70-Mt5vh3dPMhmZKGgS4b6ZRA-1; Thu, 24 Oct 2019 08:10:06 -0400 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.23]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 410A4801E5C; Thu, 24 Oct 2019 12:10:01 +0000 (UTC) Received: from t460s.redhat.com (ovpn-116-141.ams2.redhat.com [10.36.116.141]) by smtp.corp.redhat.com (Postfix) with ESMTP id 314413CCA; Thu, 24 Oct 2019 12:09:39 +0000 (UTC) From: David Hildenbrand To: linux-kernel@vger.kernel.org Subject: [PATCH v1 00/10] mm: Don't mark hotplugged pages PG_reserved (including ZONE_DEVICE) Date: Thu, 24 Oct 2019 14:09:28 +0200 Message-Id: <20191024120938.11237-1-david@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.84 on 10.5.11.23 X-MC-Unique: Mt5vh3dPMhmZKGgS4b6ZRA-1 X-Mimecast-Spam-Score: 0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Mailman-Approved-At: Fri, 25 Oct 2019 06:41:16 +1100 X-BeenThere: linuxppc-dev@lists.ozlabs.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: linux-hyperv@vger.kernel.org, Michal Hocko , =?UTF-8?q?Radim=20Kr=C4=8Dm=C3=A1=C5=99?= , kvm@vger.kernel.org, David Hildenbrand , KarimAllah Ahmed , Dave Hansen , Alexander Duyck , Michal Hocko , linux-mm@kvack.org, Pavel Tatashin , Paul Mackerras , "H. Peter Anvin" , Wanpeng Li , Alexander Duyck , "K. Y. Srinivasan" , Dan Williams , Kees Cook , devel@driverdev.osuosl.org, Stefano Stabellini , Stephen Hemminger , "Aneesh Kumar K.V" , Joerg Roedel , x86@kernel.org, YueHaibing , "Matthew Wilcox \(Oracle\)" , Mike Rapoport , Peter Zijlstra , Ingo Molnar , Vlastimil Babka , Anthony Yznaga , Oscar Salvador , "Isaac J. Manjarres" , Matt Sickler , Juergen Gross , Anshuman Khandual , Haiyang Zhang , Sasha Levin , kvm-ppc@vger.kernel.org, Qian Cai , Alex Williamson , Mike Rapoport , Borislav Petkov , Nicholas Piggin , Andy Lutomirski , xen-devel@lists.xenproject.org, Boris Ostrovsky , Vitaly Kuznetsov , Allison Randal , Jim Mattson , Mel Gorman , Cornelia Huck , Pavel Tatashin , Sean Christopherson , Thomas Gleixner , Johannes Weiner , Paolo Bonzini , Andrew Morton , linuxppc-dev@lists.ozlabs.org Errors-To: linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Sender: "Linuxppc-dev" This is the result of a recent discussion with Michal ([1], [2]). Right now we set all pages PG_reserved when initializing hotplugged memmaps. This includes ZONE_DEVICE memory. In case of system memory, PG_reserved is cleared again when onlining the memory, in case of ZONE_DEVICE memory never. In ancient times, we needed PG_reserved, because there was no way to tell whether the memmap was already properly initialized. We now have SECTION_IS_ONLINE for that in the case of !ZONE_DEVICE memory. ZONE_DEVICE memory is already initialized deferred, and there shouldn't be a visible change in that regard. One of the biggest fears were side effects. I went ahead and audited all users of PageReserved(). The details can be found in "mm/memory_hotplug: Don't mark pages PG_reserved when initializing the memmap". This patch set adapts all relevant users of PageReserved() to keep the existing behavior in respect to ZONE_DEVICE pages. The biggest part part that needs changes is KVM, to keep the existing behavior (that's all I care about in this series). Note that this series is able to rely completely on pfn_to_online_page(). No new is_zone_device_page() calles are introduced (as requested by Dan). We are currently discussing a way to mark also ZONE_DEVICE memmaps as active/initialized - pfn_active() - and lightweight locking to make sure memmaps remain active (e.g., using RCU). We might later be able to convert some suers of pfn_to_online_page() to pfn_active(). Details can be found in [3], however, this represents yet another cleanup/fix we'll perform on top of this cleanup. I only gave it a quick test with DIMMs on x86-64, but didn't test the ZONE_DEVICE part at all (any tips for a nice QEMU setup?). Also, I didn't test the KVM parts (especially with ZONE_DEVICE pages or no memmap at all). Compile-tested on x86-64 and PPC. Based on next/master. The current version (kept updated) can be found at: https://github.com/davidhildenbrand/linux.git online_reserved_cleanup RFC -> v1: - Dropped "staging/gasket: Prepare gasket_release_page() for PG_reserved changes" - Dropped "staging: kpc2000: Prepare transfer_complete_cb() for PG_reserved changes" - Converted "mm/usercopy.c: Prepare check_page_span() for PG_reserved changes" to "mm/usercopy.c: Update comment in check_page_span() regarding ZONE_DEVICE" - No new users of is_zone_device_page() are introduced. - Rephrased comments and patch descriptions. [1] https://lkml.org/lkml/2019/10/21/736 [2] https://lkml.org/lkml/2019/10/21/1034 [3] https://www.spinics.net/lists/linux-mm/msg194112.html Cc: Michal Hocko Cc: Dan Williams Cc: kvm-ppc@vger.kernel.org Cc: linuxppc-dev@lists.ozlabs.org Cc: kvm@vger.kernel.org Cc: linux-hyperv@vger.kernel.org Cc: devel@driverdev.osuosl.org Cc: xen-devel@lists.xenproject.org Cc: x86@kernel.org Cc: Alexander Duyck David Hildenbrand (10): mm/memory_hotplug: Don't allow to online/offline memory blocks with holes KVM: x86/mmu: Prepare kvm_is_mmio_pfn() for PG_reserved changes KVM: Prepare kvm_is_reserved_pfn() for PG_reserved changes vfio/type1: Prepare is_invalid_reserved_pfn() for PG_reserved changes powerpc/book3s: Prepare kvmppc_book3s_instantiate_page() for PG_reserved changes powerpc/64s: Prepare hash_page_do_lazy_icache() for PG_reserved changes powerpc/mm: Prepare maybe_pte_to_page() for PG_reserved changes x86/mm: Prepare __ioremap_check_ram() for PG_reserved changes mm/memory_hotplug: Don't mark pages PG_reserved when initializing the memmap mm/usercopy.c: Update comment in check_page_span() regarding ZONE_DEVICE arch/powerpc/kvm/book3s_64_mmu_radix.c | 14 +++++---- arch/powerpc/mm/book3s64/hash_utils.c | 10 +++--- arch/powerpc/mm/pgtable.c | 10 +++--- arch/x86/kvm/mmu.c | 29 ++++++++++------- arch/x86/mm/ioremap.c | 13 ++++++-- drivers/hv/hv_balloon.c | 6 ++++ drivers/vfio/vfio_iommu_type1.c | 10 ++++-- drivers/xen/balloon.c | 7 +++++ include/linux/page-flags.h | 8 +---- mm/memory_hotplug.c | 43 +++++++++++++++++++------- mm/page_alloc.c | 11 ------- mm/usercopy.c | 6 ++-- virt/kvm/kvm_main.c | 10 ++++-- 13 files changed, 111 insertions(+), 66 deletions(-) --=20 2.21.0 From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.5 required=3.0 tests=DKIM_INVALID,DKIM_SIGNED, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B7A49CA9EAF for ; Thu, 24 Oct 2019 12:10:46 +0000 (UTC) Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 856FE20856 for ; Thu, 24 Oct 2019 12:10:46 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="MV95PON9" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 856FE20856 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=xen-devel-bounces@lists.xenproject.org Received: from localhost ([127.0.0.1] helo=lists.xenproject.org) by lists.xenproject.org with esmtp (Exim 4.89) (envelope-from ) id 1iNbwS-0004lI-2s; Thu, 24 Oct 2019 12:10:12 +0000 Received: from us1-rack-iad1.inumbo.com ([172.99.69.81]) by lists.xenproject.org with esmtp (Exim 4.89) (envelope-from ) id 1iNbwR-0004lD-65 for xen-devel@lists.xenproject.org; Thu, 24 Oct 2019 12:10:11 +0000 X-Inumbo-ID: 390e963e-f657-11e9-bbab-bc764e2007e4 Received: from us-smtp-delivery-1.mimecast.com (unknown [205.139.110.120]) by us1-rack-iad1.inumbo.com (Halon) with ESMTP id 390e963e-f657-11e9-bbab-bc764e2007e4; Thu, 24 Oct 2019 12:10:09 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1571919008; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=TooFwxaibKWioSMXkjdsd9KqTbD0P4TIgojllh09QfQ=; b=MV95PON9VMAJOC6c8dsscFeKqUiGlv6uvdFCLmGF2Wic3u0gcAskpmCqHLztGEIt3CJ+TM ch8U/EiehGGNZvheJW7dvWMgY5HaIfEsqQ6iM0+KoTZKAaWLPZRoS1RHFl8FoeW+F8as5t WYEue9EOkftDM7dwFbQQi7QqrFUOVgw= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-70-Mt5vh3dPMhmZKGgS4b6ZRA-1; Thu, 24 Oct 2019 08:10:06 -0400 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.23]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 410A4801E5C; Thu, 24 Oct 2019 12:10:01 +0000 (UTC) Received: from t460s.redhat.com (ovpn-116-141.ams2.redhat.com [10.36.116.141]) by smtp.corp.redhat.com (Postfix) with ESMTP id 314413CCA; Thu, 24 Oct 2019 12:09:39 +0000 (UTC) From: David Hildenbrand To: linux-kernel@vger.kernel.org Date: Thu, 24 Oct 2019 14:09:28 +0200 Message-Id: <20191024120938.11237-1-david@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.84 on 10.5.11.23 X-MC-Unique: Mt5vh3dPMhmZKGgS4b6ZRA-1 X-Mimecast-Spam-Score: 0 Subject: [Xen-devel] [PATCH v1 00/10] mm: Don't mark hotplugged pages PG_reserved (including ZONE_DEVICE) X-BeenThere: xen-devel@lists.xenproject.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Cc: linux-hyperv@vger.kernel.org, Michal Hocko , =?UTF-8?q?Radim=20Kr=C4=8Dm=C3=A1=C5=99?= , kvm@vger.kernel.org, David Hildenbrand , KarimAllah Ahmed , Benjamin Herrenschmidt , Dave Hansen , Alexander Duyck , Michal Hocko , Paul Mackerras , linux-mm@kvack.org, Pavel Tatashin , Paul Mackerras , Michael Ellerman , "H. Peter Anvin" , Wanpeng Li , Alexander Duyck , "K. Y. Srinivasan" , Dan Williams , Kees Cook , devel@driverdev.osuosl.org, Stefano Stabellini , Stephen Hemminger , "Aneesh Kumar K.V" , Joerg Roedel , x86@kernel.org, YueHaibing , "Matthew Wilcox \(Oracle\)" , Mike Rapoport , Peter Zijlstra , Ingo Molnar , Vlastimil Babka , Anthony Yznaga , Oscar Salvador , "Isaac J. Manjarres" , Matt Sickler , Juergen Gross , Anshuman Khandual , Haiyang Zhang , Sasha Levin , kvm-ppc@vger.kernel.org, Qian Cai , Alex Williamson , Mike Rapoport , Borislav Petkov , Nicholas Piggin , Andy Lutomirski , xen-devel@lists.xenproject.org, Boris Ostrovsky , Vitaly Kuznetsov , Allison Randal , Jim Mattson , Christophe Leroy , Mel Gorman , Cornelia Huck , Pavel Tatashin , Sean Christopherson , Thomas Gleixner , Johannes Weiner , Paolo Bonzini , Andrew Morton , linuxppc-dev@lists.ozlabs.org Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: xen-devel-bounces@lists.xenproject.org Sender: "Xen-devel" VGhpcyBpcyB0aGUgcmVzdWx0IG9mIGEgcmVjZW50IGRpc2N1c3Npb24gd2l0aCBNaWNoYWwgKFsx XSwgWzJdKS4gUmlnaHQKbm93IHdlIHNldCBhbGwgcGFnZXMgUEdfcmVzZXJ2ZWQgd2hlbiBpbml0 aWFsaXppbmcgaG90cGx1Z2dlZCBtZW1tYXBzLiBUaGlzCmluY2x1ZGVzIFpPTkVfREVWSUNFIG1l bW9yeS4gSW4gY2FzZSBvZiBzeXN0ZW0gbWVtb3J5LCBQR19yZXNlcnZlZCBpcwpjbGVhcmVkIGFn YWluIHdoZW4gb25saW5pbmcgdGhlIG1lbW9yeSwgaW4gY2FzZSBvZiBaT05FX0RFVklDRSBtZW1v cnkKbmV2ZXIuCgpJbiBhbmNpZW50IHRpbWVzLCB3ZSBuZWVkZWQgUEdfcmVzZXJ2ZWQsIGJlY2F1 c2UgdGhlcmUgd2FzIG5vIHdheSB0byB0ZWxsCndoZXRoZXIgdGhlIG1lbW1hcCB3YXMgYWxyZWFk eSBwcm9wZXJseSBpbml0aWFsaXplZC4gV2Ugbm93IGhhdmUKU0VDVElPTl9JU19PTkxJTkUgZm9y IHRoYXQgaW4gdGhlIGNhc2Ugb2YgIVpPTkVfREVWSUNFIG1lbW9yeS4gWk9ORV9ERVZJQ0UKbWVt b3J5IGlzIGFscmVhZHkgaW5pdGlhbGl6ZWQgZGVmZXJyZWQsIGFuZCB0aGVyZSBzaG91bGRuJ3Qg YmUgYSB2aXNpYmxlCmNoYW5nZSBpbiB0aGF0IHJlZ2FyZC4KCk9uZSBvZiB0aGUgYmlnZ2VzdCBm ZWFycyB3ZXJlIHNpZGUgZWZmZWN0cy4gSSB3ZW50IGFoZWFkIGFuZCBhdWRpdGVkIGFsbAp1c2Vy cyBvZiBQYWdlUmVzZXJ2ZWQoKS4gVGhlIGRldGFpbHMgY2FuIGJlIGZvdW5kIGluICJtbS9tZW1v cnlfaG90cGx1ZzoKRG9uJ3QgbWFyayBwYWdlcyBQR19yZXNlcnZlZCB3aGVuIGluaXRpYWxpemlu ZyB0aGUgbWVtbWFwIi4KClRoaXMgcGF0Y2ggc2V0IGFkYXB0cyBhbGwgcmVsZXZhbnQgdXNlcnMg b2YgUGFnZVJlc2VydmVkKCkgdG8ga2VlcCB0aGUKZXhpc3RpbmcgYmVoYXZpb3IgaW4gcmVzcGVj dCB0byBaT05FX0RFVklDRSBwYWdlcy4gVGhlIGJpZ2dlc3QgcGFydCBwYXJ0CnRoYXQgbmVlZHMg Y2hhbmdlcyBpcyBLVk0sIHRvIGtlZXAgdGhlIGV4aXN0aW5nIGJlaGF2aW9yICh0aGF0J3MgYWxs IEkKY2FyZSBhYm91dCBpbiB0aGlzIHNlcmllcykuCgpOb3RlIHRoYXQgdGhpcyBzZXJpZXMgaXMg YWJsZSB0byByZWx5IGNvbXBsZXRlbHkgb24gcGZuX3RvX29ubGluZV9wYWdlKCkuCk5vIG5ldyBp c196b25lX2RldmljZV9wYWdlKCkgY2FsbGVzIGFyZSBpbnRyb2R1Y2VkIChhcyByZXF1ZXN0ZWQg YnkgRGFuKS4KV2UgYXJlIGN1cnJlbnRseSBkaXNjdXNzaW5nIGEgd2F5IHRvIG1hcmsgYWxzbyBa T05FX0RFVklDRSBtZW1tYXBzIGFzCmFjdGl2ZS9pbml0aWFsaXplZCAtIHBmbl9hY3RpdmUoKSAt IGFuZCBsaWdodHdlaWdodCBsb2NraW5nIHRvIG1ha2Ugc3VyZQptZW1tYXBzIHJlbWFpbiBhY3Rp dmUgKGUuZy4sIHVzaW5nIFJDVSkuIFdlIG1pZ2h0IGxhdGVyIGJlIGFibGUgdG8gY29udmVydApz b21lIHN1ZXJzIG9mIHBmbl90b19vbmxpbmVfcGFnZSgpIHRvIHBmbl9hY3RpdmUoKS4gRGV0YWls cyBjYW4gYmUgZm91bmQKaW4gWzNdLCBob3dldmVyLCB0aGlzIHJlcHJlc2VudHMgeWV0IGFub3Ro ZXIgY2xlYW51cC9maXggd2UnbGwgcGVyZm9ybQpvbiB0b3Agb2YgdGhpcyBjbGVhbnVwLgoKSSBv bmx5IGdhdmUgaXQgYSBxdWljayB0ZXN0IHdpdGggRElNTXMgb24geDg2LTY0LCBidXQgZGlkbid0 IHRlc3QgdGhlClpPTkVfREVWSUNFIHBhcnQgYXQgYWxsIChhbnkgdGlwcyBmb3IgYSBuaWNlIFFF TVUgc2V0dXA/KS4gQWxzbywgSSBkaWRuJ3QKdGVzdCB0aGUgS1ZNIHBhcnRzIChlc3BlY2lhbGx5 IHdpdGggWk9ORV9ERVZJQ0UgcGFnZXMgb3Igbm8gbWVtbWFwIGF0IGFsbCkuCkNvbXBpbGUtdGVz dGVkIG9uIHg4Ni02NCBhbmQgUFBDLgoKQmFzZWQgb24gbmV4dC9tYXN0ZXIuIFRoZSBjdXJyZW50 IHZlcnNpb24gKGtlcHQgdXBkYXRlZCkgY2FuIGJlIGZvdW5kIGF0OgogICAgaHR0cHM6Ly9naXRo dWIuY29tL2RhdmlkaGlsZGVuYnJhbmQvbGludXguZ2l0IG9ubGluZV9yZXNlcnZlZF9jbGVhbnVw CgpSRkMgLT4gdjE6Ci0gRHJvcHBlZCAic3RhZ2luZy9nYXNrZXQ6IFByZXBhcmUgZ2Fza2V0X3Jl bGVhc2VfcGFnZSgpIGZvciBQR19yZXNlcnZlZAogIGNoYW5nZXMiCi0gRHJvcHBlZCAic3RhZ2lu Zzoga3BjMjAwMDogUHJlcGFyZSB0cmFuc2Zlcl9jb21wbGV0ZV9jYigpIGZvciBQR19yZXNlcnZl ZAogIGNoYW5nZXMiCi0gQ29udmVydGVkICJtbS91c2VyY29weS5jOiBQcmVwYXJlIGNoZWNrX3Bh Z2Vfc3BhbigpIGZvciBQR19yZXNlcnZlZAogIGNoYW5nZXMiIHRvICJtbS91c2VyY29weS5jOiBV cGRhdGUgY29tbWVudCBpbiBjaGVja19wYWdlX3NwYW4oKQogIHJlZ2FyZGluZyBaT05FX0RFVklD RSIKLSBObyBuZXcgdXNlcnMgb2YgaXNfem9uZV9kZXZpY2VfcGFnZSgpIGFyZSBpbnRyb2R1Y2Vk LgotIFJlcGhyYXNlZCBjb21tZW50cyBhbmQgcGF0Y2ggZGVzY3JpcHRpb25zLgoKWzFdIGh0dHBz Oi8vbGttbC5vcmcvbGttbC8yMDE5LzEwLzIxLzczNgpbMl0gaHR0cHM6Ly9sa21sLm9yZy9sa21s LzIwMTkvMTAvMjEvMTAzNApbM10gaHR0cHM6Ly93d3cuc3Bpbmljcy5uZXQvbGlzdHMvbGludXgt bW0vbXNnMTk0MTEyLmh0bWwKCkNjOiBNaWNoYWwgSG9ja28gPG1ob2Nrb0BrZXJuZWwub3JnPgpD YzogRGFuIFdpbGxpYW1zIDxkYW4uai53aWxsaWFtc0BpbnRlbC5jb20KQ2M6IEFuZHJldyBNb3J0 b24gPGFrcG1AbGludXgtZm91bmRhdGlvbi5vcmc+CkNjOiBrdm0tcHBjQHZnZXIua2VybmVsLm9y ZwpDYzogbGludXhwcGMtZGV2QGxpc3RzLm96bGFicy5vcmcKQ2M6IGt2bUB2Z2VyLmtlcm5lbC5v cmcKQ2M6IGxpbnV4LWh5cGVydkB2Z2VyLmtlcm5lbC5vcmcKQ2M6IGRldmVsQGRyaXZlcmRldi5v c3Vvc2wub3JnCkNjOiB4ZW4tZGV2ZWxAbGlzdHMueGVucHJvamVjdC5vcmcKQ2M6IHg4NkBrZXJu ZWwub3JnCkNjOiBBbGV4YW5kZXIgRHV5Y2sgPGFsZXhhbmRlci5kdXlja0BnbWFpbC5jb20+CgpE YXZpZCBIaWxkZW5icmFuZCAoMTApOgogIG1tL21lbW9yeV9ob3RwbHVnOiBEb24ndCBhbGxvdyB0 byBvbmxpbmUvb2ZmbGluZSBtZW1vcnkgYmxvY2tzIHdpdGgKICAgIGhvbGVzCiAgS1ZNOiB4ODYv bW11OiBQcmVwYXJlIGt2bV9pc19tbWlvX3BmbigpIGZvciBQR19yZXNlcnZlZCBjaGFuZ2VzCiAg S1ZNOiBQcmVwYXJlIGt2bV9pc19yZXNlcnZlZF9wZm4oKSBmb3IgUEdfcmVzZXJ2ZWQgY2hhbmdl cwogIHZmaW8vdHlwZTE6IFByZXBhcmUgaXNfaW52YWxpZF9yZXNlcnZlZF9wZm4oKSBmb3IgUEdf cmVzZXJ2ZWQgY2hhbmdlcwogIHBvd2VycGMvYm9vazNzOiBQcmVwYXJlIGt2bXBwY19ib29rM3Nf aW5zdGFudGlhdGVfcGFnZSgpIGZvcgogICAgUEdfcmVzZXJ2ZWQgY2hhbmdlcwogIHBvd2VycGMv NjRzOiBQcmVwYXJlIGhhc2hfcGFnZV9kb19sYXp5X2ljYWNoZSgpIGZvciBQR19yZXNlcnZlZAog ICAgY2hhbmdlcwogIHBvd2VycGMvbW06IFByZXBhcmUgbWF5YmVfcHRlX3RvX3BhZ2UoKSBmb3Ig UEdfcmVzZXJ2ZWQgY2hhbmdlcwogIHg4Ni9tbTogUHJlcGFyZSBfX2lvcmVtYXBfY2hlY2tfcmFt KCkgZm9yIFBHX3Jlc2VydmVkIGNoYW5nZXMKICBtbS9tZW1vcnlfaG90cGx1ZzogRG9uJ3QgbWFy ayBwYWdlcyBQR19yZXNlcnZlZCB3aGVuIGluaXRpYWxpemluZyB0aGUKICAgIG1lbW1hcAogIG1t L3VzZXJjb3B5LmM6IFVwZGF0ZSBjb21tZW50IGluIGNoZWNrX3BhZ2Vfc3BhbigpIHJlZ2FyZGlu ZwogICAgWk9ORV9ERVZJQ0UKCiBhcmNoL3Bvd2VycGMva3ZtL2Jvb2szc182NF9tbXVfcmFkaXgu YyB8IDE0ICsrKysrLS0tLQogYXJjaC9wb3dlcnBjL21tL2Jvb2szczY0L2hhc2hfdXRpbHMuYyAg fCAxMCArKystLS0KIGFyY2gvcG93ZXJwYy9tbS9wZ3RhYmxlLmMgICAgICAgICAgICAgIHwgMTAg KysrLS0tCiBhcmNoL3g4Ni9rdm0vbW11LmMgICAgICAgICAgICAgICAgICAgICB8IDI5ICsrKysr KysrKystLS0tLS0tCiBhcmNoL3g4Ni9tbS9pb3JlbWFwLmMgICAgICAgICAgICAgICAgICB8IDEz ICsrKysrKy0tCiBkcml2ZXJzL2h2L2h2X2JhbGxvb24uYyAgICAgICAgICAgICAgICB8ICA2ICsr KysKIGRyaXZlcnMvdmZpby92ZmlvX2lvbW11X3R5cGUxLmMgICAgICAgIHwgMTAgKysrKy0tCiBk cml2ZXJzL3hlbi9iYWxsb29uLmMgICAgICAgICAgICAgICAgICB8ICA3ICsrKysrCiBpbmNsdWRl L2xpbnV4L3BhZ2UtZmxhZ3MuaCAgICAgICAgICAgICB8ICA4ICstLS0tCiBtbS9tZW1vcnlfaG90 cGx1Zy5jICAgICAgICAgICAgICAgICAgICB8IDQzICsrKysrKysrKysrKysrKysrKystLS0tLS0t CiBtbS9wYWdlX2FsbG9jLmMgICAgICAgICAgICAgICAgICAgICAgICB8IDExIC0tLS0tLS0KIG1t L3VzZXJjb3B5LmMgICAgICAgICAgICAgICAgICAgICAgICAgIHwgIDYgKystLQogdmlydC9rdm0v a3ZtX21haW4uYyAgICAgICAgICAgICAgICAgICAgfCAxMCArKysrLS0KIDEzIGZpbGVzIGNoYW5n ZWQsIDExMSBpbnNlcnRpb25zKCspLCA2NiBkZWxldGlvbnMoLSkKCi0tIAoyLjIxLjAKCgpfX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwpYZW4tZGV2ZWwgbWFp bGluZyBsaXN0Clhlbi1kZXZlbEBsaXN0cy54ZW5wcm9qZWN0Lm9yZwpodHRwczovL2xpc3RzLnhl bnByb2plY3Qub3JnL21haWxtYW4vbGlzdGluZm8veGVuLWRldmVs From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.8 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9D4F5CA9EBB for ; Thu, 24 Oct 2019 12:11:18 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 590F220856 for ; Thu, 24 Oct 2019 12:11:18 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="Ma9Nwa8U" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 590F220856 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 0B4DE6B000A; Thu, 24 Oct 2019 08:11:18 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 08D6E6B000C; Thu, 24 Oct 2019 08:11:18 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id EBD9D6B000D; Thu, 24 Oct 2019 08:11:17 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0152.hostedemail.com [216.40.44.152]) by kanga.kvack.org (Postfix) with ESMTP id BF4146B000A for ; Thu, 24 Oct 2019 08:11:17 -0400 (EDT) Received: from smtpin14.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay02.hostedemail.com (Postfix) with SMTP id 5DCED641D for ; Thu, 24 Oct 2019 12:11:17 +0000 (UTC) X-FDA: 76078562994.14.wren79_b3236a161236 X-HE-Tag: wren79_b3236a161236 X-Filterd-Recvd-Size: 9050 Received: from us-smtp-1.mimecast.com (us-smtp-delivery-1.mimecast.com [205.139.110.120]) by imf14.hostedemail.com (Postfix) with ESMTP for ; Thu, 24 Oct 2019 12:11:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1571919076; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=TooFwxaibKWioSMXkjdsd9KqTbD0P4TIgojllh09QfQ=; b=Ma9Nwa8U05TUyp2WPPo4CE5GEWVGVIG2OxIJOLhEYrr0o0eadQ2gEg1OK6TjwceiwxlWJe Hw0QzFUtbdvRE2d0aI1M838h87d/IdoynG3ITnZzx9yN2M+tyQvgmWRa2zD3KUZTBh/boQ 3cIVSTLrd6gVuUIO6+JXSFutQFQ+SWs= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-70-Mt5vh3dPMhmZKGgS4b6ZRA-1; Thu, 24 Oct 2019 08:10:06 -0400 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.23]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 410A4801E5C; Thu, 24 Oct 2019 12:10:01 +0000 (UTC) Received: from t460s.redhat.com (ovpn-116-141.ams2.redhat.com [10.36.116.141]) by smtp.corp.redhat.com (Postfix) with ESMTP id 314413CCA; Thu, 24 Oct 2019 12:09:39 +0000 (UTC) From: David Hildenbrand To: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, David Hildenbrand , Michal Hocko , Andrew Morton , kvm-ppc@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, kvm@vger.kernel.org, linux-hyperv@vger.kernel.org, devel@driverdev.osuosl.org, xen-devel@lists.xenproject.org, x86@kernel.org, Alexander Duyck , Alexander Duyck , Alex Williamson , Allison Randal , Andy Lutomirski , "Aneesh Kumar K.V" , Anshuman Khandual , Anthony Yznaga , Benjamin Herrenschmidt , Borislav Petkov , Boris Ostrovsky , Christophe Leroy , Cornelia Huck , Dan Williams , Dave Hansen , Haiyang Zhang , "H. Peter Anvin" , Ingo Molnar , "Isaac J. Manjarres" , Jim Mattson , Joerg Roedel , Johannes Weiner , Juergen Gross , KarimAllah Ahmed , Kees Cook , "K. Y. Srinivasan" , "Matthew Wilcox (Oracle)" , Matt Sickler , Mel Gorman , Michael Ellerman , Michal Hocko , Mike Rapoport , Mike Rapoport , Nicholas Piggin , Oscar Salvador , Paolo Bonzini , Paul Mackerras , Paul Mackerras , Pavel Tatashin , Pavel Tatashin , Peter Zijlstra , Qian Cai , =?UTF-8?q?Radim=20Kr=C4=8Dm=C3=A1=C5=99?= , Sasha Levin , Sean Christopherson , Stefano Stabellini , Stephen Hemminger , Thomas Gleixner , Vitaly Kuznetsov , Vlastimil Babka , Wanpeng Li , YueHaibing Subject: [PATCH v1 00/10] mm: Don't mark hotplugged pages PG_reserved (including ZONE_DEVICE) Date: Thu, 24 Oct 2019 14:09:28 +0200 Message-Id: <20191024120938.11237-1-david@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.84 on 10.5.11.23 X-MC-Unique: Mt5vh3dPMhmZKGgS4b6ZRA-1 X-Mimecast-Spam-Score: 0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: This is the result of a recent discussion with Michal ([1], [2]). Right now we set all pages PG_reserved when initializing hotplugged memmaps. This includes ZONE_DEVICE memory. In case of system memory, PG_reserved is cleared again when onlining the memory, in case of ZONE_DEVICE memory never. In ancient times, we needed PG_reserved, because there was no way to tell whether the memmap was already properly initialized. We now have SECTION_IS_ONLINE for that in the case of !ZONE_DEVICE memory. ZONE_DEVICE memory is already initialized deferred, and there shouldn't be a visible change in that regard. One of the biggest fears were side effects. I went ahead and audited all users of PageReserved(). The details can be found in "mm/memory_hotplug: Don't mark pages PG_reserved when initializing the memmap". This patch set adapts all relevant users of PageReserved() to keep the existing behavior in respect to ZONE_DEVICE pages. The biggest part part that needs changes is KVM, to keep the existing behavior (that's all I care about in this series). Note that this series is able to rely completely on pfn_to_online_page(). No new is_zone_device_page() calles are introduced (as requested by Dan). We are currently discussing a way to mark also ZONE_DEVICE memmaps as active/initialized - pfn_active() - and lightweight locking to make sure memmaps remain active (e.g., using RCU). We might later be able to convert some suers of pfn_to_online_page() to pfn_active(). Details can be found in [3], however, this represents yet another cleanup/fix we'll perform on top of this cleanup. I only gave it a quick test with DIMMs on x86-64, but didn't test the ZONE_DEVICE part at all (any tips for a nice QEMU setup?). Also, I didn't test the KVM parts (especially with ZONE_DEVICE pages or no memmap at all). Compile-tested on x86-64 and PPC. Based on next/master. The current version (kept updated) can be found at: https://github.com/davidhildenbrand/linux.git online_reserved_cleanup RFC -> v1: - Dropped "staging/gasket: Prepare gasket_release_page() for PG_reserved changes" - Dropped "staging: kpc2000: Prepare transfer_complete_cb() for PG_reserved changes" - Converted "mm/usercopy.c: Prepare check_page_span() for PG_reserved changes" to "mm/usercopy.c: Update comment in check_page_span() regarding ZONE_DEVICE" - No new users of is_zone_device_page() are introduced. - Rephrased comments and patch descriptions. [1] https://lkml.org/lkml/2019/10/21/736 [2] https://lkml.org/lkml/2019/10/21/1034 [3] https://www.spinics.net/lists/linux-mm/msg194112.html Cc: Michal Hocko Cc: Dan Williams Cc: kvm-ppc@vger.kernel.org Cc: linuxppc-dev@lists.ozlabs.org Cc: kvm@vger.kernel.org Cc: linux-hyperv@vger.kernel.org Cc: devel@driverdev.osuosl.org Cc: xen-devel@lists.xenproject.org Cc: x86@kernel.org Cc: Alexander Duyck David Hildenbrand (10): mm/memory_hotplug: Don't allow to online/offline memory blocks with holes KVM: x86/mmu: Prepare kvm_is_mmio_pfn() for PG_reserved changes KVM: Prepare kvm_is_reserved_pfn() for PG_reserved changes vfio/type1: Prepare is_invalid_reserved_pfn() for PG_reserved changes powerpc/book3s: Prepare kvmppc_book3s_instantiate_page() for PG_reserved changes powerpc/64s: Prepare hash_page_do_lazy_icache() for PG_reserved changes powerpc/mm: Prepare maybe_pte_to_page() for PG_reserved changes x86/mm: Prepare __ioremap_check_ram() for PG_reserved changes mm/memory_hotplug: Don't mark pages PG_reserved when initializing the memmap mm/usercopy.c: Update comment in check_page_span() regarding ZONE_DEVICE arch/powerpc/kvm/book3s_64_mmu_radix.c | 14 +++++---- arch/powerpc/mm/book3s64/hash_utils.c | 10 +++--- arch/powerpc/mm/pgtable.c | 10 +++--- arch/x86/kvm/mmu.c | 29 ++++++++++------- arch/x86/mm/ioremap.c | 13 ++++++-- drivers/hv/hv_balloon.c | 6 ++++ drivers/vfio/vfio_iommu_type1.c | 10 ++++-- drivers/xen/balloon.c | 7 +++++ include/linux/page-flags.h | 8 +---- mm/memory_hotplug.c | 43 +++++++++++++++++++------- mm/page_alloc.c | 11 ------- mm/usercopy.c | 6 ++-- virt/kvm/kvm_main.c | 10 ++++-- 13 files changed, 111 insertions(+), 66 deletions(-) --=20 2.21.0