From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 00CC3257AFF for ; Mon, 10 Feb 2025 19:38:38 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.133.124 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739216322; cv=none; b=VfstJ70K2pcoLaaA/rBXUCsFKY1m0hRTdI3iFNe2jB14L2qa4yr3UvGsi52t8jOEvVH6WHpRH/f/knECi/W7iOI1lFZemAnJ5RrLl5RuQcb3GjCTDqD/fZNXtep4Ug2SXDYXZh9x9HLjY+BUgixs18pUkVBsg7sYiGVI881oVy0= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1739216322; c=relaxed/simple; bh=76d5pEBVKf4Yijczbwj7KuBibjsq0aAlESoqsTf2wbw=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=Ejp9wv6KTrbYl51wnaURQNbfljzlhe4JNF08uENfgHeE49ggp98R0p5KbQOV8Ijzoren2Jbl/mUoMig7BPY7UzGLtsPp41glxICyWTE2AAqVSW8hfvJF3TOgwawxeD6e6BJzicwWX3xhu8wQnYr0khmxDOSjnTXcCLO8w8NQ/t8= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=es0bOGsN; arc=none smtp.client-ip=170.10.133.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="es0bOGsN" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1739216318; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=8sC+CKPivo9nJ2nwAVN+E4SAgEKJPEiCVQrD/mzwcqM=; b=es0bOGsN+emNWNv0iKtdB+/awn1NRoHB7YrfChy+rLlYA5DTzw/IxgRvietxiTkm8QeMQJ PAAYFe9CnfSBFi2IOFCsoMSOVm2xD9R1eH2egtajc9Z9ybOlUE+b/drXUagETlqCbzaihq 1k1rfy1IcAwFoqUrQWAAeKi5VLJGMC4= Received: from mail-wm1-f70.google.com (mail-wm1-f70.google.com [209.85.128.70]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-492-j4WVX1OcPXyIzAUB6uNPSw-1; Mon, 10 Feb 2025 14:38:36 -0500 X-MC-Unique: j4WVX1OcPXyIzAUB6uNPSw-1 X-Mimecast-MFC-AGG-ID: j4WVX1OcPXyIzAUB6uNPSw Received: by mail-wm1-f70.google.com with SMTP id 5b1f17b1804b1-4394c489babso2566105e9.1 for ; Mon, 10 Feb 2025 11:38:36 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1739216315; x=1739821115; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=8sC+CKPivo9nJ2nwAVN+E4SAgEKJPEiCVQrD/mzwcqM=; b=niudn0467AzJcnh0n5wtJ8Vvbrij4YVQVg80jQTWHXXkuqTSdKf1lLG0VNA2T438JP g17bVud/yz29JoIPJ2YI1/NkwAGPZ4v7htcShzfS5dB1j4sIHiVIvQMFpdEUxTsZ8KSa Wmp7gJ9SALlGbRqBq0DZRdP3BRbtsuvzxBic1cOIa7LgSCoZEPHrFkHsueo4uumhx66N Smcne4L6V/QvPys0BWgKvfYwcv+RrOu9kXd1xVjP7A4JgF4pS5uZVJZG6XL8iiga562t 9dMCqDQyq0oq5LPMd0pZeEbLxKA3esMt7RpzYJRv5wqwqqbbaRU3AWFEd2bvE293HYyT AS/w== X-Forwarded-Encrypted: i=1; AJvYcCUHcke+8uVv99sdc1VW8ncCzPtG14vjw3R84g9otKt5eR9y/SoVIqv13qyDQhesYBjfEnUtvjrBkP6TuvF7XS3I@vger.kernel.org X-Gm-Message-State: AOJu0Yw0C+clsKWiJNvtpuUxAtnz3KY2OdfEx9rZtOdXnKJIpSSaXXGC YTDLQdeM/uEOf27H1ZnvIVhAXH9IMnAzaZbGzGD84FwVPXAP4ljXQvCRle59vxcRDcAAgc4mhw8 jJLcPPRMnovT88cjAfg9R3c/zSDunjKNDCQohLxNpjKUBLiaIcs21h/3y8NHoNeTP2pM= X-Gm-Gg: ASbGnct9WH7SvlLNoGA+CSNoHE8WpiZjZmUPmqX3WJyffIOPXVKhse65YeTO6pzmZeK ftZDPcFMUHcFRsFHZOm77rtUZ7b246s/QIAUl/phd6QTNFve9CUhALIH3Mx6jmXBFWzCpimFfn/ 4GTTDJo7PPketpzsc1SWonTCjmZp2LXRWp8jEf1P+3vpE4TxfBIhLooqovveZP2FAxrZrkVIhPG kVEjX+hJlxfp0QRYx3WvjYooqO9J+BNzyhlYs7Wg45yP0Mm5RyVuuq5Go4N6NJvobDtEmfbNV7b AysCQpSjqO4ldCe5pOef8H3FV3myTLCcB+KXS6L+zZ/AoIzHxlYJbWYdUnGpSBhsyQ== X-Received: by 2002:a05:600c:4e91:b0:439:4637:9d9 with SMTP id 5b1f17b1804b1-43946370d97mr43287525e9.12.1739216315610; Mon, 10 Feb 2025 11:38:35 -0800 (PST) X-Google-Smtp-Source: AGHT+IF6B7q2dhfsb9dj8LY+8M1CJU6DFUQTZK7m2VERaPDmly8AB+qIiZRItOcPeyw5uy4LKosd2g== X-Received: by 2002:a05:600c:4e91:b0:439:4637:9d9 with SMTP id 5b1f17b1804b1-43946370d97mr43287075e9.12.1739216315147; Mon, 10 Feb 2025 11:38:35 -0800 (PST) Received: from localhost (p200300cbc734b80012c465cd348aaee6.dip0.t-ipconnect.de. [2003:cb:c734:b800:12c4:65cd:348a:aee6]) by smtp.gmail.com with UTF8SMTPSA id 5b1f17b1804b1-4390d94d802sm195260345e9.12.2025.02.10.11.38.31 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 10 Feb 2025 11:38:33 -0800 (PST) From: David Hildenbrand To: linux-kernel@vger.kernel.org Cc: linux-doc@vger.kernel.org, dri-devel@lists.freedesktop.org, linux-mm@kvack.org, nouveau@lists.freedesktop.org, linux-trace-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, damon@lists.linux.dev, David Hildenbrand , Andrew Morton , =?UTF-8?q?J=C3=A9r=C3=B4me=20Glisse?= , Jonathan Corbet , Alex Shi , Yanteng Si , Karol Herbst , Lyude Paul , Danilo Krummrich , David Airlie , Simona Vetter , Masami Hiramatsu , Oleg Nesterov , Peter Zijlstra , SeongJae Park , "Liam R. Howlett" , Lorenzo Stoakes , Vlastimil Babka , Jann Horn , Pasha Tatashin , Peter Xu , Alistair Popple , Jason Gunthorpe Subject: [PATCH v2 08/17] kernel/events/uprobes: handle device-exclusive entries correctly in __replace_page() Date: Mon, 10 Feb 2025 20:37:50 +0100 Message-ID: <20250210193801.781278-9-david@redhat.com> X-Mailer: git-send-email 2.48.1 In-Reply-To: <20250210193801.781278-1-david@redhat.com> References: <20250210193801.781278-1-david@redhat.com> Precedence: bulk X-Mailing-List: linux-perf-users@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Ever since commit b756a3b5e7ea ("mm: device exclusive memory access") we can return with a device-exclusive entry from page_vma_mapped_walk(). __replace_page() is not prepared for that, so teach it about these PFN swap PTEs. Note that device-private entries are so far not applicable on that path, because GUP would never have returned such folios (conversion to device-private happens by page migration, not in-place conversion of the PTE). There is a race between GUP and us locking the folio to look it up using page_vma_mapped_walk(), so this is likely a fix (unless something else could prevent that race, but it doesn't look like). pte_pfn() on something that is not a present pte could give use garbage, and we'd wrongly mess up the mapcount because it was already adjusted by calling folio_remove_rmap_pte() when making the entry device-exclusive. Fixes: b756a3b5e7ea ("mm: device exclusive memory access") Signed-off-by: David Hildenbrand --- kernel/events/uprobes.c | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/kernel/events/uprobes.c b/kernel/events/uprobes.c index 2ca797cbe465f..cd6105b100325 100644 --- a/kernel/events/uprobes.c +++ b/kernel/events/uprobes.c @@ -173,6 +173,7 @@ static int __replace_page(struct vm_area_struct *vma, unsigned long addr, DEFINE_FOLIO_VMA_WALK(pvmw, old_folio, vma, addr, 0); int err; struct mmu_notifier_range range; + pte_t pte; mmu_notifier_range_init(&range, MMU_NOTIFY_CLEAR, 0, mm, addr, addr + PAGE_SIZE); @@ -192,6 +193,16 @@ static int __replace_page(struct vm_area_struct *vma, unsigned long addr, if (!page_vma_mapped_walk(&pvmw)) goto unlock; VM_BUG_ON_PAGE(addr != pvmw.address, old_page); + pte = ptep_get(pvmw.pte); + + /* + * Handle PFN swap PTES, such as device-exclusive ones, that actually + * map pages: simply trigger GUP again to fix it up. + */ + if (unlikely(!pte_present(pte))) { + page_vma_mapped_walk_done(&pvmw); + goto unlock; + } if (new_page) { folio_get(new_folio); @@ -206,7 +217,7 @@ static int __replace_page(struct vm_area_struct *vma, unsigned long addr, inc_mm_counter(mm, MM_ANONPAGES); } - flush_cache_page(vma, addr, pte_pfn(ptep_get(pvmw.pte))); + flush_cache_page(vma, addr, pte_pfn(pte)); ptep_clear_flush(vma, addr, pvmw.pte); if (new_page) set_pte_at(mm, addr, pvmw.pte, -- 2.48.1