From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id ECF05C43458 for ; Tue, 30 Jun 2026 03:38:12 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id A5EA310E03D; Tue, 30 Jun 2026 03:38:12 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (1024-bit key; unprotected) header.d=amd.com header.i=@amd.com header.b="YVicShzG"; dkim-atps=neutral Received: from SJ2PR03CU001.outbound.protection.outlook.com (mail-westusazon11012050.outbound.protection.outlook.com [52.101.43.50]) by gabe.freedesktop.org (Postfix) with ESMTPS id 5C11810E03D for ; Tue, 30 Jun 2026 03:38:11 +0000 (UTC) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=SEi85o4m50hNrQM5rG91IlMbyuoQ8K5EOTXXmmTYdv2Sx2BbWz9hUUZiTKWond5LmSxEXakPrVejzBfmwsC9WOzL54+nqKEOqpJBgYGV5Ohyi1cl+MPecZ/0nEjkUwJN+VhGDGcN5rbs241RUrNs7ypV+dbDiqTbO+yEp+RSL08cYsRR4ykHQCSG0JsSCyFBRG18B/norcEgIU7Hwy7L1+UMWpgD81FFboVmWgXqKPOM3sYhKzviXYagmpnt9EEhRhrYO1mRndzSMjCYFQNOUgfIOKZCvFqe9csP8kh3rls4zRB8yD+arKckqzsVSOaDt964HuSRlqy1vQDOCUuuNQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=OcVDMDekZWzHwGAjq2yILnAhbXWJb8Ju1D9FFYiIjNA=; b=tFjKMEBDSfBhxn5feWC2UpsEsxhehaG8VsT/gd4UfZDWxRrSKbrYB56QyiNXmOmR646Tvp0FSl7lIEiRI4R83Jpgm6iuwdokQrxavIGe3ZlYSrUCPxXaaR+bNuYU0U4kdxIN7F6qfL7wFyr6cA4zSHjBrIzujgYnDUUM4+jmDBL4aYHUAFTNnsGzIEhlnK4ysoOKad/D4d5GxMlJFfSQ9Xn/oc+MxmCJkmGgRXskAeOn/htTy62lxwi1flNgCEAUdZFh48q6tRen+Peg4dreYDJi9b+wT5ncLqC6ZkeUL35eyKAJevu10UvBKHw4RnhNmcJQvEY6Eo4R/u5iRHWQJw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=lists.freedesktop.org smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=OcVDMDekZWzHwGAjq2yILnAhbXWJb8Ju1D9FFYiIjNA=; b=YVicShzGnPT3tAuiawuqQzIlEmlnACXfgj2JCRJtQTMg40t+8dMMa48ShNRp7v9XVQ8MNp3yy5yp81xs23L+PXmeafXugCnglk/VSzcff5xS1qVfN2Zd1JREqg/ggqb5sMwL4GdqgRc0p94F6+dlwsdHLuSuJ/1Pk26tck06fdA= Received: from BY5PR04CA0013.namprd04.prod.outlook.com (2603:10b6:a03:1d0::23) by CY8PR12MB8411.namprd12.prod.outlook.com (2603:10b6:930:6e::6) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.21.159.19; Tue, 30 Jun 2026 03:38:08 +0000 Received: from CO1PEPF00012E66.namprd05.prod.outlook.com (2603:10b6:a03:1d0:cafe::5b) by BY5PR04CA0013.outlook.office365.com (2603:10b6:a03:1d0::23) with Microsoft SMTP Server (version=TLS1_3, cipher=TLS_AES_256_GCM_SHA384) id 15.21.159.19 via Frontend Transport; Tue, 30 Jun 2026 03:38:07 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 165.204.84.17) smtp.mailfrom=amd.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=amd.com; Received-SPF: Pass (protection.outlook.com: domain of amd.com designates 165.204.84.17 as permitted sender) receiver=protection.outlook.com; client-ip=165.204.84.17; helo=satlexmb07.amd.com; pr=C Received: from satlexmb07.amd.com (165.204.84.17) by CO1PEPF00012E66.mail.protection.outlook.com (10.167.249.75) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.21.181.6 via Frontend Transport; Tue, 30 Jun 2026 03:38:07 +0000 Received: from Satlexmb09.amd.com (10.181.42.218) by satlexmb07.amd.com (10.181.42.216) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.2562.41; Mon, 29 Jun 2026 22:37:56 -0500 Received: from honglei-remote.amd.com (10.180.168.240) by satlexmb09.amd.com (10.181.42.218) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.2562.41; Mon, 29 Jun 2026 20:37:46 -0700 From: Honglei Huang To: CC: Honglei Huang Subject: [PATCH v8 0/5] drm/gpusvm: split MM and device state across gpusvm/range/pages Date: Tue, 30 Jun 2026 11:37:26 +0800 Message-ID: <20260630033731.362150-1-honghuan@amd.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Type: text/plain X-Originating-IP: [10.180.168.240] X-ClientProxiedBy: satlexmb08.amd.com (10.181.42.217) To satlexmb09.amd.com (10.181.42.218) X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: CO1PEPF00012E66:EE_|CY8PR12MB8411:EE_ X-MS-Office365-Filtering-Correlation-Id: 874ec0f5-c4ba-47f9-873f-08ded6590032 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; ARA:13230040|376014|22122799003|36860700016|82310400026|23010399003|1800799024|18002099003|11063799006|56012099006|3023799007|13003099007; X-Microsoft-Antispam-Message-Info: k1KN9v1Qd7gypRiS43kPQgQW3grpRGm10409FIw85pT62zC7Wt7Ya6/NqMq8S5SEuPUx4yIQZd6U5hAWfd4SF2886Sq307IHhb6uB2c2Gm42rE2YxmiZfVaBvf249qw3FgFhNQbNfVlB80xDNEYjNppo9yjL6VAvlTBML5W40azRWn9yV/MXJD5XUls8vtni4wyQXfgaO3B6qVTUoGcd3BXSYtuJOt87NAD5SVyn/SS0eT+kSNmKiCZpK+boT4ykZhiyDAH/bqrvTeJjYpv3bddRtsCEq3BVq5LFLhLdVfPOTFFoaNEZoqBOK/iugehWIlJYaDG6oRO6zihPFhwEDpSr5ggXSKyWVxVYOOHTX5SpZtb+attmBOItr7NiKoMUUj+RNmAnSL7imreAWl7QLCJLWqp4W1td6ifAdBfafvD8FLNMIRhhJXegin9lUdicFBdCE2UqL9QhplKVjBRHBArd0vN9qIqoT0Ie0LQKWpQXbu1/1GTSX/oAFDT2O9s4aNE7SKmmRQiCQD6qwnl1bVqYzZHwwpRNbFWIZLr6nTMKla2vGKEScfKKGhID+u9snOgGJRADgWI9UUao/SgvCbewOTEHvBJYUwUdrvnBIgNfAUCu+KVowgU5+NKgZjH22b692rmjuKAK4BlUxwifXw== X-Forefront-Antispam-Report: CIP:165.204.84.17; CTRY:US; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:satlexmb07.amd.com; PTR:InfoDomainNonexistent; CAT:NONE; SFS:(13230040)(376014)(22122799003)(36860700016)(82310400026)(23010399003)(1800799024)(18002099003)(11063799006)(56012099006)(3023799007)(13003099007); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: 4WULVZHfdEhE7p5p7grQj1qbromIznl/2tdVzQ+yBYvVj83ezlP1X7cdNZYUzkS0BovyL/n7yDBAxWWxOpm7pWsVBBn+FiZaA84gpNgi6AVRPSb2Uiag29KUgSY5++Xcp3tMYK2l9zwBhXrjjPdcE8QOys+zVIvnIkf2nGjcj5+ZVDrotJ6pFN9ZIo510oQwjONyDZ3hFZhjcd1ENKF9QMoO19Cvty1LDJM/ByDEYwzzl/h3RZU7GgUWJwLm2/OqMriJb7rpIlfdB/x6OjCaFcAbHZaJUI9uWo/TuBpvUa6g0jRQzXiiJOoRqa+3OmJ17Rgth6RlYT+CB8Sxf677T/Y0VC02amxSA0f92of4r/SsaXDJnZExYUwbTXsdRR9zgCDho1lOejiZyY85MRE0Yx0fTTj2+o9GDI7MQQDHPKqKAU9klKxkMGAVho1o/rAN X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 30 Jun 2026 03:38:07.6107 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 874ec0f5-c4ba-47f9-873f-08ded6590032 X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=3dd8961f-e488-4e60-8e11-a82d994e183d; Ip=[165.204.84.17]; Helo=[satlexmb07.amd.com] X-MS-Exchange-CrossTenant-AuthSource: CO1PEPF00012E66.namprd05.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: CY8PR12MB8411 X-BeenThere: intel-xe@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel Xe graphics driver List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-xe-bounces@lists.freedesktop.org Sender: "Intel-xe" The intent of this series is to make drm_gpusvm more flexible and give drivers more freedom over how they assemble the MM related and device side operations. It implements the direction Matt suggested in [1]: Mirror MR in gitlab: [4] - Move struct drm_gpusvm_pages out of struct drm_gpusvm_range. - Embed a struct drm_device in struct drm_gpusvm_pages and drive all DMA through it. - Drop struct drm_device from struct drm_gpusvm. - Have the driver's range structure embed one or more struct drm_gpusvm_pages in addition to struct drm_gpusvm_range. - Drop the range-based helpers (drm_gpusvm_range_pages_valid, drm_gpusvm_range_get_pages, drm_gpusvm_range_unmap_pages) and update drivers to use the drm_gpusvm_pages helpers instead. In essence the series does only two abstractions, plus the xe adaptation that follows from them: - range vs pages: split drm_gpusvm_range (MM / VA range state) from drm_gpusvm_pages (device physical related), so the two sides can have independent lifetimes and ownership. - drm_gpusvm vs drm_device: make drm_gpusvm pure MM level and push the device side down onto drm_gpusvm_pages, which is where DMA actually happens. - xe is updated to fit the modifications, no functional change intended. V8: - patch 4: add reviewed-by for Matt's review. V7: - patch 1: split MM state flags: the AI review found a KCSAN / memory model cleanliness issue. Address it for consistency with drm_gpusvm_pages_flags, set the range flags with WRITE_ONCE() on __flags and read them with READ_ONCE(). V6: - The AI review flagged a potential DMA free issue: the DMA unmap step was moved into the range_free callback, but on the invalidate path a range can be removed from the MMU interval tree while its DMA mappings are still live, so a concurrent unmap event can miss it. - patch 3: have xe_svm_range embed one drm_gpusvm_pages: explicitly call drm_gpusvm_unmap_pages() before drm_gpusvm_range_remove() in the garbage collector, so a range is never off the tree while still DMA mapped, and document this caller contract in drm_gpusvm_range_remove() kernel-doc. - patch 4: move struct drm_gpusvm_pages out: document the unmap before remove contract in the garbage collector example and note that range_free()'s drm_gpusvm_free_pages() as a final fallback. - patch 1: split MM state flags: return -EACCES directly. - Fold in the pre existing IOVA/DMA unmap fixes the AI review found previously sent separately: the uninitialized dma_addr[0].dir on the get_pages() error path, the whole reservation IOVA free for mixed ranges, and the device mapping leak on the get_pages() error path. [6] V5: - add reviewed-by in patches 1, 2, 3, 5 for Matt's review. V4: - drm_gpusvm_init_pages(): memset() the pages to zero before recording the owning drm_device. - DOC: overview: recommend a zeroing allocator: kcalloc() for the N:1 pages array. - Rebased onto the latest drm-xe. - The AI review of this series flagged two preexisting issues in the IOVA unmap path that are not introduced by this series; they are fixed in a separate series [5]. V3: - Fix a kernel-doc/Sphinx warning from the kernel test robot: use ".. code-block:: c" for the drm_gpusvm_pages example in DOC: overview. - drm_gpusvm_range_set_unmapped(): use WRITE_ONCE() on the whole pages[i].flags.__flags word to pair with the lockless READ_ONCE() readers and avoid a data race. - xe_userptr_setup(): call drm_gpusvm_init_pages() before mmu_interval_notifier_insert() to avoid exposing uninitialized pages.drm to invalidation callbacks. - Fix per commit build of the set_unmapped() pages. V2: - Followed in Matt's v0 review fixups [2]: - keep unmapped flag in pages structures. - add pages_count to drm_gpusvm_range_set_unmapped() to set the pages unmapped flag, so the framework can check unmapped status in drm_gpusvm_get_pages(). - Add drm_gpusvm_init_pages to init the drm_device and sequence number. - Remove drm_device from drm_gpusvm_get_pages() parameters. - Reworked the DOC: overview and usage examples to describe the new model: struct drm_gpusvm_pages, the 1:1 / N:1 driver layouts, and examples that operate on a driver embedded pages object by the drm_gpusvm_pages helpers and etc. - remove WARN_ON_ONCE in __drm_gpusvm_unmap_pages. - Dropped RFC. Follow-up (not in this series): - modify drm_gpusvm_get_pages() to support one time hmm range fault and multi drm device dma mapping. - Add no dma device support for drm_gpusvm_get_pages(). tests: AMDGPU: based on amdgpu adaptation patch in [3], but still SVM:DRM = 1:1, 1:n is on going needs many modifications and testings. Tested on gfx943 (MI300X) and gfx906 (MI60) with XNACK on/off: - KFD test: 95%+ passed. - ROCR test: all passed. - HIP catch test: gfx943 (MI300X): 99% passed. gfx906 (MI60): 99% passed. INTEL XE: Waiting for the xe driver git lab CI result: [4] links: [1] https://lore.kernel.org/amd-gfx/acRgr7QwdULsn6G2@gsse-cloud1/#:~:text=I%20think%20roughly,drm_gpusvm_pages%0A%20%20helpers%20instead. [2] https://gitlab.freedesktop.org/mbrost/xe-kernel-driver-svn-perf-6-15-2025/-/commit/623f6a50c037d9e44f6c9fbe6859a0ba7ad50177 [3] https://lore.kernel.org/amd-gfx/20260603065030.2554403-1-honglei1.huang@amd.com/ [4] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/360 [5] https://lore.kernel.org/all/20260627033325.3795298-1-honglei1.huang@amd.com/ [6] https://lore.kernel.org/all/20260628061757.4093701-1-honglei1.huang@amd.com/ Honglei Huang (5): drm/gpusvm: split MM state flags out of drm_gpusvm_pages_flags drm/gpusvm: embed struct drm_device into drm_gpusvm_pages drm/xe: have xe_svm_range embed one drm_gpusvm_pages drm/gpusvm: move struct drm_gpusvm_pages out of struct drm_gpusvm_range drm/gpusvm: let the drm_gpusvm core context purely MM level drivers/gpu/drm/drm_gpusvm.c | 243 +++++++++++++++++++------------- drivers/gpu/drm/xe/xe_pt.c | 2 +- drivers/gpu/drm/xe/xe_svm.c | 49 +++++-- drivers/gpu/drm/xe/xe_svm.h | 8 +- drivers/gpu/drm/xe/xe_userptr.c | 5 +- include/drm/drm_gpusvm.h | 67 ++++++--- 6 files changed, 235 insertions(+), 139 deletions(-) -- 2.34.1