From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id E731DC43458 for ; Mon, 29 Jun 2026 02:29:44 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 8AB1310E395; Mon, 29 Jun 2026 02:29:44 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (1024-bit key; unprotected) header.d=amd.com header.i=@amd.com header.b="az60XhYF"; dkim-atps=neutral Received: from CH4PR04CU002.outbound.protection.outlook.com (mail-northcentralusazon11013010.outbound.protection.outlook.com [40.107.201.10]) by gabe.freedesktop.org (Postfix) with ESMTPS id 3506D10E395 for ; Mon, 29 Jun 2026 02:29:43 +0000 (UTC) ARC-Seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=q1lAjH8z6BtinSN4bVQ4Gng+HvZrC5LoJgpz4wDL5Gb34HCGySs6Dxzo7MNwIn0BAiXWwievLaABaAnx9ZyjznBjdk3Xhg2S6TrbsEwuLm+KD2USAfQCgmrcu0w9eXAg9dNDwJ1qt2nW+seU6bOy9TnFys3v6q2Zjmdv4c7ntzpTeM3BmUOrS8gxWhdCv74tHWZUvUNMPtd8AUOLgDfdH9c8IMVJuJu/3KheP+CifYo2AUTA69AaVNftYKDTAqbrwzZQ2koBlSfg+ADJDl3FRI+bNpaHmjCyR/cDR/QLoZFaFoWimPYbmPWgPaB4lwBJ7grAMJGPUTUGr2xkhYzawQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=ud6FpybWQsBDHTXOQUTZjiPuFz5m79ik+8Es7eF6mEM=; b=fERdt9CeFCCuJuerdLbfpDNzKSCcsv8B/Cj8GZ6wCJf2x1+QuzgapzNeMu8Tbxf8BvH64OsWcnl2dEo4K+f3oILGXxAlxRFA8KskY6GZ9BxZxsdCj75YTvedAq78Pt27GbIdPJlG0Qgu+XE59Dg8t6a/YN66QZ7v0RWEtqTeBbBGDU3aJrd97bWKPQeZk/uQsAZnLKhcbS6dP+NY/8SkoN55gURBMu+38P4iQXc4z3nWr6hHOFzqS7gFZWYQ4HEgPcs/jp9PAoP78distTQ5P06DcKg1UN/mWn53HNGXVnnWgXdkBPReQyNg1Cs0BKlJn1eBA1KqA/piSCZ3vt3TWw== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass (sender ip is 165.204.84.17) smtp.rcpttodomain=lists.freedesktop.org smtp.mailfrom=amd.com; dmarc=pass (p=quarantine sp=quarantine pct=100) action=none header.from=amd.com; dkim=none (message not signed); arc=none (0) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amd.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ud6FpybWQsBDHTXOQUTZjiPuFz5m79ik+8Es7eF6mEM=; b=az60XhYFObFi3WW7d3r6li2syXbtgH61/eV0oYqRXay7vfyckUjXD/X4Qjk9mngvHZUMda9/666KdoAuqacBQIM1Y3T9hvHblOwgkBefYliFlw2AeuUYLe2FuM5WrjB1Z0WlqkrtiBJAuX1k8388144hTdGFqLcmTO4OVJqtHR0= Received: from SA0PR12CA0005.namprd12.prod.outlook.com (2603:10b6:806:6f::10) by DM4PR12MB5962.namprd12.prod.outlook.com (2603:10b6:8:69::7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.21.159.19; Mon, 29 Jun 2026 02:29:37 +0000 Received: from SA2PEPF00001508.namprd04.prod.outlook.com (2603:10b6:806:6f:cafe::97) by SA0PR12CA0005.outlook.office365.com (2603:10b6:806:6f::10) with Microsoft SMTP Server (version=TLS1_3, cipher=TLS_AES_256_GCM_SHA384) id 15.21.159.19 via Frontend Transport; Mon, 29 Jun 2026 02:29:37 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 165.204.84.17) smtp.mailfrom=amd.com; dkim=none (message not signed) header.d=none;dmarc=pass action=none header.from=amd.com; Received-SPF: Pass (protection.outlook.com: domain of amd.com designates 165.204.84.17 as permitted sender) receiver=protection.outlook.com; client-ip=165.204.84.17; helo=satlexmb07.amd.com; pr=C Received: from satlexmb07.amd.com (165.204.84.17) by SA2PEPF00001508.mail.protection.outlook.com (10.167.242.40) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.21.181.6 via Frontend Transport; Mon, 29 Jun 2026 02:29:37 +0000 Received: from honglei-remote.amd.com (10.180.168.240) by satlexmb07.amd.com (10.181.42.216) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.2562.41; Sun, 28 Jun 2026 21:29:35 -0500 From: Honglei Huang To: CC: Honglei Huang Subject: [PATCH v7 0/5] drm/gpusvm: split MM and device state across gpusvm/range/pages Date: Mon, 29 Jun 2026 10:29:16 +0800 Message-ID: <20260629022921.17533-1-honghuan@amd.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Type: text/plain X-Originating-IP: [10.180.168.240] X-ClientProxiedBy: satlexmb08.amd.com (10.181.42.217) To satlexmb07.amd.com (10.181.42.216) X-EOPAttributedMessage: 0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: SA2PEPF00001508:EE_|DM4PR12MB5962:EE_ X-MS-Office365-Filtering-Correlation-Id: e26ecef2-5a78-421d-a482-08ded58643b0 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; ARA:13230040|23010399003|36860700016|376014|82310400026|22122799003|1800799024|13003099007|3023799007|56012099006|11063799006|18002099003; X-Microsoft-Antispam-Message-Info: lhxJzJ+GOzj6NNgbVy/oKvCVc3/NR5B8HdR40sf8UGaZrdteTRp4icsFDd+rmCSsjJFcUDeY66y0mFGVYNWjdvNUHBUZuy7SmJOW2KykLhj3MTYCKWpnG6DskzqiPJsoVFEcvcgEiaRTFuq7vDRTztabaJX3EC0AQ1j/v0C2CmivGLNEaOvbyQFwMZhU5+CgMpRxbHNP8tK3cW+fjr5KCQNKQr7qWj+xE3A10+WegBNcqZJdZv5Dwjg7yYCtyGSEdVz10a3hlN0d7IEcG3RIIuPBO/SeHxDgvDnyjIJS5A47rKs4LwaJi6V3tLMVmA/wEWhjzfEEllivT9+wCJjveRBJIeOE/av9/z8DAP56h7VZpgl3L95CDAMs1K9DWPJfOyUGIkvXAdXxiHS1wYtD0cRVs+bDmtlD/juGhHLWOq+lbcr19jNCwa8vvwpr5cgKZf2uzE2WMDAP5Hzy32qNtMtE5Dv1wFbKqMVm+J8X7XQesZQu2nbAbx/C1gY9WrmsBH3VwptPCJdcueyz6kBs4IOT80FacvZ/axhJ4Vr4IrMH6aOBOmSD24DM7V7fJprVgbUfte5bZQt22BMhxG+Sj7qccMn004tjjYNGjumJSFwgM5BUJCFHkGG1BbnHJNQ6tG5C60e8t8X6biBtzT83RQ== X-Forefront-Antispam-Report: CIP:165.204.84.17; CTRY:US; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:satlexmb07.amd.com; PTR:InfoDomainNonexistent; CAT:NONE; SFS:(13230040)(23010399003)(36860700016)(376014)(82310400026)(22122799003)(1800799024)(13003099007)(3023799007)(56012099006)(11063799006)(18002099003); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: 2m6EnSaegln2d6bhZmpL3Ms4NlT/YjMOu94zuB/Vg+P+nl1pGV9cF4r4E7fn+D8Z+NjAmBlpdHjmYdjHNYKyXhnLDrzxVZA4FUTsTsSwl68ICOMx56RTrP9qHxyf/+wVHq8seYSDnKwyh46rajB6fhfFjZE5VBJWJeaULetYlE/8E7BTLBcq7SasjN1xnaL5IH1oCmMGRZfZYMWqSNKKZZYqCRBNAwV7VM8ATfXZIsoeyiyiPQzqcG8/f7O0/GfWQs/AhO8mwnyTWGxAS+gJNkegI7Sqj9Kt6mD9fPNB7rsgkdlQOV2mUeERm/nsHlnbZPyG5o9BD5kP+gOG2IpQCSQw3JTaeUlGsHYAvNzc4hXCU9F7SxzkFvIQ1TsOq0KozkQ9i5UEYqWg1aZCU9IZfLMZL4iFj9441TUzYzZPkk1oX+Nt8K3v6oOHrm2VzvdS X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 29 Jun 2026 02:29:37.0516 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: e26ecef2-5a78-421d-a482-08ded58643b0 X-MS-Exchange-CrossTenant-Id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=3dd8961f-e488-4e60-8e11-a82d994e183d; Ip=[165.204.84.17]; Helo=[satlexmb07.amd.com] X-MS-Exchange-CrossTenant-AuthSource: SA2PEPF00001508.namprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM4PR12MB5962 X-BeenThere: intel-xe@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel Xe graphics driver List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-xe-bounces@lists.freedesktop.org Sender: "Intel-xe" The intent of this series is to make drm_gpusvm more flexible and give drivers more freedom over how they assemble the MM related and device side operations. It implements the direction Matt suggested in [1]: Mirror MR in gitlab: [4] - Move struct drm_gpusvm_pages out of struct drm_gpusvm_range. - Embed a struct drm_device in struct drm_gpusvm_pages and drive all DMA through it. - Drop struct drm_device from struct drm_gpusvm. - Have the driver's range structure embed one or more struct drm_gpusvm_pages in addition to struct drm_gpusvm_range. - Drop the range-based helpers (drm_gpusvm_range_pages_valid, drm_gpusvm_range_get_pages, drm_gpusvm_range_unmap_pages) and update drivers to use the drm_gpusvm_pages helpers instead. In essence the series does only two abstractions, plus the xe adaptation that follows from them: - range vs pages: split drm_gpusvm_range (MM / VA range state) from drm_gpusvm_pages (device physical related), so the two sides can have independent lifetimes and ownership. - drm_gpusvm vs drm_device: make drm_gpusvm pure MM level and push the device side down onto drm_gpusvm_pages, which is where DMA actually happens. - xe is updated to fit the modifications, no functional change intended. V7: - patch 1: split MM state flags: the AI review found a KCSAN / memory model cleanliness issue. Address it for consistency with drm_gpusvm_pages_flags, set the range flags with WRITE_ONCE() on __flags and read them with READ_ONCE(). V6: - The AI review flagged a potential DMA free issue: the DMA unmap step was moved into the range_free callback, but on the invalidate path a range can be removed from the MMU interval tree while its DMA mappings are still live, so a concurrent unmap event can miss it. - patch 3: have xe_svm_range embed one drm_gpusvm_pages: explicitly call drm_gpusvm_unmap_pages() before drm_gpusvm_range_remove() in the garbage collector, so a range is never off the tree while still DMA mapped, and document this caller contract in drm_gpusvm_range_remove() kernel-doc. - patch 4: move struct drm_gpusvm_pages out: document the unmap before remove contract in the garbage collector example and note that range_free()'s drm_gpusvm_free_pages() as a final fallback. - patch 1: split MM state flags: return -EACCES directly. - Fold in the pre existing IOVA/DMA unmap fixes the AI review found previously sent separately: the uninitialized dma_addr[0].dir on the get_pages() error path, the whole reservation IOVA free for mixed ranges, and the device mapping leak on the get_pages() error path. [6] V5: - add reviewed-by in patches 1, 2, 3, 5 for Matt's review. V4: - drm_gpusvm_init_pages(): memset() the pages to zero before recording the owning drm_device. - DOC: overview: recommend a zeroing allocator: kcalloc() for the N:1 pages array. - Rebased onto the latest drm-xe. - The AI review of this series flagged two preexisting issues in the IOVA unmap path that are not introduced by this series; they are fixed in a separate series [5]. V3: - Fix a kernel-doc/Sphinx warning from the kernel test robot: use ".. code-block:: c" for the drm_gpusvm_pages example in DOC: overview. - drm_gpusvm_range_set_unmapped(): use WRITE_ONCE() on the whole pages[i].flags.__flags word to pair with the lockless READ_ONCE() readers and avoid a data race. - xe_userptr_setup(): call drm_gpusvm_init_pages() before mmu_interval_notifier_insert() to avoid exposing uninitialized pages.drm to invalidation callbacks. - Fix per commit build of the set_unmapped() pages. V2: - Followed in Matt's v0 review fixups [2]: - keep unmapped flag in pages structures. - add pages_count to drm_gpusvm_range_set_unmapped() to set the pages unmapped flag, so the framework can check unmapped status in drm_gpusvm_get_pages(). - Add drm_gpusvm_init_pages to init the drm_device and sequence number. - Remove drm_device from drm_gpusvm_get_pages() parameters. - Reworked the DOC: overview and usage examples to describe the new model: struct drm_gpusvm_pages, the 1:1 / N:1 driver layouts, and examples that operate on a driver embedded pages object by the drm_gpusvm_pages helpers and etc. - remove WARN_ON_ONCE in __drm_gpusvm_unmap_pages. - Dropped RFC. Follow-up (not in this series): - modify drm_gpusvm_get_pages() to support one time hmm range fault and multi drm device dma mapping. - Add no dma device support for drm_gpusvm_get_pages(). tests: AMDGPU: based on amdgpu adaptation patch in [3], but still SVM:DRM = 1:1, 1:n is on going needs many modifications and testings. Tested on gfx943 (MI300X) and gfx906 (MI60) with XNACK on/off: - KFD test: 95%+ passed. - ROCR test: all passed. - HIP catch test: gfx943 (MI300X): 99% passed. gfx906 (MI60): 99% passed. INTEL XE: Waiting for the xe driver git lab CI result: [4] links: [1] https://lore.kernel.org/amd-gfx/acRgr7QwdULsn6G2@gsse-cloud1/#:~:text=I%20think%20roughly,drm_gpusvm_pages%0A%20%20helpers%20instead. [2] https://gitlab.freedesktop.org/mbrost/xe-kernel-driver-svn-perf-6-15-2025/-/commit/623f6a50c037d9e44f6c9fbe6859a0ba7ad50177 [3] https://lore.kernel.org/amd-gfx/20260603065030.2554403-1-honglei1.huang@amd.com/ [4] https://gitlab.freedesktop.org/drm/xe/kernel/-/merge_requests/360 [5] https://lore.kernel.org/all/20260627033325.3795298-1-honglei1.huang@amd.com/ [6] https://lore.kernel.org/all/20260628061757.4093701-1-honglei1.huang@amd.com/ Honglei Huang (5): drm/gpusvm: split MM state flags out of drm_gpusvm_pages_flags drm/gpusvm: embed struct drm_device into drm_gpusvm_pages drm/xe: have xe_svm_range embed one drm_gpusvm_pages drm/gpusvm: move struct drm_gpusvm_pages out of struct drm_gpusvm_range drm/gpusvm: let the drm_gpusvm core context purely MM level drivers/gpu/drm/drm_gpusvm.c | 243 +++++++++++++++++++------------- drivers/gpu/drm/xe/xe_pt.c | 2 +- drivers/gpu/drm/xe/xe_svm.c | 49 +++++-- drivers/gpu/drm/xe/xe_svm.h | 8 +- drivers/gpu/drm/xe/xe_userptr.c | 5 +- include/drm/drm_gpusvm.h | 67 ++++++--- 6 files changed, 235 insertions(+), 139 deletions(-) -- 2.34.1