Intel-XE Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: Matthew Brost <matthew.brost@intel.com>
To: "Thomas Hellström" <thomas.hellstrom@linux.intel.com>
Cc: intel-xe@lists.freedesktop.org, dri-devel@lists.freedesktop.org,
	himal.prasad.ghimiray@intel.com, apopple@nvidia.com,
	airlied@gmail.com, "Simona Vetter" <simona.vetter@ffwll.ch>,
	felix.kuehling@amd.com,
	"Christian König" <christian.koenig@amd.com>,
	dakr@kernel.org, "Mrozek, Michal" <michal.mrozek@intel.com>,
	"Joonas Lahtinen" <joonas.lahtinen@linux.intel.com>
Subject: Re: [PATCH v4 21/22] drm/pagemap, drm/xe: Support destination migration over interconnect
Date: Wed, 17 Dec 2025 18:29:07 -0800	[thread overview]
Message-ID: <aUNm879rnea1+01Y@lstrano-desk.jf.intel.com> (raw)
In-Reply-To: <aUNW2nLEpgAy6qAt@lstrano-desk.jf.intel.com>

On Wed, Dec 17, 2025 at 05:20:26PM -0800, Matthew Brost wrote:

One more idea.

> On Thu, Dec 11, 2025 at 05:59:08PM +0100, Thomas Hellström wrote:
> > Support destination migration over interconnect when migrating from
> > device-private pages with the same dev_pagemap owner.
> > 
> > Since we now also collect device-private pages to migrate,
> > also abort migration if the range to migrate is already
> > fully populated with pages from the desired pagemap.
> > 
> > Finally return -EBUSY from drm_pagemap_populate_mm()
> > if the migration can't be completed without first migrating all
> > pages in the range to system. It is expected that the caller
> > will perform that before retrying the call to
> > drm_pagemap_populate_mm().
> > 
> > Assume for now that the drm_pagemap implementation is *not*
> > capable of migrating data within the pagemap itself. This
> > restriction will be configurable in upcoming patches.
> > 
> > v3:
> > - Fix a bug where the p2p dma-address was never used.
> > - Postpone enabling destination interconnect migration,
> >   since xe devices require source interconnect migration to
> >   ensure the source L2 cache is flushed at migration time.
> > - Update the drm_pagemap_migrate_to_devmem() interface to
> >   pass migration details.
> > v4:
> > - Define XE_INTERCONNECT_P2P unconditionally (CI)
> > - Include a missing header (CI)
> > 
> > Signed-off-by: Thomas Hellström <thomas.hellstrom@linux.intel.com>
> > ---
> >  drivers/gpu/drm/drm_pagemap.c   | 188 +++++++++++++++++++++++---------
> >  drivers/gpu/drm/xe/xe_migrate.c |   4 +-
> >  drivers/gpu/drm/xe/xe_svm.c     |  26 +++--
> >  drivers/gpu/drm/xe/xe_svm.h     |   1 +
> >  include/drm/drm_pagemap.h       |  19 +++-
> >  5 files changed, 179 insertions(+), 59 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/drm_pagemap.c b/drivers/gpu/drm/drm_pagemap.c
> > index 77f8ea5ed802..56bedb622264 100644
> > --- a/drivers/gpu/drm/drm_pagemap.c
> > +++ b/drivers/gpu/drm/drm_pagemap.c
> > @@ -206,10 +206,12 @@ static void drm_pagemap_get_devmem_page(struct page *page,
> >  /**
> >   * drm_pagemap_migrate_map_pages() - Map migration pages for GPU SVM migration
> >   * @dev: The device for which the pages are being mapped
> > + * @local_dpagemap: The drm_pagemap pointer of the local drm_pagemap.
> >   * @pagemap_addr: Array to store DMA information corresponding to mapped pages
> >   * @migrate_pfn: Array of migrate page frame numbers to map
> >   * @npages: Number of pages to map
> >   * @dir: Direction of data transfer (e.g., DMA_BIDIRECTIONAL)
> > + * @mdetails: Details governing the migration behaviour.
> >   *
> >   * This function maps pages of memory for migration usage in GPU SVM. It
> >   * iterates over each page frame number provided in @migrate_pfn, maps the
> > @@ -219,12 +221,15 @@ static void drm_pagemap_get_devmem_page(struct page *page,
> >   * Returns: 0 on success, -EFAULT if an error occurs during mapping.
> >   */
> >  static int drm_pagemap_migrate_map_pages(struct device *dev,
> > +					 struct drm_pagemap *local_dpagemap,
> >  					 struct drm_pagemap_addr *pagemap_addr,
> >  					 unsigned long *migrate_pfn,
> >  					 unsigned long npages,
> > -					 enum dma_data_direction dir)
> > +					 enum dma_data_direction dir,
> > +					 const struct drm_pagemap_migrate_details *mdetails)
> >  {
> >  	unsigned long i;
> > +	unsigned long num_peer_pages = 0;
> >  
> >  	for (i = 0; i < npages;) {
> >  		struct page *page = migrate_pfn_to_page(migrate_pfn[i]);
> > @@ -235,31 +240,50 @@ static int drm_pagemap_migrate_map_pages(struct device *dev,
> >  		if (!page)
> >  			goto next;
> >  
> > -		if (WARN_ON_ONCE(is_zone_device_page(page)))
> > -			return -EFAULT;
> > -
> >  		folio = page_folio(page);
> >  		order = folio_order(folio);
> >  
> > -		dma_addr = dma_map_page(dev, page, 0, page_size(page), dir);
> > -		if (dma_mapping_error(dev, dma_addr))
> > -			return -EFAULT;
> > +		if (is_device_private_page(page)) {
> > +			struct drm_pagemap_zdd *zdd = page->zone_device_data;
> > +			struct drm_pagemap *dpagemap = zdd->dpagemap;
> > +			struct drm_pagemap_addr addr;
> > +
> > +			if (dpagemap == local_dpagemap && !mdetails->can_migrate_same_pagemap)
> > +				goto next;
> >  
> > -		pagemap_addr[i] =
> > -			drm_pagemap_addr_encode(dma_addr,
> > -						DRM_INTERCONNECT_SYSTEM,
> > -						order, dir);
> > +			num_peer_pages += NR_PAGES(order);
> > +			addr = dpagemap->ops->device_map(dpagemap, dev, page, order, dir);
> > +			if (dma_mapping_error(dev, addr.addr))
> > +				return -EFAULT;
> > +
> > +			pagemap_addr[i] = addr;
> > +		} else {
> > +			dma_addr = dma_map_page(dev, page, 0, page_size(page), dir);
> > +			if (dma_mapping_error(dev, dma_addr))
> > +				return -EFAULT;
> > +
> > +			pagemap_addr[i] =
> > +				drm_pagemap_addr_encode(dma_addr,
> > +							DRM_INTERCONNECT_SYSTEM,
> > +							order, dir);
> > +		}
> >  
> >  next:
> >  		i += NR_PAGES(order);
> >  	}
> >  
> > +	if (num_peer_pages)
> > +		drm_dbg(local_dpagemap->drm, "Migrating %lu peer pages over interconnect.\n",
> > +			num_peer_pages);
> > +
> >  	return 0;
> >  }
> >  
> >  /**
> >   * drm_pagemap_migrate_unmap_pages() - Unmap pages previously mapped for GPU SVM migration
> >   * @dev: The device for which the pages were mapped
> > + * @migrate_pfn: Array of migrate pfns set up for the mapped pages. Used to
> > + * determine the drm_pagemap of a peer device private page.
> >   * @pagemap_addr: Array of DMA information corresponding to mapped pages
> >   * @npages: Number of pages to unmap
> >   * @dir: Direction of data transfer (e.g., DMA_BIDIRECTIONAL)
> > @@ -270,16 +294,27 @@ static int drm_pagemap_migrate_map_pages(struct device *dev,
> >   */
> >  static void drm_pagemap_migrate_unmap_pages(struct device *dev,
> >  					    struct drm_pagemap_addr *pagemap_addr,
> > +					    unsigned long *migrate_pfn,
> >  					    unsigned long npages,
> >  					    enum dma_data_direction dir)
> >  {
> >  	unsigned long i;
> >  
> >  	for (i = 0; i < npages;) {
> > -		if (!pagemap_addr[i].addr || dma_mapping_error(dev, pagemap_addr[i].addr))
> > +		struct page *page = migrate_pfn_to_page(migrate_pfn[i]);
> > +
> > +		if (!page || !pagemap_addr[i].addr || dma_mapping_error(dev, pagemap_addr[i].addr))
> >  			goto next;
> >  
> > -		dma_unmap_page(dev, pagemap_addr[i].addr, PAGE_SIZE << pagemap_addr[i].order, dir);
> > +		if (is_zone_device_page(page)) {
> > +			struct drm_pagemap_zdd *zdd = page->zone_device_data;
> > +			struct drm_pagemap *dpagemap = zdd->dpagemap;
> > +
> > +			dpagemap->ops->device_unmap(dpagemap, dev, pagemap_addr[i]);
> > +		} else {
> > +			dma_unmap_page(dev, pagemap_addr[i].addr,
> > +				       PAGE_SIZE << pagemap_addr[i].order, dir);
> > +		}
> >  
> >  next:
> >  		i += NR_PAGES(pagemap_addr[i].order);
> > @@ -301,8 +336,7 @@ npages_in_range(unsigned long start, unsigned long end)
> >   * @mm: Pointer to the struct mm_struct.
> >   * @start: Start of the virtual address range to migrate.
> >   * @end: End of the virtual address range to migrate.
> > - * @timeslice_ms: The time requested for the migrated pagemap pages to
> > - * be present in @mm before being allowed to be migrated back.
> > + * @mdetails: Details to govern the migration.
> >   *
> >   * This function migrates the specified virtual address range to device memory.
> >   * It performs the necessary setup and invokes the driver-specific operations for
> > @@ -320,7 +354,7 @@ npages_in_range(unsigned long start, unsigned long end)
> 
> Update kernel doc to indicate devmem_allocation is consumed on failure?
> 
> >  int drm_pagemap_migrate_to_devmem(struct drm_pagemap_devmem *devmem_allocation,
> >  				  struct mm_struct *mm,
> >  				  unsigned long start, unsigned long end,
> > -				  unsigned long timeslice_ms)
> > +				  const struct drm_pagemap_migrate_details *mdetails)
> >  {
> >  	const struct drm_pagemap_devmem_ops *ops = devmem_allocation->ops;
> >  	struct drm_pagemap *dpagemap = devmem_allocation->dpagemap;
> > @@ -329,9 +363,11 @@ int drm_pagemap_migrate_to_devmem(struct drm_pagemap_devmem *devmem_allocation,
> >  		.start		= start,
> >  		.end		= end,
> >  		.pgmap_owner	= pagemap->owner,
> > -		.flags		= MIGRATE_VMA_SELECT_SYSTEM,
> > +		.flags		= MIGRATE_VMA_SELECT_SYSTEM | MIGRATE_VMA_SELECT_DEVICE_COHERENT |
> > +		(mdetails->source_peer_migrates ? 0 : MIGRATE_VMA_SELECT_DEVICE_PRIVATE),
> >  	};
> >  	unsigned long i, npages = npages_in_range(start, end);
> > +	unsigned long own_pages = 0, migrated_pages = 0;
> >  	struct vm_area_struct *vas;
> >  	struct drm_pagemap_zdd *zdd = NULL;
> >  	struct page **pages;
> > @@ -373,8 +409,10 @@ int drm_pagemap_migrate_to_devmem(struct drm_pagemap_devmem *devmem_allocation,
> >  	zdd = drm_pagemap_zdd_alloc(dpagemap);
> >  	if (!zdd) {
> >  		err = -ENOMEM;
> > -		goto err_free;
> > +		kvfree(buf);
> > +		goto err_out;
> >  	}
> > +	zdd->devmem_allocation = devmem_allocation;	/* Owns ref */
> >  
> >  	migrate.vma = vas;
> >  	migrate.src = buf;
> > @@ -385,55 +423,111 @@ int drm_pagemap_migrate_to_devmem(struct drm_pagemap_devmem *devmem_allocation,
> >  		goto err_free;
> >  
> >  	if (!migrate.cpages) {
> > -		err = -EFAULT;
> > +		/* No pages to migrate. Raced or unknown device pages. */
> > +		err = -EBUSY;
> >  		goto err_free;
> >  	}
> >  
> >  	if (migrate.cpages != npages) {
> > +		/*
> > +		 * Some pages to migrate. But we want to migrate all or
> > +		 * nothing. Raced or unknown device pages.
> > +		 */
> 
> I honestly think this is going to be an issue. Let's say two devices
> fault at the same time and both try to migrate simultaneously—neither
> side is likely to make forward progress, resulting in the migration
> failing even with a retry loop at the caller.
> 
> How about a Xe module-wide migration rwsem? The first call to
> drm_pagemap_populate_mm would take it in read mode, and subsequent
> attempts would take it in write mode. We can't control CPU-side races
> here, but we do have some level of GPU-side control via a lock like the
> one I suggested.
> 
> The other alternative is to restructure our GPU SVM range tree into a
> process-wide structure (rather than per-device VM), which locks the
> range when servicing a fault and supports multiple sets of pages
> attached to the range. This is pretty large work though, so I'd lean
> towards some Xe driver side locking first.
> 

Another possible option is pass a flag from drm_pagemap_populate_mm
which makes it to xe_drm_pagemap_populate_mm and we take the validation
guard in exclusive. This won't help say if two devices both take atomics
faults though and are trying to migrate to different pagemaps.

Matt 

> >  		err = -EBUSY;
> > -		goto err_finalize;
> > +		goto err_aborted_migration;
> > +	}
> > +
> > +	/* Count device-private pages to migrate */
> > +	for (i = 0; i < npages; ++i) {
> > +		struct page *src_page = migrate_pfn_to_page(migrate.src[i]);
> > +
> > +		if (src_page && is_zone_device_page(src_page)) {
> > +			if (page_pgmap(src_page) == pagemap)
> > +				own_pages++;
> > +		}
> 
> In an effort to make the 2M transition easier, can this loop increment i
> and own-pages based on the folio order?
> 
> > +	}
> > +
> > +	drm_dbg(dpagemap->drm, "Total pages %lu; Own pages: %lu.\n",
> > +		npages, own_pages);
> > +	if (own_pages == npages) {
> > +		err = 0;
> > +		drm_dbg(dpagemap->drm, "Migration wasn't necessary.\n");
> > +		goto err_aborted_migration;
> > +	} else if (own_pages && mdetails->can_migrate_same_pagemap) {
> > +		err = -EBUSY;
> > +		drm_dbg(dpagemap->drm, "Migration aborted due to fragmentation.\n");
> > +		goto err_aborted_migration;
> >  	}
> >  
> >  	err = ops->populate_devmem_pfn(devmem_allocation, npages, migrate.dst);
> >  	if (err)
> >  		goto err_finalize;
> >  
> > -	err = drm_pagemap_migrate_map_pages(devmem_allocation->dev, pagemap_addr,
> > -					    migrate.src, npages, DMA_TO_DEVICE);
> > +	err = drm_pagemap_migrate_map_pages(devmem_allocation->dev,
> > +					    devmem_allocation->dpagemap, pagemap_addr,
> > +					    migrate.src, npages, DMA_TO_DEVICE,
> > +					    mdetails);
> > +
> > +	if (err) {
> > +		drm_pagemap_migrate_unmap_pages(devmem_allocation->dev, pagemap_addr,
> > +						migrate.src, npages, DMA_TO_DEVICE);
> >  
> > -	if (err)
> >  		goto err_finalize;
> > +	}
> >  
> > +	own_pages = 0;
> >  	for (i = 0; i < npages; ++i) {
> >  		struct page *page = pfn_to_page(migrate.dst[i]);
> > -
> > +		struct page *src_page = migrate_pfn_to_page(migrate.src[i]);
> > +
> > +		if (unlikely(src_page && is_zone_device_page(src_page) &&
> > +			     page_pgmap(src_page) == pagemap &&
> > +			     !mdetails->can_migrate_same_pagemap)) {
> > +			migrate.dst[i] = 0;
> > +			pages[i] = NULL;
> > +			own_pages++;
> > +			continue;
> > +		}
> 
> Same as above, I think logic should be based on folio order?
> 
> >  		pages[i] = page;
> >  		migrate.dst[i] = migrate_pfn(migrate.dst[i]);
> >  		drm_pagemap_get_devmem_page(page, zdd);
> >  	}
> > +	drm_WARN_ON(dpagemap->drm, !!own_pages);
> >  
> >  	err = ops->copy_to_devmem(pages, pagemap_addr, npages,
> >  				  devmem_allocation->pre_migrate_fence);
> > +	drm_pagemap_migrate_unmap_pages(devmem_allocation->dev, pagemap_addr,
> > +					migrate.src, npages, DMA_TO_DEVICE);
> >  	if (err)
> >  		goto err_finalize;
> >  
> >  	/* Upon success bind devmem allocation to range and zdd */
> >  	devmem_allocation->timeslice_expiration = get_jiffies_64() +
> > -		msecs_to_jiffies(timeslice_ms);
> > -	zdd->devmem_allocation = devmem_allocation;	/* Owns ref */
> > +		msecs_to_jiffies(mdetails->timeslice_ms);
> >  
> >  err_finalize:
> >  	if (err)
> >  		drm_pagemap_migration_unlock_put_pages(npages, migrate.dst);
> > +err_aborted_migration:
> >  	migrate_vma_pages(&migrate);
> > +
> > +	for (i = 0; i < npages; ++i)
> > +		if (migrate.src[i] & MIGRATE_PFN_MIGRATE)
> > +			migrated_pages++;
> 
> Again based on folio order?
> 
> > +
> > +	if (!err && migrated_pages < npages - own_pages) {
> > +		drm_dbg(dpagemap->drm, "Raced while finalizing migration.\n");
> > +		err = -EBUSY;
> > +	}
> > +
> >  	migrate_vma_finalize(&migrate);
> > -	drm_pagemap_migrate_unmap_pages(devmem_allocation->dev, pagemap_addr, npages,
> > -					DMA_TO_DEVICE);
> >  err_free:
> > -	if (zdd)
> > -		drm_pagemap_zdd_put(zdd);
> > +	drm_pagemap_zdd_put(zdd);
> >  	kvfree(buf);
> > +	return err;
> > +
> >  err_out:
> > +	devmem_allocation->ops->devmem_release(devmem_allocation);
> >  	return err;
> >  }
> >  EXPORT_SYMBOL_GPL(drm_pagemap_migrate_to_devmem);
> > @@ -706,6 +800,7 @@ EXPORT_SYMBOL(drm_pagemap_put);
> >  int drm_pagemap_evict_to_ram(struct drm_pagemap_devmem *devmem_allocation)
> >  {
> >  	const struct drm_pagemap_devmem_ops *ops = devmem_allocation->ops;
> > +	struct drm_pagemap_migrate_details mdetails = {};
> >  	unsigned long npages, mpages = 0;
> >  	struct page **pages;
> >  	unsigned long *src, *dst;
> > @@ -744,8 +839,10 @@ int drm_pagemap_evict_to_ram(struct drm_pagemap_devmem *devmem_allocation)
> >  	if (err || !mpages)
> >  		goto err_finalize;
> >  
> > -	err = drm_pagemap_migrate_map_pages(devmem_allocation->dev, pagemap_addr,
> > -					    dst, npages, DMA_FROM_DEVICE);
> > +	err = drm_pagemap_migrate_map_pages(devmem_allocation->dev,
> > +					    devmem_allocation->dpagemap, pagemap_addr,
> > +					    dst, npages, DMA_FROM_DEVICE,
> > +					    &mdetails);
> >  	if (err)
> >  		goto err_finalize;
> >  
> > @@ -761,8 +858,9 @@ int drm_pagemap_evict_to_ram(struct drm_pagemap_devmem *devmem_allocation)
> >  		drm_pagemap_migration_unlock_put_pages(npages, dst);
> >  	migrate_device_pages(src, dst, npages);
> >  	migrate_device_finalize(src, dst, npages);
> > -	drm_pagemap_migrate_unmap_pages(devmem_allocation->dev, pagemap_addr, npages,
> > +	drm_pagemap_migrate_unmap_pages(devmem_allocation->dev, pagemap_addr, dst, npages,
> >  					DMA_FROM_DEVICE);
> > +
> >  err_free:
> >  	kvfree(buf);
> >  err_out:
> > @@ -805,6 +903,7 @@ static int __drm_pagemap_migrate_to_ram(struct vm_area_struct *vas,
> >  		MIGRATE_VMA_SELECT_DEVICE_COHERENT,
> >  		.fault_page	= page,
> >  	};
> > +	struct drm_pagemap_migrate_details mdetails = {};
> >  	struct drm_pagemap_zdd *zdd;
> >  	const struct drm_pagemap_devmem_ops *ops;
> >  	struct device *dev = NULL;
> > @@ -853,19 +952,6 @@ static int __drm_pagemap_migrate_to_ram(struct vm_area_struct *vas,
> >  	if (!migrate.cpages)
> >  		goto err_free;
> >  
> > -	if (!page) {
> > -		for (i = 0; i < npages; ++i) {
> > -			if (!(migrate.src[i] & MIGRATE_PFN_MIGRATE))
> > -				continue;
> > -
> > -			page = migrate_pfn_to_page(migrate.src[i]);
> > -			break;
> > -		}
> > -
> > -		if (!page)
> > -			goto err_finalize;
> > -	}
> > -	zdd = page->zone_device_data;
> 
> This isn't actually related to this patch but agree this is some
> leftover dead code. You break this out into its own patch.
> 
> >  	ops = zdd->devmem_allocation->ops;
> >  	dev = zdd->devmem_allocation->dev;
> >  
> > @@ -875,8 +961,8 @@ static int __drm_pagemap_migrate_to_ram(struct vm_area_struct *vas,
> >  	if (err)
> >  		goto err_finalize;
> >  
> > -	err = drm_pagemap_migrate_map_pages(dev, pagemap_addr, migrate.dst, npages,
> > -					    DMA_FROM_DEVICE);
> > +	err = drm_pagemap_migrate_map_pages(dev, zdd->dpagemap, pagemap_addr, migrate.dst, npages,
> > +					    DMA_FROM_DEVICE, &mdetails);
> >  	if (err)
> >  		goto err_finalize;
> >  
> > @@ -893,8 +979,8 @@ static int __drm_pagemap_migrate_to_ram(struct vm_area_struct *vas,
> >  	migrate_vma_pages(&migrate);
> >  	migrate_vma_finalize(&migrate);
> >  	if (dev)
> > -		drm_pagemap_migrate_unmap_pages(dev, pagemap_addr, npages,
> > -						DMA_FROM_DEVICE);
> > +		drm_pagemap_migrate_unmap_pages(dev, pagemap_addr, migrate.dst,
> > +						npages, DMA_FROM_DEVICE);
> >  err_free:
> >  	kvfree(buf);
> >  err_out:
> > @@ -930,9 +1016,11 @@ static vm_fault_t drm_pagemap_migrate_to_ram(struct vm_fault *vmf)
> >  	struct drm_pagemap_zdd *zdd = vmf->page->zone_device_data;
> >  	int err;
> >  
> > +	drm_pagemap_zdd_get(zdd);
> 
> Can you explain the extra ref here? The page itself should have a ref
> owned by the drm_pagemap_migrate_to_ram caller, right?
> 
> >  	err = __drm_pagemap_migrate_to_ram(vmf->vma,
> >  					   vmf->page, vmf->address,
> >  					   zdd->devmem_allocation->size);
> > +	drm_pagemap_zdd_put(zdd);
> >  
> >  	return err ? VM_FAULT_SIGBUS : 0;
> >  }
> > diff --git a/drivers/gpu/drm/xe/xe_migrate.c b/drivers/gpu/drm/xe/xe_migrate.c
> > index f3b66b55acfb..4edb41548000 100644
> > --- a/drivers/gpu/drm/xe/xe_migrate.c
> > +++ b/drivers/gpu/drm/xe/xe_migrate.c
> > @@ -35,6 +35,7 @@
> >  #include "xe_sa.h"
> >  #include "xe_sched_job.h"
> >  #include "xe_sriov_vf_ccs.h"
> > +#include "xe_svm.h"
> >  #include "xe_sync.h"
> >  #include "xe_trace_bo.h"
> >  #include "xe_validation.h"
> > @@ -2048,7 +2049,8 @@ static void build_pt_update_batch_sram(struct xe_migrate *m,
> >  			u64 pte;
> >  
> >  			xe_tile_assert(m->tile, sram_addr[i].proto ==
> > -				       DRM_INTERCONNECT_SYSTEM);
> > +				       DRM_INTERCONNECT_SYSTEM ||
> > +				       sram_addr[i].proto == XE_INTERCONNECT_P2P);
> >  			xe_tile_assert(m->tile, addr);
> >  			xe_tile_assert(m->tile, PAGE_ALIGNED(addr));
> >  
> > diff --git a/drivers/gpu/drm/xe/xe_svm.c b/drivers/gpu/drm/xe/xe_svm.c
> > index 22281d69e26a..03cc4a24ce27 100644
> > --- a/drivers/gpu/drm/xe/xe_svm.c
> > +++ b/drivers/gpu/drm/xe/xe_svm.c
> > @@ -1058,6 +1058,10 @@ static int xe_drm_pagemap_populate_mm(struct drm_pagemap *dpagemap,
> >  				      unsigned long timeslice_ms)
> >  {
> >  	struct xe_pagemap *xpagemap = container_of(dpagemap, typeof(*xpagemap), dpagemap);
> > +	struct drm_pagemap_migrate_details mdetails = {
> > +		.timeslice_ms = timeslice_ms,
> > +		.source_peer_migrates = 1,
> > +	};
> >  	struct xe_vram_region *vr = xe_pagemap_to_vr(xpagemap);
> >  	struct dma_fence *pre_migrate_fence = NULL;
> >  	struct xe_device *xe = vr->xe;
> > @@ -1109,10 +1113,9 @@ static int xe_drm_pagemap_populate_mm(struct drm_pagemap *dpagemap,
> >  
> >  		/* Ensure the device has a pm ref while there are device pages active. */
> >  		xe_pm_runtime_get_noresume(xe);
> > +		/* Consumes the devmem allocation ref. */
> >  		err = drm_pagemap_migrate_to_devmem(&bo->devmem_allocation, mm,
> > -						    start, end, timeslice_ms);
> > -		if (err)
> > -			xe_svm_devmem_release(&bo->devmem_allocation);
> > +						    start, end, &mdetails);
> >  		xe_bo_unlock(bo);
> >  		xe_bo_put(bo);
> >  	}
> > @@ -1628,6 +1631,7 @@ int xe_svm_alloc_vram(struct xe_svm_range *range, const struct drm_gpusvm_ctx *c
> >  	struct xe_vm *vm = range_to_vm(&range->base);
> >  	enum drm_gpusvm_scan_result migration_state;
> >  	struct xe_device *xe = vm->xe;
> > +	int err, retries = 1;
> >  
> >  	xe_assert(range_to_vm(&range->base)->xe, range->base.pages.flags.migrate_devmem);
> >  	range_debug(range, "ALLOCATE VRAM");
> > @@ -1646,10 +1650,18 @@ int xe_svm_alloc_vram(struct xe_svm_range *range, const struct drm_gpusvm_ctx *c
> >  		drm_dbg(&xe->drm, "Request migration to device memory on \"%s\".\n",
> >  			dpagemap->drm->unique);
> >  
> > -	return drm_pagemap_populate_mm(dpagemap, xe_svm_range_start(range),
> > -				       xe_svm_range_end(range),
> > -				       range->base.gpusvm->mm,
> > -				       ctx->timeslice_ms);
> > +	do {
> > +		err = drm_pagemap_populate_mm(dpagemap, xe_svm_range_start(range),
> > +					      xe_svm_range_end(range),
> > +					      range->base.gpusvm->mm,
> > +					      ctx->timeslice_ms);
> > +
> > +		if (err == -EBUSY && retries)
> > +			drm_gpusvm_range_evict(range->base.gpusvm, &range->base);
> 
> With the above commit, here is where I think we need a module migration
> rwsem.
> 
> > +
> > +	} while (err == -EBUSY && retries--);
> > +
> > +	return err;
> >  }
> >  
> >  static struct drm_pagemap_addr
> > diff --git a/drivers/gpu/drm/xe/xe_svm.h b/drivers/gpu/drm/xe/xe_svm.h
> > index 50e80bc892b6..b7b8eeacf196 100644
> > --- a/drivers/gpu/drm/xe/xe_svm.h
> > +++ b/drivers/gpu/drm/xe/xe_svm.h
> > @@ -205,6 +205,7 @@ struct xe_tile;
> >  struct xe_vram_region;
> >  
> >  #define XE_INTERCONNECT_VRAM 1
> > +#define XE_INTERCONNECT_P2P (XE_INTERCONNECT_VRAM + 1)
> >  
> >  struct xe_svm_range {
> >  	struct {
> > diff --git a/include/drm/drm_pagemap.h b/include/drm/drm_pagemap.h
> > index f73afece42ba..46e9c58f09e0 100644
> > --- a/include/drm/drm_pagemap.h
> > +++ b/include/drm/drm_pagemap.h
> > @@ -317,10 +317,27 @@ struct drm_pagemap_devmem {
> >  	struct dma_fence *pre_migrate_fence;
> >  };
> >  
> > +/**
> > + * struct drm_pagemap_migrate_details - Details to govern migration.
> > + * @timeslice_ms: The time requested for the migrated pagemap pages to
> > + * be present in @mm before being allowed to be migrated back.
> > + * @can_migrate_same_pagemap: Whether the copy function as indicated by
> > + * the @source_peer_migrates flag, can migrate device pages within a
> > + * single drm_pagemap.
> 
> This is essentially saying 'my copy function is smart enough to skip
> pages in the correct placement' or is saying 'my copy function can copy
> pages from one position on my device to another'?
> 
> I want to make sure I'm getting this right.
> 
> Matt
> 
> > + * @source_peer_migrates: Whether on p2p migration, The source drm_pagemap
> > + * should use the copy_to_ram() callback rather than the destination
> > + * drm_pagemap should use the copy_to_devmem() callback.
> > + */
> > +struct drm_pagemap_migrate_details {
> > +	unsigned long timeslice_ms;
> > +	u32 can_migrate_same_pagemap : 1;
> > +	u32 source_peer_migrates : 1;
> > +};
> > +
> >  int drm_pagemap_migrate_to_devmem(struct drm_pagemap_devmem *devmem_allocation,
> >  				  struct mm_struct *mm,
> >  				  unsigned long start, unsigned long end,
> > -				  unsigned long timeslice_ms);
> > +				  const struct drm_pagemap_migrate_details *mdetails);
> >  
> >  int drm_pagemap_evict_to_ram(struct drm_pagemap_devmem *devmem_allocation);
> >  
> > -- 
> > 2.51.1
> > 

  reply	other threads:[~2025-12-18  2:29 UTC|newest]

Thread overview: 45+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-12-11 16:58 [PATCH v4 00/22] Dynamic drm_pagemaps and Initial multi-device SVM Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 01/22] drm/xe/svm: Fix a debug printout Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 02/22] drm/pagemap, drm/xe: Ensure that the devmem allocation is idle before use Thomas Hellström
2025-12-12  8:56   ` Thomas Hellström
2025-12-12  9:24   ` Ghimiray, Himal Prasad
2025-12-12 10:15     ` Thomas Hellström
2025-12-12 10:17       ` Ghimiray, Himal Prasad
2025-12-11 16:58 ` [PATCH v4 03/22] drm/pagemap, drm/xe: Add refcounting to struct drm_pagemap Thomas Hellström
2025-12-12 11:24   ` Ghimiray, Himal Prasad
2025-12-11 16:58 ` [PATCH v4 04/22] drm/pagemap: Add a refcounted drm_pagemap backpointer to struct drm_pagemap_zdd Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 05/22] drm/pagemap, drm/xe: Manage drm_pagemap provider lifetimes Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 06/22] drm/pagemap: Add a drm_pagemap cache and shrinker Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 07/22] drm/xe: Use the " Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 08/22] drm/pagemap: Remove the drm_pagemap_create() interface Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 09/22] drm/pagemap_util: Add a utility to assign an owner to a set of interconnected gpus Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 10/22] drm/xe: Use the drm_pagemap_util helper to get a svm pagemap owner Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 11/22] drm/xe: Pass a drm_pagemap pointer around with the memory advise attributes Thomas Hellström
2025-12-11 16:58 ` [PATCH v4 12/22] drm/xe: Use the vma attibute drm_pagemap to select where to migrate Thomas Hellström
2025-12-12  9:51   ` Ghimiray, Himal Prasad
2025-12-11 16:59 ` [PATCH v4 13/22] drm/xe: Simplify madvise_preferred_mem_loc() Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 14/22] drm/xe/uapi: Extend the madvise functionality to support foreign pagemap placement for svm Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 15/22] drm/xe: Support pcie p2p dma as a fast interconnect Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 16/22] drm/xe/vm: Add a couple of VM debug printouts Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 17/22] drm/xe/svm: Document how xe keeps drm_pagemap references Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 18/22] drm/pagemap, drm/xe: Clean up the use of the device-private page owner Thomas Hellström
2025-12-12 10:09   ` Ghimiray, Himal Prasad
2025-12-11 16:59 ` [PATCH v4 19/22] drm/gpusvm: Introduce a function to scan the current migration state Thomas Hellström
2025-12-12 11:21   ` Ghimiray, Himal Prasad
2025-12-12 11:35     ` Thomas Hellström
2025-12-16  0:58       ` Matthew Brost
2025-12-16 23:55   ` Matthew Brost
2025-12-17  6:57     ` Ghimiray, Himal Prasad
2025-12-11 16:59 ` [PATCH v4 20/22] drm/xe: Use drm_gpusvm_scan_mm() Thomas Hellström
2025-12-16  1:06   ` Matthew Brost
2025-12-17  6:58   ` Ghimiray, Himal Prasad
2025-12-11 16:59 ` [PATCH v4 21/22] drm/pagemap, drm/xe: Support destination migration over interconnect Thomas Hellström
2025-12-18  1:20   ` Matthew Brost
2025-12-18  2:29     ` Matthew Brost [this message]
2025-12-18  9:09       ` Thomas Hellström
2025-12-18  9:04     ` Thomas Hellström
2025-12-11 16:59 ` [PATCH v4 22/22] drm/pagemap: Support source " Thomas Hellström
2025-12-11 18:03 ` ✗ CI.checkpatch: warning for Dynamic drm_pagemaps and Initial multi-device SVM (rev5) Patchwork
2025-12-11 18:05 ` ✓ CI.KUnit: success " Patchwork
2025-12-11 18:49 ` ✓ Xe.CI.BAT: " Patchwork
2025-12-12  8:08 ` ✗ Xe.CI.Full: failure " Patchwork

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aUNm879rnea1+01Y@lstrano-desk.jf.intel.com \
    --to=matthew.brost@intel.com \
    --cc=airlied@gmail.com \
    --cc=apopple@nvidia.com \
    --cc=christian.koenig@amd.com \
    --cc=dakr@kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=felix.kuehling@amd.com \
    --cc=himal.prasad.ghimiray@intel.com \
    --cc=intel-xe@lists.freedesktop.org \
    --cc=joonas.lahtinen@linux.intel.com \
    --cc=michal.mrozek@intel.com \
    --cc=simona.vetter@ffwll.ch \
    --cc=thomas.hellstrom@linux.intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox