Re: [PATCH v5 0/6] Migrate on fault for device pages

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Alistair Popple <apopple@nvidia.com>
To: "David Hildenbrand (Arm)" <david@kernel.org>
Cc: mpenttil@redhat.com, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org,  Jason Gunthorpe <jgg@nvidia.com>,
	Leon Romanovsky <leonro@nvidia.com>,
	 Balbir Singh <balbirs@nvidia.com>, Zi Yan <ziy@nvidia.com>,
	Matthew Brost <matthew.brost@intel.com>
Subject: Re: [PATCH v5 0/6] Migrate on fault for device pages
Date: Fri, 13 Mar 2026 11:27:45 +1100	[thread overview]
Message-ID: <abNZdtC74SHAy9TQ@nvdebian.thelocal> (raw)
In-Reply-To: <76c5576a-65d2-4dde-a969-ef6577855864@kernel.org>

On 2026-03-13 at 06:58 +1100, "David Hildenbrand (Arm)" <david@kernel.org> wrote...
> On 2/11/26 09:12, mpenttil@redhat.com wrote:
> > From: Mika Penttilä <mpenttil@redhat.com>
> > 
> > Currently, the way device page faulting and migration works
> > is not optimal, if you want to do both fault handling and
> > migration at once.
> > 
> > Being able to migrate not present pages (or pages mapped with incorrect
> > permissions, eg. COW) to the GPU requires doing either of the
> > following sequences:
> > 
> > 1. hmm_range_fault() - fault in non-present pages with correct permissions, etc.
> > 2. migrate_vma_*() - migrate the pages
> > 
> > Or:
> > 
> > 1. migrate_vma_*() - migrate present pages
> > 2. If non-present pages detected by migrate_vma_*():
> >    a) call hmm_range_fault() to fault pages in
> >    b) call migrate_vma_*() again to migrate now present pages
> > 
> > The problem with the first sequence is that you always have to do two
> > page walks even when most of the time the pages are present or zero page
> > mappings so the common case takes a performance hit.
> > 
> > The second sequence is better for the common case, but far worse if
> > pages aren't present because now you have to walk the page tables three
> > times (once to find the page is not present, once so hmm_range_fault()
> > can find a non-present page to fault in and once again to setup the
> > migration). It is also tricky to code correctly. One page table walk
> > could costs over 1000 cpu cycles on X86-64, which is a significant hit.
> > 
> > We should be able to walk the page table once, faulting
> > pages in as required and replacing them with migration entries if
> > requested.
> > 
> > Add a new flag to HMM APIs, HMM_PFN_REQ_MIGRATE,
> > which tells to prepare for migration also during fault handling.
> > Also, for the migrate_vma_setup() call paths, a flag, MIGRATE_VMA_FAULT,
> > is added to tell to add fault handling to migrate.
> > 
> > One extra benefit of migrating with hmm_range_fault() path
> > is the migrate_vma.vma gets populated, so no need to
> > retrieve that separataly.
> > 
> > Tested in X86-64 VM with HMM test device, passing the selftests.
> > For performance, the migrate throughput tests from the selftests
> > show similar numbers (within error margin) as unmodified kernel.
> > Tested also rebased on the
> > "Remove device private pages from physical address space" series:
> > https://lore.kernel.org/linux-mm/20260130111050.53670-1-jniethe@nvidia.com/
> > plus a small patch to adjust with no problems.
> 
> Stumbling over this in my backlog, so far we have to RB tags.
> 
> I'm going to take a look at the core MM bits (soon I hope), but it would
> be great if other people could review the HMM bits and provide proper tags.

Yes, sorry I've been meaning to review this for a while but have been backlogged
for a while. I should be able to get to it early next week though.

> -- 
> Cheers,
> 
> David

next prev parent reply	other threads:[~2026-03-13  0:28 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-11  8:12 [PATCH v5 0/6] Migrate on fault for device pages mpenttil
2026-02-11  8:12 ` [PATCH v5 1/6] mm:/Kconfig changes for migrate " mpenttil
2026-03-17  8:31   ` David Hildenbrand (Arm)
2026-02-11  8:12 ` [PATCH v5 2/6] mm: Add helper to convert HMM pfn to migrate pfn mpenttil
2026-03-17  9:05   ` David Hildenbrand (Arm)
2026-03-17 13:01     ` Mika Penttilä
2026-03-17 13:45       ` Leon Romanovsky
2026-03-23 17:58         ` Jason Gunthorpe
2026-02-11  8:12 ` [PATCH v5 3/6] mm/hmm: do the plumbing for HMM to participate in migration mpenttil
2026-02-11  8:12 ` [PATCH 4/6] mm: setup device page migration in HMM pagewalk mpenttil
2026-02-11  8:13 ` [PATCH v5 5/6] mm: add new testcase for the migrate on fault case mpenttil
2026-02-11  8:13 ` [PATCH v5 6/6] mm:/migrate_device.c: remove migrate_vma_collect_*() mpenttil
2026-03-12 19:58 ` [PATCH v5 0/6] Migrate on fault for device pages David Hildenbrand (Arm)
2026-03-13  0:27   ` Alistair Popple [this message]
2026-03-17 10:06 ` Balbir Singh
2026-03-17 10:28   ` Mika Penttilä

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=abNZdtC74SHAy9TQ@nvdebian.thelocal \
    --to=apopple@nvidia.com \
    --cc=balbirs@nvidia.com \
    --cc=david@kernel.org \
    --cc=jgg@nvidia.com \
    --cc=leonro@nvidia.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=matthew.brost@intel.com \
    --cc=mpenttil@redhat.com \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.