From: Alistair Popple <apopple@nvidia.com>
To: "David Hildenbrand (Arm)" <david@kernel.org>
Cc: mpenttil@redhat.com, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, Jason Gunthorpe <jgg@nvidia.com>,
Leon Romanovsky <leonro@nvidia.com>,
Balbir Singh <balbirs@nvidia.com>, Zi Yan <ziy@nvidia.com>,
Matthew Brost <matthew.brost@intel.com>
Subject: Re: [PATCH v5 0/6] Migrate on fault for device pages
Date: Fri, 13 Mar 2026 11:27:45 +1100 [thread overview]
Message-ID: <abNZdtC74SHAy9TQ@nvdebian.thelocal> (raw)
In-Reply-To: <76c5576a-65d2-4dde-a969-ef6577855864@kernel.org>
On 2026-03-13 at 06:58 +1100, "David Hildenbrand (Arm)" <david@kernel.org> wrote...
> On 2/11/26 09:12, mpenttil@redhat.com wrote:
> > From: Mika Penttilä <mpenttil@redhat.com>
> >
> > Currently, the way device page faulting and migration works
> > is not optimal, if you want to do both fault handling and
> > migration at once.
> >
> > Being able to migrate not present pages (or pages mapped with incorrect
> > permissions, eg. COW) to the GPU requires doing either of the
> > following sequences:
> >
> > 1. hmm_range_fault() - fault in non-present pages with correct permissions, etc.
> > 2. migrate_vma_*() - migrate the pages
> >
> > Or:
> >
> > 1. migrate_vma_*() - migrate present pages
> > 2. If non-present pages detected by migrate_vma_*():
> > a) call hmm_range_fault() to fault pages in
> > b) call migrate_vma_*() again to migrate now present pages
> >
> > The problem with the first sequence is that you always have to do two
> > page walks even when most of the time the pages are present or zero page
> > mappings so the common case takes a performance hit.
> >
> > The second sequence is better for the common case, but far worse if
> > pages aren't present because now you have to walk the page tables three
> > times (once to find the page is not present, once so hmm_range_fault()
> > can find a non-present page to fault in and once again to setup the
> > migration). It is also tricky to code correctly. One page table walk
> > could costs over 1000 cpu cycles on X86-64, which is a significant hit.
> >
> > We should be able to walk the page table once, faulting
> > pages in as required and replacing them with migration entries if
> > requested.
> >
> > Add a new flag to HMM APIs, HMM_PFN_REQ_MIGRATE,
> > which tells to prepare for migration also during fault handling.
> > Also, for the migrate_vma_setup() call paths, a flag, MIGRATE_VMA_FAULT,
> > is added to tell to add fault handling to migrate.
> >
> > One extra benefit of migrating with hmm_range_fault() path
> > is the migrate_vma.vma gets populated, so no need to
> > retrieve that separataly.
> >
> > Tested in X86-64 VM with HMM test device, passing the selftests.
> > For performance, the migrate throughput tests from the selftests
> > show similar numbers (within error margin) as unmodified kernel.
> > Tested also rebased on the
> > "Remove device private pages from physical address space" series:
> > https://lore.kernel.org/linux-mm/20260130111050.53670-1-jniethe@nvidia.com/
> > plus a small patch to adjust with no problems.
>
> Stumbling over this in my backlog, so far we have to RB tags.
>
> I'm going to take a look at the core MM bits (soon I hope), but it would
> be great if other people could review the HMM bits and provide proper tags.
Yes, sorry I've been meaning to review this for a while but have been backlogged
for a while. I should be able to get to it early next week though.
> --
> Cheers,
>
> David
next prev parent reply other threads:[~2026-03-13 0:28 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-11 8:12 [PATCH v5 0/6] Migrate on fault for device pages mpenttil
2026-02-11 8:12 ` [PATCH v5 1/6] mm:/Kconfig changes for migrate " mpenttil
2026-03-17 8:31 ` David Hildenbrand (Arm)
2026-02-11 8:12 ` [PATCH v5 2/6] mm: Add helper to convert HMM pfn to migrate pfn mpenttil
2026-03-17 9:05 ` David Hildenbrand (Arm)
2026-03-17 13:01 ` Mika Penttilä
2026-03-17 13:45 ` Leon Romanovsky
2026-03-23 17:58 ` Jason Gunthorpe
2026-02-11 8:12 ` [PATCH v5 3/6] mm/hmm: do the plumbing for HMM to participate in migration mpenttil
2026-02-11 8:12 ` [PATCH 4/6] mm: setup device page migration in HMM pagewalk mpenttil
2026-02-11 8:13 ` [PATCH v5 5/6] mm: add new testcase for the migrate on fault case mpenttil
2026-02-11 8:13 ` [PATCH v5 6/6] mm:/migrate_device.c: remove migrate_vma_collect_*() mpenttil
2026-03-12 19:58 ` [PATCH v5 0/6] Migrate on fault for device pages David Hildenbrand (Arm)
2026-03-13 0:27 ` Alistair Popple [this message]
2026-03-17 10:06 ` Balbir Singh
2026-03-17 10:28 ` Mika Penttilä
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=abNZdtC74SHAy9TQ@nvdebian.thelocal \
--to=apopple@nvidia.com \
--cc=balbirs@nvidia.com \
--cc=david@kernel.org \
--cc=jgg@nvidia.com \
--cc=leonro@nvidia.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=matthew.brost@intel.com \
--cc=mpenttil@redhat.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox