From: Boris Brezillon <boris.brezillon@collabora.com>
To: Gaurav Kohli <quic_gkohli@quicinc.com>
Cc: Rob Clark <robdclark@gmail.com>,
Steven Price <steven.price@arm.com>,
Joerg Roedel <joro@8bytes.org>, <iommu@lists.linux.dev>,
Will Deacon <will@kernel.org>,
Robin Murphy <robin.murphy@arm.com>,
<linux-arm-kernel@lists.infradead.org>,
<linux-arm-msm@vger.kernel.org>
Subject: Re: [PATCH 0/2] iommu: Allow passing custom allocators to pgtable drivers
Date: Fri, 10 Nov 2023 10:47:50 +0100 [thread overview]
Message-ID: <20231110104750.198b6967@collabora.com> (raw)
In-Reply-To: <84277a5c-a33f-a412-6dec-2ab786b1a47d@quicinc.com>
Hi Gaurav,
On Tue, 7 Nov 2023 17:22:39 +0530
Gaurav Kohli <quic_gkohli@quicinc.com> wrote:
> On 10/24/2023 2:32 AM, Rob Clark wrote:
> > On Wed, Sep 20, 2023 at 6:12 AM Steven Price <steven.price@arm.com> wrote:
> >>
> >> On 09/08/2023 13:17, Boris Brezillon wrote:
> >>> Hello,
> >>>
> >>> This patchset is an attempt at making page table allocation
> >>> customizable. This is useful to some GPU drivers for various reasons:
> >>>
> >>> - speed-up upcoming page table allocations by managing a pool of free
> >>> pages
> >>> - batch page table allocation instead of allocating one page at a time
> >>> - pre-reserve pages for page tables needed for map/unmap operations and
> >>> return the unused page tables to some pool
> >>>
> >>> The first and last reasons are particularly important for GPU drivers
> >>> wanting to implement asynchronous VM_BIND. Asynchronous VM_BIND requires
> >>> that any page table needed for a map/unmap operation to succeed be
> >>> allocated at VM_BIND job creation time. At the time of the job creation,
> >>> we don't know what the VM will look like when we get to execute the
> >>> map/unmap, and can't guess how many page tables we will need. Because
> >>> of that, we have to over-provision page tables for the worst case
> >>> scenario (page table tree is empty), which means we will allocate/free
> >>> a lot. Having pool a pool of free pages is crucial if we want to
> >>> speed-up VM_BIND requests.
> >>>
> >>> A real example of how such custom allocators can be used is available
> >>> here[1]. v2 of the Panthor driver is approaching submission, and I
> >>> figured I'd try to upstream the dependencies separately, which is
> >>> why I submit this series now, even though the user of this new API
> >>> will come afterwards. If you'd prefer to have those patches submitted
> >>> along with the Panthor driver, let me know.
> >>>
> >>> This approach has been discussed with Robin, and is hopefully not too
> >>> far from what he had in mind.
> >>
> >> The alternative would be to embed a cache of pages into the IOMMU
> >> framework, however kmem_cache sadly doesn't seem to support the
> >> 'reserve' of pages concept that we need. mempools could be a solution
> >> but the mempool would need to be created by the IOMMU framework as the
> >> alloc/free functions are specified when creating the pool. So it would
> >> be a much bigger change (to drivers/iommu).
> >>
> >> So, given that so far it's just Panthor this seems like the right
> >> approach for now - when/if other drivers want the same functionality
> >> then it might make sense to revisit the idea of doing the caching within
> >> the IOMMU framework.
> >
> > I have some plans to use this as well for drm/msm.. but the reasons
> > and requirements are basically the same as for panthor. I think I
> > prefer the custom allocator approach, rather than tying this to IOMMU
> > framework. (But ofc custom allocators, I guess, does not prevent the
> > iommu driver from doing it's own caching.)
> >
> > BR,
> > -R
> >
>
> We have also posted one RFC patch series which is based on this current
> patches by Boris and helping us to define our custom alloc and free
> pgtable call. For our side usecase we have a requirement to create
> pgtable from HLOS and then share it to different entity(VMID) and
> basically that also requires few smc calls and for that we need
> custom alloc/free callbacks.
>
> https://lore.kernel.org/all/20231101071144.16309-1-quic_gkohli@quicinc.com/
>
>
> So custom allocator and free ops is helping for us also. Is there any
> plan to merge these patches from Boris.
Sorry for the late reply. I just sent a v2, but I forgot to add your
Tested-by :-/. Feel free to add it back.
Regards,
Boris
prev parent reply other threads:[~2023-11-10 9:47 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20230809121744.2341454-1-boris.brezillon@collabora.com>
[not found] ` <f72e5ab1-c6f2-ec03-68d6-1686145f3a4d@arm.com>
[not found] ` <CAF6AEGtDxkiiM5NNT7Y9-sY5eCQX15ij=RzU-NX2Mvf6kNk2nA@mail.gmail.com>
2023-11-07 11:52 ` [PATCH 0/2] iommu: Allow passing custom allocators to pgtable drivers Gaurav Kohli
2023-11-07 12:01 ` Gaurav Kohli
2023-11-10 9:47 ` Boris Brezillon [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20231110104750.198b6967@collabora.com \
--to=boris.brezillon@collabora.com \
--cc=iommu@lists.linux.dev \
--cc=joro@8bytes.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=quic_gkohli@quicinc.com \
--cc=robdclark@gmail.com \
--cc=robin.murphy@arm.com \
--cc=steven.price@arm.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox