From: "Christian König" <christian.koenig@amd.com>
To: Rob Clark <robdclark@gmail.com>
Cc: dri-devel@lists.freedesktop.org, freedreno@lists.freedesktop.org,
linux-arm-msm@vger.kernel.org,
Connor Abbott <cwabbott0@gmail.com>,
Rob Clark <robdclark@chromium.org>,
Abhinav Kumar <quic_abhinavk@quicinc.com>,
Dmitry Baryshkov <lumag@kernel.org>, Sean Paul <sean@poorly.run>,
Marijn Suijten <marijn.suijten@somainline.org>,
David Airlie <airlied@gmail.com>, Simona Vetter <simona@ffwll.ch>,
Konrad Dybcio <konradybcio@kernel.org>,
Maarten Lankhorst <maarten.lankhorst@linux.intel.com>,
Maxime Ripard <mripard@kernel.org>,
Thomas Zimmermann <tzimmermann@suse.de>,
Sumit Semwal <sumit.semwal@linaro.org>,
open list <linux-kernel@vger.kernel.org>,
"open list:DMA BUFFER SHARING
FRAMEWORK:Keyword:bdma_(?:buf|fence|resv)b"
<linux-media@vger.kernel.org>,
"moderated list:DMA BUFFER SHARING
FRAMEWORK:Keyword:bdma_(?:buf|fence|resv)b"
<linaro-mm-sig@lists.linaro.org>
Subject: Re: [PATCH v4 21/33] drm/msm: Add _NO_SHARE flag
Date: Mon, 5 May 2025 17:17:18 +0200 [thread overview]
Message-ID: <5f940da3-a32d-4ca7-966d-8b1df78c0d68@amd.com> (raw)
In-Reply-To: <CAF6AEGtmjLM-tK9Y=gT5XupW62X_eY2fiBJCYUnKqO9A9C4xFg@mail.gmail.com>
On 5/5/25 16:15, Rob Clark wrote:
> On Mon, May 5, 2025 at 12:54 AM Christian König
> <christian.koenig@amd.com> wrote:
>>
>> On 5/2/25 18:56, Rob Clark wrote:
>>> From: Rob Clark <robdclark@chromium.org>
>>>
>>> Buffers that are not shared between contexts can share a single resv
>>> object. This way drm_gpuvm will not track them as external objects, and
>>> submit-time validating overhead will be O(1) for all N non-shared BOs,
>>> instead of O(n).
>>>
>>> Signed-off-by: Rob Clark <robdclark@chromium.org>
>>> ---
>>> drivers/gpu/drm/msm/msm_drv.h | 1 +
>>> drivers/gpu/drm/msm/msm_gem.c | 23 +++++++++++++++++++++++
>>> drivers/gpu/drm/msm/msm_gem_prime.c | 15 +++++++++++++++
>>> include/uapi/drm/msm_drm.h | 14 ++++++++++++++
>>> 4 files changed, 53 insertions(+)
>>>
>>> diff --git a/drivers/gpu/drm/msm/msm_drv.h b/drivers/gpu/drm/msm/msm_drv.h
>>> index b77fd2c531c3..b0add236cbb3 100644
>>> --- a/drivers/gpu/drm/msm/msm_drv.h
>>> +++ b/drivers/gpu/drm/msm/msm_drv.h
>>> @@ -246,6 +246,7 @@ int msm_gem_prime_vmap(struct drm_gem_object *obj, struct iosys_map *map);
>>> void msm_gem_prime_vunmap(struct drm_gem_object *obj, struct iosys_map *map);
>>> struct drm_gem_object *msm_gem_prime_import_sg_table(struct drm_device *dev,
>>> struct dma_buf_attachment *attach, struct sg_table *sg);
>>> +struct dma_buf *msm_gem_prime_export(struct drm_gem_object *obj, int flags);
>>> int msm_gem_prime_pin(struct drm_gem_object *obj);
>>> void msm_gem_prime_unpin(struct drm_gem_object *obj);
>>>
>>> diff --git a/drivers/gpu/drm/msm/msm_gem.c b/drivers/gpu/drm/msm/msm_gem.c
>>> index 3708d4579203..d0f44c981351 100644
>>> --- a/drivers/gpu/drm/msm/msm_gem.c
>>> +++ b/drivers/gpu/drm/msm/msm_gem.c
>>> @@ -532,6 +532,9 @@ static int get_and_pin_iova_range_locked(struct drm_gem_object *obj,
>>>
>>> msm_gem_assert_locked(obj);
>>>
>>> + if (to_msm_bo(obj)->flags & MSM_BO_NO_SHARE)
>>> + return -EINVAL;
>>> +
>>> vma = get_vma_locked(obj, vm, range_start, range_end);
>>> if (IS_ERR(vma))
>>> return PTR_ERR(vma);
>>> @@ -1060,6 +1063,16 @@ static void msm_gem_free_object(struct drm_gem_object *obj)
>>> put_pages(obj);
>>> }
>>>
>>> + if (msm_obj->flags & MSM_BO_NO_SHARE) {
>>> + struct drm_gem_object *r_obj =
>>> + container_of(obj->resv, struct drm_gem_object, _resv);
>>> +
>>> + BUG_ON(obj->resv == &obj->_resv);
>>> +
>>> + /* Drop reference we hold to shared resv obj: */
>>> + drm_gem_object_put(r_obj);
>>> + }
>>> +
>>> drm_gem_object_release(obj);
>>>
>>> kfree(msm_obj->metadata);
>>> @@ -1092,6 +1105,15 @@ int msm_gem_new_handle(struct drm_device *dev, struct drm_file *file,
>>> if (name)
>>> msm_gem_object_set_name(obj, "%s", name);
>>>
>>> + if (flags & MSM_BO_NO_SHARE) {
>>> + struct msm_context *ctx = file->driver_priv;
>>> + struct drm_gem_object *r_obj = drm_gpuvm_resv_obj(ctx->vm);
>>> +
>>> + drm_gem_object_get(r_obj);
>>> +
>>> + obj->resv = r_obj->resv;
>>> + }
>>> +
>>> ret = drm_gem_handle_create(file, obj, handle);
>>>
>>> /* drop reference from allocate - handle holds it now */
>>> @@ -1124,6 +1146,7 @@ static const struct drm_gem_object_funcs msm_gem_object_funcs = {
>>> .free = msm_gem_free_object,
>>> .open = msm_gem_open,
>>> .close = msm_gem_close,
>>> + .export = msm_gem_prime_export,
>>> .pin = msm_gem_prime_pin,
>>> .unpin = msm_gem_prime_unpin,
>>> .get_sg_table = msm_gem_prime_get_sg_table,
>>> diff --git a/drivers/gpu/drm/msm/msm_gem_prime.c b/drivers/gpu/drm/msm/msm_gem_prime.c
>>> index ee267490c935..1a6d8099196a 100644
>>> --- a/drivers/gpu/drm/msm/msm_gem_prime.c
>>> +++ b/drivers/gpu/drm/msm/msm_gem_prime.c
>>> @@ -16,6 +16,9 @@ struct sg_table *msm_gem_prime_get_sg_table(struct drm_gem_object *obj)
>>> struct msm_gem_object *msm_obj = to_msm_bo(obj);
>>> int npages = obj->size >> PAGE_SHIFT;
>>>
>>> + if (msm_obj->flags & MSM_BO_NO_SHARE)
>>> + return ERR_PTR(-EINVAL);
>>> +
>>> if (WARN_ON(!msm_obj->pages)) /* should have already pinned! */
>>> return ERR_PTR(-ENOMEM);
>>>
>>> @@ -45,6 +48,15 @@ struct drm_gem_object *msm_gem_prime_import_sg_table(struct drm_device *dev,
>>> return msm_gem_import(dev, attach->dmabuf, sg);
>>> }
>>>
>>> +
>>> +struct dma_buf *msm_gem_prime_export(struct drm_gem_object *obj, int flags)
>>> +{
>>> + if (to_msm_bo(obj)->flags & MSM_BO_NO_SHARE)
>>> + return ERR_PTR(-EPERM);
>>> +
>>> + return drm_gem_prime_export(obj, flags);
>>> +}
>>> +
>>> int msm_gem_prime_pin(struct drm_gem_object *obj)
>>> {
>>> struct page **pages;
>>> @@ -53,6 +65,9 @@ int msm_gem_prime_pin(struct drm_gem_object *obj)
>>> if (obj->import_attach)
>>> return 0;
>>>
>>> + if (to_msm_bo(obj)->flags & MSM_BO_NO_SHARE)
>>> + return -EINVAL;
>>> +
>>> pages = msm_gem_pin_pages_locked(obj);
>>> if (IS_ERR(pages))
>>> ret = PTR_ERR(pages);
>>> diff --git a/include/uapi/drm/msm_drm.h b/include/uapi/drm/msm_drm.h
>>> index b974f5a24dbc..1bccc347945c 100644
>>> --- a/include/uapi/drm/msm_drm.h
>>> +++ b/include/uapi/drm/msm_drm.h
>>> @@ -140,6 +140,19 @@ struct drm_msm_param {
>>>
>>> #define MSM_BO_SCANOUT 0x00000001 /* scanout capable */
>>> #define MSM_BO_GPU_READONLY 0x00000002
>>> +/* Private buffers do not need to be explicitly listed in the SUBMIT
>>> + * ioctl, unless referenced by a drm_msm_gem_submit_cmd. Private
>>> + * buffers may NOT be imported/exported or used for scanout (or any
>>> + * other situation where buffers can be indefinitely pinned, but
>>> + * cases other than scanout are all kernel owned BOs which are not
>>> + * visible to userspace).
>>
>> Why is pinning for scanout a problem with those?
>>
>> Maybe I missed something but for other drivers that doesn't seem to be a problem.
>
> I guess _technically_ it could be ok because we track pin-count
> separately from dma_resv. But the motivation for that statement was
> simply that _NO_SHARE buffers share a resv obj with the VM, so they
> should not be used in a different VM (in this case, the display, which
> has it's own VM).
Ah, yes that makes perfect sense.
You should indeed avoid importing the BO into a different VM when it shares the reservation object with it. That will only cause trouble.
But at least amdgpu/radeon and I think i915 as well don't need to do that. Scanout is just separate from all VMs.
Regards,
Christian.
>
> BR,
> -R
>
>> Regards,
>> Christian.
>>
>>
>>> + *
>>> + * In exchange for those constraints, all private BOs associated with
>>> + * a single context (drm_file) share a single dma_resv, and if there
>>> + * has been no eviction since the last submit, there are no per-BO
>>> + * bookeeping to do, significantly cutting the SUBMIT overhead.
>>> + */
>>> +#define MSM_BO_NO_SHARE 0x00000004
>>> #define MSM_BO_CACHE_MASK 0x000f0000
>>> /* cache modes */
>>> #define MSM_BO_CACHED 0x00010000
>>> @@ -149,6 +162,7 @@ struct drm_msm_param {
>>>
>>> #define MSM_BO_FLAGS (MSM_BO_SCANOUT | \
>>> MSM_BO_GPU_READONLY | \
>>> + MSM_BO_NO_SHARE | \
>>> MSM_BO_CACHE_MASK)
>>>
>>> struct drm_msm_gem_new {
>>
next prev parent reply other threads:[~2025-05-05 15:17 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-05-02 16:56 [PATCH v4 00/33] drm/msm: sparse / "VM_BIND" support Rob Clark
2025-05-02 16:56 ` [PATCH v4 01/33] drm/gpuvm: Don't require obj lock in destructor path Rob Clark
2025-05-02 16:56 ` [PATCH v4 02/33] drm/gpuvm: Allow VAs to hold soft reference to BOs Rob Clark
2025-05-02 16:56 ` [PATCH v4 03/33] iommu/io-pgtable-arm: Add quirk to quiet WARN_ON() Rob Clark
2025-05-02 19:29 ` ALOK TIWARI
2025-05-02 16:56 ` [PATCH v4 04/33] drm/msm: Rename msm_file_private -> msm_context Rob Clark
2025-05-02 16:56 ` [PATCH v4 05/33] drm/msm: Improve msm_context comments Rob Clark
2025-05-02 16:56 ` [PATCH v4 06/33] drm/msm: Rename msm_gem_address_space -> msm_gem_vm Rob Clark
2025-05-02 16:56 ` [PATCH v4 07/33] drm/msm: Remove vram carveout support Rob Clark
2025-05-02 16:56 ` [PATCH v4 08/33] drm/msm: Collapse vma allocation and initialization Rob Clark
2025-05-02 16:56 ` [PATCH v4 09/33] drm/msm: Collapse vma close and delete Rob Clark
2025-05-02 16:56 ` [PATCH v4 10/33] drm/msm: Don't close VMAs on purge Rob Clark
2025-05-02 16:56 ` [PATCH v4 11/33] drm/msm: drm_gpuvm conversion Rob Clark
2025-05-02 16:56 ` [PATCH v4 12/33] drm/msm: Convert vm locking Rob Clark
2025-05-02 16:56 ` [PATCH v4 13/33] drm/msm: Use drm_gpuvm types more Rob Clark
2025-05-02 16:56 ` [PATCH v4 14/33] drm/msm: Split out helper to get iommu prot flags Rob Clark
2025-05-02 16:56 ` [PATCH v4 15/33] drm/msm: Add mmu support for non-zero offset Rob Clark
2025-05-02 16:56 ` [PATCH v4 16/33] drm/msm: Add PRR support Rob Clark
2025-05-02 16:56 ` [PATCH v4 17/33] drm/msm: Rename msm_gem_vma_purge() -> _unmap() Rob Clark
2025-05-02 16:56 ` [PATCH v4 18/33] drm/msm: Lazily create context VM Rob Clark
2025-05-02 16:56 ` [PATCH v4 19/33] drm/msm: Add opt-in for VM_BIND Rob Clark
2025-05-02 16:56 ` [PATCH v4 20/33] drm/msm: Mark VM as unusable on GPU hangs Rob Clark
2025-05-02 16:56 ` [PATCH v4 21/33] drm/msm: Add _NO_SHARE flag Rob Clark
2025-05-05 7:54 ` Christian König
2025-05-05 14:15 ` Rob Clark
2025-05-05 15:17 ` Christian König [this message]
2025-05-02 16:56 ` [PATCH v4 22/33] drm/msm: Crashdump prep for sparse mappings Rob Clark
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5f940da3-a32d-4ca7-966d-8b1df78c0d68@amd.com \
--to=christian.koenig@amd.com \
--cc=airlied@gmail.com \
--cc=cwabbott0@gmail.com \
--cc=dri-devel@lists.freedesktop.org \
--cc=freedreno@lists.freedesktop.org \
--cc=konradybcio@kernel.org \
--cc=linaro-mm-sig@lists.linaro.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-media@vger.kernel.org \
--cc=lumag@kernel.org \
--cc=maarten.lankhorst@linux.intel.com \
--cc=marijn.suijten@somainline.org \
--cc=mripard@kernel.org \
--cc=quic_abhinavk@quicinc.com \
--cc=robdclark@chromium.org \
--cc=robdclark@gmail.com \
--cc=sean@poorly.run \
--cc=simona@ffwll.ch \
--cc=sumit.semwal@linaro.org \
--cc=tzimmermann@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox