From: "Christian König" <ckoenig.leichtzumerken@gmail.com>
To: Shashank Sharma <shashank.sharma@amd.com>, amd-gfx@lists.freedesktop.org
Cc: Alex Deucher <alexander.deucher@amd.com>,
Christian Koenig <christian.koenig@amd.com>,
arvind.yadav@amd.com, arunpravin.paneerselvam@amd.com
Subject: Re: [RFC 2/7] drm/amdgpu: Add usermode queue for gfx work
Date: Tue, 3 Jan 2023 10:15:39 +0100 [thread overview]
Message-ID: <857effc7-9a67-02ce-f9e3-ae96053b1100@gmail.com> (raw)
In-Reply-To: <a39eab00-ac3f-a217-6ba7-de31680fc4af@amd.com>
Am 03.01.23 um 10:12 schrieb Shashank Sharma:
>
> On 02/01/2023 13:39, Christian König wrote:
>> Hi Shashank,
>>
>> Am 26.12.22 um 11:41 schrieb Shashank Sharma:
>>> [SNIP]
>>>>> /* df */
>>>>> struct amdgpu_df df;
>>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.h
>>>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.h
>>>>> index 0fa0e56daf67..f7413859b14f 100644
>>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.h
>>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.h
>>>>> @@ -57,6 +57,7 @@ struct amdgpu_ctx {
>>>>> unsigned long ras_counter_ce;
>>>>> unsigned long ras_counter_ue;
>>>>> uint32_t stable_pstate;
>>>>> + struct amdgpu_usermode_queue *userq;
>>>>
>>>> Why should we have this in the ctx here???
>>>
>>> We are allocating a few things dynamically for the queue, which
>>> would be valid until we destroy this queue. Also we need to save
>>> this queue
>>>
>>> container at some place for the destroy function, and I thought it
>>> would make sense to keep this with the context ptr, as this is how
>>> we are
>>>
>>> identifying the incoming request.
>>
>> I have absolutely no idea how you end up with that design.
>>
>> The ctx object is the CS IOCTL context, that is not even remotely
>> related to anything the user queues should be doing.
>>
>> Please completely drop that relationship and don't use any of the ctx
>> object stuff in the user queue code.
>>
> Historically the workload submission always came with a context (due
> to CS IOCTL), so we thought it would make sense to still have its
> relevance in the new workload submission method. Would you prefer this
> new submission to be independent of AMDGPU context ?
Well not prefer, the point is that this doesn't make any sense at all.
See the amdgpu_ctx object contains the resulting fence pointers for the
CS IOCTL as well as information necessary for the CS IOCTL to work (e.g.
scheduler entities etc...).
I don't see how anything from that stuff would be useful for the MES or
user queues.
Christian.
>
> - Shashank
>
>
>> Christian.
>>
>>>
>>> - Shashank
>>>
>>>>
>>>> Regards,
>>>> Christian.
>>>>
>>>>> };
>>>>> struct amdgpu_ctx_mgr {
>>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_userqueue.c
>>>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_userqueue.c
>>>>> new file mode 100644
>>>>> index 000000000000..3b6e8f75495c
>>>>> --- /dev/null
>>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_userqueue.c
>>>>> @@ -0,0 +1,187 @@
>>>>> +/*
>>>>> + * Copyright 2022 Advanced Micro Devices, Inc.
>>>>> + *
>>>>> + * Permission is hereby granted, free of charge, to any person
>>>>> obtaining a
>>>>> + * copy of this software and associated documentation files (the
>>>>> "Software"),
>>>>> + * to deal in the Software without restriction, including without
>>>>> limitation
>>>>> + * the rights to use, copy, modify, merge, publish, distribute,
>>>>> sublicense,
>>>>> + * and/or sell copies of the Software, and to permit persons to
>>>>> whom the
>>>>> + * Software is furnished to do so, subject to the following
>>>>> conditions:
>>>>> + *
>>>>> + * The above copyright notice and this permission notice shall be
>>>>> included in
>>>>> + * all copies or substantial portions of the Software.
>>>>> + *
>>>>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY
>>>>> KIND, EXPRESS OR
>>>>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>>>>> MERCHANTABILITY,
>>>>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO
>>>>> EVENT SHALL
>>>>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>>>>> DAMAGES OR
>>>>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
>>>>> OTHERWISE,
>>>>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE
>>>>> USE OR
>>>>> + * OTHER DEALINGS IN THE SOFTWARE.
>>>>> + *
>>>>> + */
>>>>> +
>>>>> +#include "amdgpu.h"
>>>>> +#include "amdgpu_vm.h"
>>>>> +#include "amdgpu_mes.h"
>>>>> +#include "amdgpu_usermode_queue.h"
>>>>> +#include "soc15_common.h"
>>>>> +
>>>>> +#define CHECK_ACCESS(a) (access_ok((const void __user *)a,
>>>>> sizeof(__u64)))
>>>>> +
>>>>> +static int
>>>>> +amdgpu_userqueue_index(struct amdgpu_device *adev)
>>>>> +{
>>>>> + int index;
>>>>> + struct amdgpu_userq_globals *uqg = &adev->userq;
>>>>> +
>>>>> + index = ida_simple_get(&uqg->ida, 2, AMDGPU_MAX_USERQ,
>>>>> GFP_KERNEL);
>>>>> + return index;
>>>>> +}
>>>>> +
>>>>> +static void
>>>>> +amdgpu_userqueue_remove_index(struct amdgpu_device *adev, struct
>>>>> amdgpu_usermode_queue *queue)
>>>>> +{
>>>>> + struct amdgpu_userq_globals *uqg = &adev->userq;
>>>>> +
>>>>> + ida_simple_remove(&uqg->ida, queue->queue_id);
>>>>> +}
>>>>> +
>>>>> +static int
>>>>> +amdgpu_userqueue_validate_input(struct amdgpu_device *adev,
>>>>> struct drm_amdgpu_userq_mqd *mqd_in)
>>>>> +{
>>>>> + if (mqd_in->queue_va == 0 || mqd_in->doorbell_handle == 0 ||
>>>>> mqd_in->doorbell_offset == 0) {
>>>>> + DRM_ERROR("Invalid queue object address\n");
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + if (mqd_in->queue_size == 0 || mqd_in->rptr_va == 0 ||
>>>>> mqd_in->wptr_va == 0) {
>>>>> + DRM_ERROR("Invalid queue object value\n");
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + if (mqd_in->ip_type < AMDGPU_HW_IP_GFX || mqd_in->ip_type >=
>>>>> AMDGPU_HW_IP_NUM) {
>>>>> + DRM_ERROR("Invalid HW IP type 0x%x\n", mqd_in->ip_type);
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + if (!CHECK_ACCESS(mqd_in->queue_va) ||
>>>>> !CHECK_ACCESS(mqd_in->rptr_va) ||
>>>>> + !CHECK_ACCESS(mqd_in->wptr_va)) {
>>>>> + DRM_ERROR("Invalid mapping of queue ptrs, access
>>>>> error\n");
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + DRM_DEBUG_DRIVER("Input parameters to create queue are
>>>>> valid\n");
>>>>> + return 0;
>>>>> +}
>>>>> +
>>>>> +int amdgpu_userqueue_create(struct amdgpu_device *adev, struct
>>>>> drm_file *filp,
>>>>> + union drm_amdgpu_userq *args)
>>>>> +{
>>>>> + int r, pasid;
>>>>> + struct amdgpu_usermode_queue *queue;
>>>>> + struct amdgpu_fpriv *fpriv = filp->driver_priv;
>>>>> + struct amdgpu_vm *vm = &fpriv->vm;
>>>>> + struct amdgpu_ctx *ctx = amdgpu_ctx_get(fpriv, args->in.ctx_id);
>>>>> + struct drm_amdgpu_userq_mqd *mqd_in = &args->in.mqd;
>>>>> +
>>>>> + if (!ctx) {
>>>>> + DRM_ERROR("Invalid GPU context\n");
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + if (vm->pasid < 0) {
>>>>> + DRM_WARN("No PASID info found\n");
>>>>> + pasid = 0;
>>>>> + }
>>>>> +
>>>>> + mutex_lock(&adev->userq.userq_mutex);
>>>>> +
>>>>> + queue = kzalloc(sizeof(struct amdgpu_usermode_queue),
>>>>> GFP_KERNEL);
>>>>> + if (!queue) {
>>>>> + DRM_ERROR("Failed to allocate memory for queue\n");
>>>>> + mutex_unlock(&adev->userq.userq_mutex);
>>>>> + return -ENOMEM;
>>>>> + }
>>>>> +
>>>>> + r = amdgpu_userqueue_validate_input(adev, mqd_in);
>>>>> + if (r < 0) {
>>>>> + DRM_ERROR("Invalid input to create queue\n");
>>>>> + goto free_queue;
>>>>> + }
>>>>> +
>>>>> + queue->vm = vm;
>>>>> + queue->pasid = pasid;
>>>>> + queue->wptr_gpu_addr = mqd_in->wptr_va;
>>>>> + queue->rptr_gpu_addr = mqd_in->rptr_va;
>>>>> + queue->queue_size = mqd_in->queue_size;
>>>>> + queue->queue_type = mqd_in->ip_type;
>>>>> + queue->paging = false;
>>>>> + queue->flags = mqd_in->flags;
>>>>> + queue->queue_id = amdgpu_userqueue_index(adev);
>>>>> +
>>>>> + ctx->userq = queue;
>>>>> + args->out.q_id = queue->queue_id;
>>>>> + args->out.flags = 0;
>>>>> + mutex_unlock(&adev->userq.userq_mutex);
>>>>> + return 0;
>>>>> +
>>>>> +free_queue:
>>>>> + amdgpu_userqueue_remove_index(adev, queue);
>>>>> + mutex_unlock(&adev->userq.userq_mutex);
>>>>> + kfree(queue);
>>>>> + return r;
>>>>> +}
>>>>> +
>>>>> +void amdgpu_userqueue_destroy(struct amdgpu_device *adev, struct
>>>>> drm_file *filp,
>>>>> + union drm_amdgpu_userq *args)
>>>>> +{
>>>>> + struct amdgpu_fpriv *fpriv = filp->driver_priv;
>>>>> + struct amdgpu_ctx *ctx = amdgpu_ctx_get(fpriv, args->in.ctx_id);
>>>>> + struct amdgpu_usermode_queue *queue = ctx->userq;
>>>>> +
>>>>> + mutex_lock(&adev->userq.userq_mutex);
>>>>> + amdgpu_userqueue_remove_index(adev, queue);
>>>>> + ctx->userq = NULL;
>>>>> + mutex_unlock(&adev->userq.userq_mutex);
>>>>> + kfree(queue);
>>>>> +}
>>>>> +
>>>>> +int amdgpu_userq_ioctl(struct drm_device *dev, void *data,
>>>>> + struct drm_file *filp)
>>>>> +{
>>>>> + union drm_amdgpu_userq *args = data;
>>>>> + struct amdgpu_device *adev = drm_to_adev(dev);
>>>>> + int r = 0;
>>>>> +
>>>>> + switch (args->in.op) {
>>>>> + case AMDGPU_USERQ_OP_CREATE:
>>>>> + r = amdgpu_userqueue_create(adev, filp, args);
>>>>> + if (r)
>>>>> + DRM_ERROR("Failed to create usermode queue\n");
>>>>> + break;
>>>>> +
>>>>> + case AMDGPU_USERQ_OP_FREE:
>>>>> + amdgpu_userqueue_destroy(adev, filp, args);
>>>>> + break;
>>>>> +
>>>>> + default:
>>>>> + DRM_ERROR("Invalid user queue op specified: %d\n",
>>>>> args->in.op);
>>>>> + return -EINVAL;
>>>>> + }
>>>>> +
>>>>> + return r;
>>>>> +}
>>>>> +
>>>>> +int amdgpu_userqueue_init(struct amdgpu_device *adev)
>>>>> +{
>>>>> + struct amdgpu_userq_globals *uqg = &adev->userq;
>>>>> +
>>>>> + mutex_init(&uqg->userq_mutex);
>>>>> + return 0;
>>>>> +}
>>>>> +
>>>>> +void amdgpu_userqueue_fini(struct amdgpu_device *adev)
>>>>> +{
>>>>> +
>>>>> +}
>>>>> diff --git a/drivers/gpu/drm/amd/include/amdgpu_usermode_queue.h
>>>>> b/drivers/gpu/drm/amd/include/amdgpu_usermode_queue.h
>>>>> new file mode 100644
>>>>> index 000000000000..c1fe39ffaf72
>>>>> --- /dev/null
>>>>> +++ b/drivers/gpu/drm/amd/include/amdgpu_usermode_queue.h
>>>>> @@ -0,0 +1,50 @@
>>>>> +/*
>>>>> + * Copyright 2022 Advanced Micro Devices, Inc.
>>>>> + *
>>>>> + * Permission is hereby granted, free of charge, to any person
>>>>> obtaining a
>>>>> + * copy of this software and associated documentation files (the
>>>>> "Software"),
>>>>> + * to deal in the Software without restriction, including without
>>>>> limitation
>>>>> + * the rights to use, copy, modify, merge, publish, distribute,
>>>>> sublicense,
>>>>> + * and/or sell copies of the Software, and to permit persons to
>>>>> whom the
>>>>> + * Software is furnished to do so, subject to the following
>>>>> conditions:
>>>>> + *
>>>>> + * The above copyright notice and this permission notice shall be
>>>>> included in
>>>>> + * all copies or substantial portions of the Software.
>>>>> + *
>>>>> + * THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY
>>>>> KIND, EXPRESS OR
>>>>> + * IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
>>>>> MERCHANTABILITY,
>>>>> + * FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO
>>>>> EVENT SHALL
>>>>> + * THE COPYRIGHT HOLDER(S) OR AUTHOR(S) BE LIABLE FOR ANY CLAIM,
>>>>> DAMAGES OR
>>>>> + * OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR
>>>>> OTHERWISE,
>>>>> + * ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE
>>>>> USE OR
>>>>> + * OTHER DEALINGS IN THE SOFTWARE.
>>>>> + *
>>>>> + */
>>>>> +
>>>>> +#ifndef AMDGPU_USERMODE_QUEUE_H_
>>>>> +#define AMDGPU_USERMODE_QUEUE_H_
>>>>> +
>>>>> +#define AMDGPU_MAX_USERQ 512
>>>>> +
>>>>> +struct amdgpu_usermode_queue {
>>>>> + int queue_id;
>>>>> + int queue_type;
>>>>> + int queue_size;
>>>>> + int paging;
>>>>> + int pasid;
>>>>> + int use_doorbell;
>>>>> + int doorbell_index;
>>>>> +
>>>>> + uint64_t mqd_gpu_addr;
>>>>> + uint64_t wptr_gpu_addr;
>>>>> + uint64_t rptr_gpu_addr;
>>>>> + uint64_t queue_gpu_addr;
>>>>> + uint64_t flags;
>>>>> + void *mqd_cpu_ptr;
>>>>> +
>>>>> + struct amdgpu_bo *mqd_obj;
>>>>> + struct amdgpu_vm *vm;
>>>>> + struct list_head list;
>>>>> +};
>>>>> +
>>>>> +#endif
>>>>
>>
next prev parent reply other threads:[~2023-01-03 9:15 UTC|newest]
Thread overview: 64+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-12-23 19:36 [RFC 0/7] RFC: Usermode queue for AMDGPU driver Shashank Sharma
2022-12-23 19:36 ` [RFC 1/7] drm/amdgpu: UAPI for user queue management Shashank Sharma
2022-12-24 20:20 ` Bas Nieuwenhuizen
2022-12-27 16:58 ` Alex Deucher
2023-01-02 11:27 ` Christian König
2023-01-03 19:51 ` Alex Deucher
2023-01-02 13:26 ` Christian König
2023-01-03 14:23 ` Alex Deucher
2023-01-03 18:29 ` Felix Kuehling
2023-01-03 19:17 ` Liu, Shaoyun
2023-01-03 19:22 ` Alex Deucher
2023-01-03 19:25 ` Liu, Shaoyun
2023-01-03 19:52 ` Alex Deucher
2023-01-03 20:05 ` Felix Kuehling
2023-01-03 19:18 ` Alex Deucher
2022-12-23 19:36 ` [RFC 2/7] drm/amdgpu: Add usermode queue for gfx work Shashank Sharma
2022-12-24 18:19 ` Oded Gabbay
2022-12-26 10:34 ` Shashank Sharma
2022-12-25 15:44 ` Christian König
2022-12-26 10:41 ` Shashank Sharma
2023-01-02 12:39 ` Christian König
2023-01-03 9:12 ` Shashank Sharma
2023-01-03 9:15 ` Christian König [this message]
2023-01-03 9:22 ` Shashank Sharma
2023-01-03 9:35 ` Christian König
2023-01-03 14:34 ` Alex Deucher
2023-01-03 14:50 ` Christian König
2022-12-29 17:41 ` Alex Deucher
2023-01-02 13:53 ` Christian König
2023-01-03 9:32 ` Shashank Sharma
2023-01-03 9:16 ` Shashank Sharma
2023-01-04 8:55 ` Zhu, Jiadong
2023-01-04 8:58 ` Shashank Sharma
2022-12-23 19:36 ` [RFC 3/7] drm/amdgpu: Create MQD for userspace queue Shashank Sharma
2022-12-29 17:47 ` Alex Deucher
2023-01-03 9:36 ` Shashank Sharma
2023-01-03 18:37 ` Felix Kuehling
2023-01-04 6:21 ` Yadav, Arvind
2023-01-04 9:10 ` Christian König
2023-01-04 9:13 ` Shashank Sharma
2023-01-04 9:17 ` Christian König
2023-01-04 9:23 ` Shashank Sharma
2023-01-04 14:35 ` Felix Kuehling
2023-01-04 14:38 ` Yadav, Arvind
2023-01-04 14:41 ` Shashank Sharma
2023-01-04 14:28 ` Alex Deucher
2022-12-23 19:36 ` [RFC 4/7] drm/amdgpu: Allocate doorbell slot for user queue Shashank Sharma
2022-12-29 17:50 ` Alex Deucher
2023-01-03 9:37 ` Shashank Sharma
2022-12-23 19:36 ` [RFC 5/7] drm/amdgpu: Create context for usermode queue Shashank Sharma
2022-12-29 17:54 ` Alex Deucher
2023-01-03 9:40 ` Shashank Sharma
2023-01-03 14:48 ` Alex Deucher
2022-12-23 19:36 ` [RFC 6/7] drm/amdgpu: Map userqueue into HW Shashank Sharma
2022-12-29 17:51 ` Alex Deucher
2023-01-03 9:38 ` Shashank Sharma
2022-12-23 19:36 ` [RFC 7/7] drm/amdgpu: Secure semaphore for usermode queue Shashank Sharma
2022-12-25 10:07 ` Zhang, Yifan
2022-12-27 9:32 ` Arunpravin Paneer Selvam
2022-12-29 18:02 ` [RFC 0/7] RFC: Usermode queue for AMDGPU driver Alex Deucher
2023-01-03 9:43 ` Shashank Sharma
2023-01-03 9:47 ` Christian König
2023-01-03 10:00 ` Shashank Sharma
2023-01-03 10:02 ` Christian König
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=857effc7-9a67-02ce-f9e3-ae96053b1100@gmail.com \
--to=ckoenig.leichtzumerken@gmail.com \
--cc=alexander.deucher@amd.com \
--cc=amd-gfx@lists.freedesktop.org \
--cc=arunpravin.paneerselvam@amd.com \
--cc=arvind.yadav@amd.com \
--cc=christian.koenig@amd.com \
--cc=shashank.sharma@amd.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox