AMD-GFX Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: "Huang, Honglei1" <honghuan@amd.com>
To: "Christian König" <christian.koenig@amd.com>, Felix.Kuehling@amd.com
Cc: Alexander.Deucher@amd.com, Oak.Zeng@amd.com,
	Jenny-Jing.Liu@amd.com, Philip.Yang@amd.com,
	Xiaogang.Chen@amd.com, Ray.Huang@amd.com, Lingshan.Zhu@amd.com,
	Junhua.Shen@amd.com, matthew.brost@intel.com,
	rodrigo.vivi@intel.com, thomas.hellstrom@linux.intel.com,
	dakr@kernel.org, aliceryhl@google.com,
	amd-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org
Subject: Re: [RFC/POC PATCH 01/12] drm/amdgpu: add SVM UAPI definitions
Date: Fri, 24 Apr 2026 18:12:24 +0800	[thread overview]
Message-ID: <9c5afd4b-f9f0-445a-9b39-35f56fa2bea4@amd.com> (raw)
In-Reply-To: <daa7239f-7fc9-492d-849f-2d46bd84999b@amd.com>



On 4/23/2026 7:06 PM, Huang, Honglei1 wrote:
> 
> 
> On 4/23/2026 6:39 PM, Christian König wrote:
>> On 4/23/26 08:21, Huang, Honglei1 wrote:
>>>
>>>
>>> On 4/20/2026 11:37 PM, Christian König wrote:
>>>> On 4/20/26 15:30, Huang, Honglei1 wrote:
>>>>> On 4/20/2026 8:15 PM, Christian König wrote:
>>>>>>
>>>>>>
>>>>>> On 4/20/26 14:07, Honglei Huang wrote:
>>>>>>> From: Honglei Huang <honghuan@amd.com>
>>>>>>>
>>>>>>> Add amdgpu drm SVM API definitions built on the
>>>>>>> DRM GPUSVM framework.
>>>>>>>
>>>>>>> This includes:
>>>>>>> - DRM_AMDGPU_GEM_SVM ioctl
>>>>>>> - AMDGPU_SVM_FLAG_* flags
>>>>>>> - AMDGPU_SVM_OP_SET_ATTR / AMDGPU_SVM_OP_GET_ATTR operations
>>>>>>> - AMDGPU_SVM_ATTR_* attribute types
>>>>>>> - AMDGPU_SVM_LOCATION_SYSMEM / AMDGPU_SVM_LOCATION_UNDEFINED
>>>>>>> - struct drm_amdgpu_svm_attribute and struct drm_amdgpu_gem_svm
>>>>>>>
>>>>>>> Signed-off-by: Honglei Huang <honghuan@amd.com>
>>>>>>> ---
>>>>>>>     include/uapi/drm/amdgpu_drm.h | 39 ++++++++++++++++++++++++++ 
>>>>>>> +++++++++
>>>>>>>     1 file changed, 39 insertions(+)
>>>>>>>
>>>>>>> diff --git a/include/uapi/drm/amdgpu_drm.h b/include/uapi/drm/ 
>>>>>>> amdgpu_drm.h
>>>>>>> index 406a42be4..bed71ed9b 100644
>>>>>>> --- a/include/uapi/drm/amdgpu_drm.h
>>>>>>> +++ b/include/uapi/drm/amdgpu_drm.h
>>>>>>> @@ -58,6 +58,7 @@ extern "C" {
>>>>>>>     #define DRM_AMDGPU_USERQ_SIGNAL        0x17
>>>>>>>     #define DRM_AMDGPU_USERQ_WAIT        0x18
>>>>>>>     #define DRM_AMDGPU_GEM_LIST_HANDLES    0x19
>>>>>>> +#define DRM_AMDGPU_GEM_SVM        0x1a
>>>>>>>       #define DRM_IOCTL_AMDGPU_GEM_CREATE    
>>>>>>> DRM_IOWR(DRM_COMMAND_BASE + DRM_AMDGPU_GEM_CREATE, union 
>>>>>>> drm_amdgpu_gem_create)
>>>>>>>     #define DRM_IOCTL_AMDGPU_GEM_MMAP    
>>>>>>> DRM_IOWR(DRM_COMMAND_BASE + DRM_AMDGPU_GEM_MMAP, union 
>>>>>>> drm_amdgpu_gem_mmap)
>>>>>>> @@ -79,6 +80,7 @@ extern "C" {
>>>>>>>     #define DRM_IOCTL_AMDGPU_USERQ_SIGNAL    
>>>>>>> DRM_IOWR(DRM_COMMAND_BASE + DRM_AMDGPU_USERQ_SIGNAL, struct 
>>>>>>> drm_amdgpu_userq_signal)
>>>>>>>     #define DRM_IOCTL_AMDGPU_USERQ_WAIT    
>>>>>>> DRM_IOWR(DRM_COMMAND_BASE + DRM_AMDGPU_USERQ_WAIT, struct 
>>>>>>> drm_amdgpu_userq_wait)
>>>>>>>     #define DRM_IOCTL_AMDGPU_GEM_LIST_HANDLES 
>>>>>>> DRM_IOWR(DRM_COMMAND_BASE + DRM_AMDGPU_GEM_LIST_HANDLES, struct 
>>>>>>> drm_amdgpu_gem_list_handles)
>>>>>>> +#define DRM_IOCTL_AMDGPU_GEM_SVM    DRM_IOWR(DRM_COMMAND_BASE + 
>>>>>>> DRM_AMDGPU_GEM_SVM, struct drm_amdgpu_gem_svm)
>>>>>>>       /**
>>>>>>>      * DOC: memory domains
>>>>>>> @@ -1665,6 +1667,43 @@ struct drm_color_ctm_3x4 {
>>>>>>>         __u64 matrix[12];
>>>>>>>     };
>>>>>>>     +#define AMDGPU_SVM_FLAG_HOST_ACCESS        0x00000001
>>>>>>> +#define AMDGPU_SVM_FLAG_COHERENT        0x00000002
>>>>>>> +#define AMDGPU_SVM_FLAG_HIVE_LOCAL        0x00000004
>>>>>>> +#define AMDGPU_SVM_FLAG_GPU_RO            0x00000008
>>>>>>> +#define AMDGPU_SVM_FLAG_GPU_EXEC        0x00000010
>>>>>>> +#define AMDGPU_SVM_FLAG_GPU_READ_MOSTLY        0x00000020
>>>>>>> +#define AMDGPU_SVM_FLAG_GPU_ALWAYS_MAPPED    0x00000040
>>>>>>> +#define AMDGPU_SVM_FLAG_EXT_COHERENT        0x00000080
>>>>>>> +
>>>>>>> +#define AMDGPU_SVM_OP_SET_ATTR        0
>>>>>>> +#define AMDGPU_SVM_OP_GET_ATTR        1
>>>>>>> +
>>>>>>> +#define AMDGPU_SVM_ATTR_PREFERRED_LOC        0
>>>>>>> +#define AMDGPU_SVM_ATTR_PREFETCH_LOC        1
>>>>>>
>>>>>> Up till here the interface makes perfect sense, but then it 
>>>>>> becomes a bit fuzzy.
>>>>>>
>>>>>>> +#define AMDGPU_SVM_ATTR_ACCESS            2
>>>>>>> +#define AMDGPU_SVM_ATTR_ACCESS_IN_PLACE        3
>>>>>>> +#define AMDGPU_SVM_ATTR_NO_ACCESS        4
>>>>>>
>>>>>> Why are those separate attributes? What is the difference between 
>>>>>> those?
>>>>>
>>>>> Really thanks for the comments, I have some content mistaken in V2, 
>>>>> so I updated the V3 to fix that. For the header they are same. for 
>>>>> other content please review the V3, sorry about that. And will fix 
>>>>> the concern you raised in next version.
>>>>>
>>>>> So the meaning of AMDGPU_SVM_ATTR_ACCESS and 
>>>>> AMDGPU_SVM_ATTR_NO_ACCESS are clear, GPU can access it or not, and 
>>>>> the SVM can set the preferred location, it can be in VRAM or 
>>>>> system, for AMDGPU_SVM_ATTR_ACCESS it can be migrated between RAM 
>>>>> and VRAM. For AMDGPU_SVM_ATTR_ACCESS_IN_PLACE,
>>>>> it can not migrate, GPU only can access it in the initial place.
>>>>
>>>> Yeah but that doesn't then the interface doesn't seem to make sense 
>>>> since such states are mutual exclusive.
>>>>
>>>> It would make sense when you have some attribute which is named (for 
>>>> example) AMDGPU_SVM_ATTR_ACCESS which can have the values 
>>>> INACCESSIBLE, IN_PLACE, MIGRATE.
>>>
>>> Got it so can I change the UAPI to the following format?
>>>
>>> enum amdgpu_ioctl_svm_attr_type {
>>>      AMDGPU_IOCTL_SVM_ATTR_PREFERRED_LOC,
>>>      AMDGPU_IOCTL_SVM_ATTR_PREFETCH_LOC,
>>>      AMDGPU_IOCTL_SVM_ATTR_ACCESS,
>>>      AMDGPU_IOCTL_SVM_ATTR_SET_FLAGS,
>>>      AMDGPU_IOCTL_SVM_ATTR_CLR_FLAGS,
>>>      AMDGPU_IOCTL_SVM_ATTR_GRANULARITY
>>> };
>>>
>>> enum amdgpu_ioctl_svm_location {
>>
>> The enum name could probably be improved, but apart from that looks 
>> reasonable to me.
> 
> Will improve the name.
>>
>>>      AMDGPU_SVM_ACCESS_INACCESSIBLE = 1,
>>>      AMDGPU_SVM_ACCESS_IN_PLACE = 2,
>>>      AMDGPU_SVM_ACCESS_MIGRATE = 3,
>>> };
>>>
>>>>
>>>>>>> +#define AMDGPU_SVM_ATTR_SET_FLAGS        5
>>>>>>> +#define AMDGPU_SVM_ATTR_CLR_FLAGS        6
>>>>>>
>>>>>> Why is that separated into set and clear flags?
>>>>>
>>>>> This method inherits from KFD and is also designed to be compatible 
>>>>> with upper layer applications such as ROCR.
>>>>
>>>> That is *not* sufficient as justification. We need to document why 
>>>> that is necessary and *not* just say ROCR works that way.
>>>>
>>>> As far as I can see just a SET_FLAGS should be sufficient.
>>>
>>> Accoding to the reply form Felix, CLR_FLAGS provides a convenient 
>>> method for deleting large-scale flags, do we need to redesign this part?
>>
>> I think we should expose those flags as individual attributes then.
> 
> Got it will do.
> 
>>
>>>>
>>>>>>> +#define AMDGPU_SVM_ATTR_GRANULARITY        7
>>>>>>> +
>>>>>>> +#define AMDGPU_SVM_LOCATION_SYSMEM        0
>>>>>>> +#define AMDGPU_SVM_LOCATION_UNDEFINED        0xffffffff
>>>>>>
>>>>>> No location for device local memory?
>>>>>
>>>>> Vaule > 0 means for device memory, in xe_svm, it seems like it uses 
>>>>> fd for device local memory.
>>>>
>>>> Absolute clear NAK for that approach. This interface is per FD!
>>>>
>>>> We need some value AMDGPU_SVM_LOCATION_DEVICE which means that the 
>>>> memory should be migrated to the current device.
>>>>
>>>> We also need to make sure that setting attributes for different 
>>>> devices doesn't affect each other.
>>>
>>>
>>> I Totally agreed with your thoughts, but according to the reply from 
>>> matt, it seems like we need to consider the P2P/multi GPU situation.
>>
>> When the drm_svm or pagemap component has already code to deal with 
>> that then it is probably ok to have the same interface.
>>
>> When when XE only hacked that together on their own then that is a bit 
>> questionable because getting the lifetime right is usually tricky.
>>
>>>
>>> So do I need to add a AMDGPU_SVM_LOCATION_DEVICE  flag or do I need 
>>> to modify the UAPI to align with xe_svm?
>>
>> I think we need to full clarify how XE works here. E.g. that you can 
>> specify both 0 as well as give the fd to get the memory migrated to 
>> the local device sounds odd.
> 
> Got it, for this part maybe require more discussion and time to fully 
> understand, so this part will remain unchanged in the next version.
> 

Hi Christian, Felix,

Based on the v3 review, I've reworked the SVM UAPI. Please let me know 
if anything still looks off before I post v4.

Changes with v3 UAPI:
   - OP / ATTR_TYPE / ACCESS / LOCATION converted to documented enums.
   - Three ACCESS_* attribute types collapsed into a single
     AMDGPU_SVM_ATTR_ACCESS carrying enum amdgpu_ioctl_svm_access.
   - ACCESS/MIGRATE renamed to ALLOW_MIGRATE to means permitted, not 
required
   - SET_FLAGS / CLR_FLAGS removed; each former flag is now its own
     attribute carrying enum amdgpu_ioctl_svm_flag_value: CLR / SET


   enum amdgpu_ioctl_svm_op {
           AMDGPU_SVM_OP_SET_ATTR = 0,
           AMDGPU_SVM_OP_GET_ATTR = 1,
   };

   enum amdgpu_ioctl_svm_attr_type {
           AMDGPU_SVM_ATTR_PREFERRED_LOC           = 0,
           AMDGPU_SVM_ATTR_PREFETCH_LOC            = 1,
           AMDGPU_SVM_ATTR_ACCESS                  = 2,
           AMDGPU_SVM_ATTR_GRANULARITY             = 3,
           AMDGPU_SVM_ATTR_HOST_ACCESS             = 4,
           AMDGPU_SVM_ATTR_COHERENT                = 5,
           AMDGPU_SVM_ATTR_EXT_COHERENT            = 6,
           AMDGPU_SVM_ATTR_HIVE_LOCAL              = 7,
           AMDGPU_SVM_ATTR_GPU_RO                  = 8,
           AMDGPU_SVM_ATTR_GPU_EXEC                = 9,
           AMDGPU_SVM_ATTR_GPU_READ_MOSTLY         = 10,
           AMDGPU_SVM_ATTR_GPU_ALWAYS_MAPPED       = 11,
   };

   enum amdgpu_ioctl_svm_access {
           AMDGPU_SVM_ACCESS_INACCESSIBLE          = 0,
           AMDGPU_SVM_ACCESS_IN_PLACE              = 1,
           AMDGPU_SVM_ACCESS_ALLOW_MIGRATE         = 2,
   };

   enum amdgpu_ioctl_svm_location {
           AMDGPU_SVM_LOCATION_SYSMEM              = 0,
           AMDGPU_SVM_LOCATION_UNDEFINED           = 0xffffffff,
   };

   enum amdgpu_ioctl_svm_flag_value {
           AMDGPU_SVM_FLAG_CLR                     = 0,
           AMDGPU_SVM_FLAG_SET                     = 1,
   };


Regards,
Honglei

> Regards,
> Honglei
> 
>>
>> Regards,
>> Christian.
>>
>>>
>>> Regards,
>>> Honglei
>>>
>>>>
>>>> Regards,
>>>> Christian.
>>>>
>>>>>
>>>>>>
>>>>>>> +
>>>>>>> +struct drm_amdgpu_svm_attribute {
>>>>>>> +    __u32 type;
>>>>>>> +    __u32 value;
>>>>>>> +};
>>>>>>> +
>>>>>>> +struct drm_amdgpu_gem_svm {
>>>>>>> +    __u64 start_addr;
>>>>>>> +    __u64 size;
>>>>>>> +    __u32 operation;
>>>>>>> +    __u32 nattr;
>>>>>>> +    __u64 attrs_ptr;
>>>>>>> +};
>>>>>>
>>>>>> Those struct make perfect sense but clearly need documentation. 
>>>>>> Preferable as kerneldoc.
>>>>>>
>>>>>> And we usually use unions in this header to separate the input 
>>>>>> from the output parameters.
>>>>>
>>>>> Got it will add documentation for it and will use unions in next 
>>>>> version. Really thanks for the comments.
>>>>>
>>>>> Regards,
>>>>> Honglei
>>>>>
>>>>>>
>>>>>> Regards,
>>>>>> Christian.
>>>>>>
>>>>>>> +
>>>>>>>     #if defined(__cplusplus)
>>>>>>>     }
>>>>>>>     #endif
>>>>>>
>>>>>
>>>>
>>>
>>
> 


  parent reply	other threads:[~2026-04-24 10:12 UTC|newest]

Thread overview: 37+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-20 12:07 [RFC V2 00/12] drm/amdgpu: SVM implementation based on drm_gpusvm Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 01/12] drm/amdgpu: add SVM UAPI definitions Honglei Huang
2026-04-20 12:15   ` Christian König
2026-04-20 13:30     ` Huang, Honglei1
2026-04-20 15:37       ` Christian König
2026-04-20 16:06         ` Matthew Brost
2026-04-20 16:28           ` Thomas Hellström
2026-04-20 18:07           ` Christian König
2026-04-21  5:08             ` Matthew Brost
2026-04-21  6:19               ` Christian König
2026-04-21  6:48                 ` Matthew Brost
2026-04-21  7:13                   ` Christian König
2026-04-21  9:52           ` Huang, Honglei1
2026-04-23  6:21         ` Huang, Honglei1
2026-04-23 10:39           ` Christian König
2026-04-23 11:06             ` Huang, Honglei1
2026-04-23 20:02               ` Matthew Brost
2026-04-24 10:20                 ` Huang, Honglei1
2026-04-24 10:12               ` Huang, Honglei1 [this message]
2026-04-27 21:05                 ` Felix Kuehling
2026-04-28  2:24                   ` Huang, Honglei1
2026-04-28  6:49                   ` Christian König
2026-04-28  7:00                     ` Huang, Honglei1
2026-04-21  3:37     ` Kuehling, Felix
2026-04-21  6:39       ` Christian König
2026-04-20 12:07 ` [RFC/POC PATCH 02/12] drm/amdgpu: add SVM data structures and header Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 03/12] drm/amdgpu: add SVM attribute data structures Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 04/12] drm/amdgpu: implement SVM attribute tree operations Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 05/12] drm/amdgpu: implement SVM attribute set Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 06/12] drm/amdgpu: add SVM range data structures Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 07/12] drm/amdgpu: implement SVM range PTE flags and GPU mapping Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 08/12] drm/amdgpu: implement SVM range notifier and invalidation Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 09/12] drm/amdgpu: implement SVM range workers Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 10/12] drm/amdgpu: implement SVM core initialization and fini Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 11/12] drm/amdgpu: implement SVM ioctl and fault handler Honglei Huang
2026-04-20 12:07 ` [RFC/POC PATCH 12/12] drm/amdgpu: wire up SVM build system " Honglei Huang
  -- strict thread matches above, loose matches on Subject: below --
2026-03-17 11:29 [RFC/POC PATCH 00/12] POC SVM implementation in AMDGPU based on drm_gpusvm Honglei Huang
2026-03-17 11:29 ` [RFC/POC PATCH 01/12] drm/amdgpu: add SVM UAPI definitions Honglei Huang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9c5afd4b-f9f0-445a-9b39-35f56fa2bea4@amd.com \
    --to=honghuan@amd.com \
    --cc=Alexander.Deucher@amd.com \
    --cc=Felix.Kuehling@amd.com \
    --cc=Jenny-Jing.Liu@amd.com \
    --cc=Junhua.Shen@amd.com \
    --cc=Lingshan.Zhu@amd.com \
    --cc=Oak.Zeng@amd.com \
    --cc=Philip.Yang@amd.com \
    --cc=Ray.Huang@amd.com \
    --cc=Xiaogang.Chen@amd.com \
    --cc=aliceryhl@google.com \
    --cc=amd-gfx@lists.freedesktop.org \
    --cc=christian.koenig@amd.com \
    --cc=dakr@kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=matthew.brost@intel.com \
    --cc=rodrigo.vivi@intel.com \
    --cc=thomas.hellstrom@linux.intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox