* [PATCH v2] drm/amdkfd: Don't clear PT after process killed
@ 2025-10-31 15:14 Philip Yang
2025-11-03 10:49 ` Christian König
0 siblings, 1 reply; 2+ messages in thread
From: Philip Yang @ 2025-10-31 15:14 UTC (permalink / raw)
To: amd-gfx; +Cc: Felix.Kuehling, christian.koenig, Gang.Ba, Philip Yang
If process is killed. the vm entity is stopped, submit pt update job
will trigger the error message "*ERROR* Trying to push to a killed
entity", job will not execute.
Suggested-by: Christian König <christian.koenig@amd.com>
Signed-off-by: Philip Yang <Philip.Yang@amd.com>
---
drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 4 ++++
1 file changed, 4 insertions(+)
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
index 96ccd5ade031..b1c24c8fa686 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
@@ -1274,6 +1274,10 @@ static int unmap_bo_from_gpuvm(struct kgd_mem *mem,
(void)amdgpu_vm_bo_unmap(adev, bo_va, entry->va);
+ /* VM entity stopped if process killed, don't clear freed pt bo */
+ if (!amdgpu_vm_ready(vm))
+ return 0;
+
(void)amdgpu_vm_clear_freed(adev, vm, &bo_va->last_pt_update);
(void)amdgpu_sync_fence(sync, bo_va->last_pt_update, GFP_KERNEL);
--
2.49.0
^ permalink raw reply related [flat|nested] 2+ messages in thread
* Re: [PATCH v2] drm/amdkfd: Don't clear PT after process killed
2025-10-31 15:14 [PATCH v2] drm/amdkfd: Don't clear PT after process killed Philip Yang
@ 2025-11-03 10:49 ` Christian König
0 siblings, 0 replies; 2+ messages in thread
From: Christian König @ 2025-11-03 10:49 UTC (permalink / raw)
To: Philip Yang, amd-gfx; +Cc: Felix.Kuehling, Gang.Ba
On 10/31/25 16:14, Philip Yang wrote:
> If process is killed. the vm entity is stopped, submit pt update job
> will trigger the error message "*ERROR* Trying to push to a killed
> entity", job will not execute.
>
> Suggested-by: Christian König <christian.koenig@amd.com>
> Signed-off-by: Philip Yang <Philip.Yang@amd.com>
Reviewed-by: Christian König <christian.koenig@amd.com>
> ---
> drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 4 ++++
> 1 file changed, 4 insertions(+)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
> index 96ccd5ade031..b1c24c8fa686 100644
> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c
> @@ -1274,6 +1274,10 @@ static int unmap_bo_from_gpuvm(struct kgd_mem *mem,
>
> (void)amdgpu_vm_bo_unmap(adev, bo_va, entry->va);
>
> + /* VM entity stopped if process killed, don't clear freed pt bo */
> + if (!amdgpu_vm_ready(vm))
> + return 0;
> +
> (void)amdgpu_vm_clear_freed(adev, vm, &bo_va->last_pt_update);
>
> (void)amdgpu_sync_fence(sync, bo_va->last_pt_update, GFP_KERNEL);
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2025-11-03 10:49 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-10-31 15:14 [PATCH v2] drm/amdkfd: Don't clear PT after process killed Philip Yang
2025-11-03 10:49 ` Christian König
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox