public inbox for amd-gfx@lists.freedesktop.org
 help / color / mirror / Atom feed
From: Mario Limonciello <superm1@kernel.org>
To: "Christian König" <christian.koenig@amd.com>,
	"Mario Limonciello" <mario.limonciello@amd.com>,
	"Liang, Prike" <Prike.Liang@amd.com>,
	"Deucher, Alexander" <Alexander.Deucher@amd.com>,
	"amd-gfx@lists.freedesktop.org" <amd-gfx@lists.freedesktop.org>
Subject: Re: [PATCH] Revert "drm/amdgpu: attach tlb fence to the PTs update"
Date: Fri, 13 Mar 2026 18:58:37 -0500	[thread overview]
Message-ID: <9bbdbe21-010a-4fe3-b480-1a94c55a0ea3@kernel.org> (raw)
In-Reply-To: <2bce8ba2-c36f-4c64-b54e-aeb964a47ebc@amd.com>

Well the original patch was intended for userq but is causing issues on 
systems without userq.  How about just narrowing it down to only userq 
for now until we have a root cause?

On 3/13/26 7:03 AM, Christian König wrote:
> Yeah, but that is still not the root cause.
> 
> Attaching the TLB fence all the time just makes more use of the MES, it doesn't cause any additional problems which wouldn't have been there before.
> 
> Regards,
> Christian.
> 
> On 3/12/26 22:08, Mario Limonciello wrote:
>> There is actually a contingent of two people who claim that this patch is the cause for MES resets here:
>>
>> https://gitlab.freedesktop.org/drm/amd/-/issues/4749
>>
>>
>> On 3/5/2026 3:43 AM, Christian König wrote:
>>> The original reporter already mentioned on the ticket that this patch is not the actual cause of the issues.
>>>
>>> It basically just changes timing to create and eventually wait for the TLB fence to signal.
>>>
>>> Let's see what the reporter finds with his extended bisect.
>>>
>>> Regards,
>>> Christian.
>>>
>>> On 3/5/26 07:48, Liang, Prike wrote:
>>>> [Public]
>>>>
>>>> It’s possible that we failed to save and invalidate some active pages during suspend, which then prevents those pages from being restored correctly on resume.
>>>>
>>>> For now, we still rely on this patch to keep the userq page tables updated and synchronized. Until the full solution is ready, how about we fall back to the initial approach and restrict this TLB flush to only the userq path?
>>>>
>>>> Regards,
>>>>         Prike
>>>>
>>>>> -----Original Message-----
>>>>> From: Koenig, Christian <Christian.Koenig@amd.com>
>>>>> Sent: Wednesday, March 4, 2026 9:57 PM
>>>>> To: Deucher, Alexander <Alexander.Deucher@amd.com>; amd-
>>>>> gfx@lists.freedesktop.org
>>>>> Cc: Liang, Prike <Prike.Liang@amd.com>
>>>>> Subject: Re: [PATCH] Revert "drm/amdgpu: attach tlb fence to the PTs update"
>>>>>
>>>>> On 3/4/26 14:54, Alex Deucher wrote:
>>>>>> This reverts commit f3854e04b708d73276c4488231a8bd66d30b4671.
>>>>>>
>>>>>> This causes framebuffer corruption after suspend.
>>>>>
>>>>> But prevents massive memory corruption with userqueues.
>>>>>
>>>>> I have strong doubts that this is related to the FB corruption in any way, it will just
>>>>> change the timing.
>>>>>
>>>>> Regards,
>>>>> Christian.
>>>>>
>>>>>>
>>>>>> Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4798
>>>>>> Cc: Christian König <christian.koenig@amd.com>
>>>>>> Cc: Prike Liang <Prike.Liang@amd.com>
>>>>>> Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
>>>>>> ---
>>>>>>    drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 2 +-
>>>>>>    1 file changed, 1 insertion(+), 1 deletion(-)
>>>>>>
>>>>>> diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
>>>>> b/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
>>>>>> index 01fef0e4f4085..25b1d679ba262 100644
>>>>>> --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
>>>>>> +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
>>>>>> @@ -1073,7 +1073,7 @@ amdgpu_vm_tlb_flush(struct
>>>>> amdgpu_vm_update_params *params,
>>>>>>       }
>>>>>>
>>>>>>       /* Prepare a TLB flush fence to be attached to PTs */
>>>>>> -   if (!params->unlocked) {
>>>>>> +   if (!params->unlocked && vm->is_compute_context) {
>>>>>>               amdgpu_vm_tlb_fence_create(params->adev, vm, fence);
>>>>>>
>>>>>>               /* Makes sure no PD/PT is freed before the flush */
>>>>
>>>
>>>
>>
> 
> 


      reply	other threads:[~2026-03-13 23:58 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-03-04 13:54 [PATCH] Revert "drm/amdgpu: attach tlb fence to the PTs update" Alex Deucher
2026-03-04 13:56 ` Christian König
2026-03-05  6:48   ` Liang, Prike
2026-03-05  9:43     ` Christian König
2026-03-12 21:08       ` Mario Limonciello
2026-03-13 12:03         ` Christian König
2026-03-13 23:58           ` Mario Limonciello [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9bbdbe21-010a-4fe3-b480-1a94c55a0ea3@kernel.org \
    --to=superm1@kernel.org \
    --cc=Alexander.Deucher@amd.com \
    --cc=Prike.Liang@amd.com \
    --cc=amd-gfx@lists.freedesktop.org \
    --cc=christian.koenig@amd.com \
    --cc=mario.limonciello@amd.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox