From: "Christian König" <ckoenig.leichtzumerken-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
To: Andrey Grodzovsky
<andrey.grodzovsky-5C7GfCeVMHo@public.gmane.org>,
amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org
Cc: Alexander.Deucher-5C7GfCeVMHo@public.gmane.org,
Hawking.Zhang-5C7GfCeVMHo@public.gmane.org
Subject: Re: [PATCH] drm/amdgpu: Fix compute ring 1.0.0 failure after reset
Date: Fri, 26 Oct 2018 10:05:02 +0200 [thread overview]
Message-ID: <c402ce16-e8e3-78ee-3fb2-666d09b0807b@gmail.com> (raw)
In-Reply-To: <1540498601-5270-1-git-send-email-andrey.grodzovsky-5C7GfCeVMHo@public.gmane.org>
Am 25.10.18 um 22:16 schrieb Andrey Grodzovsky:
> Problem: After GPU reset on dGPUs with gfx8 compute ring
> 1.0.0 fails to pass the ring test. Ring registers inspection
> shows that it's active and no hang is observed (rptr == wptr)
> No significant diffs were observed between CP_HQD* registers
> for the ring in good and bad shape.
>
> Fix: No clear reason why but reversing the order of ring tests
> fixes the problem.
>
> Signed-off-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
Mhm, maybe try adding a delay before the ring test?
Could be that the rings are started in reverse order as well and for
some reason the first one is start tested to quickly after a reset.
Anyway patch is Acked-by: Christian König <christian.koenig@amd.com>
Thanks,
Christian.
> ---
> drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c | 6 ++++--
> 1 file changed, 4 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c b/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c
> index b2e1376..02f8ca5 100644
> --- a/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c
> +++ b/drivers/gpu/drm/amd/amdgpu/gfx_v8_0.c
> @@ -4811,8 +4811,10 @@ static int gfx_v8_0_kcq_resume(struct amdgpu_device *adev)
> if (r)
> goto done;
>
> - /* Test KCQs */
> - for (i = 0; i < adev->gfx.num_compute_rings; i++) {
> + /* Test KCQs - reversing the order of rings seems to fix ring test failure
> + * after GPU reset
> + */
> + for (i = adev->gfx.num_compute_rings - 1; i >= 0; i--) {
> ring = &adev->gfx.compute_ring[i];
> r = amdgpu_ring_test_helper(ring);
> }
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx
next prev parent reply other threads:[~2018-10-26 8:05 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-10-25 20:16 [PATCH] drm/amdgpu: Fix compute ring 1.0.0 failure after reset Andrey Grodzovsky
[not found] ` <1540498601-5270-1-git-send-email-andrey.grodzovsky-5C7GfCeVMHo@public.gmane.org>
2018-10-26 8:05 ` Christian König [this message]
[not found] ` <c402ce16-e8e3-78ee-3fb2-666d09b0807b-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2018-10-26 15:00 ` Grodzovsky, Andrey
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c402ce16-e8e3-78ee-3fb2-666d09b0807b@gmail.com \
--to=ckoenig.leichtzumerken-re5jqeeqqe8avxtiumwx3w@public.gmane.org \
--cc=Alexander.Deucher-5C7GfCeVMHo@public.gmane.org \
--cc=Hawking.Zhang-5C7GfCeVMHo@public.gmane.org \
--cc=amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org \
--cc=andrey.grodzovsky-5C7GfCeVMHo@public.gmane.org \
--cc=christian.koenig-5C7GfCeVMHo@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.