Igt-dev Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: Matthew Brost <matthew.brost@intel.com>
To: "Bommu, Krishnaiah" <krishnaiah.bommu@intel.com>
Cc: "Auld, Matthew" <matthew.auld@intel.com>,
	"Bernatowicz, Marcin" <marcin.bernatowicz@linux.intel.com>,
	"igt-dev@lists.freedesktop.org" <igt-dev@lists.freedesktop.org>,
	"Summers, Stuart" <stuart.summers@intel.com>
Subject: Re: [PATCH i-g-t] tests/intel/xe_exec_reset: Skip syncobj_wait during the gt_reset
Date: Thu, 3 Oct 2024 16:08:38 +0000	[thread overview]
Message-ID: <Zv7BhpgImS06j/3m@DUT025-TGLU.fm.intel.com> (raw)
In-Reply-To: <DM4PR11MB52932237D9C02F73FCD9CD6A9D762@DM4PR11MB5293.namprd11.prod.outlook.com>

On Mon, Sep 30, 2024 at 04:18:49AM -0600, Bommu, Krishnaiah wrote:
> 
> 
> > -----Original Message-----
> > From: Auld, Matthew <matthew.auld@intel.com>
> > Sent: Friday, September 27, 2024 3:52 PM
> > To: Bernatowicz, Marcin <marcin.bernatowicz@linux.intel.com>; Bommu,
> > Krishnaiah <krishnaiah.bommu@intel.com>; igt-dev@lists.freedesktop.org
> > Cc: Summers, Stuart <stuart.summers@intel.com>; Brost, Matthew
> > <matthew.brost@intel.com>
> > Subject: Re: [PATCH i-g-t] tests/intel/xe_exec_reset: Skip syncobj_wait during
> > the gt_reset
> > 
> > On 27/09/2024 11:05, Bernatowicz, Marcin wrote:
> > >
> > >
> > > On 9/25/2024 12:31 PM, Bommu Krishnaiah wrote:
> > >> From: "Bommu Krishnaiah" <krishnaiah.bommu@intel.com>
> > >>
> > >> Skipping the syncobj_wait for the workloads which is submitted before
> > >> gt reset, since After gt reset There is no expectation from the
> > >> hardware/GuC/KMD that the workload will then re-execute and complete.
> > >>
> > >> Signed-off-by: Bommu Krishnaiah <krishnaiah.bommu@intel.com>
> > >> Cc: Stuart Summers <stuart.summers@intel.com>
> > >> ---
> > >>   tests/intel/xe_exec_reset.c | 8 +++++---
> > >>   1 file changed, 5 insertions(+), 3 deletions(-)
> > >>
> > >> diff --git a/tests/intel/xe_exec_reset.c
> > >> b/tests/intel/xe_exec_reset.c index b5d5f43ea..b1a7548c6 100644
> > >> --- a/tests/intel/xe_exec_reset.c
> > >> +++ b/tests/intel/xe_exec_reset.c
> > >> @@ -263,8 +263,9 @@ test_balancer(int fd, int gt, int class, int
> > >> n_exec_queues, int n_execs,
> > >>       }
> > >>       for (i = 0; i < n_exec_queues && n_execs; i++)
> > >> -        igt_assert(syncobj_wait(fd, &syncobjs[i], 1, INT64_MAX, 0,
> > >> -                    NULL));
> > >> +        if (!(flags & GT_RESET))
> > >> +            igt_assert(syncobj_wait(fd, &syncobjs[i], 1, INT64_MAX,
> > >
> > > What happens when the user waits on syncobj in case of GT reset ?
> > > Maybe there is no expectation that there will be re-execute, but
> > > shouldn't the syncobj be notified or a timeout hit ?
> > 
> > Yeah, this sounds like KMD bug. Expectation is that dma fences should
> > eventually signal no matter what, and in a reasonable amount of time.
> > 
> > Possibly relevant fix (very recently merged):
> > https://patchwork.freedesktop.org/patch/605681/?series=136463&rev=1

Matt Auld is correct here, this is a KMD bug not a test bug.

I thought the above patch would have fixed this problem.

> 
> I verified with this(https://patchwork.freedesktop.org/patch/605681/?series=136463&rev=1) patch, still I see the failure with this patch also.
> 

This is unfortunate. Seems like we still have some KMD issue here. I
just chatted with Himal about this and gave him a bit a direction.

If this persists and easy to reproduce perhaps I can jump in to take a
look in a few days if needed. Corner case submission issues are pretty
difficult to debug and happy to help if needed.

Matt 

> Regards,
> Krishna.
> 
> > 
> > >
> > >> +                        0, NULL));
> > >>       igt_assert(syncobj_wait(fd, &sync[0].handle, 1, INT64_MAX, 0,
> > >> NULL));
> > >>       sync[0].flags |= DRM_XE_SYNC_FLAG_SIGNAL; @@ -410,7 +411,8 @@
> > >> test_legacy_mode(int fd, struct drm_xe_engine_class_instance *eci,
> > >>       }
> > >>       for (i = 0; i < n_exec_queues && n_execs; i++)
> > >> -        igt_assert(syncobj_wait(fd, &syncobjs[i], 1, INT64_MAX, 0,
> > >> +        if (!(flags & GT_RESET))
> > >> +            igt_assert(syncobj_wait(fd, &syncobjs[i], 1, INT64_MAX,
> > >> +0,
> > >>                       NULL));
> > >>       igt_assert(syncobj_wait(fd, &sync[0].handle, 1, INT64_MAX, 0,
> > >> NULL));
> > >

      reply	other threads:[~2024-10-03 16:08 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-09-25 10:31 [PATCH i-g-t] tests/intel/xe_exec_reset: Skip syncobj_wait during the gt_reset Bommu Krishnaiah
2024-09-26  5:18 ` ✗ Fi.CI.BAT: failure for " Patchwork
2024-09-26  5:24 ` ✓ CI.xeBAT: success " Patchwork
2024-09-26 17:03 ` ✗ CI.xeFULL: failure " Patchwork
2024-09-27 10:05 ` [PATCH i-g-t] " Bernatowicz, Marcin
2024-09-27 10:22   ` Matthew Auld
2024-09-30 10:18     ` Bommu, Krishnaiah
2024-10-03 16:08       ` Matthew Brost [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Zv7BhpgImS06j/3m@DUT025-TGLU.fm.intel.com \
    --to=matthew.brost@intel.com \
    --cc=igt-dev@lists.freedesktop.org \
    --cc=krishnaiah.bommu@intel.com \
    --cc=marcin.bernatowicz@linux.intel.com \
    --cc=matthew.auld@intel.com \
    --cc=stuart.summers@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox