Igt-dev Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: "Welty, Brian" <brian.welty@intel.com>
To: Kamil Konieczny <kamil.konieczny@linux.intel.com>,
	"igt-dev@lists.freedesktop.org" <igt-dev@lists.freedesktop.org>,
	"Cavitt, Jonathan" <jonathan.cavitt@intel.com>,
	"Gupta, saurabhg" <saurabhg.gupta@intel.com>,
	"Mistat, Tomasz" <tomasz.mistat@intel.com>,
	"Girotra, Himanshu" <himanshu.girotra@intel.com>
Subject: Re: [PATCH i-g-t 1/1] lib/xe/xe_query: Wait for xe_supports_faults
Date: Wed, 8 May 2024 18:27:59 -0700	[thread overview]
Message-ID: <662f9756-b04d-462a-893a-8d0c4385c552@intel.com> (raw)
In-Reply-To: <20240508171127.cjl3ty7vra5fsszj@kamilkon-DESK.igk.intel.com>



On 5/8/2024 10:11 AM, Kamil Konieczny wrote:
> Hi,
> On 2024-05-08 at 16:43:04 +0000, Cavitt, Jonathan wrote:
>> -----Original Message-----
>> From: Kamil Konieczny <kamil.konieczny@linux.intel.com>
>> Sent: Wednesday, May 8, 2024 9:24 AM
>> To: igt-dev@lists.freedesktop.org
>> Cc: Cavitt, Jonathan <jonathan.cavitt@intel.com>; Gupta, saurabhg <saurabhg.gupta@intel.com>; Welty, Brian <brian.welty@intel.com>; Mistat, Tomasz <tomasz.mistat@intel.com>; Girotra, Himanshu <himanshu.girotra@intel.com>
>> Subject: Re: [PATCH i-g-t 1/1] lib/xe/xe_query: Wait for xe_supports_faults
>>>
>>> Hi Jonathan,
>>> On 2024-05-03 at 12:37:14 -0700, Jonathan Cavitt wrote:
>>>> It's possible for xe_supports_faults to return false if the system is
>>>> busy with multiple running tests.  This is because the check looks for
>>>> all active VMs and searches for VMs that do not have faults enabled,
>>>> returning false if any exist.  Recently, this check has been changed to
>>>> return EBUSY when the check fails in this way, so wait for up to ten
>>>> seconds for all the active VMs to flush out before proceeding.
>>>>
>>>> Suggested-by: Brian Welty <brian.welty@intel.com>
>>>> Signed-off-by: Jonathan Cavitt <jonathan.cavitt@intel.com>
>>>> ---
>>>>   lib/xe/xe_query.c | 9 +++++++--
>>>>   1 file changed, 7 insertions(+), 2 deletions(-)
>>>>
>>>> diff --git a/lib/xe/xe_query.c b/lib/xe/xe_query.c
>>>> index 6df8f42649..5458c73417 100644
>>>> --- a/lib/xe/xe_query.c
>>>> +++ b/lib/xe/xe_query.c
>>>> @@ -314,8 +314,13 @@ bool xe_supports_faults(int fd)
>>>>   		.flags = DRM_XE_VM_CREATE_FLAG_LR_MODE |
>>>>   			 DRM_XE_VM_CREATE_FLAG_FAULT_MODE,
>>>>   	};
>>>> -
>>>> -	supports_faults = !igt_ioctl(fd, DRM_IOCTL_XE_VM_CREATE, &create);
>>>> +	struct timespec tv = {};
>>>> +	int result, timeout;
>>>> +	do {
>>>> +		result = igt_ioctl(fd, DRM_IOCTL_XE_VM_CREATE, &create);
>>>> +		supports_faults = !result;
>>>> +		timeout = igt_seconds_elapsed(&tv);
>>>> +	} while (result == -EBUSY && timeout < 10);
>>> -------------------------------- ^^^^^^^^^^^^
>>>
>>> Waiting any number of seconds in library function is way too much,
>>> imho this is ok in test itself, not on lib.
>>
>> Is the suggestion here that we should perform the wait "on" xe_supports_faults,
>> rather than "in" xe_supports_faults?  I.E. that we do something like this instead
>> in xe_exec_fault_mode.c:
>>
>>          igt_fixture {
>>                  bool supports_faults;
>>                  struct timespec tv = {};
>>                  fd = drm_open_driver(DRIVER_XE);
>>                  do {
>>                          supports_faults = xe_supports_faults(fd);
>>                  } while (!supports_faults && igt_seconds_elapsed(&tv) < 10);
>>                  igt_require(supports_faults);
>>          }
>>
>> I can do this, though if xe_supports_faults returns false for any non-EBUSY related
>> reasons, we won't be able to detect it from here and we'll spend ten seconds waiting
>> for xe_supports_faults to return true when it strictly cannot.  Is this an acceptable
>> tradeoff?
>> -Jonathan Cavitt
>>
> 
> If you are doing this just after igt_main you do not expect
> it will return EBUSY. Other way would be to return errno from
> lib function and let subtest decide if it want to wait.
> There you could also account for simulation, where it can
> take longer. Btw is there any sysfs param for it?
> 
> Regards,
> Kamil
> 

Problem is the issue is that 2 VMs cannot be created in the 2 different 
modes.
Cannot have one VM in 'fault mode' at same time as one is in 'non-fault 
mode'.   So the correct place for the check is not even inside 
xe_supports_fault_mode(), but in xe_vm_create().

As every single IGT test that creates an IGT can fail with -EBUSY from
xe_vm_create() as creating 2 such  VMs is not allowed.  Whoever tries to 
create the VM second loses (fails).

Maybe a good first step is to have xe_vm_create() raise a SKIP on -EBUSY 
errors?

Maybe a good second step is to figure when and where adding a 
timeout+retry might make sense.....   maybe since this is a shard run (I 
think?),  this test can be attempted again later in the batch of tests?

-Brian




>>>
>>> Regards,
>>> Kamil
>>>
>>>>   
>>>>   	if (supports_faults)
>>>>   		xe_vm_destroy(fd, create.vm_id);
>>>> -- 
>>>> 2.25.1
>>>>
>>>

  reply	other threads:[~2024-05-09  1:28 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-05-03 19:37 [PATCH i-g-t 0/1] lib/xe/xe_query: Wait for xe_supports_faults Jonathan Cavitt
2024-05-03 19:37 ` [PATCH i-g-t 1/1] " Jonathan Cavitt
2024-05-08 16:23   ` Kamil Konieczny
2024-05-08 16:43     ` Cavitt, Jonathan
2024-05-08 17:11       ` Kamil Konieczny
2024-05-09  1:27         ` Welty, Brian [this message]
2024-05-09 14:27           ` Cavitt, Jonathan
2024-05-03 20:31 ` ✓ Fi.CI.BAT: success for " Patchwork
2024-05-03 20:47 ` ✓ CI.xeBAT: " Patchwork
2024-05-03 23:16 ` ✗ CI.xeFULL: failure " Patchwork
2024-05-04  7:50 ` ✗ Fi.CI.IGT: " Patchwork
  -- strict thread matches above, loose matches on Subject: below --
2024-05-08 19:35 [PATCH i-g-t 0/1] " Jonathan Cavitt
2024-05-08 19:35 ` [PATCH i-g-t 1/1] " Jonathan Cavitt
2024-05-09 16:20   ` Kamil Konieczny
2024-05-09 16:30     ` Cavitt, Jonathan
2024-05-10 15:58       ` Kamil Konieczny

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=662f9756-b04d-462a-893a-8d0c4385c552@intel.com \
    --to=brian.welty@intel.com \
    --cc=himanshu.girotra@intel.com \
    --cc=igt-dev@lists.freedesktop.org \
    --cc=jonathan.cavitt@intel.com \
    --cc=kamil.konieczny@linux.intel.com \
    --cc=saurabhg.gupta@intel.com \
    --cc=tomasz.mistat@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox