From: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
To: phasta@kernel.org, amd-gfx@lists.freedesktop.org,
dri-devel@lists.freedesktop.org
Cc: kernel-dev@igalia.com,
"Christian König" <christian.koenig@amd.com>,
"Danilo Krummrich" <dakr@kernel.org>,
"Matthew Brost" <matthew.brost@intel.com>,
"Pierre-Eric Pelloux-Prayer" <pierre-eric.pelloux-prayer@amd.com>
Subject: Re: [PATCH 03/28] drm/sched: Add some more scheduling quality unit tests
Date: Sat, 11 Oct 2025 14:21:06 +0100 [thread overview]
Message-ID: <64149773-e4b2-4874-8afc-43dffa63a5a2@igalia.com> (raw)
In-Reply-To: <ffe8556a939fac3edaf9c7007c3c4b5cf1c7c74d.camel@mailbox.org>
On 10/10/2025 10:48, Philipp Stanner wrote:
> On Wed, 2025-10-08 at 09:53 +0100, Tvrtko Ursulin wrote:
>> This time round we explore the rate of submitted job queue processing
>> with multiple identical parallel clients.
>>
>> Example test output:
>>
>> 3 clients:
>> t cycle: min avg max : ...
>> + 0ms 0 0 0 : 0 0 0
>> + 102ms 2 2 2 : 2 2 2
>> + 208ms 5 6 6 : 6 5 5
>> + 310ms 8 9 9 : 9 9 8
>> ...
>> + 2616ms 82 83 83 : 83 83 82
>> + 2717ms 83 83 83 : 83 83 83
>> avg_max_min_delta(x100)=60
>>
>> Every 100ms for the duration of the test test logs how many jobs each
>> client had completed, prefixed by minimum, average and maximum numbers.
>> When finished overall average delta between max and min is output as a
>> rough indicator to scheduling fairness.
>>
>> Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
>> Cc: Christian König <christian.koenig@amd.com>
>> Cc: Danilo Krummrich <dakr@kernel.org>
>> Cc: Matthew Brost <matthew.brost@intel.com>
>> Cc: Philipp Stanner <phasta@kernel.org>
>> Cc: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>
>> Acked-by: Christian König <christian.koenig@amd.com>
>> ---
>> .../gpu/drm/scheduler/tests/tests_scheduler.c | 186 +++++++++++++++++-
>> 1 file changed, 185 insertions(+), 1 deletion(-)
>>
>> diff --git a/drivers/gpu/drm/scheduler/tests/tests_scheduler.c b/drivers/gpu/drm/scheduler/tests/tests_scheduler.c
>> index c66c151a66d2..77b02c5e8d52 100644
>> --- a/drivers/gpu/drm/scheduler/tests/tests_scheduler.c
>> +++ b/drivers/gpu/drm/scheduler/tests/tests_scheduler.c
>> @@ -195,6 +195,7 @@ struct drm_sched_client_params {
>>
>> struct drm_sched_test_params {
>> const char *description;
>> + unsigned int num_clients;
>> struct drm_sched_client_params client[2];
>> };
>>
>> @@ -689,6 +690,189 @@ static struct kunit_suite drm_sched_scheduler_two_clients2 = {
>> .test_cases = drm_sched_scheduler_two_clients_tests,
>> };
>>
>> +
>> +static const struct drm_sched_test_params drm_sched_many_cases[] = {
>> + {
>> + .description = "2 clients",
>> + .num_clients = 2,
>> + .client[0] = {
>> + .priority = DRM_SCHED_PRIORITY_NORMAL,
>> + .job_cnt = 4,
>> + .job_us = 1000,
>> + .wait_us = 0,
>> + .sync = true,
>> + },
>> + },
>> + {
>> + .description = "3 clients",
>> + .num_clients = 3,
>> + .client[0] = {
>> + .priority = DRM_SCHED_PRIORITY_NORMAL,
>> + .job_cnt = 4,
>> + .job_us = 1000,
>> + .wait_us = 0,
>> + .sync = true,
>> + },
>> + },
>> + {
>> + .description = "7 clients",
>> + .num_clients = 7,
>> + .client[0] = {
>> + .priority = DRM_SCHED_PRIORITY_NORMAL,
>> + .job_cnt = 4,
>> + .job_us = 1000,
>> + .wait_us = 0,
>> + .sync = true,
>> + },
>> + },
>> + {
>> + .description = "13 clients",
>> + .num_clients = 13,
>> + .client[0] = {
>> + .priority = DRM_SCHED_PRIORITY_NORMAL,
>> + .job_cnt = 4,
>> + .job_us = 1000,
>> + .wait_us = 0,
>> + .sync = true,
>> + },
>> + },
>> + {
>> + .description = "31 clients",
>> + .num_clients = 31,
>> + .client[0] = {
>> + .priority = DRM_SCHED_PRIORITY_NORMAL,
>> + .job_cnt = 2,
>> + .job_us = 1000,
>> + .wait_us = 0,
>> + .sync = true,
>> + },
>> + },
>> +};
>> +
>> +KUNIT_ARRAY_PARAM(drm_sched_scheduler_many_clients,
>> + drm_sched_many_cases,
>> + drm_sched_desc);
>> +
>> +static void drm_sched_scheduler_many_clients_test(struct kunit *test)
>> +{
>> + const struct drm_sched_test_params *params = test->param_value;
>> + struct drm_mock_scheduler *sched = test->priv;
>> + const unsigned int clients = params->num_clients;
>> + unsigned int i, j, delta_total = 0, loops = 0;
>> + struct test_client *client;
>> + unsigned int *prev_cycle;
>> + ktime_t start;
>> + char *buf;
>> +
>> + /*
>> + * Many clients with deep-ish async queues.
>> + */
>> +
>> + buf = kunit_kmalloc(test, PAGE_SIZE, GFP_KERNEL);
>> + client = kunit_kcalloc(test, clients, sizeof(*client), GFP_KERNEL);
>> + prev_cycle = kunit_kcalloc(test, clients, sizeof(*prev_cycle),
>> + GFP_KERNEL);
>
> No error handling necessary??
Ha, fixed. I probably got confused thinking kunit does it for us.
>> +
>> + for (i = 0; i < clients; i++)
>> + client[i].entity =
>> + drm_mock_sched_entity_new(test,
>> + DRM_SCHED_PRIORITY_NORMAL,
>> + sched);
>> +
>> + for (i = 0; i < clients; i++) {
>> + client[i].test = test;
>> + client[i].id = i;
>> + client[i].params = params->client[0];
>> + client[i].duration = ms_to_ktime(1000 / clients);
>> + client[i].cycle_time.min_us = ~0U;
>> + client[i].latency_time.min_us = ~0U;
>> + client[i].worker =
>> + kthread_create_worker(0, "%s-%u", __func__, i);
>> + if (IS_ERR(client[i].worker)) {
>> + for (j = 0; j < i; j++)
>> + kthread_destroy_worker(client[j].worker);
>> + KUNIT_FAIL(test, "Failed to create worker!\n");
>> + }
>> +
>> + kthread_init_work(&client[i].work, drm_sched_client_work);
>> + }
>> +
>> + for (i = 0; i < clients; i++)
>> + kthread_queue_work(client[i].worker, &client[i].work);
>> +
>> + start = ktime_get();
>> + pr_info("%u clients:\n\tt\t\tcycle:\t min avg max : ...\n", clients);
>> + for (;;) {
>> + unsigned int min = ~0;
>
> Why is min initialized to UINT_MAX?
So that "if (val < min) min = val" works.
>> + unsigned int max = 0;
>> + unsigned int total = 0;
>> + bool done = true;
>> + char pbuf[16];
>> +
>> + memset(buf, 0, PAGE_SIZE);
>> + for (i = 0; i < clients; i++) {
>> + unsigned int cycle, cycles;
>> +
>> + cycle = READ_ONCE(client[i].cycle);
>> + cycles = READ_ONCE(client[i].cycles);
>
> I think I had asked why READ_ONCE is necessary. It's not super obvious.
Those values are updated in a different thread, and even though I don't
think compiler can omit those with the kernel settings, I like to use
READ_ONCE/WRITE_ONCE pairs for documentation. I added a comment.
Regards,
Tvrtko
>
>
> P.
>
>> +
>> + snprintf(pbuf, sizeof(pbuf), " %3d", cycle);
>> + strncat(buf, pbuf, PAGE_SIZE);
>> +
>> + total += cycle;
>> + if (cycle < min)
>> + min = cycle;
>> + if (cycle > max)
>> + max = cycle;
>> +
>> + if (!min || (cycle + 1) < cycles)
>> + done = false;
>> + }
>> +
>> + loops++;
>> + delta_total += max - min;
>> +
>> + pr_info("\t+%6lldms\t\t %3u %3u %3u :%s\n",
>> + ktime_to_ms(ktime_sub(ktime_get(), start)),
>> + min, DIV_ROUND_UP(total, clients), max, buf);
>> +
>> + if (done)
>> + break;
>> +
>> + msleep(100);
>> + }
>> +
>> + pr_info(" avg_max_min_delta(x100)=%u\n",
>> + loops ? DIV_ROUND_UP(delta_total * 100, loops) : 0);
>> +
>> + for (i = 0; i < clients; i++) {
>> + kthread_flush_work(&client[i].work);
>> + kthread_destroy_worker(client[i].worker);
>> + }
>> +
>> + for (i = 0; i < clients; i++)
>> + drm_mock_sched_entity_free(client[i].entity);
>> +}
>> +
>> +static const struct kunit_attributes drm_sched_scheduler_many_clients_attr = {
>> + .speed = KUNIT_SPEED_SLOW,
>> +};
>> +
>> +static struct kunit_case drm_sched_scheduler_many_clients_tests[] = {
>> + KUNIT_CASE_PARAM_ATTR(drm_sched_scheduler_many_clients_test,
>> + drm_sched_scheduler_many_clients_gen_params,
>> + drm_sched_scheduler_many_clients_attr),
>> + {}
>> +};
>> +
>> +static struct kunit_suite drm_sched_scheduler_many_clients = {
>> + .name = "drm_sched_scheduler_many_clients_tests",
>> + .init = drm_sched_scheduler_init2,
>> + .exit = drm_sched_scheduler_exit,
>> + .test_cases = drm_sched_scheduler_many_clients_tests,
>> +};
>> +
>> kunit_test_suites(&drm_sched_scheduler_overhead,
>> &drm_sched_scheduler_two_clients1,
>> - &drm_sched_scheduler_two_clients2);
>> + &drm_sched_scheduler_two_clients2,
>> + &drm_sched_scheduler_many_clients);
>
next prev parent reply other threads:[~2025-10-11 13:21 UTC|newest]
Thread overview: 76+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-08 8:53 [PATCH 00/28] Fair DRM scheduler Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 01/28] drm/sched: Reverse drm_sched_rq_init arguments Tvrtko Ursulin
2025-10-10 8:55 ` Philipp Stanner
2025-10-10 9:46 ` Tvrtko Ursulin
2025-10-10 10:36 ` Philipp Stanner
2025-10-11 13:21 ` Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 02/28] drm/sched: Add some scheduling quality unit tests Tvrtko Ursulin
2025-10-10 9:38 ` Philipp Stanner
2025-10-11 13:09 ` Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 03/28] drm/sched: Add some more " Tvrtko Ursulin
2025-10-10 9:48 ` Philipp Stanner
2025-10-11 13:21 ` Tvrtko Ursulin [this message]
2025-10-08 8:53 ` [PATCH 04/28] drm/sched: Implement RR via FIFO Tvrtko Ursulin
2025-10-10 10:18 ` Philipp Stanner
2025-10-11 13:30 ` Tvrtko Ursulin
2025-10-14 6:40 ` Philipp Stanner
2025-10-08 8:53 ` [PATCH 05/28] drm/sched: Consolidate entity run queue management Tvrtko Ursulin
2025-10-10 10:49 ` Philipp Stanner
2025-10-11 14:19 ` Tvrtko Ursulin
2025-10-14 6:53 ` Philipp Stanner
2025-10-14 7:26 ` Tvrtko Ursulin
2025-10-14 8:52 ` Philipp Stanner
2025-10-14 10:04 ` Tvrtko Ursulin
2025-10-14 11:23 ` Philipp Stanner
2025-10-08 8:53 ` [PATCH 06/28] drm/sched: Move run queue related code into a separate file Tvrtko Ursulin
2025-10-08 22:49 ` Matthew Brost
2025-10-08 8:53 ` [PATCH 07/28] drm/sched: Free all finished jobs at once Tvrtko Ursulin
2025-10-08 22:48 ` Matthew Brost
2025-10-08 8:53 ` [PATCH 08/28] drm/sched: Account entity GPU time Tvrtko Ursulin
2025-10-10 12:22 ` Philipp Stanner
2025-10-11 14:56 ` Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 09/28] drm/sched: Remove idle entity from tree Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 10/28] drm/sched: Add fair scheduling policy Tvrtko Ursulin
2025-10-14 10:27 ` Philipp Stanner
2025-10-14 12:56 ` Tvrtko Ursulin
2025-10-14 14:02 ` Philipp Stanner
2025-10-14 14:32 ` Simona Vetter
2025-10-14 14:58 ` Tvrtko Ursulin
2025-10-16 7:06 ` Philipp Stanner
2025-10-16 8:42 ` Tvrtko Ursulin
2025-10-16 9:50 ` Danilo Krummrich
2025-10-16 10:54 ` Tvrtko Ursulin
2025-10-16 11:14 ` Danilo Krummrich
2025-10-08 8:53 ` [PATCH 11/28] drm/sched: Favour interactive clients slightly Tvrtko Ursulin
2025-10-14 10:53 ` Philipp Stanner
2025-10-14 12:20 ` Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 12/28] drm/sched: Switch default policy to fair Tvrtko Ursulin
2025-10-10 12:56 ` Philipp Stanner
2025-10-08 8:53 ` [PATCH 13/28] drm/sched: Remove FIFO and RR and simplify to a single run queue Tvrtko Ursulin
2025-10-14 11:16 ` Philipp Stanner
2025-10-14 13:16 ` Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 14/28] drm/sched: Embed run queue singleton into the scheduler Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 15/28] accel/amdxdna: Remove drm_sched_init_args->num_rqs usage Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 16/28] accel/rocket: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 17/28] drm/amdgpu: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 18/28] drm/etnaviv: " Tvrtko Ursulin
2025-10-08 10:31 ` Christian Gmeiner
2025-10-08 8:53 ` [PATCH 19/28] drm/imagination: " Tvrtko Ursulin
2025-10-10 14:29 ` Matt Coster
2025-10-08 8:53 ` [PATCH 20/28] drm/lima: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 21/28] drm/msm: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 22/28] drm/nouveau: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 23/28] drm/panfrost: " Tvrtko Ursulin
2025-10-08 14:55 ` Steven Price
2025-10-08 8:53 ` [PATCH 24/28] drm/panthor: " Tvrtko Ursulin
2025-10-08 14:55 ` Steven Price
2025-10-10 10:02 ` Liviu Dudau
2025-10-08 8:53 ` [PATCH 25/28] drm/sched: " Tvrtko Ursulin
2025-10-08 22:44 ` Matthew Brost
2025-10-08 8:53 ` [PATCH 26/28] drm/v3d: " Tvrtko Ursulin
2025-10-10 14:15 ` Melissa Wen
2025-10-08 8:53 ` [PATCH 27/28] drm/xe: " Tvrtko Ursulin
2025-10-08 8:53 ` [PATCH 28/28] drm/sched: Remove drm_sched_init_args->num_rqs Tvrtko Ursulin
2025-10-10 13:00 ` Philipp Stanner
2025-10-11 14:58 ` Tvrtko Ursulin
2025-10-10 8:59 ` [PATCH 00/28] Fair DRM scheduler Philipp Stanner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=64149773-e4b2-4874-8afc-43dffa63a5a2@igalia.com \
--to=tvrtko.ursulin@igalia.com \
--cc=amd-gfx@lists.freedesktop.org \
--cc=christian.koenig@amd.com \
--cc=dakr@kernel.org \
--cc=dri-devel@lists.freedesktop.org \
--cc=kernel-dev@igalia.com \
--cc=matthew.brost@intel.com \
--cc=phasta@kernel.org \
--cc=pierre-eric.pelloux-prayer@amd.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox