From: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>
To: Chris Wilson <chris@chris-wilson.co.uk>, intel-gfx@lists.freedesktop.org
Cc: Ben Widawsky <ben@bwidawsk.net>,
Eero Tamminen <eero.t.tamminen@intel.com>
Subject: Re: [PATCH] drm/i915: Increase the busyspin durations for i915_wait_request
Date: Fri, 15 Sep 2017 11:01:02 +0100 [thread overview]
Message-ID: <f438922f-ac0a-fe7e-a1d0-3521363325fd@linux.intel.com> (raw)
In-Reply-To: <20170914095807.16359-1-chris@chris-wilson.co.uk>
On 14/09/2017 10:58, Chris Wilson wrote:
> An interesting discussion regarding "hybrid interrupt polling" for NVMe
> came to the conclusion that the ideal busyspin before sleeping was half
> of the expected request latency (and better if it was already halfway
> through that request). This suggested that we too should look again at
> our tradeoff between spinning and waiting. Currently, our spin simply
> tries to hide the cost of enabling the interrupt, which is good to avoid
> penalising nop requests (i.e. test throughput) and not much else.
> Studying real world workloads suggests that a spin of upto 500us can
What workloads and and power/perf testing?
> dramatically boost performance, but the suggestion is that this is not
> from avoiding interrupt latency per-se, but from secondary effects of
> sleeping such as allowing the CPU reduce cstate and context switch away.
Maybe the second part of the sentence would be clearer if not in a way
in inverted form. Like longer spin = more performance = less sleeping =
less cstate switching? Or just add "but from _avoiding_ secondary
effects of sleeping"?
> To offset those costs from penalising the active client, bump the initial
> spin somewhat to 250us and the secondary spin to 20us to balance the cost
> of another context switch following the interrupt.
>
> Suggested-by: Sagar Kamble <sagar.a.kamble@intel.com>
> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> Cc: Sagar Kamble <sagar.a.kamble@intel.com>
> Cc: Eero Tamminen <eero.t.tamminen@intel.com>
> Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
> Cc: Ben Widawsky <ben@bwidawsk.net>
> Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
> ---
> drivers/gpu/drm/i915/i915_gem_request.c | 25 +++++++++++++++++++++----
> 1 file changed, 21 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/gpu/drm/i915/i915_gem_request.c b/drivers/gpu/drm/i915/i915_gem_request.c
> index 813a3b546d6e..ccbdaf6a0e4d 100644
> --- a/drivers/gpu/drm/i915/i915_gem_request.c
> +++ b/drivers/gpu/drm/i915/i915_gem_request.c
> @@ -1155,8 +1155,20 @@ long i915_wait_request(struct drm_i915_gem_request *req,
> GEM_BUG_ON(!intel_wait_has_seqno(&wait));
> GEM_BUG_ON(!i915_sw_fence_signaled(&req->submit));
>
> - /* Optimistic short spin before touching IRQs */
> - if (i915_spin_request(req, state, 5))
> + /* Optimistic short spin before touching IRQs.
So it's not short any more. "Optimistic busy spin" ?
> + *
> + * We use a rather large value here to offset the penalty of switching
> + * away from the active task. Frequently, the client will wait upon
> + * an old swapbuffer to throttle itself to remain within a frame of
> + * the gpu. If the client is running in lockstep with the gpu, then
> + * it should not be waiting long at all, and a sleep now will incur
> + * extra scheduler latency in producing the next frame. So we sleep
> + * for longer to try and keep the client running.
> + *
250us sounds quite long and worrying to me.
In the waiting on swapbuffer case, what are the clients waiting for? GPU
rendering to finish or previous vblank or something?
I am thinking if it would be possible to add a special API just for this
sort of waits and internally know how long it is likely to take. So then
decide based on that whether to spin or sleep. Like next vblank is
coming in 5ms, no point in busy spinning or something like that.
Regards,
Tvrtko
> + * We need ~5us to enable the irq, ~20us to hide a context switch,
> + * we use 250us to keep the cache hot.
> + */
> + if (i915_spin_request(req, state, 250))
> goto complete;
>
> set_current_state(state);
> @@ -1212,8 +1224,13 @@ long i915_wait_request(struct drm_i915_gem_request *req,
> __i915_wait_request_check_and_reset(req))
> continue;
>
> - /* Only spin if we know the GPU is processing this request */
> - if (i915_spin_request(req, state, 2))
> + /*
> + * A quick spin now we are on the CPU to offset the cost of
> + * context switching away (and so spin for roughly the same as
> + * the scheduler latency). We only spin if we know the GPU is
> + * processing this request, and so likely to finish shortly.
> + */
> + if (i915_spin_request(req, state, 20))
> break;
>
> if (!intel_wait_check_request(&wait, req)) {
>
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2017-09-15 10:01 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-09-14 9:58 [PATCH] drm/i915: Increase the busyspin durations for i915_wait_request Chris Wilson
2017-09-14 10:18 ` ✓ Fi.CI.BAT: success for " Patchwork
2017-09-14 11:17 ` ✗ Fi.CI.IGT: failure " Patchwork
2017-09-15 9:15 ` [PATCH] " Kamble, Sagar A
2017-09-15 9:23 ` Chris Wilson
2017-09-15 10:01 ` Tvrtko Ursulin [this message]
2017-09-15 10:18 ` Chris Wilson
2017-10-23 9:05 ` Sagar Arun Kamble
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f438922f-ac0a-fe7e-a1d0-3521363325fd@linux.intel.com \
--to=tvrtko.ursulin@linux.intel.com \
--cc=ben@bwidawsk.net \
--cc=chris@chris-wilson.co.uk \
--cc=eero.t.tamminen@intel.com \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox