From: Dave Gordon <david.s.gordon@intel.com>
To: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>,
Intel-GFX@Lists.FreeDesktop.Org, "Daniel,
Thomas" <thomas.daniel@intel.com>,
Daniel Vetter <daniel@ffwll.ch>
Subject: Re: [PATCH] drm/i915: Update to post-reset execlist queue clean-up
Date: Fri, 11 Dec 2015 14:14:00 +0000 [thread overview]
Message-ID: <566ADA28.60506@intel.com> (raw)
In-Reply-To: <565D88B3.4040401@linux.intel.com>
On 01/12/15 11:46, Tvrtko Ursulin wrote:
>
> On 23/10/15 18:02, Tomas Elf wrote:
>> When clearing an execlist queue, instead of traversing it and
>> unreferencing all
>> requests while holding the spinlock (which might lead to thread
>> sleeping with
>> IRQs are turned off - bad news!), just move all requests to the retire
>> request
>> list while holding spinlock and then drop spinlock and invoke the
>> execlists
>> request retirement path, which already deals with the intricacies of
>> purging/dereferencing execlist queue requests.
>>
>> This patch can be considered v3 of:
>>
>> commit b96db8b81c54ef30485ddb5992d63305d86ea8d3
>> Author: Tomas Elf <tomas.elf@intel.com>
>> drm/i915: Grab execlist spinlock to avoid post-reset concurrency
>> issues
>>
>> This patch assumes v2 of the above patch is part of the baseline,
>> reverts v2
>> and adds changes on top to turn it into v3.
>>
>> Signed-off-by: Tomas Elf <tomas.elf@intel.com>
>> Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>> Cc: Chris Wilson <chris@chris-wilson.co.uk>
>> ---
>> drivers/gpu/drm/i915/i915_gem.c | 15 ++++-----------
>> 1 file changed, 4 insertions(+), 11 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/i915/i915_gem.c
>> b/drivers/gpu/drm/i915/i915_gem.c
>> index 2c7a0b7..b492603 100644
>> --- a/drivers/gpu/drm/i915/i915_gem.c
>> +++ b/drivers/gpu/drm/i915/i915_gem.c
>> @@ -2756,20 +2756,13 @@ static void i915_gem_reset_ring_cleanup(struct
>> drm_i915_private *dev_priv,
>>
>> if (i915.enable_execlists) {
>> spin_lock_irq(&ring->execlist_lock);
>> - while (!list_empty(&ring->execlist_queue)) {
>> - struct drm_i915_gem_request *submit_req;
>>
>> - submit_req = list_first_entry(&ring->execlist_queue,
>> - struct drm_i915_gem_request,
>> - execlist_link);
>> - list_del(&submit_req->execlist_link);
>> + /* list_splice_tail_init checks for empty lists */
>> + list_splice_tail_init(&ring->execlist_queue,
>> + &ring->execlist_retired_req_list);
>>
>> - if (submit_req->ctx != ring->default_context)
>> - intel_lr_context_unpin(submit_req);
>> -
>> - i915_gem_request_unreference(submit_req);
>> - }
>> spin_unlock_irq(&ring->execlist_lock);
>> + intel_execlists_retire_requests(ring);
>> }
>>
>> /*
>
> Fallen through the cracks..
>
> This looks to be even more serious, since lockdep notices possible
> deadlock involving vmap_area_lock:
>
> Possible interrupt unsafe locking scenario:
>
> CPU0 CPU1
> ---- ----
> lock(vmap_area_lock);
> local_irq_disable();
> lock(&(&ring->execlist_lock)->rlock);
> lock(vmap_area_lock);
> <Interrupt>
> lock(&(&ring->execlist_lock)->rlock);
>
> *** DEADLOCK ***
>
> Because it unpins LRC context and ringbuffer which ends up in the VM
> code under the execlist_lock.
>
> intel_execlists_retire_requests is slightly different from the code in
> the reset handler because it concerns itself with ctx_obj existence
> which the other one doesn't.
>
> Could people more knowledgeable of this code check if it is OK and R-B?
>
> Regards,
>
> Tvrtko
Hi Tvrtko,
I didn't understand this message at first, I thought you'd found a
problem with this ("v3") patch, but now I see what you actually meant is
that there is indeed a problem with the (v2) that got merged, not the
original question about unreferencing an object while holding a spinlock
(because it can't be the last reference), but rather because of the
unpin, which can indeed cause a problem with a non-i915-defined kernel lock.
So we should certainly update the current (v2) upstream with this.
Thomas Daniel already R-B'd this code on 23rd October, when it was:
[PATCH v3 7/8] drm/i915: Grab execlist spinlock to avoid post-reset
concurrency issues.
and it hasn't changed in substance since then, so you can carry his R-B
over, plus I said on that same day that this was a better solution. So:
Reviewed-by: Thomas Daniel <thomas.daniel@intel.com>
Reviewed-by: Dave Gordon <dave.gordon@intel.com>
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2015-12-11 14:14 UTC|newest]
Thread overview: 68+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-10-08 18:31 [PATCH 0/8] Stability improvements to error state capture Tomas Elf
2015-10-08 18:31 ` [PATCH 1/8] drm/i915: Early exit from semaphore_waits_for for execlist mode Tomas Elf
2015-10-08 18:31 ` [PATCH 2/8] drm/i915: Migrate to safe iterators in error state capture Tomas Elf
2015-10-09 7:49 ` Chris Wilson
2015-10-09 11:38 ` Tomas Elf
2015-10-09 8:27 ` Daniel Vetter
2015-10-09 11:40 ` Tomas Elf
2015-10-13 11:37 ` Daniel Vetter
2015-10-13 11:47 ` Chris Wilson
2015-10-08 18:31 ` [PATCH 3/8] drm/i915: Cope with request list state change during " Tomas Elf
2015-10-09 7:48 ` Chris Wilson
2015-10-09 11:25 ` Tomas Elf
2015-10-13 11:39 ` Daniel Vetter
2015-10-14 11:46 ` Tomas Elf
2015-10-14 12:45 ` Daniel Vetter
2015-10-09 8:28 ` Daniel Vetter
2015-10-09 11:45 ` Tomas Elf
2015-10-13 11:40 ` Daniel Vetter
2015-10-08 18:31 ` [PATCH 4/8] drm/i915: NULL checking when capturing buffer objects " Tomas Elf
2015-10-09 7:49 ` Chris Wilson
2015-10-09 11:34 ` Tomas Elf
2015-10-09 8:32 ` Daniel Vetter
2015-10-09 8:47 ` Chris Wilson
2015-10-09 11:52 ` Tomas Elf
2015-10-09 11:45 ` Tomas Elf
2015-10-08 18:31 ` [PATCH 5/8] drm/i915: vma NULL pointer check Tomas Elf
2015-10-09 7:48 ` Chris Wilson
2015-10-09 11:30 ` Tomas Elf
2015-10-09 11:59 ` Chris Wilson
2015-10-13 11:43 ` Daniel Vetter
2015-10-09 8:33 ` Daniel Vetter
2015-10-09 11:46 ` Tomas Elf
2015-10-08 18:31 ` [PATCH 6/8] drm/i915: Use safe list iterators Tomas Elf
2015-10-09 7:41 ` Chris Wilson
2015-10-09 10:27 ` Tomas Elf
2015-10-09 10:38 ` Chris Wilson
2015-10-09 12:00 ` Tomas Elf
2015-10-08 18:31 ` [PATCH 7/8] drm/i915: Grab execlist spinlock to avoid post-reset concurrency issues Tomas Elf
2015-10-09 7:45 ` Chris Wilson
2015-10-09 10:28 ` Tomas Elf
2015-10-09 8:38 ` Daniel Vetter
2015-10-09 8:45 ` Chris Wilson
2015-10-13 11:46 ` Daniel Vetter
2015-10-13 11:45 ` Chris Wilson
2015-10-13 13:46 ` Daniel Vetter
2015-10-13 14:00 ` Chris Wilson
2015-10-19 15:32 ` [PATCH v2 " Tomas Elf
2015-10-22 16:49 ` Dave Gordon
2015-10-22 17:35 ` Daniel Vetter
2015-10-23 8:42 ` Tvrtko Ursulin
2015-10-23 8:59 ` Daniel Vetter
2015-10-23 11:02 ` Tomas Elf
2015-10-23 12:49 ` Dave Gordon
2015-10-23 13:08 ` [PATCH v3 " Tomas Elf
2015-10-23 14:53 ` Daniel, Thomas
2015-10-23 17:02 ` [PATCH] drm/i915: Update to post-reset execlist queue clean-up Tomas Elf
2015-12-01 11:46 ` Tvrtko Ursulin
2015-12-11 14:14 ` Dave Gordon [this message]
2015-12-11 16:40 ` Daniel Vetter
2015-12-14 10:21 ` Mika Kuoppala
2015-10-08 18:31 ` [PATCH 8/8] drm/i915: NULL check of unpin_work Tomas Elf
2015-10-09 7:46 ` Chris Wilson
2015-10-09 8:39 ` Daniel Vetter
2015-10-09 11:50 ` Tomas Elf
2015-10-09 10:30 ` Tomas Elf
2015-10-09 10:44 ` Chris Wilson
2015-10-09 12:06 ` Tomas Elf
2015-10-13 11:51 ` Daniel Vetter
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=566ADA28.60506@intel.com \
--to=david.s.gordon@intel.com \
--cc=Intel-GFX@Lists.FreeDesktop.Org \
--cc=daniel@ffwll.ch \
--cc=thomas.daniel@intel.com \
--cc=tvrtko.ursulin@linux.intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).