From: Jesse Barnes <jbarnes@virtuousgeek.org>
To: imre.deak@intel.com
Cc: intel-gfx@lists.freedesktop.org
Subject: Re: [PATCH 1/2] drm/i915/bxt: work around HW coherency issue when accessing GPU seqno
Date: Wed, 10 Jun 2015 08:33:26 -0700 [thread overview]
Message-ID: <557858C6.5040304@virtuousgeek.org> (raw)
In-Reply-To: <1433950018.25216.94.camel@intel.com>
On 06/10/2015 08:26 AM, Imre Deak wrote:
> On ke, 2015-06-10 at 08:10 -0700, Jesse Barnes wrote:
>> On 06/10/2015 03:59 AM, Imre Deak wrote:
>>> I think the discussion here is about two separate things:
>>> 1. Possible ordering issue between the seqno store and the completion
>>> interrupt
>>> 2. Coherency issue that leaves the CPU with a stale view of the seqno
>>> indefinitely, which this patch works around
>>>
>>> I'm confident that in my case the problem is not due to ordering. If it
>>> was "only" ordering then the value would show up eventually. This is not
>>> the case though, __wait_for_request will see the stale value
>>> indefinitely even though it gets woken up periodically afterwards by the
>>> lost IRQ logic (with hangcheck disabled).
>>
>> Yeah, based on your workaround it sounds like the write from the CS is
>> landing in memory but failing to invalidate the associated CPU
>> cacheline. I assume mapping the HWSP as uncached also works around this
>> issue?
>
> I assume it would, but it would of course have a bigger overhead. Based
> on my testing the coherency problem happens only occasionally, so for
> the rest of the times we still would want to benefit from cached reads.
> See especially __i915_spin_request().
Yeah, pretty sure we want it cached given how often we read from it. I
was just curious if the UC mapping would address this just to narrow
things down even further.
>> I wonder if this is just an issue with GGTT mappings on BXT. If we had
>> per context HSWPs using PPGTT (and maybe even 48 bit PPGTT) mappings,
>> the snoop may be performed correctly... Looks like we don't have a
>> store_dword variant for explicit coherent or incoherent buffer writes
>> (though it does test PPGTT writes at least); that would make this easy
>> to test.
>
> Well, I tried to play around with the GGTT PTE/PAT flags to see if we're
> not setting something or there is a bit that is significant despite the
> specification. The spec says it's only the snoop flag that matters and
> everything maps to PAT index 0. That looks correct in our code. In any
> case I would expect that the MI_STORE_DATA_IMM would be coherent since
> this is explicitly stated in the spec.
>
> I also added a new testcase to gem_storedw_batches_loop that tests the
> same thing via PPGTT mappings, it fails the same way.
Ok interesting, so it sounds like a more general problem than just the GGTT.
Thanks,
Jesse
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2015-06-10 15:31 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-08 16:28 [PATCH 0/2] drm/i915/bxt: work around HW coherency issue Imre Deak
2015-06-08 16:28 ` [PATCH 1/2] drm/i915/bxt: work around HW coherency issue when accessing GPU seqno Imre Deak
2015-06-08 17:08 ` Dave Gordon
2015-06-08 17:12 ` Chris Wilson
2015-06-08 17:34 ` Ville Syrjälä
2015-06-08 18:00 ` Chris Wilson
2015-06-08 18:40 ` Ville Syrjälä
2015-06-08 19:33 ` Dave Gordon
2015-06-10 10:59 ` Imre Deak
2015-06-10 15:10 ` Jesse Barnes
2015-06-10 15:26 ` Imre Deak
2015-06-10 15:33 ` Jesse Barnes [this message]
2015-06-10 15:55 ` Imre Deak
2015-06-10 15:52 ` Chris Wilson
2015-06-11 8:02 ` Dave Gordon
2015-06-11 8:20 ` Chris Wilson
2015-06-11 19:14 ` Imre Deak
2015-06-08 17:14 ` Imre Deak
2015-06-09 8:21 ` Jani Nikula
2015-06-10 14:07 ` Imre Deak
2015-06-10 14:21 ` Chris Wilson
2015-06-10 14:55 ` Imre Deak
2015-06-10 15:00 ` Ville Syrjälä
2015-06-10 15:16 ` Imre Deak
2015-06-10 15:35 ` Chris Wilson
2015-07-01 13:40 ` Mika Kuoppala
2015-07-01 13:53 ` Mika Kuoppala
2015-06-08 16:28 ` [PATCH 2/2] drm/i915/bxt: work around HW coherency issue for cached GEM mappings Imre Deak
2015-06-13 18:04 ` shuang.he
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=557858C6.5040304@virtuousgeek.org \
--to=jbarnes@virtuousgeek.org \
--cc=imre.deak@intel.com \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox