From: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>
To: Logan Gunthorpe <logang@deltatee.com>, Intel-gfx@lists.freedesktop.org
Cc: Tom Murphy <murphyt7@tcd.ie>,
Chris Wilson <chris@chris-wilson.co.uk>,
Matthew Auld <matthew.auld@intel.com>,
Lu Baolu <baolu.lu@linux.intel.com>
Subject: Re: [Intel-gfx] [PATCH] drm/i915: Fix DMA mapped scatterlist walks
Date: Thu, 10 Sep 2020 11:27:05 +0100 [thread overview]
Message-ID: <6afea622-4b07-569a-2610-dbe37de58d37@linux.intel.com> (raw)
In-Reply-To: <0d89654d-bc36-52ca-9066-f4d5c6e2a508@deltatee.com>
On 09/09/2020 20:29, Logan Gunthorpe wrote:
> On 2020-09-09 6:44 a.m., Tvrtko Ursulin wrote:
>> From: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>>
>> When walking DMA mapped scatterlists sg_dma_len has to be used since it
>> can be different (coalesced) from the backing store entry.
>>
>> This also means we have to end the walk when encountering a zero length
>> DMA entry and cannot rely on the normal sg list end marker.
>>
>> Both issues were there in theory for some time but were hidden by the fact
>> Intel IOMMU driver was never coalescing entries. As there are ongoing
>> efforts to change this we need to start handling it.
>>
>> Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>> References: 85d1225ec066 ("drm/i915: Introduce & use new lightweight SGL iterators")
>> References: b31144c0daa8 ("drm/i915: Micro-optimise gen6_ppgtt_insert_entries()")
>> Reported-by: Tom Murphy <murphyt7@tcd.ie>
>> Suggested-by: Tom Murphy <murphyt7@tcd.ie> # __sgt_iter
>> Suggested-by: Logan Gunthorpe <logang@deltatee.com> # __sgt_iter
>> Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
>> Cc: Chris Wilson <chris@chris-wilson.co.uk>
>> Cc: Matthew Auld <matthew.auld@intel.com>
>> Cc: Lu Baolu <baolu.lu@linux.intel.com>
>> ---
>> drivers/gpu/drm/i915/gt/gen6_ppgtt.c | 6 +++---
>> drivers/gpu/drm/i915/gt/gen8_ppgtt.c | 17 ++++++++++-------
>> drivers/gpu/drm/i915/gt/intel_gtt.h | 2 +-
>> drivers/gpu/drm/i915/i915_scatterlist.h | 12 ++++++++----
>> 4 files changed, 22 insertions(+), 15 deletions(-)
>>
>> diff --git a/drivers/gpu/drm/i915/gt/gen6_ppgtt.c b/drivers/gpu/drm/i915/gt/gen6_ppgtt.c
>> index fd0d24d28763..c0d17f87b00f 100644
>> --- a/drivers/gpu/drm/i915/gt/gen6_ppgtt.c
>> +++ b/drivers/gpu/drm/i915/gt/gen6_ppgtt.c
>> @@ -131,17 +131,17 @@ static void gen6_ppgtt_insert_entries(struct i915_address_space *vm,
>>
>> vaddr = kmap_atomic_px(i915_pt_entry(pd, act_pt));
>> do {
>> - GEM_BUG_ON(iter.sg->length < I915_GTT_PAGE_SIZE);
>> + GEM_BUG_ON(sg_dma_len(iter.sg) < I915_GTT_PAGE_SIZE);
>> vaddr[act_pte] = pte_encode | GEN6_PTE_ADDR_ENCODE(iter.dma);
>>
>> iter.dma += I915_GTT_PAGE_SIZE;
>> if (iter.dma == iter.max) {
>> iter.sg = __sg_next(iter.sg);
>> - if (!iter.sg)
>> + if (!iter.sg || sg_dma_len(iter.sg) == 0)
>> break;
>>
>> iter.dma = sg_dma_address(iter.sg);
>> - iter.max = iter.dma + iter.sg->length;
>> + iter.max = iter.dma + sg_dma_len(iter.sg);
>> }
>>
>> if (++act_pte == GEN6_PTES) {
>> diff --git a/drivers/gpu/drm/i915/gt/gen8_ppgtt.c b/drivers/gpu/drm/i915/gt/gen8_ppgtt.c
>> index eb64f474a78c..0361b3dfdc72 100644
>> --- a/drivers/gpu/drm/i915/gt/gen8_ppgtt.c
>> +++ b/drivers/gpu/drm/i915/gt/gen8_ppgtt.c
>> @@ -372,19 +372,19 @@ gen8_ppgtt_insert_pte(struct i915_ppgtt *ppgtt,
>> pd = i915_pd_entry(pdp, gen8_pd_index(idx, 2));
>> vaddr = kmap_atomic_px(i915_pt_entry(pd, gen8_pd_index(idx, 1)));
>> do {
>> - GEM_BUG_ON(iter->sg->length < I915_GTT_PAGE_SIZE);
>> + GEM_BUG_ON(sg_dma_len(iter->sg) < I915_GTT_PAGE_SIZE);
>> vaddr[gen8_pd_index(idx, 0)] = pte_encode | iter->dma;
>>
>> iter->dma += I915_GTT_PAGE_SIZE;
>> if (iter->dma >= iter->max) {
>> iter->sg = __sg_next(iter->sg);
>> - if (!iter->sg) {
>> + if (!iter->sg || sg_dma_len(iter->sg) == 0) {
>> idx = 0;
>> break;
>> }
>>
>> iter->dma = sg_dma_address(iter->sg);
>> - iter->max = iter->dma + iter->sg->length;
>> + iter->max = iter->dma + sg_dma_len(iter->sg);
>> }
>>
>> if (gen8_pd_index(++idx, 0) == 0) {
>> @@ -414,7 +414,7 @@ static void gen8_ppgtt_insert_huge(struct i915_vma *vma,
>> {
>> const gen8_pte_t pte_encode = gen8_pte_encode(0, cache_level, flags);
>> u64 start = vma->node.start;
>> - dma_addr_t rem = iter->sg->length;
>> + dma_addr_t rem = sg_dma_len(iter->sg);
>
> Seems a little odd to me to be storing a length in a dma_addr_t. But
> besides that small nit, this all makes sense to me.
>
> Reviewed-by: Logan Gunthorpe <logang@deltatee.com>
I did not spot that, thanks. I'll improve that in v2 since I need to add
a 2nd patch to completely prepare i915 for this.
Regards,
Tvrtko
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx
prev parent reply other threads:[~2020-09-10 10:27 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-09-09 12:44 [Intel-gfx] [PATCH] drm/i915: Fix DMA mapped scatterlist walks Tvrtko Ursulin
2020-09-09 13:03 ` Chris Wilson
2020-09-09 13:43 ` Tvrtko Ursulin
2020-09-09 13:07 ` [Intel-gfx] ✓ Fi.CI.BAT: success for " Patchwork
2020-09-09 14:07 ` [Intel-gfx] ✓ Fi.CI.IGT: " Patchwork
2020-09-09 19:29 ` [Intel-gfx] [PATCH] " Logan Gunthorpe
2020-09-10 10:27 ` Tvrtko Ursulin [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6afea622-4b07-569a-2610-dbe37de58d37@linux.intel.com \
--to=tvrtko.ursulin@linux.intel.com \
--cc=Intel-gfx@lists.freedesktop.org \
--cc=baolu.lu@linux.intel.com \
--cc=chris@chris-wilson.co.uk \
--cc=logang@deltatee.com \
--cc=matthew.auld@intel.com \
--cc=murphyt7@tcd.ie \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox