Re: [PATCH v12 11/13] drm/xe: Force flush system memory AuxCCS framebuffers before scan out

Intel-XE Archive on lore.kernel.org
 help / color / mirror / Atom feed

From: "Ville Syrjälä" <ville.syrjala@linux.intel.com>
To: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
Cc: intel-xe@lists.freedesktop.org, kernel-dev@igalia.com
Subject: Re: [PATCH v12 11/13] drm/xe: Force flush system memory AuxCCS framebuffers before scan out
Date: Thu, 2 Oct 2025 20:14:03 +0300	[thread overview]
Message-ID: <aN6y2-2GTgOVmbmR@intel.com> (raw)
In-Reply-To: <6649a3e8-eb62-49a6-9f09-46c28515d45a@igalia.com>

On Thu, Oct 02, 2025 at 06:04:28PM +0100, Tvrtko Ursulin wrote:
> 
> On 02/10/2025 17:23, Ville Syrjälä wrote:
> > On Thu, Oct 02, 2025 at 03:01:08PM +0100, Tvrtko Ursulin wrote:
> >>
> >> Hi,
> >>
> >> On 26/09/2025 20:35, Ville Syrjälä wrote:
> >>> On Fri, Sep 26, 2025 at 10:41:56AM +0300, Ville Syrjälä wrote:
> >>>> I reverse engineered this a bit and there's definitely a
> >>>> MOCS issue at play.
> >>>>
> >>>> First I noticed that if filled the entire MOCS table with
> >>>> UC the problem went away. I then filled the entire table
> >>>> with WB and essentially bisected what I need to make UC
> >>>> to fix it. And I had to repeat that same process starting
> >>>> from the other end of table.
> >>>>
> >>>> Looks like there is some undocumented magic in the hardware.
> >>>>
> >>>> MOCS 61 really is special:
> >>>> - MOCS 61 UC, others WB, select MOCS 61 -> no corruption
> >>>>
> >>>> MOCS 0 and 63 are special in other ways:
> >>>> - MOCS X UC, others WB, select MOCS X -> corruption
> >>>> - MOCS X+0 UC, others WB, select MOCS X -> corruption
> >>>> - MOCS X+63 UC, others WB, select MOCS X -> corruption
> >>>> - MOCS X+0+63 UC, others WB, select MOCS X -> no corruption
> >>>>     where X != 61
> >>>
> >>> OK, the MOCS 63 issue was caused by me having L3=WB still in
> >>> MOCS X. If I change MOCS X to L3=UC, MOCS 63 no longer makes
> >>> a difference. I suppose that means MOCS 63 is still used for
> >>> L3 evictions, even though bspec no longer mentions that fact
> >>> explicitly.
> >>>
> >>> So MOCS 0 is the thing that really matters for CCS. And for
> >>> MOCS 0 only the LLC WB vs. UC selection matters. L3 WB vs. UC
> >>> doesn't seem to make any difference.
> >>>
> >>> It's interesting that MOCS 60 is documented as a "CCS special case",
> >>> but in reality it's MOCS 0 that matters for CCS. I wonder if some
> >>> wires got crossed in the hw design and the wrong MOCS entry ended
> >>> up being used for CCS and no one noticed...
> >>
> >> Oh wow, that is an amazing discovery!
> >>
> >> I verified it on my end too. Setting MOCS 0 to uncached and cache dirt
> >> is gone. No need to the explicit cache flush patch on first pin.
> >>
> >> Luckily ADL is unsupported so we could change it to UC. I will send a
> >> series for CI to see what it will say.
> 
> So the MOCS 0 UC experiment did not seem to be 100% glitch free. It 
> *looks* it helps, maybe even a lot, but not fully - three tests still 
> failed due CRC mismatches.
> 
> > I think the real fix is to change igt to use MOCS 61 for tgl/adl.
> > That is what Mesa uses as well.
> I somehow glossed over the fact you initially wrote 61 worked fine for 
> you and focused only on your X+0+63 combinations. :(
> 
> 61 works fine for me locally too. Very curious hw behaviour.
> 
> It would be nice to do a CI run with IGT changed to 61 but AFAIK the xe 
> patchwork/CI does not support the Test-with tag.
> 
> > Looks like Mesa uses a different MOCS for DG1 and DG2. Those
> > do seem to like up with what's in bspec, so probably someone
> > needs to just copy the whole MOCS thing from Mesa into igt.
> 
> I can have a look.
> 
> > Looks like Mesa doesn't even use a UC MOCS for anything except
> > on MTL, so possibly we can just change the TGL MOCS 0 to be the
> > same WB as on ADL, and maybe that gives some performance benefit
> > in some cases.
> 
> On xe, i915 or both?

Both.

Does xe not program the table already according to bspec? I doubt
we should really care about the "ancient Mesa + xe + TGL" case,
so the special TGL MOCS table shouldn't be needed on xe IMO.

> 
> >>>> I didn't actually test all values of X there, but I did spot
> >>>> check a handful of them.
> >>>>
> >>>> Also, ADL is affected, but TGL doesn't seem to be. Though I
> >>>> still need to check the situation on TGL a bit more thoroughly.
> >>>
> >>> TGL actually works exactly the same as ADL. The only reason why
> >>> TGL worked correctly out of the box was that we use a different
> >>> MOCS table for TGL/RKL (IIRC because we started out with the
> >>> wrong table and early Mesa versions depended on that), and in
> >>> that table MOCS 0 is just 0x0, whereas on ADL MOCS 0 is WB.
> >>
> >> Kind of sounds familiar but the only commit I found was 3f027d61663f
> >> ("drm/i915/gt: Add separate MOCS table for Gen12 devices other than
> >> TGL/RKL") but it is about MOCS 1. What am I missing? Are the hw defaults
> >> maybe different and not the code?
> > 
> > The defaults are somehow populated differently dependign on
> > unused_entries_index which is also being set in a very confusing
> > way (first set it to 1(PTE) on everything and the overwritten
> > with some other value for some of the platforms). The code could
> > certainly use a good cleanup pass.
> > 
> > Anyways, the default index ends up being different on TGL and ADL
> > and thus MOCS 0 ends up different as well.
> 
> Yep. I missed it and forgot about cfbe5291a189 ("drm/i915/gt: Initialize 
> unused MOCS entries with device specific values").
> 
> > Since MOCS 0 seems to be special, we should probablya populate
> > it explicitly. And I suppose we should first figure out if
> > other platforms are also affected.
> 
> Yeah. If we could only get the full understanding on the details of 
> "specialness".

I filed a bspec issue for it now. I guess we'll see if anyone
cares anymore...

And I do still want to reverse engineer this on other platforms
as well.

-- 
Ville Syrjälä
Intel

next prev parent reply	other threads:[~2025-10-02 17:14 UTC|newest]

Thread overview: 45+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-09-23 10:07 [PATCH v12 00/13] AuxCCS handling and render compression modifiers Tvrtko Ursulin
2025-09-23 10:07 ` [PATCH v12 01/13] drm/xe/xelpg: Flush CCS when flushing caches Tvrtko Ursulin
2025-09-23 10:07 ` [PATCH v12 02/13] drm/xe/xelp: Quiesce memory traffic before invalidating AuxCCS Tvrtko Ursulin
2025-10-01 15:47   ` Rodrigo Vivi
2025-09-23 10:07 ` [PATCH v12 03/13] drm/xe/xelp: Support auxccs invalidation on blitter Tvrtko Ursulin
2025-09-23 10:07 ` [PATCH v12 04/13] drm/xe/xelp: Use MI_FLUSH_DW_CCS on auxccs platforms Tvrtko Ursulin
2025-09-23 10:07 ` [PATCH v12 05/13] drm/xe/xelp: Wait for AuxCCS invalidation to complete Tvrtko Ursulin
2025-09-23 10:07 ` [PATCH v12 06/13] drm/xe: Export xe_emit_aux_table_inv Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 07/13] drm/xe/xelp: Add AuxCCS invalidation to the indirect context workarounds Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 08/13] drm/xe: Flush GGTT writes after populating DPT Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 09/13] drm/xe: Handle DPT in system memory Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 10/13] drm/xe/display: Add support for AuxCCS Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 11/13] drm/xe: Force flush system memory AuxCCS framebuffers before scan out Tvrtko Ursulin
2025-09-23 10:19   ` Ville Syrjälä
2025-09-23 10:48     ` Tvrtko Ursulin
2025-09-23 12:01       ` Ville Syrjälä
2025-09-23 12:25         ` Tvrtko Ursulin
2025-09-23 13:20           ` Ville Syrjälä
2025-09-23 14:40             ` Tvrtko Ursulin
2025-09-23 14:52               ` Ville Syrjälä
2025-09-24 13:09                 ` Tvrtko Ursulin
2025-09-24 22:35                   ` Ville Syrjälä
2025-09-25  7:24                     ` Tvrtko Ursulin
2025-09-25 10:08                       ` Tvrtko Ursulin
2025-09-26  7:41                         ` Ville Syrjälä
2025-09-26 19:35                           ` Ville Syrjälä
2025-10-02 14:01                             ` Tvrtko Ursulin
2025-10-02 14:36                               ` Tvrtko Ursulin
2025-10-02 16:23                               ` Ville Syrjälä
2025-10-02 17:04                                 ` Tvrtko Ursulin
2025-10-02 17:14                                   ` Ville Syrjälä [this message]
2025-10-02 22:02                                     ` Ville Syrjälä
2025-09-23 10:44   ` [PATCH v13 " Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 12/13] drm/xe: Do not use stolen memory for DPT on IGFX and AuxCCS Tvrtko Ursulin
2025-09-23 10:08 ` [PATCH v12 13/13] drm/i915/display: Expose AuxCCS frame buffer modifiers for Xe Tvrtko Ursulin
2025-09-23 10:15 ` ✗ CI.checkpatch: warning for AuxCCS handling and render compression modifiers (rev15) Patchwork
2025-09-23 10:16 ` ✓ CI.KUnit: success " Patchwork
2025-09-23 11:15 ` ✓ Xe.CI.BAT: " Patchwork
2025-09-23 11:21 ` ✗ CI.checkpatch: warning for AuxCCS handling and render compression modifiers (rev16) Patchwork
2025-09-23 11:22 ` ✓ CI.KUnit: success " Patchwork
2025-09-23 12:03 ` ✓ Xe.CI.BAT: " Patchwork
2025-09-23 13:26 ` ✗ Xe.CI.Full: failure for AuxCCS handling and render compression modifiers (rev15) Patchwork
2025-09-23 14:12 ` ✗ Xe.CI.Full: failure for AuxCCS handling and render compression modifiers (rev16) Patchwork
2025-09-23 20:12 ` [PATCH v12 00/13] AuxCCS handling and render compression modifiers Ville Syrjälä
2025-09-24  7:59   ` Tvrtko Ursulin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aN6y2-2GTgOVmbmR@intel.com \
    --to=ville.syrjala@linux.intel.com \
    --cc=intel-xe@lists.freedesktop.org \
    --cc=kernel-dev@igalia.com \
    --cc=tvrtko.ursulin@igalia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox