From: "Vivekanandan, Balasubramani" <balasubramani.vivekanandan@intel.com>
To: Lucas De Marchi <lucas.demarchi@intel.com>
Cc: Matt Roper <matthew.d.roper@intel.com>,
<intel-xe@lists.freedesktop.org>,
Shekhar Chauhan <shekhar.chauhan@intel.com>,
Tejas Upadhyay <tejas.upadhyay@intel.com>,
Brian Welty <brian.welty@intel.com>
Subject: Re: [PATCH 03/23] drm/xe/xe3p_lpm: Configure MAIN_GAMCTRL_QUEUE_SELECT
Date: Thu, 16 Oct 2025 19:50:48 +0530 [thread overview]
Message-ID: <aPD_QK8r_DvW1SOX@bvivekan-mobl1> (raw)
In-Reply-To: <jvumbsccpra6yu4vqn5xtjlmfn5n5cflr6yaofjazfylrbfk7t@qd5lkufxluxp>
On 15.10.2025 09:03, Lucas De Marchi wrote:
> On Wed, Oct 15, 2025 at 12:06:55PM +0530, Vivekanandan, Balasubramani wrote:
> > On 14.10.2025 21:28, Lucas De Marchi wrote:
> > > On Tue, Oct 14, 2025 at 09:34:03AM -0700, Matt Roper wrote:
> > > > On Mon, Oct 13, 2025 at 08:24:35PM -0700, Lucas De Marchi wrote:
> > > > > From: Brian Welty <brian.welty@intel.com>
> > > > >
> > > > > Starting from Xe3p, there are two different copies of some of the GAM
> > > > > registers: the traditional MCR variant at their old locations, and a
> > > > > new unicast copy known as "main_gamctrl." The Xe driver doesn't use
> > > > > these registers directly, but we need to instruct the GuC on which set
> > > > > it should use. Since the new, unicast registers are preferred (since
> > > > > they avoid the need for unnecessary MCR synchronization), set a new GuC
> > > > > feature flag, GUC_CTL_MAIN_GAMCTRL_QUEUES to convey this decision. A
> > > > > new helper function, xe_guc_using_main_gamctrl_queues(), is added for
> > > > > use in the 3 independent places that need to handle configuration of the
> > > > > new reporting queues.
> > > > >
> > > > > The mmio write to enable the main gamctl is only done during the general
> > > > > GuC upload. The gamctrl registers are not accessed by the GuC during
> > > > > hwconfig load.
> > > > >
> > > > > Last, the ADS blob for communicating the queue addresses contains both a
> > > > > DPA and GGTT offset. The GuC documentation states that DPA is now MBZ
> > > > > when using the MAIN_GAMCTRL queues.
> > > > >
> > > > > Signed-off-by: Brian Welty <brian.welty@intel.com>
> > > > > Signed-off-by: Lucas De Marchi <lucas.demarchi@intel.com>
> > > > > ---
> > > > > drivers/gpu/drm/xe/regs/xe_gt_regs.h | 3 +++
> > > > > drivers/gpu/drm/xe/xe_gt.h | 6 ++++++
> > > > > drivers/gpu/drm/xe/xe_guc.c | 27 +++++++++++++++++++++++++++
> > > > > drivers/gpu/drm/xe/xe_guc.h | 1 +
> > > > > drivers/gpu/drm/xe/xe_guc_ads.c | 6 +++++-
> > > > > drivers/gpu/drm/xe/xe_guc_fwif.h | 1 +
> > > > > 6 files changed, 43 insertions(+), 1 deletion(-)
> > > > >
> > > > > diff --git a/drivers/gpu/drm/xe/regs/xe_gt_regs.h b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > > > index 51f2a03847f9d..47e13a3fb9072 100644
> > > > > --- a/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > > > +++ b/drivers/gpu/drm/xe/regs/xe_gt_regs.h
> > > > > @@ -545,6 +545,9 @@
> > > > > #define SARB_CHICKEN1 XE_REG_MCR(0xe90c)
> > > > > #define COMP_CKN_IN REG_GENMASK(30, 29)
> > > > >
> > > > > +#define MAIN_GAMCTRL_MODE XE_REG(0xef00)
> > > > > +#define MAIN_GAMCTRL_QUEUE_SELECT REG_BIT(0)
> > > > > +
> > > > > #define RCU_MODE XE_REG(0x14800, XE_REG_OPTION_MASKED)
> > > > > #define RCU_MODE_FIXED_SLICE_CCS_MODE REG_BIT(1)
> > > > > #define RCU_MODE_CCS_ENABLE REG_BIT(0)
> > > > > diff --git a/drivers/gpu/drm/xe/xe_gt.h b/drivers/gpu/drm/xe/xe_gt.h
> > > > > index 5df2ffe3ff838..9d710049da455 100644
> > > > > --- a/drivers/gpu/drm/xe/xe_gt.h
> > > > > +++ b/drivers/gpu/drm/xe/xe_gt.h
> > > > > @@ -22,6 +22,12 @@
> > > > >
> > > > > #define CCS_MASK(gt) (((gt)->info.engine_mask & XE_HW_ENGINE_CCS_MASK) >> XE_HW_ENGINE_CCS0)
> > > > >
> > > > > +#define GT_VER(gt) ({ \
> > > > > + typeof(gt) gt_ = (gt); \
> > > > > + struct xe_device *xe = gt_to_xe(gt_); \
> > > > > + xe_gt_is_media_type(gt_) ? MEDIA_VER(xe) : GRAPHICS_VER(xe); \
> > > > > +})
> > > > > +
> > > > > extern struct fault_attr gt_reset_failure;
> > > > > static inline bool xe_fault_inject_gt_reset(void)
> > > > > {
> > > > > diff --git a/drivers/gpu/drm/xe/xe_guc.c b/drivers/gpu/drm/xe/xe_guc.c
> > > > > index d94490979adc0..37e3735f34e63 100644
> > > > > --- a/drivers/gpu/drm/xe/xe_guc.c
> > > > > +++ b/drivers/gpu/drm/xe/xe_guc.c
> > > > > @@ -91,6 +91,9 @@ static u32 guc_ctl_feature_flags(struct xe_guc *guc)
> > > > > if (xe_configfs_get_psmi_enabled(to_pci_dev(xe->drm.dev)))
> > > > > flags |= GUC_CTL_ENABLE_PSMI_LOGGING;
> > > > >
> > > > > + if (xe_guc_using_main_gamctrl_queues(guc))
> > > > > + flags |= GUC_CTL_MAIN_GAMCTRL_QUEUES;
> > > > > +
> > > > > return flags;
> > > > > }
> > > > >
> > > > > @@ -1255,8 +1258,13 @@ int xe_guc_min_load_for_hwconfig(struct xe_guc *guc)
> > > > >
> > > > > int xe_guc_upload(struct xe_guc *guc)
> > > > > {
> > > > > + struct xe_gt *gt = guc_to_gt(guc);
> > > > > +
> > > > > xe_guc_ads_populate(&guc->ads);
> > > > >
> > > > > + if (xe_guc_using_main_gamctrl_queues(guc))
> > > > > + xe_mmio_write32(>->mmio, MAIN_GAMCTRL_MODE, MAIN_GAMCTRL_QUEUE_SELECT);
> > > > > +
> > > > > return __xe_guc_upload(guc);
> > > > > }
> > > > >
> > > > > @@ -1657,6 +1665,25 @@ void xe_guc_declare_wedged(struct xe_guc *guc)
> > > > > xe_guc_submit_wedge(guc);
> > > > > }
> > > > >
> > > > > +/**
> > > > > + * xe_guc_using_main_gamctrl_queues() - Detect which reporting queues to use.
> > > > > + * @guc: The GuC object
> > > > > + *
> > > > > + * For Xe3p and beyond, we want to program the hardware to use the
> > > > > + * "Main GAMCTRL queue" rather than the legacy queue before we upload
> > > > > + * the GuC firmware. This will allow the GuC to use a new set of
> > > > > + * registers for pagefault handling and avoid some unnecessary
> > > > > + * complications with MCR register range handling.
> > > > > + *
> > > > > + * Return: true if can use new main gamctrl queues.
> > > > > + */
> > > > > +bool xe_guc_using_main_gamctrl_queues(struct xe_guc *guc)
> > > > > +{
> > > > > + struct xe_gt *gt = guc_to_gt(guc);
> > > > > +
> > > > > + return GT_VER(gt) >= 35;
> > > >
> > > > Revisiting the spec on this, I'm not sure whether using GT_VER() here is
> > > > actually the right thing to do. As far as I can see, the media GT does
> > > > not actually have a "main gamctrl" register range at all (i.e., bspec
> > > > 76445 only lists MCR gamctrl ranges, and 0x38EF00-0x38EFFF falls within
> > > > a reserved/unused block). That means that registers like
> > >
> > > true
> > >
> > > > MAIN_GAMCTRL_MODE that we're trying to write during xe_guc_upload above
> > >
> > > We do have MAIN_GAMCTRL_MODE for xe3p_lpm in bspec 73540.
> > > The tagging is odd, but there is a reference to the register at
> > > 0x38EF00. It wouldn't make sense to have a 0x38xxxx register instance
> > > for primary/graphics GT.
> > >
> > > So either
> > >
> > > 1) we don't have it like you said
> > > 2) we have it and there's something missing in the MCR ranges
> > >
> > > > don't actually exist for the media GT. Furthermore, the tagging on the
> > > > register detail page 73540 also seems to imply that this only applies to
> > > > the primary/graphics GT.
> > >
> > > so... in that case we'd need to check both the gt type and graphics ver,
> > > so we load GuC with those different flags. I will take a shot at that
> > > and check if anything breaks.
> > >
> > > Bala, since you did the patch following this, do you remember of
> > > anything related to this?
> >
> > There seems to be a conflict between the Bspec and the HSD. HSD
> > 16017994409 and the HAS claim the MainGAMCTRL is supported in Xe3p Media
> > IP whereas the Bspec page 73542 and the Steering table show the
> > MainGAMCTRL register range as not available for Xe3p Media.
> >
> > For me it looks like a Bspec update issue. We have tested and verified
> > the current implementation which enables MainGAMCTRL for Media.
> > I would say lets keep the code as it is though it conflicts with the
> > spec and raise a Bspec issue to get it clarified.
>
> I'm circling this back with HW and Spec people. I also did some changes
> local changes to test. Although since the old queues are still valid, I
> guess it will just pass, but without this optimization. On the other
> hand, I'd expect it to not work at all with the current code if it
> didn't apply to media.
There is an Bspec update to the steering table yesterday which now lists
the address range 0x38EF00 - 0x38EFFF as non-MCR. But still the Bspec
filtering is not correct. So the register range doesn't show up when
filtered for Xe3p Media.
Regards,
Bala
>
> Another point is that we can delay this patch a little bit, since the
> next patch turns this off for media with the special bits set.
>
> thanks
> Lucas De Marchi
next prev parent reply other threads:[~2025-10-16 14:21 UTC|newest]
Thread overview: 78+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-14 3:24 [PATCH 00/23] drm/xe: Add Xe3p support Lucas De Marchi
2025-10-14 3:24 ` [PATCH 01/23] drm/xe/xe3: Add support for graphics IP versions 30.04 & 30.05 Lucas De Marchi
2025-10-14 6:17 ` Shekhar Chauhan
2025-10-14 16:11 ` Matt Roper
2025-10-14 3:24 ` [PATCH 02/23] drm/xe/xe3p_lpm: Add support for media IP versions 35.00 & 35.03 Lucas De Marchi
2025-10-14 6:22 ` Shekhar Chauhan
2025-10-14 16:14 ` Matt Roper
2025-10-14 3:24 ` [PATCH 03/23] drm/xe/xe3p_lpm: Configure MAIN_GAMCTRL_QUEUE_SELECT Lucas De Marchi
2025-10-14 16:34 ` Matt Roper
2025-10-15 2:28 ` Lucas De Marchi
2025-10-15 6:36 ` Vivekanandan, Balasubramani
2025-10-15 14:03 ` Lucas De Marchi
2025-10-16 14:20 ` Vivekanandan, Balasubramani [this message]
2025-10-15 14:59 ` Matt Roper
2025-10-14 3:24 ` [PATCH 04/23] drm/xe/xe3p_lpm: Add special check in Media GT for Main GAMCTRL Lucas De Marchi
2025-10-14 16:36 ` Matt Roper
2025-10-14 3:24 ` [PATCH 05/23] drm/xe/xe3p_lpm: Stop reading the CTC_MODE register Lucas De Marchi
2025-10-14 11:58 ` Shekhar Chauhan
2025-10-14 16:52 ` Matt Roper
2025-10-15 3:41 ` Lucas De Marchi
2025-10-15 9:19 ` Vivekanandan, Balasubramani
2025-10-15 15:04 ` Matt Roper
2025-10-14 16:40 ` Matt Roper
2025-10-14 16:53 ` Matt Roper
2025-10-14 3:24 ` [PATCH 06/23] drm/xe/xe3p_lpm: Skip disabling NOA on unsupported IPs Lucas De Marchi
2025-10-14 17:04 ` Matt Roper
2025-10-14 3:24 ` [PATCH 07/23] drm/xe/xe3p_lpm: Handle MCR steering Lucas De Marchi
2025-10-15 9:56 ` Vivekanandan, Balasubramani
2025-10-14 3:24 ` [PATCH 08/23] drm/xe/xe3p: Stop programming RCU_MODE's fixed slice mode setting Lucas De Marchi
2025-10-15 12:24 ` Vivekanandan, Balasubramani
2025-10-14 3:24 ` [PATCH 09/23] drm/xe/xe3p: Determine service copy availability from fuse Lucas De Marchi
2025-10-15 20:14 ` Gustavo Sousa
2025-10-14 3:24 ` [PATCH 10/23] drm/xe/xe3p: Skip TD flush Lucas De Marchi
2025-10-14 19:35 ` Matt Roper
2025-10-14 3:24 ` [PATCH 11/23] drm/xe/xe3p: Enable L2 flush optimization feature Lucas De Marchi
2025-10-14 19:43 ` Matt Roper
2025-10-15 4:02 ` Lucas De Marchi
2025-10-14 3:24 ` [PATCH 12/23] drm/xe/xe3p: Flush userptr/shrinker bo cachelines manually Lucas De Marchi
2025-10-14 12:58 ` Thomas Hellström
2025-10-15 18:42 ` Lucas De Marchi
2025-10-14 3:24 ` [PATCH 13/23] drm/xe: Dump CURRENT_LRCA and CSMQDEBUG registers Lucas De Marchi
2025-10-14 17:24 ` Matt Roper
2025-10-15 4:07 ` Lucas De Marchi
2025-10-14 3:24 ` [PATCH 14/23] drm/xe/nvl: Define NVL-S platform Lucas De Marchi
2025-10-14 7:34 ` Shekhar Chauhan
2025-10-14 3:24 ` [PATCH 15/23] drm/xe/nvls: Define GuC firmware for NVL-S Lucas De Marchi
2025-10-15 18:49 ` Lucas De Marchi
2025-10-14 3:24 ` [PATCH 16/23] drm/xe/nvls: Attach MOCS table " Lucas De Marchi
2025-10-14 7:45 ` Shekhar Chauhan
2025-10-14 3:24 ` [PATCH 17/23] drm/xe/xe3p_xpc: Add Xe3p_XPC IP definition Lucas De Marchi
2025-10-14 8:04 ` Shekhar Chauhan
2025-10-14 8:12 ` Shekhar Chauhan
2025-10-14 17:33 ` Matt Roper
2025-10-15 2:47 ` Shekhar Chauhan
2025-10-14 17:36 ` Matt Roper
2025-10-14 3:24 ` [PATCH 18/23] drm/xe/xe3p_xpc: Add L3 bank mask Lucas De Marchi
2025-10-14 17:46 ` Matt Roper
2025-10-14 3:24 ` [PATCH 19/23] drm/xe/xe3p_xpc: Add MCR steering Lucas De Marchi
2025-10-14 3:24 ` [PATCH 20/23] drm/xe/xe3p_xpc: Add support for compute walker for non-MSIx Lucas De Marchi
2025-10-14 18:07 ` Matt Roper
2025-10-15 17:07 ` Lucas De Marchi
2025-10-15 17:12 ` Matt Roper
2025-10-14 3:24 ` [PATCH 21/23] drm/xe/xe3p_xpc: Skip compression tuning on platforms without flatccs Lucas De Marchi
2025-10-14 8:09 ` Shekhar Chauhan
2025-10-14 8:13 ` Shekhar Chauhan
2025-10-14 18:14 ` Matt Roper
2025-10-15 2:52 ` Shekhar Chauhan
2025-10-15 14:38 ` Lucas De Marchi
2025-10-15 15:34 ` Shekhar Chauhan
2025-10-14 3:24 ` [PATCH 22/23] drm/xe/xe3p_xpc: Setup PAT table Lucas De Marchi
2025-10-16 14:07 ` Vivekanandan, Balasubramani
2025-10-14 3:24 ` [PATCH 23/23] drm/xe/xe3p: Add xe3p EU stall data format Lucas De Marchi
2025-10-14 19:11 ` Matt Roper
2025-10-15 23:05 ` Harish Chegondi
2025-10-14 3:45 ` ✗ CI.checkpatch: warning for drm/xe: Add Xe3p support Patchwork
2025-10-14 3:47 ` ✓ CI.KUnit: success " Patchwork
2025-10-14 4:22 ` ✓ Xe.CI.BAT: " Patchwork
2025-10-14 12:06 ` ✓ Xe.CI.Full: " Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aPD_QK8r_DvW1SOX@bvivekan-mobl1 \
--to=balasubramani.vivekanandan@intel.com \
--cc=brian.welty@intel.com \
--cc=intel-xe@lists.freedesktop.org \
--cc=lucas.demarchi@intel.com \
--cc=matthew.d.roper@intel.com \
--cc=shekhar.chauhan@intel.com \
--cc=tejas.upadhyay@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox