From: Matthew Auld <matthew.auld@intel.com>
To: igt-dev@lists.freedesktop.org
Subject: [igt-dev] [PATCH i-g-t v6 01/15] drm-uapi/xe_drm: sync to get pat and coherency bits
Date: Thu, 2 Nov 2023 17:46:46 +0000 [thread overview]
Message-ID: <20231102174700.298135-2-matthew.auld@intel.com> (raw)
In-Reply-To: <20231102174700.298135-1-matthew.auld@intel.com>
Grab the PAT & coherency uapi additions.
Signed-off-by: Matthew Auld <matthew.auld@intel.com>
Cc: José Roberto de Souza <jose.souza@intel.com>
Cc: Pallavi Mishra <pallavi.mishra@intel.com>
Reviewed-by: Pallavi Mishra <pallavi.mishra@intel.com>
---
include/drm-uapi/xe_drm.h | 93 +++++++++++++++++++++++++++++++++++++--
1 file changed, 90 insertions(+), 3 deletions(-)
diff --git a/include/drm-uapi/xe_drm.h b/include/drm-uapi/xe_drm.h
index 6ff1106e4..d9ed75f53 100644
--- a/include/drm-uapi/xe_drm.h
+++ b/include/drm-uapi/xe_drm.h
@@ -548,8 +548,54 @@ struct drm_xe_gem_create {
*/
__u32 handle;
- /** @pad: MBZ */
- __u32 pad;
+ /**
+ * @coh_mode: The coherency mode for this object. This will limit the
+ * possible @cpu_caching values.
+ *
+ * Supported values:
+ *
+ * DRM_XE_GEM_COH_NONE: GPU access is assumed to be not coherent with
+ * CPU. CPU caches are not snooped.
+ *
+ * DRM_XE_GEM_COH_AT_LEAST_1WAY:
+ *
+ * CPU-GPU coherency must be at least 1WAY.
+ *
+ * If 1WAY then GPU access is coherent with CPU (CPU caches are snooped)
+ * until GPU acquires. The acquire by the GPU is not tracked by CPU
+ * caches.
+ *
+ * If 2WAY then should be fully coherent between GPU and CPU. Fully
+ * tracked by CPU caches. Both CPU and GPU caches are snooped.
+ *
+ * Note: On dgpu the GPU device never caches system memory. The device
+ * should be thought of as always 1WAY coherent, with the addition that
+ * the GPU never caches system memory. At least on current dgpu HW there
+ * is no way to turn off snooping so likely the different coherency
+ * modes of the pat_index make no difference for system memory.
+ */
+#define DRM_XE_GEM_COH_NONE 1
+#define DRM_XE_GEM_COH_AT_LEAST_1WAY 2
+ __u16 coh_mode;
+
+ /**
+ * @cpu_caching: The CPU caching mode to select for this object. If
+ * mmaping the object the mode selected here will also be used.
+ *
+ * Supported values:
+ *
+ * DRM_XE_GEM_CPU_CACHING_WB: Allocate the pages with write-back caching.
+ * On iGPU this can't be used for scanout surfaces. The @coh_mode must
+ * be DRM_XE_GEM_COH_AT_LEAST_1WAY. Currently not allowed for objects placed
+ * in VRAM.
+ *
+ * DRM_XE_GEM_CPU_CACHING_WC: Allocate the pages as write-combined. This is
+ * uncached. Any @coh_mode is permitted. Scanout surfaces should likely
+ * use this. All objects that can be placed in VRAM must use this.
+ */
+#define DRM_XE_GEM_CPU_CACHING_WB 1
+#define DRM_XE_GEM_CPU_CACHING_WC 2
+ __u16 cpu_caching;
/** @reserved: Reserved */
__u64 reserved[2];
@@ -626,8 +672,49 @@ struct drm_xe_vm_bind_op {
*/
__u32 obj;
+ /**
+ * @pat_index: The platform defined @pat_index to use for this mapping.
+ * The index basically maps to some predefined memory attributes,
+ * including things like caching, coherency, compression etc. The exact
+ * meaning of the pat_index is platform specific and defined in the
+ * Bspec and PRMs. When the KMD sets up the binding the index here is
+ * encoded into the ppGTT PTE.
+ *
+ * For coherency the @pat_index needs to be least as coherent as
+ * drm_xe_gem_create.coh_mode. i.e coh_mode(pat_index) >=
+ * drm_xe_gem_create.coh_mode. The KMD will extract the coherency mode
+ * from the @pat_index and reject if there is a mismatch (see note below
+ * for pre-MTL platforms).
+ *
+ * Note: On pre-MTL platforms there is only a caching mode and no
+ * explicit coherency mode, but on such hardware there is always a
+ * shared-LLC (or is dgpu) so all GT memory accesses are coherent with
+ * CPU caches even with the caching mode set as uncached. It's only the
+ * display engine that is incoherent (on dgpu it must be in VRAM which
+ * is always mapped as WC on the CPU). However to keep the uapi somewhat
+ * consistent with newer platforms the KMD groups the different cache
+ * levels into the following coherency buckets on all pre-MTL platforms:
+ *
+ * ppGTT UC -> DRM_XE_GEM_COH_NONE
+ * ppGTT WC -> DRM_XE_GEM_COH_NONE
+ * ppGTT WT -> DRM_XE_GEM_COH_NONE
+ * ppGTT WB -> DRM_XE_GEM_COH_AT_LEAST_1WAY
+ *
+ * In practice UC/WC/WT should only ever used for scanout surfaces on
+ * such platforms (or perhaps in general for dma-buf if shared with
+ * another device) since it is only the display engine that is actually
+ * incoherent. Everything else should typically use WB given that we
+ * have a shared-LLC. On MTL+ this completely changes and the HW
+ * defines the coherency mode as part of the @pat_index, where
+ * incoherent GT access is possible.
+ *
+ * Note: For userptr and externally imported dma-buf the kernel expects
+ * either 1WAY or 2WAY for the @pat_index.
+ */
+ __u16 pat_index;
+
/** @pad: MBZ */
- __u32 pad;
+ __u16 pad;
union {
/**
--
2.41.0
next prev parent reply other threads:[~2023-11-02 17:47 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-11-02 17:46 [igt-dev] [PATCH i-g-t v6 00/15] PAT and cache coherency support Matthew Auld
2023-11-02 17:46 ` Matthew Auld [this message]
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 02/15] lib/igt_fb: mark buffers as SCANOUT Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 03/15] lib/igt_draw: " Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 04/15] lib/xe: support cpu_caching and coh_mod for gem_create Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 05/15] tests/xe/mmap: add some tests for cpu_caching and coh_mode Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 06/15] lib/intel_pat: add helpers for common pat_index modes Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 07/15] lib/allocator: add get_offset_pat_index() helper Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 08/15] lib/intel_blt: support pat_index Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 09/15] lib/intel_buf: " Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 10/15] lib/xe_ioctl: update vm_bind to account for pat_index Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 11/15] lib/intel_allocator: treat default_alignment as the minimum Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 12/15] lib/intel_blt: tidy up alignment usage Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 13/15] lib/intel_batchbuffer: extend to include optional alignment Matthew Auld
2023-11-02 17:46 ` [igt-dev] [PATCH i-g-t v6 14/15] tests/xe: add some vm_bind pat_index tests Matthew Auld
2023-11-02 17:47 ` [igt-dev] [PATCH i-g-t v6 15/15] tests/intel-ci/xe: add pat and caching related tests Matthew Auld
2023-11-02 19:02 ` [igt-dev] ✓ Fi.CI.BAT: success for PAT and cache coherency support (rev6) Patchwork
2023-11-02 19:24 ` [igt-dev] ✗ CI.xeBAT: failure " Patchwork
2023-11-03 16:28 ` [igt-dev] ✗ Fi.CI.IGT: " Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20231102174700.298135-2-matthew.auld@intel.com \
--to=matthew.auld@intel.com \
--cc=igt-dev@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox