public inbox for intel-xe@lists.freedesktop.org
 help / color / mirror / Atom feed
* [PATCH v1 0/2] drm/xe: Add support for GPU health indicator
@ 2026-04-16  9:36 Soham Purkait
  2026-04-16  9:36 ` [PATCH v1 1/2] drm/xe/xe_ras: Add structures and commands for RAS " Soham Purkait
                   ` (5 more replies)
  0 siblings, 6 replies; 18+ messages in thread
From: Soham Purkait @ 2026-04-16  9:36 UTC (permalink / raw)
  To: intel-xe, riana.tauro, anshuman.gupta, aravind.iddamsetty,
	badal.nilawar, raag.jadav, ravi.kishore.koppuravuri,
	mallesh.koujalagi
  Cc: soham.purkait, anoop.c.vijay

        Large-scale service providers and tier-1 datacenters commonly
deploy various reactive health-monitoring approaches. The Xe GPU health
indicator is intended to fit into such reactive monitoring flows, where
it can be consumed by management and orchestration software.

This series adds Xe GPU health indicator support as a RAS feature
through the System Controller mailbox and exposes it through sysfs.

It introduces the health command IDs and request/response structures
used by the System Controller mailbox, and integrates the feature into
Xe through the gpu_health sysfs interface.

The sysfs file, gpu_health, is created at the device level and
provides a simple interface for observing and updating the reported
GPU health state. It is exposed as read-write on PF/native functions
and read-only on VFs.

Soham Purkait (2):
  drm/xe/xe_ras: Add structures and commands for RAS GPU health
    indicator
  drm/xe/xe_ras: Add RAS support for GPU health indicator

 drivers/gpu/drm/xe/Makefile                   |   1 +
 drivers/gpu/drm/xe/xe_device.c                |   3 +
 drivers/gpu/drm/xe/xe_ras.c                   | 181 ++++++++++++++++++
 drivers/gpu/drm/xe/xe_ras.h                   |  13 ++
 drivers/gpu/drm/xe/xe_ras_types.h             |  65 +++++++
 drivers/gpu/drm/xe/xe_sysctrl_mailbox_types.h |  15 ++
 6 files changed, 278 insertions(+)
 create mode 100644 drivers/gpu/drm/xe/xe_ras.c
 create mode 100644 drivers/gpu/drm/xe/xe_ras.h
 create mode 100644 drivers/gpu/drm/xe/xe_ras_types.h

-- 
2.34.1


^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2026-04-22  6:05 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-04-16  9:36 [PATCH v1 0/2] drm/xe: Add support for GPU health indicator Soham Purkait
2026-04-16  9:36 ` [PATCH v1 1/2] drm/xe/xe_ras: Add structures and commands for RAS " Soham Purkait
2026-04-16 11:39   ` Andi Shyti
2026-04-17 14:45   ` Rodrigo Vivi
2026-04-16  9:36 ` [PATCH v1 2/2] drm/xe/xe_ras: Add RAS support for " Soham Purkait
2026-04-16 11:54   ` Andi Shyti
2026-04-17 14:51     ` Rodrigo Vivi
2026-04-20 15:26       ` Andi Shyti
2026-04-20 19:51         ` Rodrigo Vivi
2026-04-21 12:56           ` Andi Shyti
2026-04-21 13:21             ` Rodrigo Vivi
2026-04-22  6:05           ` Purkait, Soham
2026-04-20 16:19     ` Purkait, Soham
2026-04-20 17:35       ` Andi Shyti
2026-04-16  9:55 ` ✗ CI.checkpatch: warning for drm/xe: Add " Patchwork
2026-04-16  9:56 ` ✓ CI.KUnit: success " Patchwork
2026-04-16 10:58 ` ✓ Xe.CI.BAT: " Patchwork
2026-04-16 12:01 ` ✗ Xe.CI.FULL: failure " Patchwork

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox