From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 92C43E9E2F9 for ; Wed, 11 Feb 2026 12:02:32 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 4548A10E149; Wed, 11 Feb 2026 12:02:32 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (2048-bit key; unprotected) header.d=intel.com header.i=@intel.com header.b="Rke7Uqjb"; dkim-atps=neutral Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.19]) by gabe.freedesktop.org (Postfix) with ESMTPS id 5DD8110E04C; Wed, 11 Feb 2026 12:02:30 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1770811351; x=1802347351; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=ICp1Wbk/vqTW08lQDuqrjDhgeA/Oyzt07UaYdtQ/Bmg=; b=Rke7UqjbrBCHNcUXHPEaTGcOdU/jMr8rhgKVx0SCINL2DBOi5dq05WfX pVceBhyKpIOIH9b5wZlwUOtxxh1mRWFAywtz6/Sl+feCAt+SThVS6X+61 xZ15CLtSJlmgEAo8CzPehpbYhnIgq9YBumgPi9Zo8pxlnC5br/edutEC3 CFeETgcfY09Fzbn0682b9HI4yfBXpHSqq7CaZ3S+L6nD9WiZVgEgnO5uM IEyDjsZphGcCeH0w5eLF45uUQ2jKALq/mJ2zM8Z5OtnTN6Uu9s5OiQH9C Zrbs/+xtmhv6Wx5aqsLCvNZkxX3QJbR8+uaqzEmHET/4Vka7VrVVmjTdp A==; X-CSE-ConnectionGUID: gzKFPpidTqy2jmpWMN2XtA== X-CSE-MsgGUID: kM9ZrvhTR46YcCCn4es0Ig== X-IronPort-AV: E=McAfee;i="6800,10657,11697"; a="71854627" X-IronPort-AV: E=Sophos;i="6.21,283,1763452800"; d="scan'208";a="71854627" Received: from fmviesa002.fm.intel.com ([10.60.135.142]) by orvoesa111.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 11 Feb 2026 04:01:49 -0800 X-CSE-ConnectionGUID: TxSRGyQLQByo/nfX4Jkm8w== X-CSE-MsgGUID: NF8lgwnxQx6jvjTvXqeqqQ== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.21,283,1763452800"; d="scan'208";a="235210710" Received: from jraag-z790m-itx-wifi.iind.intel.com ([10.190.239.23]) by fmviesa002.fm.intel.com with ESMTP; 11 Feb 2026 04:01:45 -0800 From: Mallesh Koujalagi To: intel-xe@lists.freedesktop.org, dri-devel@lists.freedesktop.org, rodrigo.vivi@intel.com Cc: andrealmeid@igalia.com, christian.koenig@amd.com, airlied@gmail.com, simona.vetter@ffwll.ch, mripard@kernel.org, anshuman.gupta@intel.com, badal.nilawar@intel.com, riana.tauro@intel.com, karthik.poosa@intel.com, sk.anirban@intel.com, raag.jadav@intel.com, Mallesh Koujalagi Subject: [PATCH 1/4] drm: Add DRM_WEDGE_RECOVERY_COLD_RESET for critical error Date: Wed, 11 Feb 2026 17:29:48 +0530 Message-ID: <20260211115946.2014051-7-mallesh.koujalagi@intel.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20260211115946.2014051-6-mallesh.koujalagi@intel.com> References: <20260211115946.2014051-6-mallesh.koujalagi@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-BeenThere: intel-xe@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel Xe graphics driver List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-xe-bounces@lists.freedesktop.org Sender: "Intel-xe" Introduce DRM_WEDGE_RECOVERY_COLD_RESET (BIT(4)) recovery method to handle critical errors requiring complete device power cycling. This method addresses scenarios where recovery mechanisms (driver reload, PCIe reset, etc.) are insufficient to restore device functionality. When set, it indicates to userspace that only a full cold reset can recover the device from its current error state. The cold reset method serves as a last resort for critical errors. Signed-off-by: Mallesh Koujalagi --- drivers/gpu/drm/drm_drv.c | 2 ++ include/drm/drm_device.h | 1 + 2 files changed, 3 insertions(+) diff --git a/drivers/gpu/drm/drm_drv.c b/drivers/gpu/drm/drm_drv.c index 2915118436ce..48d269d470a3 100644 --- a/drivers/gpu/drm/drm_drv.c +++ b/drivers/gpu/drm/drm_drv.c @@ -534,6 +534,8 @@ static const char *drm_get_wedge_recovery(unsigned int opt) return "bus-reset"; case DRM_WEDGE_RECOVERY_VENDOR: return "vendor-specific"; + case DRM_WEDGE_RECOVERY_COLD_RESET: + return "cold-reset"; default: return NULL; } diff --git a/include/drm/drm_device.h b/include/drm/drm_device.h index bc78fb77cc27..3e386eb42023 100644 --- a/include/drm/drm_device.h +++ b/include/drm/drm_device.h @@ -37,6 +37,7 @@ struct pci_controller; #define DRM_WEDGE_RECOVERY_REBIND BIT(1) /* unbind + bind driver */ #define DRM_WEDGE_RECOVERY_BUS_RESET BIT(2) /* unbind + reset bus device + bind */ #define DRM_WEDGE_RECOVERY_VENDOR BIT(3) /* vendor specific recovery method */ +#define DRM_WEDGE_RECOVERY_COLD_RESET BIT(4) /* full device cold reset */ /** * struct drm_wedge_task_info - information about the guilty task of a wedge dev -- 2.34.1