From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 09387C87FCA for ; Fri, 25 Jul 2025 05:50:15 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id AFB3710E425; Fri, 25 Jul 2025 05:50:15 +0000 (UTC) Authentication-Results: gabe.freedesktop.org; dkim=pass (2048-bit key; unprotected) header.d=intel.com header.i=@intel.com header.b="Z9gQxLqt"; dkim-atps=neutral Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.13]) by gabe.freedesktop.org (Postfix) with ESMTPS id 8BF3D10E425; Fri, 25 Jul 2025 05:50:14 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1753422614; x=1784958614; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=o86HXC8fS175u4T+p5NLxbkvugG8XecNDwVMDSSC+QA=; b=Z9gQxLqttE6Y41xl4uSHvTMvnAVh/wxSf0qdGjsjLDAhhZeyqGrgcyuZ ia3KAe45YKHTtxOqA/GSjw6xyUXdkMDrG0H5bGIDDacry0JOoSw8u/XZ+ 1IE/+pNnPvMQOkiaBLplEs4AUszij4jJl3vqc5hjf++5/1Lp+gG78NvEt DzKHi9GgX4gAiLgF2pUdq8p6yxi0OTQmlS1BvXDRm2BcJnbgLEbbbCsvM TLUuFRlmHQ4jROCJI8Lfrs/TITsFQb1qyvXeRjLKFWdi7t/3kmy01ki39 q+a8QCZ9wC/DFZSnwaQHVqOiJSqcOqtt4j+cxY6wU/3+1y4NLpJ3XvgXC g==; X-CSE-ConnectionGUID: t3D+j2owTuyJQ8bk/fpOoA== X-CSE-MsgGUID: ROtHAntbQA2p88DrcYx40A== X-IronPort-AV: E=McAfee;i="6800,10657,11501"; a="58371623" X-IronPort-AV: E=Sophos;i="6.16,338,1744095600"; d="scan'208";a="58371623" Received: from fmviesa010.fm.intel.com ([10.60.135.150]) by fmvoesa107.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jul 2025 22:50:14 -0700 X-CSE-ConnectionGUID: mmatlTa/TpuHp5ohl2/gfg== X-CSE-MsgGUID: YfGJA1/8Smiw1RzWguPaDw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.16,338,1744095600"; d="scan'208";a="161453275" Received: from black.fi.intel.com ([10.237.72.28]) by fmviesa010.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Jul 2025 22:50:11 -0700 Date: Fri, 25 Jul 2025 08:50:08 +0300 From: Raag Jadav To: Riana Tauro Cc: intel-xe@lists.freedesktop.org, dri-devel@lists.freedesktop.org, anshuman.gupta@intel.com, rodrigo.vivi@intel.com, lucas.demarchi@intel.com, aravind.iddamsetty@linux.intel.com, umesh.nerlige.ramappa@intel.com, frank.scarbrough@intel.com, sk.anirban@intel.com, simona.vetter@ffwll.ch Subject: Re: [PATCH v6 6/9] drm/xe/doc: Document device wedged and runtime survivability Message-ID: References: <20250724143440.232862-1-riana.tauro@intel.com> <20250724143440.232862-7-riana.tauro@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20250724143440.232862-7-riana.tauro@intel.com> X-BeenThere: intel-xe@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel Xe graphics driver List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: intel-xe-bounces@lists.freedesktop.org Sender: "Intel-xe" On Thu, Jul 24, 2025 at 08:04:35PM +0530, Riana Tauro wrote: > Add documentation for vendor specific device wedged recovery method > and runtime survivability. ... > /** > + * DOC: Xe Device Wedging > + * > + * Xe driver uses drm device wedged uevent as documented in Documentation/gpu/drm-uapi.rst. > + * > + * When device is in wedged state, every IOCTL will be blocked and GT cannot be > + * used. Certain critical errors like gt reset failure, firmware failures can cause > + * the device to be wedged. The default recovery method for a wedged state > + * is rebind/bus-reset. > + * > + * Another recovery method is vendor-specific. Below are the usecases that trigger > + * vendor-specific drm device wedged uevent and the procedure to be performed > + * to recover the device. > + * > + * Case 1: CSC firmware errors require a firmware flash to restore normal device > + * operation. Since firmware flash is a vendor-specific action > + * ``WEDGED=vendor-specific`` recovery method along with > + * :ref:`runtime survivability mode ` is used to > + * notify userspace. User can then initiate a firmware flash using userspace tools > + * like fwupd to restore device to normal situation. s/situation/operation Reviewed-by: Raag Jadav