From: Rodrigo Vivi <rodrigo.vivi@intel.com>
To: Asahi Lina <lina@asahilina.net>
Cc: Lucas De Marchi <lucas.demarchi@intel.com>,
Raag Jadav <raag.jadav@intel.com>, <airlied@gmail.com>,
<daniel@ffwll.ch>, <thomas.hellstrom@linux.intel.com>,
<jani.nikula@linux.intel.com>, <joonas.lahtinen@linux.intel.com>,
<tursulin@ursulin.net>, <intel-xe@lists.freedesktop.org>,
<intel-gfx@lists.freedesktop.org>,
<dri-devel@lists.freedesktop.org>,
<himal.prasad.ghimiray@intel.com>, <francois.dugast@intel.com>,
<aravind.iddamsetty@linux.intel.com>, <anshuman.gupta@intel.com>
Subject: Re: [PATCH v4 1/3] drm: Introduce device wedged event
Date: Mon, 9 Sep 2024 16:43:15 -0400 [thread overview]
Message-ID: <Zt9d4wDYEhRLJ_RZ@intel.com> (raw)
In-Reply-To: <4c9ce216-9762-4dfd-80c7-fe285cae05c2@asahilina.net>
On Sun, Sep 08, 2024 at 11:08:39PM +0900, Asahi Lina wrote:
>
>
> On 9/8/24 12:07 AM, Lucas De Marchi wrote:
> > On Sat, Sep 07, 2024 at 08:38:30PM GMT, Asahi Lina wrote:
> >>
> >>
> >> On 9/6/24 6:42 PM, Raag Jadav wrote:
> >>> Introduce device wedged event, which will notify userspace of wedged
> >>> (hanged/unusable) state of the DRM device through a uevent. This is
> >>> useful especially in cases where the device is in unrecoverable state
> >>> and requires userspace intervention for recovery.
> >>>
> >>> Purpose of this implementation is to be vendor agnostic. Userspace
> >>> consumers (sysadmin) can define udev rules to parse this event and
> >>> take respective action to recover the device.
> >>>
> >>> Consumer expectations:
> >>> ----------------------
> >>> 1) Unbind driver
> >>> 2) Reset bus device
> >>> 3) Re-bind driver
> >>
> >> Is this supposed to be normative? For drm/asahi we have a "wedged"
> >> concept (firmware crashed), but the only possible recovery action is a
> >> full system reboot (which might still be desirable to allow userspace to
> >> trigger automatically in some scenarios) since there is no bus-level
> >> reset and no firmware reload possible.
> >
> > maybe let drivers hint possible/supported recovery mechanisms and then
> > sysadmin chooses what to do?
>
> How would we do this? A textual value for the event or something like
> that? ("WEDGED=bus-reset" vs "WEDGED=reboot"?)
Looks like a good idea.
Although in our case it is not just a 'bus-reset' but unbind+bus_reset+rebind,
but that should be okay to have 'bus-reset' kind of text and driver
to document the meaning.
>
> ~~ Lina
next prev parent reply other threads:[~2024-09-09 20:43 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-09-06 9:42 [PATCH v4 0/3] Introduce DRM device wedged event Raag Jadav
2024-09-06 9:42 ` [PATCH v4 1/3] drm: Introduce " Raag Jadav
2024-09-07 11:38 ` Asahi Lina
2024-09-07 15:07 ` Lucas De Marchi
2024-09-08 14:08 ` Asahi Lina
2024-09-09 20:01 ` Lucas De Marchi
2024-09-10 15:53 ` Raag Jadav
2024-09-10 16:06 ` Lucas De Marchi
2024-09-09 20:43 ` Rodrigo Vivi [this message]
2024-09-09 21:53 ` Matt Roper
2024-09-10 15:49 ` Raag Jadav
2024-09-24 9:37 ` Simona Vetter
2024-09-06 9:42 ` [PATCH v4 2/3] drm/xe: Use " Raag Jadav
2024-09-06 9:42 ` [PATCH v4 3/3] drm/i915: " Raag Jadav
2024-09-06 9:51 ` ✓ CI.Patch_applied: success for Introduce DRM device wedged event (rev2) Patchwork
2024-09-06 9:51 ` ✗ CI.checkpatch: warning " Patchwork
2024-09-06 9:53 ` ✓ CI.KUnit: success " Patchwork
2024-09-06 10:05 ` ✓ CI.Build: " Patchwork
2024-09-06 10:07 ` ✓ CI.Hooks: " Patchwork
2024-09-06 10:08 ` ✗ CI.checksparse: warning " Patchwork
2024-09-06 10:23 ` ✓ CI.BAT: success " Patchwork
2024-09-06 10:52 ` ✗ Fi.CI.CHECKPATCH: warning " Patchwork
2024-09-06 10:52 ` ✗ Fi.CI.SPARSE: " Patchwork
2024-09-06 10:59 ` ✓ Fi.CI.BAT: success " Patchwork
2024-09-10 8:53 ` ✗ Fi.CI.IGT: failure " Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Zt9d4wDYEhRLJ_RZ@intel.com \
--to=rodrigo.vivi@intel.com \
--cc=airlied@gmail.com \
--cc=anshuman.gupta@intel.com \
--cc=aravind.iddamsetty@linux.intel.com \
--cc=daniel@ffwll.ch \
--cc=dri-devel@lists.freedesktop.org \
--cc=francois.dugast@intel.com \
--cc=himal.prasad.ghimiray@intel.com \
--cc=intel-gfx@lists.freedesktop.org \
--cc=intel-xe@lists.freedesktop.org \
--cc=jani.nikula@linux.intel.com \
--cc=joonas.lahtinen@linux.intel.com \
--cc=lina@asahilina.net \
--cc=lucas.demarchi@intel.com \
--cc=raag.jadav@intel.com \
--cc=thomas.hellstrom@linux.intel.com \
--cc=tursulin@ursulin.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.