From: Rodrigo Vivi <rodrigo.vivi@intel.com>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: "Maarten Lankhorst" <dev@lankhorst.se>,
"Matthew Brost" <matthew.brost@intel.com>,
"Thomas Hellström" <thomas.hellstrom@linux.intel.com>,
"David Airlie" <airlied@gmail.com>,
"Simona Vetter" <simona@ffwll.ch>,
intel-xe@lists.freedesktop.org,
dri-devel <dri-devel@lists.freedesktop.org>
Subject: Re: drm: xe: Kernel-submitted job timed out
Date: Fri, 22 May 2026 16:44:02 -0400 [thread overview]
Message-ID: <ahDAErDSE-SFA0oM@intel.com> (raw)
In-Reply-To: <CAHk-=wj5aE1yVMb8RM2AuNUHSMnQDaGT+Vy+At5ds1vxBA+Kng@mail.gmail.com>
On Fri, May 22, 2026 at 12:05:35PM -0700, Linus Torvalds wrote:
> On Fri, 22 May 2026 at 11:55, Maarten Lankhorst <dev@lankhorst.se> wrote:
> >
> > There's a
> > May 22 11:09:19 3970x kernel: xe 0000:4b:00.0: [drm] Tile0: GT0: Timedout job: seqno=4485322, lrc_seqno=4485322, guc_id=0, flags=0x73 in no process [-1]
> > May 22 11:09:19 3970x kernel: xe 0000:4b:00.0: [drm] Xe device coredump has been created
> > May 22 11:09:19 3970x kernel: xe 0000:4b:00.0: [drm] Check your /sys/class/drm/card0/device/devcoredump/data
> >
> > Do you have this coredump too?
>
> Nope. I was assuming it didn't survive the reboot.
It doesn't. In this kind of setup the best way to deal with devcoredump
is to create a udev rule that copies the data file to a persistent place.
>
> (This machine doesn't allow any remote logins - very much on purpose -
> so when the GPU hangs, it's toast).
Any journal saving the kernel buf log of previous boots? Preferably with
some drm.debug flags enabled 0xf likely
Also:
Any bisect possible in this setup? I imagine it might be painful though...
What was the last drm-fixes pull you got in this 7.1.0-rc3-00073-ga6920214ba75 ?
I believe the quickest path might be to simply drop the xe fixes you might
have recently gotten there while we don't identify the culprit.
Thanks,
Rodrigo.
>
> Linus
next prev parent reply other threads:[~2026-05-22 20:44 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-22 18:52 drm: xe: Kernel-submitted job timed out Linus Torvalds
2026-05-22 18:55 ` Maarten Lankhorst
2026-05-22 19:05 ` Linus Torvalds
2026-05-22 20:44 ` Rodrigo Vivi [this message]
2026-05-22 20:54 ` Linus Torvalds
2026-05-23 8:29 ` Maarten Lankhorst
2026-05-23 14:48 ` Linus Torvalds
2026-06-09 16:30 ` Matthew Brost
2026-06-11 13:46 ` Rodrigo Vivi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ahDAErDSE-SFA0oM@intel.com \
--to=rodrigo.vivi@intel.com \
--cc=airlied@gmail.com \
--cc=dev@lankhorst.se \
--cc=dri-devel@lists.freedesktop.org \
--cc=intel-xe@lists.freedesktop.org \
--cc=matthew.brost@intel.com \
--cc=simona@ffwll.ch \
--cc=thomas.hellstrom@linux.intel.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox