From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
To: Thorsten Leemhuis <regressions@leemhuis.info>
Cc: "Mark Somerville" <mark@qpok.net>,
stable@vger.kernel.org, "Mario Limonciello" <superm1@kernel.org>,
regressions@lists.linux.dev,
"Alex Deucher" <alexander.deucher@amd.com>,
"Christian König" <christian.koenig@amd.com>,
"Xinhui Pan" <Xinhui.Pan@amd.com>,
"Sasha Levin" <sashal@kernel.org>
Subject: Re: [REGRESSION] Unable to pass AMD RX 6400 GPU via VFIO
Date: Fri, 20 Mar 2026 13:34:17 +0100 [thread overview]
Message-ID: <2026032040-yelp-antidote-3f60@gregkh> (raw)
In-Reply-To: <2fd785ce-bf37-4875-a9a4-92422870f5b5@leemhuis.info>
On Fri, Mar 20, 2026 at 12:42:04PM +0100, Thorsten Leemhuis wrote:
> @greg/@sasha: I might be missing something, but looks like one patch
> that was backported missed two series where it's needed (see below for
> details):
>
> On 3/20/26 10:28, Mark Somerville wrote:
> >
> > I run Debian 13 stable (6.12 kernel) and have encountered a regression.
> >
> > My machine has three GPUs, the iGPU that is part of my 7950X and two dGPUs - one NVIDIA 3090 and one AMD RX 6400. I use the iGPU for the host and only use the two dGPUs with virtual machines via VFIO with libvirt.
> >
> > Although I have specified kernel parameters vfio_pci.ids for the GPUs, I have not blacklisted the amdgpu driver so that the host iGPU can operate. Previously, starting a VM with the RX 6400 dGPU assigned to it (via VFIO) would work fine. However, doing this with more recent stable kernels causes the machine to hang immediately (and then, ultimately, reset after a while - ~30s). No errors are logged, at least as things are configured just now.
> >
> > I can reliably reproduce this crash and a bisection revealed the commit that introducted the problem: 8140ac7c55e75093a01c6110a2c4025fe7177c57.
>
> That is 28695ca09d3264 ("drm/amd: Clean up kfd node on surprise
> disconnect") [v6.19-rc6, v6.18.7, v6.12.67 (as 8140ac7c55e750), v6.6.122].
>
> A fix for that f7afda7fcd169a ("drm/amd: Fix hang on amdgpu unload by
> using pci_dev_is_disconnected()") [v7.0-rc1, v6.18.17, v6.12.77].
>
> @greg/@sasha: Wondering why it's not in 6.19.y and 6.6.y. It failed
> there first, but later was applied to 6.18.y and 6.12.y:
>
> https://lore.kernel.org/all/?q=%22Fix+hang+on+amdgpu+unload+by+using+pci_dev_is_disconnected%22+%28f%3Agreg+OR+f%3Asasha%29
It's in the queue for 6.6.y, I've queued it up for 6.19.y now too.
thanks,
greg k-h
next prev parent reply other threads:[~2026-03-20 12:34 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-20 9:28 [REGRESSION] Unable to pass AMD RX 6400 GPU via VFIO Mark Somerville
2026-03-20 11:42 ` Thorsten Leemhuis
2026-03-20 12:34 ` Greg Kroah-Hartman [this message]
2026-03-20 11:42 ` Mario Limonciello
2026-03-20 13:33 ` Mark Somerville
2026-03-20 13:51 ` Mario Limonciello (AMD) (kernel.org)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2026032040-yelp-antidote-3f60@gregkh \
--to=gregkh@linuxfoundation.org \
--cc=Xinhui.Pan@amd.com \
--cc=alexander.deucher@amd.com \
--cc=christian.koenig@amd.com \
--cc=mark@qpok.net \
--cc=regressions@leemhuis.info \
--cc=regressions@lists.linux.dev \
--cc=sashal@kernel.org \
--cc=stable@vger.kernel.org \
--cc=superm1@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox