linux-pci.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per device domains") causing boot hangs on my laptop
@ 2025-10-02 17:04 Kenneth Crudup
  2025-10-02 22:28 ` Inochi Amaoto
                   ` (2 more replies)
  0 siblings, 3 replies; 28+ messages in thread
From: Kenneth Crudup @ 2025-10-02 17:04 UTC (permalink / raw)
  To: inochiama; +Cc: tglx, bhelgaas, unicorn_wang, linux-pci, Me


I'm running Linus' master (as of 7f7072574).

I bisected it to the above named commit (but had to back out ba9d484ed3 
(""PCI/MSI: Remove the conditional parent [un]mask logic") and then 
727e914bbfbbd ("PCI/MSI: Check MSI_FLAG_PCI_MSI_MASK_PARENT in 
cond_[startup|shutdown]_parent()") first for a clean revert.)

I have a Dell XPS-9320 laptop, and booting would hang before it switched 
to the xe video driver from the EFI FB driver (not sure if this is a 
symptom or partial cause) and I'd see a message akin to "not being able 
to set up DP tunnels, destroying" as the last thing printed before it 
hangs. (If it's important to see those messages I believe I can force a 
pstore crash to get them where they can be saved off, let me know).

LMK if you need further info,

-Kenny

-- 
Kenneth R. Crudup / Sr. SW Engineer, Scott County Consulting, Orange 
County CA


^ permalink raw reply	[flat|nested] 28+ messages in thread
[parent not found: <lhrbiugb4o0da3rtcvl0aduk.1759451570558@email.android.com>]
* Re: commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per device domains") causing boot hangs on my laptop
@ 2025-10-03  0:58 Kenneth Crudup
  2025-10-03  1:06 ` Inochi Amaoto
  0 siblings, 1 reply; 28+ messages in thread
From: Kenneth Crudup @ 2025-10-03  0:58 UTC (permalink / raw)
  To: Inochi Amaoto; +Cc: tglx, bhelgaas, unicorn_wang, linux-pci


Resending to re-add linux-pci (Vger thinks my tablet's MUA is "Spammy")

I'm going to figure out which line is is that's killing my NVMe IRQs.

FWIW, my NVMe is behind a VMD bridge(? I guess that's what it is):

----
-[0000:00]-+-00.0  Intel Corporation 12th Gen Core Processor Host 
Bridge/DRAM Registers
            +-02.0  Intel Corporation Alder Lake-P GT2 [Iris Xe Graphics]
            +-04.0  Intel Corporation Alder Lake Innovation Platform 
Framework Processor Participant
            +-05.0  Intel Corporation Alder Lake Imaging Signal Processor
            +-06.0  Intel Corporation RST VMD Managed Controller
            +-07.0-[01-3f]--
            +-07.2-[40-7e]----00.0-[41-7e]--+-00.0-[42]--
            |                               +-01.0-[43-56]--
            |                               +-02.0-[57-6a]--
            |                               +-03.0-[6b-7d]--
            |                               \-04.0-[7e]--
            +-08.0  Intel Corporation 12th Gen Core Processor Gaussian & 
Neural Accelerator
            +-0d.0  Intel Corporation Alder Lake-P Thunderbolt 4 USB 
Controller
            +-0d.2  Intel Corporation Alder Lake-P Thunderbolt 4 NHI #0
            +-0d.3  Intel Corporation Alder Lake-P Thunderbolt 4 NHI #1
            +-0e.0  Intel Corporation Volume Management Device NVMe RAID 
Controller
            +-12.0  Intel Corporation Alder Lake-P Integrated Sensor Hub
            +-14.0  Intel Corporation Alder Lake PCH USB 3.2 xHCI Host 
Controller
            +-14.2  Intel Corporation Alder Lake PCH Shared SRAM
            +-14.3  Intel Corporation Alder Lake-P PCH CNVi WiFi
            +-15.0  Intel Corporation Alder Lake PCH Serial IO I2C 
Controller #0
            +-15.1  Intel Corporation Alder Lake PCH Serial IO I2C 
Controller #1
            +-16.0  Intel Corporation Alder Lake PCH HECI Controller
            +-1e.0  Intel Corporation Alder Lake PCH UART #0
            +-1e.3  Intel Corporation Alder Lake SPI Controller
            +-1f.0  Intel Corporation Alder Lake PCH eSPI Controller
            +-1f.3  Intel Corporation Alder Lake PCH-P High Definition 
Audio Controller
            +-1f.4  Intel Corporation Alder Lake PCH-P SMBus Host Controller
            \-1f.5  Intel Corporation Alder Lake-P PCH SPI Controller
-[10000:e0]---06.0-[e1]----00.0  SK hynix Platinum P41/PC801 NVMe Solid 
State Drive
----

On 10/2/25 17:42, Kenneth R. Crudup wrote:
> 
> Yeah, it's definitely IRQ/NVMe related (this is a Google Lens 
> transcription of a camera picture, but it's close enough):
> 
> ----
> 1206] thunderbolt 0000:00:00.3: 0:5 <-> 1:13 (DP): not active, tearing down
> T167] thunderbolt 0000:00:00.3: 0:6 <-> 1:13 (DP): not active, tearing down
> 
> 199] nume nume0: 1/0 tag 20 (1014) QID O timeout, completion polled
> 
> 11511 nume nume0: 20/0/0 default/read/poll queues
> -----
> 
> ... and it does limp forward and continues a bit, then oops in an IRQ 
> routine somewhere (getting that next).
> 
> So, anything I can try to solve this? (I've since updated to Linus' 
> master as of a few mins ago, FWIW).
> 
> Thanks, Kenny
> 
> --
> Sent from my Tab S9+
> 
> 
> -------- Original message --------
> From: Inochi Amaoto <inochiama@gmail.com>
> Date: 10/2/25 16:46 (GMT-08:00)
> To: Kenneth Crudup <kenny@panix.com>, Inochi Amaoto <inochiama@gmail.com>
> Cc: tglx@linutronix.de, bhelgaas@google.com, unicorn_wang@outlook.com, 
> linux-pci@vger.kernel.org
> Subject: Re: commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per 
> device domains") causing boot hangs on my laptop
> 
> On Thu, Oct 02, 2025 at 04:42:40PM -0700, Kenneth Crudup wrote:
>  >
>  > On 10/2/25 16:37, Inochi Amaoto wrote:
>  >
>  > > I think it is good to have some more information like call trace to 
> know
>  > > whether is caused by this change, or the side effect from other commit.
>  >
>  > Yeah, let me make a branch with the commits back in place, then see 
> if I can
>  > get the traces in pstore.
>  >
>  > > I also suggest adding someone related to the xe driver ...
>  > Nah, I honestly think it may be related to VMD or my NVMe; it's like 
> it does
>  > everything it can except do disk I/O.
>  >
> 
> If this is related to the NVMe, I think you can check dmesg to see if 
> there is
> some log like "nvme nvme0: I/O tag XXX (XXX) QID XX timeout, completion 
> polled",
> which indicate the NVMe is broken.
> 
> Regards,
> Inochi
> 

-- 
Kenneth R. Crudup / Sr. SW Engineer, Scott County Consulting, Orange 
County CA


^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2025-10-30 22:27 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-10-02 17:04 commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per device domains") causing boot hangs on my laptop Kenneth Crudup
2025-10-02 22:28 ` Inochi Amaoto
2025-10-02 22:50   ` Kenneth Crudup
2025-10-02 23:37     ` Inochi Amaoto
2025-10-02 23:42       ` Kenneth Crudup
2025-10-02 23:45         ` Inochi Amaoto
2025-10-03  1:06           ` Kenneth Crudup
2025-10-13 21:18 ` Bjorn Helgaas
2025-10-13 21:19   ` Bjorn Helgaas
2025-10-28 22:52     ` Bjorn Helgaas
2025-10-30 22:27       ` Bjorn Helgaas
2025-10-14  0:13 ` Inochi Amaoto
2025-10-14  1:05   ` Kenneth Crudup
2025-10-14  3:10     ` Inochi Amaoto
     [not found] <lhrbiugb4o0da3rtcvl0aduk.1759451570558@email.android.com>
     [not found] ` <68df1c05.050a0220.31415.096bSMTPIN_ADDED_MISSING@mx.google.com>
2025-10-03  0:53   ` commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per device? " Inochi Amaoto
2025-10-03  1:00     ` Kenneth Crudup
  -- strict thread matches above, loose matches on Subject: below --
2025-10-03  0:58 commit 54f45a30c0 ("PCI/MSI: Add startup/shutdown for per device " Kenneth Crudup
2025-10-03  1:06 ` Inochi Amaoto
2025-10-03  1:08   ` Kenneth Crudup
2025-10-03  1:10   ` Kenneth Crudup
2025-10-03  1:18     ` Inochi Amaoto
2025-10-03  1:30       ` Inochi Amaoto
2025-10-03  1:51         ` Kenneth Crudup
2025-10-03  2:03           ` Inochi Amaoto
2025-10-03  5:36             ` Kenneth Crudup
2025-10-03 12:01               ` Inochi Amaoto
2025-10-03 13:18                 ` Kenneth Crudup
2025-10-04  2:09                   ` Inochi Amaoto

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).