public inbox for linux-pci@vger.kernel.org
 help / color / mirror / Atom feed
* IWL errors when reading PCI config through /sys
@ 2024-11-03 12:52 Jan Šídlo
  2024-11-04 21:22 ` Jan Šídlo
  2024-11-04 23:33 ` Bjorn Helgaas
  0 siblings, 2 replies; 5+ messages in thread
From: Jan Šídlo @ 2024-11-03 12:52 UTC (permalink / raw)
  To: linux-pci

Hello,

I'm not sure if this is the right place - if not, I'm sorry! It is the first time I'm trying to join a linux
mailing list so I may have missed something or I may have done something incorrectly. I'm not even sure if
this is the right way to send a message, but I have to start somewhere :)

I'm trying to hunt down few issues with my new-ish HP ZBook not wanting to go to deeper C-stsates, which is
kind of painful for a laptop (battery drain is ~5-10%/hour). For this I created a little python script that
gathers all the info about all the components from the system and periodically reports the status (every 3s or
so) including PCI and USB devices. To gather some information (specifically about ASPM) I'm reading /config
file for each PCI device in /sys device tree and parsing it. I'm not reading only /config but it is a prime
suspect, because I excluded WLAN card from this reading routine and the crash took much longer to occur -
hours instead of minutes.

When I run this script, the IWL subsystem crashes after some time (minutes to hours). There is clearly
something going on the PCI bus that I don't really understand. Since the error I get from IWL is changing, I
suspect there is some kind of race condition that is triggered by my script. I opened a bug [1] and after some
back and forth with Emmanuel Grumbach, he said that this kind of error is caused by IWL not being able to talk
to the WLAN device (at all) and to try to get your opinion on the matter :)

I have tried two different kernel versions (6.11.5 and 6.10.10), two different WLAN cards (BE200NGW and
AX211NGW) and multiple versions of firmware for the cards. The error is still present, so I would say I'd need
to dig deeper, but I'm not really familiar with PCI subsystem and how to debug it efficiently given the amount
of data going through.

What can I do to debug this issue further?

Thanks
Jan

1 - https://bugzilla.kernel.org/show_bug.cgi?id=219457

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2024-11-05 14:34 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-11-03 12:52 IWL errors when reading PCI config through /sys Jan Šídlo
2024-11-04 21:22 ` Jan Šídlo
2024-11-04 23:33 ` Bjorn Helgaas
2024-11-05  0:24   ` Jan Šídlo
2024-11-05 14:34     ` Bjorn Helgaas

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox