From: "Neftin, Sasha" <sasha.neftin@intel.com>
To: Kyle Sanderson <kyle.leet@gmail.com>,
<intel-wired-lan@lists.osuosl.org>, <jesse.brandeburg@intel.com>,
<anthony.l.nguyen@intel.com>,
"MP, Sureshkumar" <sureshkumar.mp@intel.com>
Cc: netdev@vger.kernel.org,
Linux-Kernel <linux-kernel@vger.kernel.org>,
Greg KH <gregkh@linuxfoundation.org>,
"Ruinskiy, Dima" <dima.ruinskiy@intel.com>,
Linus Torvalds <torvalds@linux-foundation.org>,
"Avivi, Amir" <amir.avivi@intel.com>,
therbert@google.com
Subject: Re: [Intel-wired-lan] igc: 5.10.146 Kernel BUG at 0xffffffff813ce19f
Date: Wed, 28 Dec 2022 09:04:48 +0200 [thread overview]
Message-ID: <70eea40e-808c-e9ee-9aab-617ebe67d67c@intel.com> (raw)
In-Reply-To: <CACsaVZLh0WFu1p7TUxE=RwucoTcZwsfQ5+ivorcbwCiRneeVFg@mail.gmail.com>
On 12/28/2022 06:45, Kyle Sanderson wrote:
> hi Intel IGC Maintainers,
>
> I've managed to reproduce this issue on 5.15.85 (same steps to
> reproduce), and have symbols and line numbers in the below panic.
> There's no device support in 5.4 for this hardware, so I was unable to
> reproduce the issue there in igc.
>
> From the Kernel BUG_ON, it's being asked to read beyond the array
> size. The min call looks very suspicious (igb, and other drives don't
> appear to do that), but I don't know if that's where the issue is.
>
> Please let me know if there's anything more I can do to help.
I do not know if it is an SW problem.
1. Does the problem reproduce on the latest upstream kernel?(worth to check)
2. I do not see this crash in our labs. I haven't a platform with six
i226 parts.(Trying find folks who work with this platform.)
3. I am working on a patch to address .ndo_tx_timeout support. (pass the
reset task to netdev while the link disconnected during traffic, under
testing). It could be related and worth checking - please, let me know
if you want to apply on your platform (against upstream). Reach us
(Dima, Amir, and me) directly off the list.
>
> [ 223.725003] igc 0000:01:00.0 eth0: Reset adapter
> [ 233.139441] kernel BUG at lib/dynamic_queue_limits.c:27!
> [ 233.146814] invalid opcode: 0000 [#1] SMP NOPTI
> [ 233.146816] refcount_t: saturated; leaking memory.
> [ 233.146833] WARNING: CPU: 0 PID: 0 at lib/refcount.c:19
> refcount_warn_saturate+0x97/0x110
> [ 233.153243] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G W
> 5.15.85-amd64-vyos #1
> [ 233.159216] Modules linked in:
> [ 233.168451] Hardware name: Default string Default string/Default
> string, BIOS 5.19 09/23/2022
> [ 233.177895] wireguard
> [ 233.181645] RIP: 0010:dql_completed+0x12f/0x140
> [ 233.191360] curve25519_x86_64
> [ 233.194406] Code: cf c9 00 48 89 57 58 e9 54 ff ff ff 85 ed 40 0f
> 95 c5 41 39 d8 41 0f 95 c0 44 84 c5 74 04 85 d2 78 0a 44 89 d8 e9 36
> ff ff ff <0f> 0b 01 f6 44 89 da 29 f2 0f 48 d0 eb 8d cc cc cc 41 56 49
> 89 f3
> [ 233.199767] libcurve25519_generic
> [ 233.203540] RSP: 0018:ffffa85dc0134e20 EFLAGS: 00010283
> [ 233.225248] libchacha20poly1305
> [ 233.229417]
> [ 233.229417] RAX: 0000000000000001 RBX: ffff934002104b40 RCX: 00000000000005ea
> [ 233.235539] chacha_x86_64
> [ 233.239508] RDX: ffff934002110000 RSI: 0000000000001d92 RDI: ffff93400211a200
> [ 233.241606] poly1305_x86_64
> [ 233.249796] RBP: 0000000000000000 R08: 000000000004ad4e R09: 0000000000000000
> [ 233.253226] ip6_udp_tunnel
> [ 233.261445] R10: 000000000004b338 R11: ffffffffbabfee80 R12: 0000000000001d92
> [ 233.261446] R13: ffff934002104b40 R14: ffffa85dc09d1450 R15: 00000000ffffffa6
> [ 233.265054] udp_tunnel
> [ 233.273314] FS: 0000000000000000(0000) GS:ffff934f3fe80000(0000)
> knlGS:0000000000000000
> [ 233.276826] libchacha
> [ 233.285023] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 233.285025] CR2: 00007f294393fe84 CR3: 0000000605e10000 CR4: 0000000000350ee0
> [ 233.285026] Call Trace:
> [ 233.285026] <IRQ>
> [ 233.285027] igc_poll+0x19d/0x14b0 [igc]
> [ 233.293242] vrf
> [ 233.296396] __napi_poll+0x22/0x110
> [ 233.305688] nft_masq
> [ 233.308763] net_rx_action+0xe9/0x250
> [ 233.315455] nf_nat_tftp
> [ 233.323756] ? igc_msix_ring+0x51/0x60 [igc]
> [ 233.326946] nf_conntrack_tftp
> [ 233.329661] __do_softirq+0xb8/0x1e9
> [ 233.334471] nf_nat_sip
> [ 233.336991] irq_exit_rcu+0x84/0xb0
> [ 233.341290] nf_conntrack_sip
> [ 233.344284] common_interrupt+0x78/0x90
> [ 233.348778] nf_nat_pptp
> [ 233.352104] </IRQ>
> [ 233.357240] nf_conntrack_pptp
> [ 233.361052] <TASK>
> [ 233.365360] nf_nat_h323
> [ 233.368484] asm_common_interrupt+0x22/0x40
> [ 233.372723] nf_conntrack_h323
> [ 233.376363] RIP: 0010:cpuidle_enter_state+0xb5/0x2a0
> [ 233.380952] nf_nat_ftp
> [ 233.384155] Code: c1 48 b2 ff 65 8b 3d b2 58 49 46 e8 65 47 b2 ff
> 31 ff 49 89 c5 e8 6b 52 b2 ff 45 84 f6 0f 85 85 01 00 00 fb 66 0f 1f
> 44 00 00 <45> 85 ff 0f 88 bb 00 00 00 49 63 c7 4c 2b 2c 24 48 8d 14 40
> 48 8d
> [ 233.386840] nf_conntrack_ftp
> [ 233.390553] RSP: 0018:ffffa85dc00efea8 EFLAGS: 00000246
> [ 233.393224] nft_objref
> [ 233.396340]
> [ 233.396340] RAX: ffff934f3fea3440 RBX: 0000000000000003 RCX: 000000000000001f
> [ 233.401256] nft_counter
> [ 233.404981] RDX: 0000000000000000 RSI: 0000000046ec0743 RDI: 0000000000000000
> [ 233.410769] nft_ct
> [ 233.413816] RBP: ffff934f3feac910 R08: 00000036481c5d1b R09: 0000003605db0041
> [ 233.435320] nft_chain_nat
> [ 233.438947] R10: 0000000000000072 R11: 0000000000000164 R12: ffffffffba6ccb40
> [ 233.445014] nf_nat
> [ 233.448065] R13: 00000036481c5d1b R14: 0000000000000000 R15: 0000000000000003
> [ 233.450073] nf_tables
> [ 233.458210] ? cpuidle_enter_state+0xa5/0x2a0
> [ 233.461335] nfnetlink_cthelper
> [ 233.469449] cpuidle_enter+0x24/0x40
> [ 233.472106] nf_conntrack
> [ 233.480247] do_idle+0x1e4/0x280
> [ 233.483580] nf_defrag_ipv6
> [ 233.491703] cpu_startup_entry+0x14/0x20
> [ 233.494399] nf_defrag_ipv4
> [ 233.502517] secondary_startup_64_no_verify+0xb0/0xbb
> [ 233.505503] libcrc32c
> [ 233.510641] </TASK>
> [ 233.514474] nfnetlink
> [ 233.518787] Modules linked in: wireguard
> [ 233.522065] af_packet
> [ 233.525975] curve25519_x86_64
> [ 233.529441] x86_pkg_temp_thermal
> [ 233.534136] libcurve25519_generic
> [ 233.537612] intel_powerclamp
> [ 233.543511] libchacha20poly1305
> [ 233.546508] coretemp
> [ 233.549313] chacha_x86_64 poly1305_x86_64
> [ 233.552304] crct10dif_pclmul
> [ 233.556981] ip6_udp_tunnel udp_tunnel libchacha vrf nft_masq
> nf_nat_tftp nf_conntrack_tftp nf_nat_sip nf_conntrack_sip nf_nat_pptp
> nf_conntrack_pptp nf_nat_h323 nf_conntrack_h323 nf_nat_ftp
> nf_conntrack_ftp
> [ 233.559990] crc32_pclmul
> [ 233.563754] nft_objref nft_counter
> [ 233.567791] ghash_clmulni_intel
> [ 233.571912] nft_ct
> [ 233.575569] aesni_intel
> [ 233.579500] nft_chain_nat
> [ 233.582390] crypto_simd
> [ 233.587225] nf_nat
> [ 233.590841] cryptd
> [ 233.612012] nf_tables
> [ 233.615288] intel_cstate
> [ 233.619486] nfnetlink_cthelper
> [ 233.623388] iTCO_wdt
> [ 233.626063] nf_conntrack
> [ 233.629196] efi_pstore
> [ 233.632499] nf_defrag_ipv6
> [ 233.635597] pcspkr
> [ 233.638218] nf_defrag_ipv4
> [ 233.640825] evdev
> [ 233.643700] libcrc32c
> [ 233.646869] iTCO_vendor_support
> [ 233.650591] nfnetlink
> [ 233.653355] sg
> [ 233.656497] af_packet
> [ 233.659446] tpm_crb
> [ 233.662775] x86_pkg_temp_thermal
> [ 233.665337] tpm_tis
> [ 233.668670] intel_powerclamp
> [ 233.671144] tpm_tis_core
> [ 233.673993] coretemp
> [ 233.677768] tpm
> [ 233.680591] crct10dif_pclmul
> [ 233.682782] rng_core
> [ 233.685624] crc32_pclmul
> [ 233.688271] mei_me
> [ 233.692161] ghash_clmulni_intel
> [ 233.694799] mei
> [ 233.698290] aesni_intel
> [ 233.701384] button
> [ 233.704125] crypto_simd
> [ 233.706379] acpi_pad
> [ 233.709861] cryptd
> [ 233.712587] mpls_iptunnel
> [ 233.715682] intel_cstate
> [ 233.718177] mpls_router
> [ 233.721872] iTCO_wdt
> [ 233.724077] ip_tunnel
> [ 233.727034] efi_pstore
> [ 233.729533] br_netfilter
> [ 233.732471] pcspkr
> [ 233.735139] bridge
> [ 233.737627] evdev
> [ 233.740768] stp
> [ 233.743827] iTCO_vendor_support
> [ 233.746789] llc
> [ 233.749457] sg
> [ 233.752222] fuse
> [ 233.755071] tpm_crb
> [ 233.758113] configfs
> [ 233.760589] tpm_tis
> [ 233.763065] efivarfs
> [ 233.765437] tpm_tis_core
> [ 233.767622] ip_tables
> [ 233.771314] tpm
> [ 233.773511] x_tables
> [ 233.775607] rng_core
> [ 233.777893] autofs4
> [ 233.780456] mei_me
> [ 233.783120] usb_storage
> [ 233.785686] mei
> [ 233.788319] ohci_hcd
> [ 233.791358] button
> [ 233.794104] uhci_hcd
> [ 233.796287] acpi_pad
> [ 233.798948] ehci_hcd
> [ 233.801608] mpls_iptunnel
> [ 233.804146] squashfs
> [ 233.806598] mpls_router
> [ 233.809530] zstd_decompress
> [ 233.811719] ip_tunnel
> [ 233.814378] lz4_decompress
> [ 233.816841] br_netfilter
> [ 233.819492] loop
> [ 233.822152] bridge
> [ 233.824802] overlay
> [ 233.827927] stp
> [ 233.830564] ext4
> [ 233.833498] llc
> [ 233.836805] crc32c_generic
> [ 233.839557] fuse
> [ 233.842787] crc16
> [ 233.845815] configfs
> [ 233.848084] mbcache
> [ 233.850564] efivarfs
> [ 233.853117] jbd2
> [ 233.855296] ip_tables
> [ 233.857561] nls_cp437
> [ 233.859722] x_tables autofs4
> [ 233.862950] vfat
> [ 233.865216] usb_storage
> [ 233.867585] fat
> [ 233.870239] ohci_hcd uhci_hcd
> [ 233.872779] efivars
> [ 233.875414] ehci_hcd
> [ 233.877693] nls_ascii
> [ 233.880433] squashfs zstd_decompress
> [ 233.883172] hid_generic
> [ 233.886580] lz4_decompress
> [ 233.888861] usbhid
> [ 233.891803] loop
> [ 233.893980] hid
> [ 233.897493] overlay
> [ 233.900050] sd_mod
> [ 233.902702] ext4
> [ 233.905446] t10_pi
> [ 233.909612] crc32c_generic
> [ 233.912548] ahci
> [ 233.915776] crc16
> [ 233.918244] libahci
> [ 233.920540] mbcache
> [ 233.922740] crc32c_intel
> [ 233.925303] jbd2
> [ 233.927777] libata
> [ 233.930058] nls_cp437
> [ 233.932530] i2c_i801
> [ 233.935740] vfat fat
> [ 233.938022] i2c_smbus
> [ 233.940397] efivars
> [ 233.942945] xhci_pci
> [ 233.945504] nls_ascii hid_generic
> [ 233.948535] xhci_hcd
> [ 233.950814] usbhid
> [ 233.953282] scsi_mod
> [ 233.956022] hid
> [ 233.958671] scsi_common
> [ 233.961327] sd_mod t10_pi
> [ 233.964066] igc
> [ 233.966618] ahci
> [ 233.969274] thermal
> [ 233.973168] libahci
> [ 233.975830] fan
> [ 233.978310] crc32c_intel
> [ 233.980975]
> [ 233.983158] libata
> [ 233.986113] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G W
> 5.15.85-amd64-vyos #1
> [ 233.989257] i2c_i801
> [ 233.991441] Hardware name: Default string Default string/Default
> string, BIOS 5.19 09/23/2022
> [ 233.993730] i2c_smbus
> [ 233.996286] RIP: 0010:refcount_warn_saturate+0x97/0x110
> [ 233.998850] xhci_pci
> [ 234.001055] Code: 00 01 e8 cb 40 42 00 0f 0b c3 cc cc cc cc 80 3d
> 39 f4 da 00 00 75 a8 48 c7 c7 d8 13 43 ba c6 05 29 f4 da 00 01 e8 a8
> 40 42 00 <0f> 0b c3 cc cc cc cc 80 3d 13 f4 da 00 00 75 85 48 c7 c7 30
> 14 43
> [ 234.004069] xhci_hcd scsi_mod
> [ 234.005878] RSP: 0018:ffffa85dc0003ae0 EFLAGS: 00010282
> [ 234.008348] scsi_common igc
> [ 234.017611]
> [ 234.020297] thermal fan
> [ 234.029764] RAX: 0000000000000000 RBX: 0000000000005837 RCX: 0000000000000000
> [ 234.032559]
> [ 234.032585] ---[ end trace 8acd09a29bf2e660 ]---
> [ 234.038458] RDX: ffff934f3fe1f3e0 RSI: ffff934f3fe1c490 RDI: 0000000000000300
> [ 234.141617] RIP: 0010:dql_completed+0x12f/0x140
> [ 234.146459] RBP: ffff9340074b28c0 R08: 0000000000000000 R09: ffffa85dc0003908
> [ 234.150075] Code: cf c9 00 48 89 57 58 e9 54 ff ff ff 85 ed 40 0f
> 95 c5 41 39 d8 41 0f 95 c0 44 84 c5 74 04 85 d2 78 0a 44 89 d8 e9 36
> ff ff ff <0f> 0b 01 f6 44 89 da 29 f2 0f 48 d0 eb 8d cc cc cc 41 56 49
> 89 f3
> [ 234.156048] R10: ffffa85dc0003900 R11: ffffffffba6b0ce8 R12: ffff9340074b2908
> [ 234.159502] RSP: 0018:ffffa85dc0134e20 EFLAGS: 00010283
> [ 234.161442] R13: ffffffffba28eb60 R14: fffffffffffffff0 R15: ffffa85dc0003b40
> [ 234.164506]
> [ 234.172573] FS: 0000000000000000(0000) GS:ffff934f3fe00000(0000)
> knlGS:0000000000000000
> [ 234.174545] RAX: 0000000000000001 RBX: ffff934002104b40 RCX: 00000000000005ea
> [ 234.179914] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 234.188023] RDX: ffff934002110000 RSI: 0000000000001d92 RDI: ffff93400211a200
> [ 234.193301] CR2: 000055e26436ee10 CR3: 0000000605e10000 CR4: 0000000000350ef0
> [ 234.201457] RBP: 0000000000000000 R08: 000000000004ad4e R09: 0000000000000000
> [ 234.223063] Call Trace:
> [ 234.231267] R10: 000000000004b338 R11: ffffffffbabfee80 R12: 0000000000001d92
> [ 234.237398] <IRQ>
> [ 234.245613] R13: ffff934002104b40 R14: ffffa85dc09d1450 R15: 00000000ffffffa6
> [ 234.247734] __nf_conntrack_find_get+0x331/0x340 [nf_conntrack]
> [ 234.256997] FS: 0000000000000000(0000) GS:ffff934f3fe80000(0000)
> knlGS:0000000000000000
> [ 234.265245] nf_conntrack_in+0x1e1/0x760 [nf_conntrack]
> [ 234.271954] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 234.280252] nf_hook_slow+0x37/0xb0
> [ 234.288537] CR2: 00007f294393fe84 CR3: 000000011da48000 CR4: 0000000000350ee0
> [ 234.296788] nf_hook_slow_list+0x8c/0x130
> [ 234.300002] Kernel panic - not syncing: Fatal exception in interrupt
> [ 234.308339] ip_sublist_rcv+0x1fa/0x220
> [ 234.319422] Kernel Offset: 0x38600000 from 0xffffffff81000000
> (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
> [ 234.494681] ---[ end Kernel panic - not syncing: Fatal exception in
> interrupt ]---
>
> Kyle.
>
> On Tue, Dec 20, 2022 at 10:29 AM Kyle Sanderson <kyle.leet@gmail.com> wrote:
>>
>> re-sending as plain text - my apologies.
>>
>>> On Sun, 18 Dec 2022, 23:31 Neftin, Sasha wrote:
>>> What is a board in use (LAN on board or NIC)?
>>> What is lspci, lspci -t and lspci -s 0000:[lan bus:device.function] -vvv output?
>>
>> It's embedded on the board, could very well be on a bridge though as a
>> card. The box has 6 ports, 2 were in-use while testing.
>>
>> 00:00.0 Host bridge: Intel Corporation Device 4522 (rev 01)
>> 00:02.0 VGA compatible controller: Intel Corporation Elkhart Lake [UHD
>> Graphics Gen11 16EU] (rev 01)
>> 00:08.0 System peripheral: Intel Corporation Device 4511 (rev 01)
>> 00:14.0 USB controller: Intel Corporation Device 4b7d (rev 11)
>> 00:14.2 RAM memory: Intel Corporation Device 4b7f (rev 11)
>> 00:16.0 Communication controller: Intel Corporation Device 4b70 (rev 11)
>> 00:17.0 SATA controller: Intel Corporation Device 4b63 (rev 11)
>> 00:1c.0 PCI bridge: Intel Corporation Device 4b38 (rev 11)
>> 00:1c.1 PCI bridge: Intel Corporation Device 4b39 (rev 11)
>> 00:1c.2 PCI bridge: Intel Corporation Device 4b3a (rev 11)
>> 00:1c.3 PCI bridge: Intel Corporation Device 4b3b (rev 11)
>> 00:1c.4 PCI bridge: Intel Corporation Device 4b3c (rev 11)
>> 00:1c.6 PCI bridge: Intel Corporation Device 4b3e (rev 11)
>> 00:1f.0 ISA bridge: Intel Corporation Device 4b00 (rev 11)
>> 00:1f.3 Audio device: Intel Corporation Device 4b58 (rev 11)
>> 00:1f.4 SMBus: Intel Corporation Device 4b23 (rev 11)
>> 00:1f.5 Serial bus controller: Intel Corporation Device 4b24 (rev 11)
>> 01:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 02:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 03:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 04:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 05:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 06:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>>
>> -[0000:00]-+-00.0
>> +-02.0
>> +-08.0
>> +-14.0
>> +-14.2
>> +-16.0
>> +-17.0
>> +-1c.0-[01]----00.0
>> +-1c.1-[02]----00.0
>> +-1c.2-[03]----00.0
>> +-1c.3-[04]----00.0
>> +-1c.4-[05]----00.0
>> +-1c.6-[06]----00.0
>> +-1f.0
>> +-1f.3
>> +-1f.4
>> \-1f.5
>>
>>
>> 01:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> Subsystem: Intel Corporation Device 0000
>> Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
>> Stepping- SERR- FastB2B- DisINTx+
>> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>> Latency: 0
>> Interrupt: pin A routed to IRQ 16
>> Region 0: Memory at 80600000 (32-bit, non-prefetchable) [size=1M]
>> Region 3: Memory at 80700000 (32-bit, non-prefetchable) [size=16K]
>> Capabilities: [40] Power Management version 3
>> Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
>> Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
>> Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
>> Address: 0000000000000000 Data: 0000
>> Masking: 00000000 Pending: 00000000
>> Capabilities: [70] MSI-X: Enable+ Count=5 Masked-
>> Vector table: BAR=3 offset=00000000
>> PBA: BAR=3 offset=00002000
>> Capabilities: [a0] Express (v2) Endpoint, MSI 00
>> DevCap: MaxPayload 512 bytes, PhantFunc 0, Latency L0s <512ns, L1 <64us
>> ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 0W
>> DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
>> RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
>> MaxPayload 128 bytes, MaxReadReq 512 bytes
>> DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr+ TransPend-
>> LnkCap: Port #0, Speed 5GT/s, Width x1, ASPM L1, Exit Latency L1 <4us
>> ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
>> LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
>> ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>> LnkSta: Speed 5GT/s, Width x1
>> TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
>> DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ NROPrPrP- LTR+
>> 10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt- EETLPPrefix-
>> EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
>> FRS- TPHComp- ExtTPHComp-
>> AtomicOpsCap: 32bit- 64bit- 128bitCAS-
>> DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- LTR+
>> 10BitTagReq- OBFF Disabled,
>> AtomicOpsCtl: ReqEn-
>> LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
>> Transmit Margin: Normal Operating Range, EnterModifiedCompliance-
>> ComplianceSOS-
>> Compliance Preset/De-emphasis: -6dB de-emphasis, 0dB preshoot
>> LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-
>> EqualizationPhase1-
>> EqualizationPhase2- EqualizationPhase3- LinkEqualizationRequest-
>> Retimer- 2Retimers- CrosslinkRes: unsupported
>> Capabilities: [100 v2] Advanced Error Reporting
>> UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>> MalfTLP- ECRC- UnsupReq- ACSViol-
>> UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>> MalfTLP- ECRC- UnsupReq- ACSViol-
>> UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+
>> MalfTLP+ ECRC- UnsupReq- ACSViol-
>> CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
>> CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
>> AERCap: First Error Pointer: 00, ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
>> MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
>> HeaderLog: 00000000 00000000 00000000 00000000
>> Capabilities: [140 v1] Device Serial Number e4-3a-6e-ff-ff-5d-bb-54
>> Capabilities: [1c0 v1] Latency Tolerance Reporting
>> Max snoop latency: 3145728ns
>> Max no snoop latency: 3145728ns
>> Capabilities: [1f0 v1] Precision Time Measurement
>> PTMCap: Requester:+ Responder:- Root:-
>> PTMClockGranularity: 4ns
>> PTMControl: Enabled:- RootSelected:-
>> PTMEffectiveGranularity: Unknown
>> Capabilities: [1e0 v1] L1 PM Substates
>> L1SubCap: PCI-PM_L1.2+ PCI-PM_L1.1+ ASPM_L1.2+ ASPM_L1.1+ L1_PM_Substates+
>> PortCommonModeRestoreTime=55us PortTPowerOnTime=70us
>> L1SubCtl1: PCI-PM_L1.2- PCI-PM_L1.1- ASPM_L1.2- ASPM_L1.1-
>> T_CommonMode=0us LTR1.2_Threshold=81920ns
>> L1SubCtl2: T_PwrOn=50us
>> Kernel driver in use: igc
>> Kernel modules: igc
>>
>> On Sun, Dec 18, 2022 at 10:31 PM Neftin, Sasha <sasha.neftin@intel.com> wrote:
>>>
>>> On 12/16/2022 00:28, Kyle Sanderson wrote:
>>>> (Un)fortunately I can reproduce this bug by simply removing the
>>>> ethernet cable from the box while there is traffic flowing. kprint
>>>> below from a console line. Please CC / to me for any additional
>>>> information I can provide for this panic.
>>> What is a board in use (LAN on board or NIC)? What is lspci, lspci -t
>>> and lspci -s 0000:[lan bus:device.function] -vvv output?
>>>>
>>>> [ 156.707054] igc 0000:01:00.0 eth0: NIC Link is Down
>>>> [ 156.712981] br-lan: port 1(eth0) entered disabled state
>>>> [ 156.719246] igc 0000:01:00.0 eth0: Register Dump
>>>> [ 156.724784] igc 0000:01:00.0 eth0: Register Name Value
>>>> [ 156.731067] igc 0000:01:00.0 eth0: CTRL 181c0641
>>>> [ 156.737607] igc 0000:01:00.0 eth0: STATUS 00380681
>>>> [ 156.744133] igc 0000:01:00.0 eth0: CTRL_EXT 100000c0
>>>> [ 156.750759] igc 0000:01:00.0 eth0: MDIC 18017949
>>>> [ 156.757258] igc 0000:01:00.0 eth0: ICR 00000001
>>>> [ 156.763785] igc 0000:01:00.0 eth0: RCTL 0440803a
>>>> [ 156.770324] igc 0000:01:00.0 eth0: RDLEN[0-3] 00001000
>>>> 00001000 00001000 00001000
>>>> [ 156.779457] igc 0000:01:00.0 eth0: RDH[0-3] 000000ef
>>>> 000000a1 00000092 000000ba
>>>> [ 156.788500] igc 0000:01:00.0 eth0: RDT[0-3] 000000ee
>>>> 000000a0 00000091 000000b9
>>>> [ 156.797650] igc 0000:01:00.0 eth0: RXDCTL[0-3] 02040808
>>>> 02040808 02040808 02040808
>>>> [ 156.806688] igc 0000:01:00.0 eth0: RDBAL[0-3] 02f43000
>>>> 02180000 02e7f000 02278000
>>>> [ 156.815781] igc 0000:01:00.0 eth0: RDBAH[0-3] 00000001
>>>> 00000001 00000001 00000001
>>>> [ 156.824928] igc 0000:01:00.0 eth0: TCTL a503f0fa
>>>> [ 156.831587] igc 0000:01:00.0 eth0: TDBAL[0-3] 02f43000
>>>> 02180000 02e7f000 02278000
>>>> [ 156.840637] igc 0000:01:00.0 eth0: TDBAH[0-3] 00000001
>>>> 00000001 00000001 00000001
>>>> [ 156.849753] igc 0000:01:00.0 eth0: TDLEN[0-3] 00001000
>>>> 00001000 00001000 00001000
>>>> [ 156.858760] igc 0000:01:00.0 eth0: TDH[0-3] 000000d4
>>>> 0000003d 000000af 0000002a
>>>> [ 156.867771] igc 0000:01:00.0 eth0: TDT[0-3] 000000e4
>>>> 0000005a 000000c8 0000002a
>>>> [ 156.876864] igc 0000:01:00.0 eth0: TXDCTL[0-3] 02100108
>>>> 02100108 02100108 02100108
>>>> [ 156.885905] igc 0000:01:00.0 eth0: Reset adapter
>>>> [ 160.307195] igc 0000:01:00.0 eth0: NIC Link is Up 1000 Mbps Full
>>>> Duplex, Flow Control: RX/TX
>>>> [ 160.317974] br-lan: port 1(eth0) entered blocking state
>>>> [ 160.324532] br-lan: port 1(eth0) entered forwarding state
>>>> [ 161.197263] ------------[ cut here ]------------
>>>> [ 161.202669] Kernel BUG at 0xffffffff813ce19f [verbose debug info unavailable]
>>>> [ 161.210769] invalid opcode: 0000 [#1] SMP NOPTI
>>>> [ 161.216022] CPU: 1 PID: 0 Comm: swapper/1 Not tainted 5.10.146 #0
>>>> [ 161.222980] Hardware name: Default string Default string/Default
>>>> string, BIOS 5.19 09/23/2022
>>>> [ 161.232546] RIP: 0010:0xffffffff813ce19f
>>>> [ 161.237167] Code: 03 01 4c 89 48 58 e9 2f ff ff ff 85 db 41 0f 95
>>>> c2 45 39 d9 41 0f 95 c1 45 84 ca 74 05 45 85 e4 78 0a 44 89 c2 e9 10
>>>> ff ff ff <0f> 0b 01 d2 45 89 c1 41 29 d1 ba 00 00 00 00 44 0f 48 ca eb
>>>> 80 cc
>>>> [ 161.258651] RSP: 0018:ffffc90000118e88 EFLAGS: 00010283
>>>> [ 161.264736] RAX: ffff888101f8f200 RBX: ffffc900006f9bd0 RCX: 000000000000050e
>>>> [ 161.272837] RDX: ffff888101fec000 RSI: 0000000000000a1c RDI: 0000000000061a10
>>>> [ 161.280942] RBP: ffffc90000118ef8 R08: 0000000000000000 R09: 0000000000061502
>>>> [ 161.289089] R10: 0000000000000000 R11: 0000000000000000 R12: 00000000ffffff3f
>>>> [ 161.297229] R13: ffff888101f8f140 R14: 0000000000000000 R15: ffff888100ad9b00
>>>> [ 161.305345] FS: 0000000000000000(0000) GS:ffff88903fe80000(0000)
>>>> knlGS:00000 00000000000
>>>> [ 161.314492] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>>> [ 161.321139] CR2: 00007f941ad43a9b CR3: 000000000340a000 CR4: 0000000000350ee0
>>>> [ 161.329284] Call Trace:
>>>> [ 161.332373] <IRQ>
>>>> [ 161.334981] ? 0xffffffffa0185f78 [igc@00000000f400031b+0x13000]
>>>> [ 161.341949] 0xffffffff8185b047
>>>> [ 161.345797] 0xffffffff8185b2ca
>>>> [ 161.349637] 0xffffffff81e000bb
>>>> [ 161.353465] 0xffffffff81c0109f
>>>> [ 161.357304] </IRQ>
>>>> [ 161.359988] 0xffffffff8102cdac
>>>> [ 161.363783] 0xffffffff810bfdaf
>>>> [ 161.367584] 0xffffffff81a2e616
>>>> [ 161.371374] 0xffffffff81c00c9e
>>>> [ 161.375192] RIP: 0010:0xffffffff817e331b
>>>> [ 161.379840] Code: 21 90 ff 65 8b 3d 45 23 83 7e e8 80 20 90 ff 31
>>>> ff 49 89 c6 e8 26 2d 90 ff 80 7d d7 00 0f 85 9e 01 00 00 fb 66 0f 1f
>>>> 44 00 00 <45> 85 ff 0f 88 cf 00 00 00 49 63 cf 48 8d 04 49 48 8d 14 81
>>>> 48 c1
>>>> [ 161.401397] RSP: 0018:ffffc900000d3e80 EFLAGS: 00000246
>>>> [ 161.407493] RAX: ffff88903fea5180 RBX: ffff88903feadf00 RCX: 000000000000001f
>>>> [ 161.415648] RDX: 0000000000000000 RSI: 0000000046ec0743 RDI: 0000000000000000
>>>> [ 161.423811] RBP: ffffc900000d3eb8 R08: 00000025881a3b81 R09: ffff888100317340
>>>> [ 161.432003] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000003
>>>> [ 161.440154] R13: ffffffff824c7bc0 R14: 00000025881a3b81 R15: 0000000000000003
>>>> [ 161.448285] 0xffffffff817e357f
>>>> [ 161.452123] 0xffffffff810e6258
>>>> [ 161.455938] 0xffffffff810e63fb
>>>> [ 161.459746] 0xffffffff8104bec0
>>>> [ 161.463526] 0xffffffff810000f5
>>>> [ 161.467290] Modules linked in: pppoe ppp_async nft_fib_inet
>>>> nf_flow_table_ipv 6 nf_flow_table_ipv4 nf_flow_table_inet wireguard
>>>> pppox ppp_generic nft_reject_i pv6 nft_reject_ipv4 nft_reject_inet
>>>> nft_reject nft_redir nft_quota nft_objref nf t_numgen nft_nat nft_masq
>>>> nft_log nft_limit nft_hash nft_flow_offload nft_fib_ip v6 nft_fib_ipv4
>>>> nft_fib nft_ct nft_counter nft_chain_nat nf_tables nf_nat nf_flo
>>>> w_table nf_conntrack libchacha20poly1305 curve25519_x86_64
>>>> chacha_x86_64 slhc r8 169 poly1305_x86_64 nfnetlink nf_reject_ipv6
>>>> nf_reject_ipv4 nf_log_ipv6 nf_log_i pv4 nf_log_common nf_defrag_ipv6
>>>> nf_defrag_ipv4 libcurve25519_generic libcrc32c libchacha igc forcedeth
>>>> e1000e crc_ccitt bnx2 i2c_dev ixgbe e1000 amd_xgbe ip6_u dp_tunnel
>>>> udp_tunnel mdio nls_utf8 ena kpp nls_iso8859_1 nls_cp437 vfat fat igb
>>>> button_hotplug tg3 ptp realtek pps_core mii
>>>> [ 161.550507] ---[ end trace b1cb18ab2d1741bd ]---
>>>> [ 161.555938] RIP: 0010:0xffffffff813ce19f
>>>> [ 161.560634] Code: 03 01 4c 89 48 58 e9 2f ff ff ff 85 db 41 0f 95
>>>> c2 45 39 d9 41 0f 95 c1 45 84 ca 74 05 45 85 e4 78 0a 44 89 c2 e9 10
>>>> ff ff ff <0f> 0b 01 d2 45 89 c1 41 29 d1 ba 00 00 00 00 44 0f 48 ca eb
>>>> 80 cc
>>>> [ 161.582281] RSP: 0018:ffffc90000118e88 EFLAGS: 00010283
>>>> [ 161.588426] RAX: ffff888101f8f200 RBX: ffffc900006f9bd0 RCX: 000000000000050e
>>>> [ 161.596668] RDX: ffff888101fec000 RSI: 0000000000000a1c RDI: 0000000000061a10
>>>> [ 161.604860] RBP: ffffc90000118ef8 R08: 0000000000000000 R09: 0000000000061502
>>>> [ 161.613052] R10: 0000000000000000 R11: 0000000000000000 R12: 00000000ffffff3f
>>>> [ 161.621291] R13: ffff888101f8f140 R14: 0000000000000000 R15: ffff888100ad9b00
>>>> [ 161.629505] FS: 0000000000000000(0000) GS:ffff88903fe80000(0000)
>>>> knlGS:00000 00000000000
>>>> [ 161.638781] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>>> [ 161.645549] CR2: 00007f941ad43a9b CR3: 000000000340a000 CR4: 0000000000350ee0
>>>> [ 161.653841] Kernel panic - not syncing: Fatal exception in interrupt
>>>> [ 161.661287] Kernel Offset: disabled
>>>> [ 161.665644] Rebooting in 3 seconds..
>>>> [ 164.670313] ACPI MEMORY or I/O RESET_REG.
>>>>
>>>> Kyle.
>>>> _______________________________________________
>>>> Intel-wired-lan mailing list
>>>> Intel-wired-lan@osuosl.org
>>>> https://lists.osuosl.org/mailman/listinfo/intel-wired-lan
>>>
_______________________________________________
Intel-wired-lan mailing list
Intel-wired-lan@osuosl.org
https://lists.osuosl.org/mailman/listinfo/intel-wired-lan
next prev parent reply other threads:[~2022-12-28 7:07 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-12-15 22:28 [Intel-wired-lan] igc: 5.10.146 Kernel BUG at 0xffffffff813ce19f Kyle Sanderson
2022-12-18 0:20 ` Kyle Sanderson
2022-12-19 6:30 ` Neftin, Sasha
2022-12-19 23:08 ` Kyle Sanderson
2022-12-20 18:29 ` Kyle Sanderson
2022-12-28 4:45 ` Kyle Sanderson
2022-12-28 7:04 ` Neftin, Sasha [this message]
2022-12-28 22:34 ` Kyle Sanderson
2022-12-29 2:48 ` Kyle Sanderson
2022-12-29 4:12 ` MP, Sureshkumar
2022-12-29 4:28 ` Kyle Sanderson
2022-12-29 9:21 ` MP, Sureshkumar
2022-12-30 0:49 ` Kyle Sanderson
2023-01-04 5:23 ` Kyle Sanderson
2023-01-11 14:11 ` Ruinskiy, Dima
2023-01-13 20:33 ` Kyle Sanderson
2023-01-15 9:13 ` Ruinskiy, Dima
2023-02-07 7:23 ` Kyle Sanderson
2023-03-11 1:47 ` [Intel-wired-lan] igc: 5.15.98 Kernel PANIC on igc_down Kyle Sanderson
2023-03-12 7:17 ` Neftin, Sasha
2023-03-27 7:22 ` Kyle Sanderson
2023-03-28 6:05 ` Kyle Sanderson
2023-03-29 16:03 ` Zulkifli, Muhammad Husaini
2023-04-10 7:07 ` Zulkifli, Muhammad Husaini
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=70eea40e-808c-e9ee-9aab-617ebe67d67c@intel.com \
--to=sasha.neftin@intel.com \
--cc=amir.avivi@intel.com \
--cc=anthony.l.nguyen@intel.com \
--cc=dima.ruinskiy@intel.com \
--cc=gregkh@linuxfoundation.org \
--cc=intel-wired-lan@lists.osuosl.org \
--cc=jesse.brandeburg@intel.com \
--cc=kyle.leet@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=sureshkumar.mp@intel.com \
--cc=therbert@google.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox