Intel-Wired-Lan Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: "Neftin, Sasha" <sasha.neftin@intel.com>
To: Kyle Sanderson <kyle.leet@gmail.com>,
	<intel-wired-lan@lists.osuosl.org>,  <jesse.brandeburg@intel.com>,
	<anthony.l.nguyen@intel.com>,
	"MP, Sureshkumar" <sureshkumar.mp@intel.com>
Cc: netdev@vger.kernel.org,
	Linux-Kernel <linux-kernel@vger.kernel.org>,
	Greg KH <gregkh@linuxfoundation.org>,
	"Ruinskiy, Dima" <dima.ruinskiy@intel.com>,
	Linus Torvalds <torvalds@linux-foundation.org>,
	"Avivi, Amir" <amir.avivi@intel.com>,
	therbert@google.com
Subject: Re: [Intel-wired-lan] igc: 5.10.146 Kernel BUG at 0xffffffff813ce19f
Date: Wed, 28 Dec 2022 09:04:48 +0200	[thread overview]
Message-ID: <70eea40e-808c-e9ee-9aab-617ebe67d67c@intel.com> (raw)
In-Reply-To: <CACsaVZLh0WFu1p7TUxE=RwucoTcZwsfQ5+ivorcbwCiRneeVFg@mail.gmail.com>

On 12/28/2022 06:45, Kyle Sanderson wrote:
> hi Intel IGC Maintainers,
> 
> I've managed to reproduce this issue on 5.15.85 (same steps to
> reproduce), and have symbols and line numbers in the below panic.
> There's no device support in 5.4 for this hardware, so I was unable to
> reproduce the issue there in igc.
> 
>  From the Kernel BUG_ON, it's being asked to read beyond the array
> size. The min call looks very suspicious (igb, and other drives don't
> appear to do that), but I don't know if that's where the issue is.
> 
> Please let me know if there's anything more I can do to help.
I do not know if it is an SW problem.
1. Does the problem reproduce on the latest upstream kernel?(worth to check)
2. I do not see this crash in our labs. I haven't a platform with six 
i226 parts.(Trying find folks who work with this platform.)
3. I am working on a patch to address .ndo_tx_timeout support. (pass the 
reset task to netdev while the link disconnected during traffic, under 
testing). It could be related and worth checking - please, let me know 
if you want to apply on your platform (against upstream). Reach us 
(Dima, Amir, and me) directly off the list.
> 
> [  223.725003] igc 0000:01:00.0 eth0: Reset adapter
> [  233.139441] kernel BUG at lib/dynamic_queue_limits.c:27!
> [  233.146814] invalid opcode: 0000 [#1] SMP NOPTI
> [  233.146816] refcount_t: saturated; leaking memory.
> [  233.146833] WARNING: CPU: 0 PID: 0 at lib/refcount.c:19
> refcount_warn_saturate+0x97/0x110
> [  233.153243] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G        W
>    5.15.85-amd64-vyos #1
> [  233.159216] Modules linked in:
> [  233.168451] Hardware name: Default string Default string/Default
> string, BIOS 5.19 09/23/2022
> [  233.177895]  wireguard
> [  233.181645] RIP: 0010:dql_completed+0x12f/0x140
> [  233.191360]  curve25519_x86_64
> [  233.194406] Code: cf c9 00 48 89 57 58 e9 54 ff ff ff 85 ed 40 0f
> 95 c5 41 39 d8 41 0f 95 c0 44 84 c5 74 04 85 d2 78 0a 44 89 d8 e9 36
> ff ff ff <0f> 0b 01 f6 44 89 da 29 f2 0f 48 d0 eb 8d cc cc cc 41 56 49
> 89 f3
> [  233.199767]  libcurve25519_generic
> [  233.203540] RSP: 0018:ffffa85dc0134e20 EFLAGS: 00010283
> [  233.225248]  libchacha20poly1305
> [  233.229417]
> [  233.229417] RAX: 0000000000000001 RBX: ffff934002104b40 RCX: 00000000000005ea
> [  233.235539]  chacha_x86_64
> [  233.239508] RDX: ffff934002110000 RSI: 0000000000001d92 RDI: ffff93400211a200
> [  233.241606]  poly1305_x86_64
> [  233.249796] RBP: 0000000000000000 R08: 000000000004ad4e R09: 0000000000000000
> [  233.253226]  ip6_udp_tunnel
> [  233.261445] R10: 000000000004b338 R11: ffffffffbabfee80 R12: 0000000000001d92
> [  233.261446] R13: ffff934002104b40 R14: ffffa85dc09d1450 R15: 00000000ffffffa6
> [  233.265054]  udp_tunnel
> [  233.273314] FS:  0000000000000000(0000) GS:ffff934f3fe80000(0000)
> knlGS:0000000000000000
> [  233.276826]  libchacha
> [  233.285023] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  233.285025] CR2: 00007f294393fe84 CR3: 0000000605e10000 CR4: 0000000000350ee0
> [  233.285026] Call Trace:
> [  233.285026]  <IRQ>
> [  233.285027]  igc_poll+0x19d/0x14b0 [igc]
> [  233.293242]  vrf
> [  233.296396]  __napi_poll+0x22/0x110
> [  233.305688]  nft_masq
> [  233.308763]  net_rx_action+0xe9/0x250
> [  233.315455]  nf_nat_tftp
> [  233.323756]  ? igc_msix_ring+0x51/0x60 [igc]
> [  233.326946]  nf_conntrack_tftp
> [  233.329661]  __do_softirq+0xb8/0x1e9
> [  233.334471]  nf_nat_sip
> [  233.336991]  irq_exit_rcu+0x84/0xb0
> [  233.341290]  nf_conntrack_sip
> [  233.344284]  common_interrupt+0x78/0x90
> [  233.348778]  nf_nat_pptp
> [  233.352104]  </IRQ>
> [  233.357240]  nf_conntrack_pptp
> [  233.361052]  <TASK>
> [  233.365360]  nf_nat_h323
> [  233.368484]  asm_common_interrupt+0x22/0x40
> [  233.372723]  nf_conntrack_h323
> [  233.376363] RIP: 0010:cpuidle_enter_state+0xb5/0x2a0
> [  233.380952]  nf_nat_ftp
> [  233.384155] Code: c1 48 b2 ff 65 8b 3d b2 58 49 46 e8 65 47 b2 ff
> 31 ff 49 89 c5 e8 6b 52 b2 ff 45 84 f6 0f 85 85 01 00 00 fb 66 0f 1f
> 44 00 00 <45> 85 ff 0f 88 bb 00 00 00 49 63 c7 4c 2b 2c 24 48 8d 14 40
> 48 8d
> [  233.386840]  nf_conntrack_ftp
> [  233.390553] RSP: 0018:ffffa85dc00efea8 EFLAGS: 00000246
> [  233.393224]  nft_objref
> [  233.396340]
> [  233.396340] RAX: ffff934f3fea3440 RBX: 0000000000000003 RCX: 000000000000001f
> [  233.401256]  nft_counter
> [  233.404981] RDX: 0000000000000000 RSI: 0000000046ec0743 RDI: 0000000000000000
> [  233.410769]  nft_ct
> [  233.413816] RBP: ffff934f3feac910 R08: 00000036481c5d1b R09: 0000003605db0041
> [  233.435320]  nft_chain_nat
> [  233.438947] R10: 0000000000000072 R11: 0000000000000164 R12: ffffffffba6ccb40
> [  233.445014]  nf_nat
> [  233.448065] R13: 00000036481c5d1b R14: 0000000000000000 R15: 0000000000000003
> [  233.450073]  nf_tables
> [  233.458210]  ? cpuidle_enter_state+0xa5/0x2a0
> [  233.461335]  nfnetlink_cthelper
> [  233.469449]  cpuidle_enter+0x24/0x40
> [  233.472106]  nf_conntrack
> [  233.480247]  do_idle+0x1e4/0x280
> [  233.483580]  nf_defrag_ipv6
> [  233.491703]  cpu_startup_entry+0x14/0x20
> [  233.494399]  nf_defrag_ipv4
> [  233.502517]  secondary_startup_64_no_verify+0xb0/0xbb
> [  233.505503]  libcrc32c
> [  233.510641]  </TASK>
> [  233.514474]  nfnetlink
> [  233.518787] Modules linked in: wireguard
> [  233.522065]  af_packet
> [  233.525975]  curve25519_x86_64
> [  233.529441]  x86_pkg_temp_thermal
> [  233.534136]  libcurve25519_generic
> [  233.537612]  intel_powerclamp
> [  233.543511]  libchacha20poly1305
> [  233.546508]  coretemp
> [  233.549313]  chacha_x86_64 poly1305_x86_64
> [  233.552304]  crct10dif_pclmul
> [  233.556981]  ip6_udp_tunnel udp_tunnel libchacha vrf nft_masq
> nf_nat_tftp nf_conntrack_tftp nf_nat_sip nf_conntrack_sip nf_nat_pptp
> nf_conntrack_pptp nf_nat_h323 nf_conntrack_h323 nf_nat_ftp
> nf_conntrack_ftp
> [  233.559990]  crc32_pclmul
> [  233.563754]  nft_objref nft_counter
> [  233.567791]  ghash_clmulni_intel
> [  233.571912]  nft_ct
> [  233.575569]  aesni_intel
> [  233.579500]  nft_chain_nat
> [  233.582390]  crypto_simd
> [  233.587225]  nf_nat
> [  233.590841]  cryptd
> [  233.612012]  nf_tables
> [  233.615288]  intel_cstate
> [  233.619486]  nfnetlink_cthelper
> [  233.623388]  iTCO_wdt
> [  233.626063]  nf_conntrack
> [  233.629196]  efi_pstore
> [  233.632499]  nf_defrag_ipv6
> [  233.635597]  pcspkr
> [  233.638218]  nf_defrag_ipv4
> [  233.640825]  evdev
> [  233.643700]  libcrc32c
> [  233.646869]  iTCO_vendor_support
> [  233.650591]  nfnetlink
> [  233.653355]  sg
> [  233.656497]  af_packet
> [  233.659446]  tpm_crb
> [  233.662775]  x86_pkg_temp_thermal
> [  233.665337]  tpm_tis
> [  233.668670]  intel_powerclamp
> [  233.671144]  tpm_tis_core
> [  233.673993]  coretemp
> [  233.677768]  tpm
> [  233.680591]  crct10dif_pclmul
> [  233.682782]  rng_core
> [  233.685624]  crc32_pclmul
> [  233.688271]  mei_me
> [  233.692161]  ghash_clmulni_intel
> [  233.694799]  mei
> [  233.698290]  aesni_intel
> [  233.701384]  button
> [  233.704125]  crypto_simd
> [  233.706379]  acpi_pad
> [  233.709861]  cryptd
> [  233.712587]  mpls_iptunnel
> [  233.715682]  intel_cstate
> [  233.718177]  mpls_router
> [  233.721872]  iTCO_wdt
> [  233.724077]  ip_tunnel
> [  233.727034]  efi_pstore
> [  233.729533]  br_netfilter
> [  233.732471]  pcspkr
> [  233.735139]  bridge
> [  233.737627]  evdev
> [  233.740768]  stp
> [  233.743827]  iTCO_vendor_support
> [  233.746789]  llc
> [  233.749457]  sg
> [  233.752222]  fuse
> [  233.755071]  tpm_crb
> [  233.758113]  configfs
> [  233.760589]  tpm_tis
> [  233.763065]  efivarfs
> [  233.765437]  tpm_tis_core
> [  233.767622]  ip_tables
> [  233.771314]  tpm
> [  233.773511]  x_tables
> [  233.775607]  rng_core
> [  233.777893]  autofs4
> [  233.780456]  mei_me
> [  233.783120]  usb_storage
> [  233.785686]  mei
> [  233.788319]  ohci_hcd
> [  233.791358]  button
> [  233.794104]  uhci_hcd
> [  233.796287]  acpi_pad
> [  233.798948]  ehci_hcd
> [  233.801608]  mpls_iptunnel
> [  233.804146]  squashfs
> [  233.806598]  mpls_router
> [  233.809530]  zstd_decompress
> [  233.811719]  ip_tunnel
> [  233.814378]  lz4_decompress
> [  233.816841]  br_netfilter
> [  233.819492]  loop
> [  233.822152]  bridge
> [  233.824802]  overlay
> [  233.827927]  stp
> [  233.830564]  ext4
> [  233.833498]  llc
> [  233.836805]  crc32c_generic
> [  233.839557]  fuse
> [  233.842787]  crc16
> [  233.845815]  configfs
> [  233.848084]  mbcache
> [  233.850564]  efivarfs
> [  233.853117]  jbd2
> [  233.855296]  ip_tables
> [  233.857561]  nls_cp437
> [  233.859722]  x_tables autofs4
> [  233.862950]  vfat
> [  233.865216]  usb_storage
> [  233.867585]  fat
> [  233.870239]  ohci_hcd uhci_hcd
> [  233.872779]  efivars
> [  233.875414]  ehci_hcd
> [  233.877693]  nls_ascii
> [  233.880433]  squashfs zstd_decompress
> [  233.883172]  hid_generic
> [  233.886580]  lz4_decompress
> [  233.888861]  usbhid
> [  233.891803]  loop
> [  233.893980]  hid
> [  233.897493]  overlay
> [  233.900050]  sd_mod
> [  233.902702]  ext4
> [  233.905446]  t10_pi
> [  233.909612]  crc32c_generic
> [  233.912548]  ahci
> [  233.915776]  crc16
> [  233.918244]  libahci
> [  233.920540]  mbcache
> [  233.922740]  crc32c_intel
> [  233.925303]  jbd2
> [  233.927777]  libata
> [  233.930058]  nls_cp437
> [  233.932530]  i2c_i801
> [  233.935740]  vfat fat
> [  233.938022]  i2c_smbus
> [  233.940397]  efivars
> [  233.942945]  xhci_pci
> [  233.945504]  nls_ascii hid_generic
> [  233.948535]  xhci_hcd
> [  233.950814]  usbhid
> [  233.953282]  scsi_mod
> [  233.956022]  hid
> [  233.958671]  scsi_common
> [  233.961327]  sd_mod t10_pi
> [  233.964066]  igc
> [  233.966618]  ahci
> [  233.969274]  thermal
> [  233.973168]  libahci
> [  233.975830]  fan
> [  233.978310]  crc32c_intel
> [  233.980975]
> [  233.983158]  libata
> [  233.986113] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G        W
>    5.15.85-amd64-vyos #1
> [  233.989257]  i2c_i801
> [  233.991441] Hardware name: Default string Default string/Default
> string, BIOS 5.19 09/23/2022
> [  233.993730]  i2c_smbus
> [  233.996286] RIP: 0010:refcount_warn_saturate+0x97/0x110
> [  233.998850]  xhci_pci
> [  234.001055] Code: 00 01 e8 cb 40 42 00 0f 0b c3 cc cc cc cc 80 3d
> 39 f4 da 00 00 75 a8 48 c7 c7 d8 13 43 ba c6 05 29 f4 da 00 01 e8 a8
> 40 42 00 <0f> 0b c3 cc cc cc cc 80 3d 13 f4 da 00 00 75 85 48 c7 c7 30
> 14 43
> [  234.004069]  xhci_hcd scsi_mod
> [  234.005878] RSP: 0018:ffffa85dc0003ae0 EFLAGS: 00010282
> [  234.008348]  scsi_common igc
> [  234.017611]
> [  234.020297]  thermal fan
> [  234.029764] RAX: 0000000000000000 RBX: 0000000000005837 RCX: 0000000000000000
> [  234.032559]
> [  234.032585] ---[ end trace 8acd09a29bf2e660 ]---
> [  234.038458] RDX: ffff934f3fe1f3e0 RSI: ffff934f3fe1c490 RDI: 0000000000000300
> [  234.141617] RIP: 0010:dql_completed+0x12f/0x140
> [  234.146459] RBP: ffff9340074b28c0 R08: 0000000000000000 R09: ffffa85dc0003908
> [  234.150075] Code: cf c9 00 48 89 57 58 e9 54 ff ff ff 85 ed 40 0f
> 95 c5 41 39 d8 41 0f 95 c0 44 84 c5 74 04 85 d2 78 0a 44 89 d8 e9 36
> ff ff ff <0f> 0b 01 f6 44 89 da 29 f2 0f 48 d0 eb 8d cc cc cc 41 56 49
> 89 f3
> [  234.156048] R10: ffffa85dc0003900 R11: ffffffffba6b0ce8 R12: ffff9340074b2908
> [  234.159502] RSP: 0018:ffffa85dc0134e20 EFLAGS: 00010283
> [  234.161442] R13: ffffffffba28eb60 R14: fffffffffffffff0 R15: ffffa85dc0003b40
> [  234.164506]
> [  234.172573] FS:  0000000000000000(0000) GS:ffff934f3fe00000(0000)
> knlGS:0000000000000000
> [  234.174545] RAX: 0000000000000001 RBX: ffff934002104b40 RCX: 00000000000005ea
> [  234.179914] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  234.188023] RDX: ffff934002110000 RSI: 0000000000001d92 RDI: ffff93400211a200
> [  234.193301] CR2: 000055e26436ee10 CR3: 0000000605e10000 CR4: 0000000000350ef0
> [  234.201457] RBP: 0000000000000000 R08: 000000000004ad4e R09: 0000000000000000
> [  234.223063] Call Trace:
> [  234.231267] R10: 000000000004b338 R11: ffffffffbabfee80 R12: 0000000000001d92
> [  234.237398]  <IRQ>
> [  234.245613] R13: ffff934002104b40 R14: ffffa85dc09d1450 R15: 00000000ffffffa6
> [  234.247734]  __nf_conntrack_find_get+0x331/0x340 [nf_conntrack]
> [  234.256997] FS:  0000000000000000(0000) GS:ffff934f3fe80000(0000)
> knlGS:0000000000000000
> [  234.265245]  nf_conntrack_in+0x1e1/0x760 [nf_conntrack]
> [  234.271954] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  234.280252]  nf_hook_slow+0x37/0xb0
> [  234.288537] CR2: 00007f294393fe84 CR3: 000000011da48000 CR4: 0000000000350ee0
> [  234.296788]  nf_hook_slow_list+0x8c/0x130
> [  234.300002] Kernel panic - not syncing: Fatal exception in interrupt
> [  234.308339]  ip_sublist_rcv+0x1fa/0x220
> [  234.319422] Kernel Offset: 0x38600000 from 0xffffffff81000000
> (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
> [  234.494681] ---[ end Kernel panic - not syncing: Fatal exception in
> interrupt ]---
> 
> Kyle.
> 
> On Tue, Dec 20, 2022 at 10:29 AM Kyle Sanderson <kyle.leet@gmail.com> wrote:
>>
>> re-sending as plain text - my apologies.
>>
>>> On Sun, 18 Dec 2022, 23:31 Neftin, Sasha wrote:
>>> What is a board in use (LAN on board or NIC)?
>>> What is lspci, lspci -t and lspci -s 0000:[lan bus:device.function] -vvv output?
>>
>> It's embedded on the board, could very well be on a bridge though as a
>> card. The box has 6 ports, 2 were in-use while testing.
>>
>> 00:00.0 Host bridge: Intel Corporation Device 4522 (rev 01)
>> 00:02.0 VGA compatible controller: Intel Corporation Elkhart Lake [UHD
>> Graphics Gen11 16EU] (rev 01)
>> 00:08.0 System peripheral: Intel Corporation Device 4511 (rev 01)
>> 00:14.0 USB controller: Intel Corporation Device 4b7d (rev 11)
>> 00:14.2 RAM memory: Intel Corporation Device 4b7f (rev 11)
>> 00:16.0 Communication controller: Intel Corporation Device 4b70 (rev 11)
>> 00:17.0 SATA controller: Intel Corporation Device 4b63 (rev 11)
>> 00:1c.0 PCI bridge: Intel Corporation Device 4b38 (rev 11)
>> 00:1c.1 PCI bridge: Intel Corporation Device 4b39 (rev 11)
>> 00:1c.2 PCI bridge: Intel Corporation Device 4b3a (rev 11)
>> 00:1c.3 PCI bridge: Intel Corporation Device 4b3b (rev 11)
>> 00:1c.4 PCI bridge: Intel Corporation Device 4b3c (rev 11)
>> 00:1c.6 PCI bridge: Intel Corporation Device 4b3e (rev 11)
>> 00:1f.0 ISA bridge: Intel Corporation Device 4b00 (rev 11)
>> 00:1f.3 Audio device: Intel Corporation Device 4b58 (rev 11)
>> 00:1f.4 SMBus: Intel Corporation Device 4b23 (rev 11)
>> 00:1f.5 Serial bus controller: Intel Corporation Device 4b24 (rev 11)
>> 01:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 02:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 03:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 04:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 05:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>> 06:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>>
>> -[0000:00]-+-00.0
>>             +-02.0
>>             +-08.0
>>             +-14.0
>>             +-14.2
>>             +-16.0
>>             +-17.0
>>             +-1c.0-[01]----00.0
>>             +-1c.1-[02]----00.0
>>             +-1c.2-[03]----00.0
>>             +-1c.3-[04]----00.0
>>             +-1c.4-[05]----00.0
>>             +-1c.6-[06]----00.0
>>             +-1f.0
>>             +-1f.3
>>             +-1f.4
>>             \-1f.5
>>
>>
>> 01:00.0 Ethernet controller: Intel Corporation Device 125c (rev 04)
>>   Subsystem: Intel Corporation Device 0000
>>   Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
>> Stepping- SERR- FastB2B- DisINTx+
>>   Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>>   Latency: 0
>>   Interrupt: pin A routed to IRQ 16
>>   Region 0: Memory at 80600000 (32-bit, non-prefetchable) [size=1M]
>>   Region 3: Memory at 80700000 (32-bit, non-prefetchable) [size=16K]
>>   Capabilities: [40] Power Management version 3
>>    Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
>>    Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
>>   Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
>>    Address: 0000000000000000 Data: 0000
>>    Masking: 00000000 Pending: 00000000
>>   Capabilities: [70] MSI-X: Enable+ Count=5 Masked-
>>    Vector table: BAR=3 offset=00000000
>>    PBA: BAR=3 offset=00002000
>>   Capabilities: [a0] Express (v2) Endpoint, MSI 00
>>    DevCap: MaxPayload 512 bytes, PhantFunc 0, Latency L0s <512ns, L1 <64us
>>     ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 0W
>>    DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
>>     RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
>>     MaxPayload 128 bytes, MaxReadReq 512 bytes
>>    DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr+ TransPend-
>>    LnkCap: Port #0, Speed 5GT/s, Width x1, ASPM L1, Exit Latency L1 <4us
>>     ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
>>    LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
>>     ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>>    LnkSta: Speed 5GT/s, Width x1
>>     TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
>>    DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ NROPrPrP- LTR+
>>      10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt- EETLPPrefix-
>>      EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
>>      FRS- TPHComp- ExtTPHComp-
>>      AtomicOpsCap: 32bit- 64bit- 128bitCAS-
>>    DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- LTR+
>> 10BitTagReq- OBFF Disabled,
>>      AtomicOpsCtl: ReqEn-
>>    LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
>>      Transmit Margin: Normal Operating Range, EnterModifiedCompliance-
>> ComplianceSOS-
>>      Compliance Preset/De-emphasis: -6dB de-emphasis, 0dB preshoot
>>    LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-
>> EqualizationPhase1-
>>      EqualizationPhase2- EqualizationPhase3- LinkEqualizationRequest-
>>      Retimer- 2Retimers- CrosslinkRes: unsupported
>>   Capabilities: [100 v2] Advanced Error Reporting
>>    UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>> MalfTLP- ECRC- UnsupReq- ACSViol-
>>    UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF-
>> MalfTLP- ECRC- UnsupReq- ACSViol-
>>    UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+
>> MalfTLP+ ECRC- UnsupReq- ACSViol-
>>    CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
>>    CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
>>    AERCap: First Error Pointer: 00, ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
>>     MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
>>    HeaderLog: 00000000 00000000 00000000 00000000
>>   Capabilities: [140 v1] Device Serial Number e4-3a-6e-ff-ff-5d-bb-54
>>   Capabilities: [1c0 v1] Latency Tolerance Reporting
>>    Max snoop latency: 3145728ns
>>    Max no snoop latency: 3145728ns
>>   Capabilities: [1f0 v1] Precision Time Measurement
>>    PTMCap: Requester:+ Responder:- Root:-
>>    PTMClockGranularity: 4ns
>>    PTMControl: Enabled:- RootSelected:-
>>    PTMEffectiveGranularity: Unknown
>>   Capabilities: [1e0 v1] L1 PM Substates
>>    L1SubCap: PCI-PM_L1.2+ PCI-PM_L1.1+ ASPM_L1.2+ ASPM_L1.1+ L1_PM_Substates+
>>       PortCommonModeRestoreTime=55us PortTPowerOnTime=70us
>>    L1SubCtl1: PCI-PM_L1.2- PCI-PM_L1.1- ASPM_L1.2- ASPM_L1.1-
>>        T_CommonMode=0us LTR1.2_Threshold=81920ns
>>    L1SubCtl2: T_PwrOn=50us
>>   Kernel driver in use: igc
>>   Kernel modules: igc
>>
>> On Sun, Dec 18, 2022 at 10:31 PM Neftin, Sasha <sasha.neftin@intel.com> wrote:
>>>
>>> On 12/16/2022 00:28, Kyle Sanderson wrote:
>>>> (Un)fortunately I can reproduce this bug by simply removing the
>>>> ethernet cable from the box while there is traffic flowing. kprint
>>>> below from a console line. Please CC / to me for any additional
>>>> information I can provide for this panic.
>>> What is a board in use (LAN on board or NIC)? What is lspci, lspci -t
>>> and lspci -s 0000:[lan bus:device.function] -vvv output?
>>>>
>>>> [  156.707054] igc 0000:01:00.0 eth0: NIC Link is Down
>>>> [  156.712981] br-lan: port 1(eth0) entered disabled state
>>>> [  156.719246] igc 0000:01:00.0 eth0: Register Dump
>>>> [  156.724784] igc 0000:01:00.0 eth0: Register Name   Value
>>>> [  156.731067] igc 0000:01:00.0 eth0: CTRL            181c0641
>>>> [  156.737607] igc 0000:01:00.0 eth0: STATUS          00380681
>>>> [  156.744133] igc 0000:01:00.0 eth0: CTRL_EXT        100000c0
>>>> [  156.750759] igc 0000:01:00.0 eth0: MDIC            18017949
>>>> [  156.757258] igc 0000:01:00.0 eth0: ICR             00000001
>>>> [  156.763785] igc 0000:01:00.0 eth0: RCTL            0440803a
>>>> [  156.770324] igc 0000:01:00.0 eth0: RDLEN[0-3]      00001000
>>>> 00001000 00001000 00001000
>>>> [  156.779457] igc 0000:01:00.0 eth0: RDH[0-3]        000000ef
>>>> 000000a1 00000092 000000ba
>>>> [  156.788500] igc 0000:01:00.0 eth0: RDT[0-3]        000000ee
>>>> 000000a0 00000091 000000b9
>>>> [  156.797650] igc 0000:01:00.0 eth0: RXDCTL[0-3]     02040808
>>>> 02040808 02040808 02040808
>>>> [  156.806688] igc 0000:01:00.0 eth0: RDBAL[0-3]      02f43000
>>>> 02180000 02e7f000 02278000
>>>> [  156.815781] igc 0000:01:00.0 eth0: RDBAH[0-3]      00000001
>>>> 00000001 00000001 00000001
>>>> [  156.824928] igc 0000:01:00.0 eth0: TCTL            a503f0fa
>>>> [  156.831587] igc 0000:01:00.0 eth0: TDBAL[0-3]      02f43000
>>>> 02180000 02e7f000 02278000
>>>> [  156.840637] igc 0000:01:00.0 eth0: TDBAH[0-3]      00000001
>>>> 00000001 00000001 00000001
>>>> [  156.849753] igc 0000:01:00.0 eth0: TDLEN[0-3]      00001000
>>>> 00001000 00001000 00001000
>>>> [  156.858760] igc 0000:01:00.0 eth0: TDH[0-3]        000000d4
>>>> 0000003d 000000af 0000002a
>>>> [  156.867771] igc 0000:01:00.0 eth0: TDT[0-3]        000000e4
>>>> 0000005a 000000c8 0000002a
>>>> [  156.876864] igc 0000:01:00.0 eth0: TXDCTL[0-3]     02100108
>>>> 02100108 02100108 02100108
>>>> [  156.885905] igc 0000:01:00.0 eth0: Reset adapter
>>>> [  160.307195] igc 0000:01:00.0 eth0: NIC Link is Up 1000 Mbps Full
>>>> Duplex, Flow Control: RX/TX
>>>> [  160.317974] br-lan: port 1(eth0) entered blocking state
>>>> [  160.324532] br-lan: port 1(eth0) entered forwarding state
>>>> [  161.197263] ------------[ cut here ]------------
>>>> [  161.202669] Kernel BUG at 0xffffffff813ce19f [verbose debug info unavailable]
>>>> [  161.210769] invalid opcode: 0000 [#1] SMP NOPTI
>>>> [  161.216022] CPU: 1 PID: 0 Comm: swapper/1 Not tainted 5.10.146 #0
>>>> [  161.222980] Hardware name: Default string Default string/Default
>>>> string, BIOS 5.19 09/23/2022
>>>> [  161.232546] RIP: 0010:0xffffffff813ce19f
>>>> [  161.237167] Code: 03 01 4c 89 48 58 e9 2f ff ff ff 85 db 41 0f 95
>>>> c2 45 39 d9 41 0f 95 c1 45 84 ca 74 05 45 85 e4 78 0a 44 89 c2 e9 10
>>>> ff ff ff <0f> 0b 01 d2 45 89 c1 41 29 d1 ba 00 00 00 00 44 0f 48 ca eb
>>>> 80 cc
>>>> [  161.258651] RSP: 0018:ffffc90000118e88 EFLAGS: 00010283
>>>> [  161.264736] RAX: ffff888101f8f200 RBX: ffffc900006f9bd0 RCX: 000000000000050e
>>>> [  161.272837] RDX: ffff888101fec000 RSI: 0000000000000a1c RDI: 0000000000061a10
>>>> [  161.280942] RBP: ffffc90000118ef8 R08: 0000000000000000 R09: 0000000000061502
>>>> [  161.289089] R10: 0000000000000000 R11: 0000000000000000 R12: 00000000ffffff3f
>>>> [  161.297229] R13: ffff888101f8f140 R14: 0000000000000000 R15: ffff888100ad9b00
>>>> [  161.305345] FS:  0000000000000000(0000) GS:ffff88903fe80000(0000)
>>>> knlGS:00000 00000000000
>>>> [  161.314492] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>>> [  161.321139] CR2: 00007f941ad43a9b CR3: 000000000340a000 CR4: 0000000000350ee0
>>>> [  161.329284] Call Trace:
>>>> [  161.332373]  <IRQ>
>>>> [  161.334981]  ? 0xffffffffa0185f78 [igc@00000000f400031b+0x13000]
>>>> [  161.341949]  0xffffffff8185b047
>>>> [  161.345797]  0xffffffff8185b2ca
>>>> [  161.349637]  0xffffffff81e000bb
>>>> [  161.353465]  0xffffffff81c0109f
>>>> [  161.357304]  </IRQ>
>>>> [  161.359988]  0xffffffff8102cdac
>>>> [  161.363783]  0xffffffff810bfdaf
>>>> [  161.367584]  0xffffffff81a2e616
>>>> [  161.371374]  0xffffffff81c00c9e
>>>> [  161.375192] RIP: 0010:0xffffffff817e331b
>>>> [  161.379840] Code: 21 90 ff 65 8b 3d 45 23 83 7e e8 80 20 90 ff 31
>>>> ff 49 89 c6 e8 26 2d 90 ff 80 7d d7 00 0f 85 9e 01 00 00 fb 66 0f 1f
>>>> 44 00 00 <45> 85 ff 0f 88 cf 00 00 00 49 63 cf 48 8d 04 49 48 8d 14 81
>>>> 48 c1
>>>> [  161.401397] RSP: 0018:ffffc900000d3e80 EFLAGS: 00000246
>>>> [  161.407493] RAX: ffff88903fea5180 RBX: ffff88903feadf00 RCX: 000000000000001f
>>>> [  161.415648] RDX: 0000000000000000 RSI: 0000000046ec0743 RDI: 0000000000000000
>>>> [  161.423811] RBP: ffffc900000d3eb8 R08: 00000025881a3b81 R09: ffff888100317340
>>>> [  161.432003] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000003
>>>> [  161.440154] R13: ffffffff824c7bc0 R14: 00000025881a3b81 R15: 0000000000000003
>>>> [  161.448285]  0xffffffff817e357f
>>>> [  161.452123]  0xffffffff810e6258
>>>> [  161.455938]  0xffffffff810e63fb
>>>> [  161.459746]  0xffffffff8104bec0
>>>> [  161.463526]  0xffffffff810000f5
>>>> [  161.467290] Modules linked in: pppoe ppp_async nft_fib_inet
>>>> nf_flow_table_ipv 6 nf_flow_table_ipv4 nf_flow_table_inet wireguard
>>>> pppox ppp_generic nft_reject_i pv6 nft_reject_ipv4 nft_reject_inet
>>>> nft_reject nft_redir nft_quota nft_objref nf t_numgen nft_nat nft_masq
>>>> nft_log nft_limit nft_hash nft_flow_offload nft_fib_ip v6 nft_fib_ipv4
>>>> nft_fib nft_ct nft_counter nft_chain_nat nf_tables nf_nat nf_flo
>>>> w_table nf_conntrack libchacha20poly1305 curve25519_x86_64
>>>> chacha_x86_64 slhc r8 169 poly1305_x86_64 nfnetlink nf_reject_ipv6
>>>> nf_reject_ipv4 nf_log_ipv6 nf_log_i pv4 nf_log_common nf_defrag_ipv6
>>>> nf_defrag_ipv4 libcurve25519_generic libcrc32c libchacha igc forcedeth
>>>> e1000e crc_ccitt bnx2 i2c_dev ixgbe e1000 amd_xgbe ip6_u dp_tunnel
>>>> udp_tunnel mdio nls_utf8 ena kpp nls_iso8859_1 nls_cp437 vfat fat igb
>>>> button_hotplug tg3 ptp realtek pps_core mii
>>>> [  161.550507] ---[ end trace b1cb18ab2d1741bd ]---
>>>> [  161.555938] RIP: 0010:0xffffffff813ce19f
>>>> [  161.560634] Code: 03 01 4c 89 48 58 e9 2f ff ff ff 85 db 41 0f 95
>>>> c2 45 39 d9 41 0f 95 c1 45 84 ca 74 05 45 85 e4 78 0a 44 89 c2 e9 10
>>>> ff ff ff <0f> 0b 01 d2 45 89 c1 41 29 d1 ba 00 00 00 00 44 0f 48 ca eb
>>>> 80 cc
>>>> [  161.582281] RSP: 0018:ffffc90000118e88 EFLAGS: 00010283
>>>> [  161.588426] RAX: ffff888101f8f200 RBX: ffffc900006f9bd0 RCX: 000000000000050e
>>>> [  161.596668] RDX: ffff888101fec000 RSI: 0000000000000a1c RDI: 0000000000061a10
>>>> [  161.604860] RBP: ffffc90000118ef8 R08: 0000000000000000 R09: 0000000000061502
>>>> [  161.613052] R10: 0000000000000000 R11: 0000000000000000 R12: 00000000ffffff3f
>>>> [  161.621291] R13: ffff888101f8f140 R14: 0000000000000000 R15: ffff888100ad9b00
>>>> [  161.629505] FS:  0000000000000000(0000) GS:ffff88903fe80000(0000)
>>>> knlGS:00000 00000000000
>>>> [  161.638781] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>>> [  161.645549] CR2: 00007f941ad43a9b CR3: 000000000340a000 CR4: 0000000000350ee0
>>>> [  161.653841] Kernel panic - not syncing: Fatal exception in interrupt
>>>> [  161.661287] Kernel Offset: disabled
>>>> [  161.665644] Rebooting in 3 seconds..
>>>> [  164.670313] ACPI MEMORY or I/O RESET_REG.
>>>>
>>>> Kyle.
>>>> _______________________________________________
>>>> Intel-wired-lan mailing list
>>>> Intel-wired-lan@osuosl.org
>>>> https://lists.osuosl.org/mailman/listinfo/intel-wired-lan
>>>


_______________________________________________
Intel-wired-lan mailing list
Intel-wired-lan@osuosl.org
https://lists.osuosl.org/mailman/listinfo/intel-wired-lan


  reply	other threads:[~2022-12-28  7:07 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-12-15 22:28 [Intel-wired-lan] igc: 5.10.146 Kernel BUG at 0xffffffff813ce19f Kyle Sanderson
2022-12-18  0:20 ` Kyle Sanderson
2022-12-19  6:30 ` Neftin, Sasha
2022-12-19 23:08   ` Kyle Sanderson
2022-12-20 18:29   ` Kyle Sanderson
2022-12-28  4:45     ` Kyle Sanderson
2022-12-28  7:04       ` Neftin, Sasha [this message]
2022-12-28 22:34         ` Kyle Sanderson
2022-12-29  2:48           ` Kyle Sanderson
2022-12-29  4:12             ` MP, Sureshkumar
2022-12-29  4:28               ` Kyle Sanderson
2022-12-29  9:21                 ` MP, Sureshkumar
2022-12-30  0:49                   ` Kyle Sanderson
2023-01-04  5:23                     ` Kyle Sanderson
2023-01-11 14:11                       ` Ruinskiy, Dima
2023-01-13 20:33                         ` Kyle Sanderson
2023-01-15  9:13                           ` Ruinskiy, Dima
2023-02-07  7:23                             ` Kyle Sanderson
2023-03-11  1:47                               ` [Intel-wired-lan] igc: 5.15.98 Kernel PANIC on igc_down Kyle Sanderson
2023-03-12  7:17                                 ` Neftin, Sasha
2023-03-27  7:22                                   ` Kyle Sanderson
2023-03-28  6:05                                     ` Kyle Sanderson
2023-03-29 16:03                                       ` Zulkifli, Muhammad Husaini
2023-04-10  7:07                                       ` Zulkifli, Muhammad Husaini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=70eea40e-808c-e9ee-9aab-617ebe67d67c@intel.com \
    --to=sasha.neftin@intel.com \
    --cc=amir.avivi@intel.com \
    --cc=anthony.l.nguyen@intel.com \
    --cc=dima.ruinskiy@intel.com \
    --cc=gregkh@linuxfoundation.org \
    --cc=intel-wired-lan@lists.osuosl.org \
    --cc=jesse.brandeburg@intel.com \
    --cc=kyle.leet@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=sureshkumar.mp@intel.com \
    --cc=therbert@google.com \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox