From: "Ethy H. Brito" <ethy.brito@inexo.com.br>
To: "xdp-newbies@vger.kernel.org" <xdp-newbies@vger.kernel.org>
Subject: Re: watchdog: BUG: soft lockup
Date: Sun, 1 Aug 2021 21:57:56 -0300 [thread overview]
Message-ID: <20210801215756.65966bd9@babalu> (raw)
In-Reply-To: <20210729010457.680241f7@babalu>
Hi guys.
Hasn't anybody had any experience with these?
Ethy
On Thu, 29 Jul 2021 01:04:57 -0300
"Ethy H. Brito" <ethy.brito@inexo.com.br> wrote:
> Hi everyone.
>
> Is anyone using MQ+HFSC as a class discipline and xdp-cpumap-tc?
>
> Today I tried to put this combination in production and got a freezed box with dozens of the
> messages bellow in syslog (with three of 48 virtualized cores at 100% load).
> I could still ping the box thru a management interface however, but that's about it.
>
> This setup worked nicely in a lab environment, but with no real traffic.
> Just a few successful, single user, iperf3 100mbps upload and download simulations.
>
> As I was debugging a vlan issue earlier (solved elsewhere [1]), I left around about 20 bpf_debug messages at
> xdp_iphash/tc_classify kernel code.
> Could these messages be causing the lockup?
> If not, what could it be?
>
> The packet rate just before launching the script was as low as 100kpps WAN->LAN.
>
> Thanks for your time.
>
> Ethy
>
> [1]
> https://www.spinics.net/lists/xdp-newbies/msg02085.html
>
> -----------------------------------------------8<--------------------------------------------------
>
>
> Jul 28 02:28:38 quiron kernel: [511672.119473] watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [cpumap/7/map:1:1171030]
>
> Jul 28 02:28:38 quiron kernel: [511672.127497] Modules linked in: cls_bpf algif_hash af_alg sch_ingress sch_fq_codel sch_hfsc
> cls_u32 sch_sfq sch_htb xt_connt rack xt_nat xt_tcpudp ip_set_list_set ip_set_hash_netport ip_set_hash_ip ip_set_hash_net
> ip_set 8021q garp mrp veth nf_tables nfnetlink bridge stp llc ip6table_filter ip6table_raw ip6_tables iptable_filter
> iptable_raw iptable_nat nf_nat bpfilter dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua intel_rapl_msr intel_rapl_common
> ipmi_ssif isst_if_common skx_edac nfit x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul
> ghash_clmulni_intel aesni_i ntel crypto_simd cryptd glue_helper mgag200 drm_vram_helper ttm drm_kms_helper dell_smbios
> ipmi_si fb_sys_fops ipmi_devintf mei_me syscopyarea sysfillrect rapl sysimgblt wmi_bmof dcdbas dell_wmi_descriptor mei
> intel_cstate ipmi_msghandler mac_hid acpi_power_meter sch_fq i5500_temp tcp_bbr nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4
> dummy drm ip_tables x_tables autofs4 raid10 raid456 async_raid6_recov
>
> Jul 28 02:28:38 quiron kernel: [511672.127531] async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0
> multipath linear igb i2c_algo_bit ahci i2c_i801 lpc_ich dca crc32_pclmul i40e libahci wmi
>
> Jul 28 02:28:38 quiron kernel: [511672.127541] CPU: 7 PID: 1171030 Comm: cpumap/7/map:1 Not tainted 5.4.0-66-generic
> #74-Ubuntu
>
> Jul 28 02:28:38 quiron kernel: [511672.127541] Hardware name: Dell Inc. PowerEdge R640/0X45NX, BIOS 2.10.2 02/24/2021
>
> Jul 28 02:28:38 quiron kernel: [511672.127548] RIP: 0010:rb_first+0x8/0x20
>
> Jul 28 02:28:38 quiron kernel: [511672.127550] Code: 10 e9 da fe ff ff 4c 89 e9 4c 89 f2 4d 89 ee 49 89 c5 e9 81 fe ff ff 66
> 66 2e 0f 1f 84 00 00 00 00 00 48 8b 07 48 85 c0 74 10 <49> 89 c0 48 8b 40 10 48 85 c0 75 f4 4c 89 c0 c3 45 31 c0 eb f7 0f
>
> Jul 28 02:28:38 quiron kernel: [511672.127551] RSP: 0018:ffffba23c4837928 EFLAGS: 00000286 ORIG_RAX: ffffffffffffff13
>
> Jul 28 02:28:38 quiron kernel: [511672.127552] RAX: ffff9568749e9090 RBX: ffff9566470a1800 RCX: 0000000000000018
>
> Jul 28 02:28:38 quiron kernel: [511672.127553] RDX: 000042e5af1e6e28 RSI: 000a7768f6487f18 RDI: ffff9566470a1c40
>
> Jul 28 02:28:38 quiron kernel: [511672.127553] RBP: ffffba23c4837958 R08: ffff9568749e9090 R09: 0000000000000000
>
> Jul 28 02:28:38 quiron kernel: [511672.127554] R10: 00000000559e0129 R11: 00000074559e0129 R12: 0000000000000000
>
> Jul 28 02:28:38 quiron kernel: [511672.127555] R13: 0000074559e01298 R14: ffff9566470a1c40 R15: ffff9566470a1800
>
> Jul 28 02:28:38 quiron kernel: [511672.127555] FS: 0000000000000000(0000) GS:ffff9568778c0000(0000) knlGS:0000000000000000
>
> Jul 28 02:28:38 quiron kernel: [511672.127556] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>
> Jul 28 02:28:38 quiron kernel: [511672.127557] CR2: 00007fa33136b290 CR3: 000000039680a001 CR4: 00000000007606e0
>
> Jul 28 02:28:38 quiron kernel: [511672.127557] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
>
> Jul 28 02:28:38 quiron kernel: [511672.127558] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
>
> Jul 28 02:28:38 quiron kernel: [511672.127558] PKRU: 55555554
>
> Jul 28 02:28:38 quiron kernel: [511672.127559] Call Trace:
>
> Jul 28 02:28:38 quiron kernel: [511672.127564] ? hfsc_dequeue+0x41/0x380 [sch_hfsc]
>
> Jul 28 02:28:38 quiron kernel: [511672.127569] __qdisc_run+0x94/0x5b0
>
> Jul 28 02:28:38 quiron kernel: [511672.127573] __dev_queue_xmit+0x489/0x8e0
>
> Jul 28 02:28:38 quiron kernel: [511672.127575] dev_queue_xmit+0x10/0x20
>
> Jul 28 02:28:38 quiron kernel: [511672.127578] vlan_dev_hard_start_xmit+0x97/0x120 [8021q]
>
> Jul 28 02:28:38 quiron kernel: [511672.127579] dev_hard_start_xmit+0x91/0x1f0
>
> Jul 28 02:28:38 quiron kernel: [511672.127580] __dev_queue_xmit+0x720/0x8e0
>
> Jul 28 02:28:38 quiron kernel: [511672.127582] ? eth_header+0x2b/0xc0
>
> Jul 28 02:28:38 quiron kernel: [511672.127583] dev_queue_xmit+0x10/0x20
>
> Jul 28 02:28:38 quiron kernel: [511672.127585] neigh_connected_output+0xc1/0xf0
>
> Jul 28 02:28:38 quiron kernel: [511672.127589] ip_finish_output2+0x19b/0x590
>
> Jul 28 02:28:38 quiron kernel: [511672.127590] __ip_finish_output+0xbf/0x1e0
>
> Jul 28 02:28:38 quiron kernel: [511672.127592] ip_finish_output+0x2d/0xb0
>
> Jul 28 02:28:38 quiron kernel: [511672.127593] ip_output+0x75/0xf0
>
> Jul 28 02:28:38 quiron kernel: [511672.127595] ? __ip_finish_output+0x1e0/0x1e0
>
> Jul 28 02:28:38 quiron kernel: [511672.127596] ip_forward_finish+0x58/0x90
>
> Jul 28 02:28:38 quiron kernel: [511672.127597] ip_forward+0x3a7/0x4b0
>
> Jul 28 02:28:38 quiron kernel: [511672.127598] ? ip_frag_queue.cold+0x32/0x32
>
> Jul 28 02:28:38 quiron kernel: [511672.127600] ip_rcv_finish+0x85/0xa0
>
> Jul 28 02:28:38 quiron kernel: [511672.127601] ip_rcv+0xbc/0xd0
>
> Jul 28 02:28:38 quiron kernel: [511672.127602] ? ip_rcv_finish_core.isra.0+0x3c0/0x3c0
>
> Jul 28 02:28:38 quiron kernel: [511672.127603] __netif_receive_skb_one_core+0x88/0xa0
>
> Jul 28 02:28:38 quiron kernel: [511672.127604] netif_receive_skb_core+0x10/0x20
>
> Jul 28 02:28:38 quiron kernel: [511672.127608] cpu_map_kthread_run+0x275/0x540
>
> Jul 28 02:28:38 quiron kernel: [511672.127612] ? __schedule+0x2eb/0x740
>
> Jul 28 02:28:38 quiron kernel: [511672.127616] kthread+0x104/0x140
>
> Jul 28 02:28:38 quiron kernel: [511672.127617] ? __cpu_map_entry_free+0x70/0x70
>
> Jul 28 02:28:38 quiron kernel: [511672.127618] ? kthread_park+0x90/0x90
>
> Jul 28 02:28:38 quiron kernel: [511672.127621] ret_from_fork+0x1f/0x40
>
>
--
Ethy H. Brito /"\
InterNexo Ltda. \ / CAMPANHA DA FITA ASCII - CONTRA MAIL HTML
+55 (12) 3797-6860 X ASCII RIBBON CAMPAIGN - AGAINST HTML MAIL
S.J.Campos - Brasil / \
PGP key: http://www.inexo.com.br/~ethy/0xC3F222A0.asc
next prev parent reply other threads:[~2021-08-02 0:58 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-07-29 4:04 watchdog: BUG: soft lockup Ethy H. Brito
2021-08-02 0:57 ` Ethy H. Brito [this message]
-- strict thread matches above, loose matches on Subject: below --
2024-12-23 6:32 wzs
2025-01-02 16:10 ` Petr Mladek
2025-01-08 17:27 ` Doug Anderson
2025-01-13 8:27 ` wzs
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20210801215756.65966bd9@babalu \
--to=ethy.brito@inexo.com.br \
--cc=xdp-newbies@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.