* [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#]
@ 2026-03-13 8:11 kernel test robot
2026-03-13 8:51 ` Matthieu Baerts
0 siblings, 1 reply; 2+ messages in thread
From: kernel test robot @ 2026-03-13 8:11 UTC (permalink / raw)
To: Matthieu Baerts; +Cc: oe-lkp, lkp, matttbe, oliver.sang
Hello,
kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![make:#]" on:
commit: 5e4b47fd150c2ab74396945eae895ed419505daa ("Revert "sched/mmcid: Switch over to the new mechanism"")
https://git.kernel.org/cgit/linux/kernel/git/matttbe/net-next.git gh-issue-617-stalls-starting-vsock
in testcase: perf-stat-tests
version:
with following parameters:
config: x86_64-rhel-9.4-bpf
compiler: gcc-14
test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz (Skylake) with 32G memory
(please refer to attached dmesg/kmsg for entire log/backtrace)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202603131533.347f08fb-lkp@intel.com
[ 203.398029][ C1] watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [make:6528]
[ 203.398035][ C1] Modules linked in: btrfs intel_rapl_msr intel_rapl_common libblake2b xor zstd_compress raid6_pq x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel platform_profile sd_mod kvm sg snd_hda_intel dell_wmi irqbypass ghash_clmulni_intel mei_wdt snd_soc_avs rfkill dell_smbios rapl dcdbas intel_cstate dell_smm_hwmon sparse_keymap wmi_bmof snd_soc_hda_codec dell_wmi_descriptor snd_hda_ext_core snd_hda_codec snd_hda_core drm_buddy snd_intel_dspcfg snd_intel_sdw_acpi ttm i2c_i801 snd_hwdep intel_uncore drm_display_helper pcspkr i2c_smbus snd_soc_core cec drm_client_lib snd_compress intel_pmc_core drm_kms_helper snd_pcm pmt_telemetry ahci pmt_discovery intel_gtt snd_timer pmt_class agpgart mei_me intel_pmc_ssram_telemetry libahci snd video acpi_pad intel_vsec mei soundcore libata intel_pch_thermal wmi binfmt_misc drm nfnetlink ip_tables x_tables sch_fq_codel
[ 203.398124][ C1] irq event stamp: 44440
[ 203.398125][ C1] hardirqs last enabled at (44439): asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:569)
[ 203.398133][ C1] hardirqs last disabled at (44440): sysvec_apic_timer_interrupt (arch/x86/include/asm/hardirq.h:81 (discriminator 2) arch/x86/kernel/apic/apic.c:1056 (discriminator 2))
[ 203.398139][ C1] softirqs last enabled at (44380): handle_softirqs (kernel/softirq.c:469 (discriminator 2) kernel/softirq.c:650 (discriminator 2))
[ 203.398156][ C1] softirqs last disabled at (44375): __irq_exit_rcu (kernel/softirq.c:657 kernel/softirq.c:496 kernel/softirq.c:723)
[ 203.398163][ C1] CPU: 1 UID: 0 PID: 6528 Comm: make Tainted: G S I 7.0.0-rc2-00273-g5e4b47fd150c #1 PREEMPT(full)
[ 203.398184][ C1] Tainted: [S]=CPU_OUT_OF_SPEC, [I]=FIRMWARE_WORKAROUND
[ 203.398186][ C1] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
[ 203.398188][ C1] RIP: 0010:sched_mm_cid_after_execve (kernel/sched/sched.h:4026 (discriminator 2) kernel/sched/sched.h:4045 (discriminator 2) kernel/sched/sched.h:4061 (discriminator 2) kernel/sched/core.c:10741 (discriminator 2))
[ 203.398194][ C1] Code: 1f 49 8d 94 47 40 0b 00 00 48 89 c8 48 c1 f8 06 48 89 55 c8 48 8d 3c c2 e8 27 90 5a 00 48 8b 55 c8 48 8b 4d b8 f0 48 0f ab 0a <72> 42 48 8b 7d d0 e8 0f 7a 5a 00 45 89 ac 24 30 1b 00 00 49 8d bc
All code
========
0: 1f (bad)
1: 49 8d 94 47 40 0b 00 lea 0xb40(%r15,%rax,2),%rdx
8: 00
9: 48 89 c8 mov %rcx,%rax
c: 48 c1 f8 06 sar $0x6,%rax
10: 48 89 55 c8 mov %rdx,-0x38(%rbp)
14: 48 8d 3c c2 lea (%rdx,%rax,8),%rdi
18: e8 27 90 5a 00 call 0x5a9044
1d: 48 8b 55 c8 mov -0x38(%rbp),%rdx
21: 48 8b 4d b8 mov -0x48(%rbp),%rcx
25: f0 48 0f ab 0a lock bts %rcx,(%rdx)
2a:* 72 42 jb 0x6e <-- trapping instruction
2c: 48 8b 7d d0 mov -0x30(%rbp),%rdi
30: e8 0f 7a 5a 00 call 0x5a7a44
35: 45 89 ac 24 30 1b 00 mov %r13d,0x1b30(%r12)
3c: 00
3d: 49 rex.WB
3e: 8d .byte 0x8d
3f: bc .byte 0xbc
Code starting with the faulting instruction
===========================================
0: 72 42 jb 0x44
2: 48 8b 7d d0 mov -0x30(%rbp),%rdi
6: e8 0f 7a 5a 00 call 0x5a7a1a
b: 45 89 ac 24 30 1b 00 mov %r13d,0x1b30(%r12)
12: 00
13: 49 rex.WB
14: 8d .byte 0x8d
15: bc .byte 0xbc
[ 203.398197][ C1] RSP: 0018:ffff888805237ad0 EFLAGS: 00000247
[ 203.398201][ C1] RAX: 0000000000000001 RBX: ffff8888052a6000 RCX: 0000000000000000
[ 203.398204][ C1] RDX: ffff8888052a6b50 RSI: 0000000000000008 RDI: ffff8888052a6b50
[ 203.398206][ C1] RBP: ffff888805237b18 R08: 0000000000000001 R09: ffffed1100a54d6a
[ 203.398209][ C1] R10: ffff8888052a6b57 R11: 0000000000000000 R12: ffff888805240000
[ 203.398224][ C1] R13: 0000000000000000 R14: 0000000000000001 R15: ffff8888052a6000
[ 203.398227][ C1] FS: 0000000000000000(0000) GS:ffff888815bbd000(0000) knlGS:0000000000000000
[ 203.398230][ C1] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 203.398232][ C1] CR2: 00007fff149c5e49 CR3: 000000084d6a2005 CR4: 00000000003726f0
[ 203.398235][ C1] Call Trace:
[ 203.398237][ C1] <TASK>
[ 203.398244][ C1] bprm_execve (fs/exec.c:383 fs/exec.c:554)
[ 203.398252][ C1] do_execveat_common (fs/exec.c:1846)
[ 203.398261][ C1] __x64_sys_execve (include/linux/fs.h:2539 fs/exec.c:1929 fs/exec.c:1924 fs/exec.c:1924)
[ 203.398267][ C1] do_syscall_64 (arch/x86/entry/syscall_64.c:63 arch/x86/entry/syscall_64.c:94)
[ 203.398273][ C1] ? _raw_spin_unlock_irq (include/linux/spinlock_api_smp.h:188 (discriminator 3) kernel/locking/spinlock.c:202 (discriminator 3))
[ 203.398277][ C1] ? __x64_sys_rt_sigprocmask (kernel/signal.c:3338 kernel/signal.c:3316 kernel/signal.c:3316)
[ 203.398283][ C1] ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3316)
[ 203.398293][ C1] ? lock_acquire (include/trace/events/lock.h:24 (discriminator 15) include/trace/events/lock.h:24 (discriminator 15) kernel/locking/lockdep.c:5831 (discriminator 15))
[ 203.398299][ C1] ? do_syscall_64 (include/linux/irq-entry-common.h:298 include/linux/entry-common.h:327 arch/x86/entry/syscall_64.c:100)
[ 203.398305][ C1] ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[ 203.398311][ C1] ? do_syscall_64 (arch/x86/entry/syscall_64.c:113)
[ 203.398315][ C1] ? __lock_release+0x5d/0x1b0
[ 203.398321][ C1] ? __might_fault (mm/memory.c:7217 (discriminator 1))
[ 203.398326][ C1] ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[ 203.398332][ C1] ? _copy_to_user (arch/x86/include/asm/uaccess_64.h:126 arch/x86/include/asm/uaccess_64.h:147 include/linux/uaccess.h:206 lib/usercopy.c:26)
[ 203.398352][ C1] ? __x64_sys_rt_sigprocmask (kernel/signal.c:3339 (discriminator 1) kernel/signal.c:3316 (discriminator 1) kernel/signal.c:3316 (discriminator 1))
[ 203.398357][ C1] ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3316)
[ 203.398378][ C1] ? rcu_is_watching (arch/x86/include/asm/atomic.h:23 include/linux/atomic/atomic-arch-fallback.h:457 include/linux/context_tracking.h:128 kernel/rcu/tree.c:752)
[ 203.398398][ C1] ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
[ 203.398403][ C1] ? do_syscall_64 (include/linux/irq-entry-common.h:298 include/linux/entry-common.h:327 arch/x86/entry/syscall_64.c:100)
[ 203.398409][ C1] ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[ 203.398415][ C1] ? do_syscall_64 (arch/x86/entry/syscall_64.c:113)
[ 203.398419][ C1] ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
[ 203.398438][ C1] ? exc_page_fault (arch/x86/include/asm/irqflags.h:26 arch/x86/include/asm/irqflags.h:109 arch/x86/include/asm/irqflags.h:151 arch/x86/mm/fault.c:1480 arch/x86/mm/fault.c:1527)
[ 203.398441][ C1] ? __lock_release+0x5d/0x1b0
[ 203.398446][ C1] ? handle_mm_fault (mm/memory.c:6509 (discriminator 1) mm/memory.c:6649 (discriminator 1))
[ 203.398451][ C1] ? exc_page_fault (arch/x86/include/asm/irqflags.h:26 arch/x86/include/asm/irqflags.h:109 arch/x86/include/asm/irqflags.h:151 arch/x86/mm/fault.c:1480 arch/x86/mm/fault.c:1527)
[ 203.398456][ C1] ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[ 203.398461][ C1] ? irqentry_exit (include/linux/irq-entry-common.h:298 include/linux/irq-entry-common.h:341 kernel/entry/common.c:219)
[ 203.398467][ C1] ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[ 203.398472][ C1] ? lockdep_hardirqs_on_prepare (kernel/locking/lockdep.c:4629 (discriminator 4))
[ 203.398477][ C1] ? irqentry_exit (arch/x86/include/asm/jump_label.h:37 include/linux/context_tracking_state.h:138 include/linux/context_tracking.h:41 include/linux/irq-entry-common.h:301 include/linux/irq-entry-common.h:341 kernel/entry/common.c:219)
[ 203.398497][ C1] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
[ 203.398501][ C1] RIP: 0033:0x7fd5f465b280
[ 203.398523][ C1] Code: Unable to access opcode bytes at 0x7fd5f465b256.
Code starting with the faulting instruction
===========================================
[ 203.398525][ C1] RSP: 002b:00007fff149c5880 EFLAGS: 00000202 ORIG_RAX: 000000000000003b
[ 203.398529][ C1] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 0000000000000000
[ 203.398531][ C1] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 203.398533][ C1] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 203.398535][ C1] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 203.398537][ C1] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 203.398563][ C1] </TASK>
[ 203.398565][ C1] Kernel panic - not syncing: softlockup: hung tasks
[ 203.895548][ C1] CPU: 1 UID: 0 PID: 6528 Comm: make Tainted: G S I L 7.0.0-rc2-00273-g5e4b47fd150c #1 PREEMPT(full)
[ 203.907621][ C1] Tainted: [S]=CPU_OUT_OF_SPEC, [I]=FIRMWARE_WORKAROUND, [L]=SOFTLOCKUP
[ 203.915919][ C1] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
[ 203.924142][ C1] Call Trace:
[ 203.927472][ C1] <IRQ>
[ 203.930315][ C1] dump_stack_lvl (lib/dump_stack.c:122)
[ 203.934808][ C1] vpanic (kernel/panic.c:651)
[ 203.938758][ C1] panic (kernel/panic.c:787)
[ 203.942483][ C1] ? __pfx_panic (kernel/panic.c:783)
[ 203.946911][ C1] watchdog_timer_fn.cold (kernel/watchdog.c:869)
[ 203.952198][ C1] ? __pfx_watchdog_timer_fn (kernel/watchdog.c:774)
[ 203.957632][ C1] __hrtimer_run_queues (kernel/time/hrtimer.c:1785 kernel/time/hrtimer.c:1849)
[ 203.962805][ C1] ? __pfx___hrtimer_run_queues (kernel/time/hrtimer.c:1819)
[ 203.968564][ C1] ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[ 203.973119][ C1] ? __pfx_read_tsc (arch/x86/include/asm/tsc.h:57 arch/x86/kernel/tsc.c:1143)
[ 203.977760][ C1] ? ktime_get_update_offsets_now (kernel/time/timekeeping.c:2576)
[ 203.983802][ C1] hrtimer_interrupt (kernel/time/hrtimer.c:1914)
[ 203.988708][ C1] __sysvec_apic_timer_interrupt (arch/x86/include/asm/jump_label.h:37 arch/x86/include/asm/trace/irq_vectors.h:40 arch/x86/kernel/apic/apic.c:1063)
[ 203.994589][ C1] sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1056 (discriminator 35) arch/x86/kernel/apic/apic.c:1056 (discriminator 35))
[ 204.000223][ C1] </IRQ>
[ 204.003142][ C1] <TASK>
[ 204.006064][ C1] asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:569)
[ 204.012017][ C1] RIP: 0010:sched_mm_cid_after_execve (kernel/sched/sched.h:4026 (discriminator 2) kernel/sched/sched.h:4045 (discriminator 2) kernel/sched/sched.h:4061 (discriminator 2) kernel/sched/core.c:10741 (discriminator 2))
[ 204.018447][ C1] Code: 1f 49 8d 94 47 40 0b 00 00 48 89 c8 48 c1 f8 06 48 89 55 c8 48 8d 3c c2 e8 27 90 5a 00 48 8b 55 c8 48 8b 4d b8 f0 48 0f ab 0a <72> 42 48 8b 7d d0 e8 0f 7a 5a 00 45 89 ac 24 30 1b 00 00 49 8d bc
All code
========
0: 1f (bad)
1: 49 8d 94 47 40 0b 00 lea 0xb40(%r15,%rax,2),%rdx
8: 00
9: 48 89 c8 mov %rcx,%rax
c: 48 c1 f8 06 sar $0x6,%rax
10: 48 89 55 c8 mov %rdx,-0x38(%rbp)
14: 48 8d 3c c2 lea (%rdx,%rax,8),%rdi
18: e8 27 90 5a 00 call 0x5a9044
1d: 48 8b 55 c8 mov -0x38(%rbp),%rdx
21: 48 8b 4d b8 mov -0x48(%rbp),%rcx
25: f0 48 0f ab 0a lock bts %rcx,(%rdx)
2a:* 72 42 jb 0x6e <-- trapping instruction
2c: 48 8b 7d d0 mov -0x30(%rbp),%rdi
30: e8 0f 7a 5a 00 call 0x5a7a44
35: 45 89 ac 24 30 1b 00 mov %r13d,0x1b30(%r12)
3c: 00
3d: 49 rex.WB
3e: 8d .byte 0x8d
3f: bc .byte 0xbc
Code starting with the faulting instruction
===========================================
0: 72 42 jb 0x44
2: 48 8b 7d d0 mov -0x30(%rbp),%rdi
6: e8 0f 7a 5a 00 call 0x5a7a1a
b: 45 89 ac 24 30 1b 00 mov %r13d,0x1b30(%r12)
12: 00
13: 49 rex.WB
14: 8d .byte 0x8d
15: bc .byte 0xbc
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260313/202603131533.347f08fb-lkp@intel.com
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#]
2026-03-13 8:11 [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#] kernel test robot
@ 2026-03-13 8:51 ` Matthieu Baerts
0 siblings, 0 replies; 2+ messages in thread
From: Matthieu Baerts @ 2026-03-13 8:51 UTC (permalink / raw)
To: kernel test robot; +Cc: oe-lkp, lkp
Hello,
On 13/03/2026 09:11, kernel test robot wrote:
> kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![make:#]" on:
>
> commit: 5e4b47fd150c2ab74396945eae895ed419505daa ("Revert "sched/mmcid: Switch over to the new mechanism"")
> https://git.kernel.org/cgit/linux/kernel/git/matttbe/net-next.git gh-issue-617-stalls-starting-vsock
Thank you for the bug report. A newer version exists without this issue:
https://lore.kernel.org/all/20260310201009.257617049@kernel.org
Applied in tip: sched/urgent
Cheers,
Matt
--
Sponsored by the NGI0 Core fund.
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2026-03-13 8:51 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-13 8:11 [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#] kernel test robot
2026-03-13 8:51 ` Matthieu Baerts
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox