public inbox for oe-lkp@lists.linux.dev
 help / color / mirror / Atom feed
* [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#]
@ 2026-03-13  8:11 kernel test robot
  2026-03-13  8:51 ` Matthieu Baerts
  0 siblings, 1 reply; 2+ messages in thread
From: kernel test robot @ 2026-03-13  8:11 UTC (permalink / raw)
  To: Matthieu Baerts; +Cc: oe-lkp, lkp, matttbe, oliver.sang



Hello,

kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![make:#]" on:

commit: 5e4b47fd150c2ab74396945eae895ed419505daa ("Revert "sched/mmcid: Switch over to the new mechanism"")
https://git.kernel.org/cgit/linux/kernel/git/matttbe/net-next.git gh-issue-617-stalls-starting-vsock

in testcase: perf-stat-tests
version: 
with following parameters:


config: x86_64-rhel-9.4-bpf
compiler: gcc-14
test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz (Skylake) with 32G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202603131533.347f08fb-lkp@intel.com



[  203.398029][    C1] watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [make:6528]
[  203.398035][    C1] Modules linked in: btrfs intel_rapl_msr intel_rapl_common libblake2b xor zstd_compress raid6_pq x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel platform_profile sd_mod kvm sg snd_hda_intel dell_wmi irqbypass ghash_clmulni_intel mei_wdt snd_soc_avs rfkill dell_smbios rapl dcdbas intel_cstate dell_smm_hwmon sparse_keymap wmi_bmof snd_soc_hda_codec dell_wmi_descriptor snd_hda_ext_core snd_hda_codec snd_hda_core drm_buddy snd_intel_dspcfg snd_intel_sdw_acpi ttm i2c_i801 snd_hwdep intel_uncore drm_display_helper pcspkr i2c_smbus snd_soc_core cec drm_client_lib snd_compress intel_pmc_core drm_kms_helper snd_pcm pmt_telemetry ahci pmt_discovery intel_gtt snd_timer pmt_class agpgart mei_me intel_pmc_ssram_telemetry libahci snd video acpi_pad intel_vsec mei soundcore libata intel_pch_thermal wmi binfmt_misc drm nfnetlink ip_tables x_tables sch_fq_codel
[  203.398124][    C1] irq event stamp: 44440
[  203.398125][    C1] hardirqs last  enabled at (44439): asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:569)
[  203.398133][    C1] hardirqs last disabled at (44440): sysvec_apic_timer_interrupt (arch/x86/include/asm/hardirq.h:81 (discriminator 2) arch/x86/kernel/apic/apic.c:1056 (discriminator 2))
[  203.398139][    C1] softirqs last  enabled at (44380): handle_softirqs (kernel/softirq.c:469 (discriminator 2) kernel/softirq.c:650 (discriminator 2))
[  203.398156][    C1] softirqs last disabled at (44375): __irq_exit_rcu (kernel/softirq.c:657 kernel/softirq.c:496 kernel/softirq.c:723)
[  203.398163][    C1] CPU: 1 UID: 0 PID: 6528 Comm: make Tainted: G S        I         7.0.0-rc2-00273-g5e4b47fd150c #1 PREEMPT(full)
[  203.398184][    C1] Tainted: [S]=CPU_OUT_OF_SPEC, [I]=FIRMWARE_WORKAROUND
[  203.398186][    C1] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
[  203.398188][    C1] RIP: 0010:sched_mm_cid_after_execve (kernel/sched/sched.h:4026 (discriminator 2) kernel/sched/sched.h:4045 (discriminator 2) kernel/sched/sched.h:4061 (discriminator 2) kernel/sched/core.c:10741 (discriminator 2))
[  203.398194][    C1] Code: 1f 49 8d 94 47 40 0b 00 00 48 89 c8 48 c1 f8 06 48 89 55 c8 48 8d 3c c2 e8 27 90 5a 00 48 8b 55 c8 48 8b 4d b8 f0 48 0f ab 0a <72> 42 48 8b 7d d0 e8 0f 7a 5a 00 45 89 ac 24 30 1b 00 00 49 8d bc
All code
========
   0:	1f                   	(bad)
   1:	49 8d 94 47 40 0b 00 	lea    0xb40(%r15,%rax,2),%rdx
   8:	00 
   9:	48 89 c8             	mov    %rcx,%rax
   c:	48 c1 f8 06          	sar    $0x6,%rax
  10:	48 89 55 c8          	mov    %rdx,-0x38(%rbp)
  14:	48 8d 3c c2          	lea    (%rdx,%rax,8),%rdi
  18:	e8 27 90 5a 00       	call   0x5a9044
  1d:	48 8b 55 c8          	mov    -0x38(%rbp),%rdx
  21:	48 8b 4d b8          	mov    -0x48(%rbp),%rcx
  25:	f0 48 0f ab 0a       	lock bts %rcx,(%rdx)
  2a:*	72 42                	jb     0x6e		<-- trapping instruction
  2c:	48 8b 7d d0          	mov    -0x30(%rbp),%rdi
  30:	e8 0f 7a 5a 00       	call   0x5a7a44
  35:	45 89 ac 24 30 1b 00 	mov    %r13d,0x1b30(%r12)
  3c:	00 
  3d:	49                   	rex.WB
  3e:	8d                   	.byte 0x8d
  3f:	bc                   	.byte 0xbc

Code starting with the faulting instruction
===========================================
   0:	72 42                	jb     0x44
   2:	48 8b 7d d0          	mov    -0x30(%rbp),%rdi
   6:	e8 0f 7a 5a 00       	call   0x5a7a1a
   b:	45 89 ac 24 30 1b 00 	mov    %r13d,0x1b30(%r12)
  12:	00 
  13:	49                   	rex.WB
  14:	8d                   	.byte 0x8d
  15:	bc                   	.byte 0xbc
[  203.398197][    C1] RSP: 0018:ffff888805237ad0 EFLAGS: 00000247
[  203.398201][    C1] RAX: 0000000000000001 RBX: ffff8888052a6000 RCX: 0000000000000000
[  203.398204][    C1] RDX: ffff8888052a6b50 RSI: 0000000000000008 RDI: ffff8888052a6b50
[  203.398206][    C1] RBP: ffff888805237b18 R08: 0000000000000001 R09: ffffed1100a54d6a
[  203.398209][    C1] R10: ffff8888052a6b57 R11: 0000000000000000 R12: ffff888805240000
[  203.398224][    C1] R13: 0000000000000000 R14: 0000000000000001 R15: ffff8888052a6000
[  203.398227][    C1] FS:  0000000000000000(0000) GS:ffff888815bbd000(0000) knlGS:0000000000000000
[  203.398230][    C1] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  203.398232][    C1] CR2: 00007fff149c5e49 CR3: 000000084d6a2005 CR4: 00000000003726f0
[  203.398235][    C1] Call Trace:
[  203.398237][    C1]  <TASK>
[  203.398244][    C1]  bprm_execve (fs/exec.c:383 fs/exec.c:554)
[  203.398252][    C1]  do_execveat_common (fs/exec.c:1846)
[  203.398261][    C1]  __x64_sys_execve (include/linux/fs.h:2539 fs/exec.c:1929 fs/exec.c:1924 fs/exec.c:1924)
[  203.398267][    C1]  do_syscall_64 (arch/x86/entry/syscall_64.c:63 arch/x86/entry/syscall_64.c:94)
[  203.398273][    C1]  ? _raw_spin_unlock_irq (include/linux/spinlock_api_smp.h:188 (discriminator 3) kernel/locking/spinlock.c:202 (discriminator 3))
[  203.398277][    C1]  ? __x64_sys_rt_sigprocmask (kernel/signal.c:3338 kernel/signal.c:3316 kernel/signal.c:3316)
[  203.398283][    C1]  ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3316)
[  203.398293][    C1]  ? lock_acquire (include/trace/events/lock.h:24 (discriminator 15) include/trace/events/lock.h:24 (discriminator 15) kernel/locking/lockdep.c:5831 (discriminator 15))
[  203.398299][    C1]  ? do_syscall_64 (include/linux/irq-entry-common.h:298 include/linux/entry-common.h:327 arch/x86/entry/syscall_64.c:100)
[  203.398305][    C1]  ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[  203.398311][    C1]  ? do_syscall_64 (arch/x86/entry/syscall_64.c:113)
[  203.398315][    C1]  ? __lock_release+0x5d/0x1b0
[  203.398321][    C1]  ? __might_fault (mm/memory.c:7217 (discriminator 1))
[  203.398326][    C1]  ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[  203.398332][    C1]  ? _copy_to_user (arch/x86/include/asm/uaccess_64.h:126 arch/x86/include/asm/uaccess_64.h:147 include/linux/uaccess.h:206 lib/usercopy.c:26)
[  203.398352][    C1]  ? __x64_sys_rt_sigprocmask (kernel/signal.c:3339 (discriminator 1) kernel/signal.c:3316 (discriminator 1) kernel/signal.c:3316 (discriminator 1))
[  203.398357][    C1]  ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3316)
[  203.398378][    C1]  ? rcu_is_watching (arch/x86/include/asm/atomic.h:23 include/linux/atomic/atomic-arch-fallback.h:457 include/linux/context_tracking.h:128 kernel/rcu/tree.c:752)
[  203.398398][    C1]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
[  203.398403][    C1]  ? do_syscall_64 (include/linux/irq-entry-common.h:298 include/linux/entry-common.h:327 arch/x86/entry/syscall_64.c:100)
[  203.398409][    C1]  ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[  203.398415][    C1]  ? do_syscall_64 (arch/x86/entry/syscall_64.c:113)
[  203.398419][    C1]  ? find_held_lock (kernel/locking/lockdep.c:5350 (discriminator 1))
[  203.398438][    C1]  ? exc_page_fault (arch/x86/include/asm/irqflags.h:26 arch/x86/include/asm/irqflags.h:109 arch/x86/include/asm/irqflags.h:151 arch/x86/mm/fault.c:1480 arch/x86/mm/fault.c:1527)
[  203.398441][    C1]  ? __lock_release+0x5d/0x1b0
[  203.398446][    C1]  ? handle_mm_fault (mm/memory.c:6509 (discriminator 1) mm/memory.c:6649 (discriminator 1))
[  203.398451][    C1]  ? exc_page_fault (arch/x86/include/asm/irqflags.h:26 arch/x86/include/asm/irqflags.h:109 arch/x86/include/asm/irqflags.h:151 arch/x86/mm/fault.c:1480 arch/x86/mm/fault.c:1527)
[  203.398456][    C1]  ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[  203.398461][    C1]  ? irqentry_exit (include/linux/irq-entry-common.h:298 include/linux/irq-entry-common.h:341 kernel/entry/common.c:219)
[  203.398467][    C1]  ? trace_hardirqs_on_prepare (kernel/trace/trace_preemptirq.c:64 (discriminator 4) kernel/trace/trace_preemptirq.c:59 (discriminator 4))
[  203.398472][    C1]  ? lockdep_hardirqs_on_prepare (kernel/locking/lockdep.c:4629 (discriminator 4))
[  203.398477][    C1]  ? irqentry_exit (arch/x86/include/asm/jump_label.h:37 include/linux/context_tracking_state.h:138 include/linux/context_tracking.h:41 include/linux/irq-entry-common.h:301 include/linux/irq-entry-common.h:341 kernel/entry/common.c:219)
[  203.398497][    C1]  entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
[  203.398501][    C1] RIP: 0033:0x7fd5f465b280
[  203.398523][    C1] Code: Unable to access opcode bytes at 0x7fd5f465b256.

Code starting with the faulting instruction
===========================================
[  203.398525][    C1] RSP: 002b:00007fff149c5880 EFLAGS: 00000202 ORIG_RAX: 000000000000003b
[  203.398529][    C1] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 0000000000000000
[  203.398531][    C1] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  203.398533][    C1] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[  203.398535][    C1] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  203.398537][    C1] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[  203.398563][    C1]  </TASK>
[  203.398565][    C1] Kernel panic - not syncing: softlockup: hung tasks
[  203.895548][    C1] CPU: 1 UID: 0 PID: 6528 Comm: make Tainted: G S        I  L      7.0.0-rc2-00273-g5e4b47fd150c #1 PREEMPT(full)
[  203.907621][    C1] Tainted: [S]=CPU_OUT_OF_SPEC, [I]=FIRMWARE_WORKAROUND, [L]=SOFTLOCKUP
[  203.915919][    C1] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
[  203.924142][    C1] Call Trace:
[  203.927472][    C1]  <IRQ>
[  203.930315][    C1]  dump_stack_lvl (lib/dump_stack.c:122)
[  203.934808][    C1]  vpanic (kernel/panic.c:651)
[  203.938758][    C1]  panic (kernel/panic.c:787)
[  203.942483][    C1]  ? __pfx_panic (kernel/panic.c:783)
[  203.946911][    C1]  watchdog_timer_fn.cold (kernel/watchdog.c:869)
[  203.952198][    C1]  ? __pfx_watchdog_timer_fn (kernel/watchdog.c:774)
[  203.957632][    C1]  __hrtimer_run_queues (kernel/time/hrtimer.c:1785 kernel/time/hrtimer.c:1849)
[  203.962805][    C1]  ? __pfx___hrtimer_run_queues (kernel/time/hrtimer.c:1819)
[  203.968564][    C1]  ? lock_release (kernel/locking/lockdep.c:470 (discriminator 4) kernel/locking/lockdep.c:5891 (discriminator 4) kernel/locking/lockdep.c:5875 (discriminator 4))
[  203.973119][    C1]  ? __pfx_read_tsc (arch/x86/include/asm/tsc.h:57 arch/x86/kernel/tsc.c:1143)
[  203.977760][    C1]  ? ktime_get_update_offsets_now (kernel/time/timekeeping.c:2576)
[  203.983802][    C1]  hrtimer_interrupt (kernel/time/hrtimer.c:1914)
[  203.988708][    C1]  __sysvec_apic_timer_interrupt (arch/x86/include/asm/jump_label.h:37 arch/x86/include/asm/trace/irq_vectors.h:40 arch/x86/kernel/apic/apic.c:1063)
[  203.994589][    C1]  sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1056 (discriminator 35) arch/x86/kernel/apic/apic.c:1056 (discriminator 35))
[  204.000223][    C1]  </IRQ>
[  204.003142][    C1]  <TASK>
[  204.006064][    C1]  asm_sysvec_apic_timer_interrupt (arch/x86/include/asm/idtentry.h:569)
[  204.012017][    C1] RIP: 0010:sched_mm_cid_after_execve (kernel/sched/sched.h:4026 (discriminator 2) kernel/sched/sched.h:4045 (discriminator 2) kernel/sched/sched.h:4061 (discriminator 2) kernel/sched/core.c:10741 (discriminator 2))
[  204.018447][    C1] Code: 1f 49 8d 94 47 40 0b 00 00 48 89 c8 48 c1 f8 06 48 89 55 c8 48 8d 3c c2 e8 27 90 5a 00 48 8b 55 c8 48 8b 4d b8 f0 48 0f ab 0a <72> 42 48 8b 7d d0 e8 0f 7a 5a 00 45 89 ac 24 30 1b 00 00 49 8d bc
All code
========
   0:	1f                   	(bad)
   1:	49 8d 94 47 40 0b 00 	lea    0xb40(%r15,%rax,2),%rdx
   8:	00 
   9:	48 89 c8             	mov    %rcx,%rax
   c:	48 c1 f8 06          	sar    $0x6,%rax
  10:	48 89 55 c8          	mov    %rdx,-0x38(%rbp)
  14:	48 8d 3c c2          	lea    (%rdx,%rax,8),%rdi
  18:	e8 27 90 5a 00       	call   0x5a9044
  1d:	48 8b 55 c8          	mov    -0x38(%rbp),%rdx
  21:	48 8b 4d b8          	mov    -0x48(%rbp),%rcx
  25:	f0 48 0f ab 0a       	lock bts %rcx,(%rdx)
  2a:*	72 42                	jb     0x6e		<-- trapping instruction
  2c:	48 8b 7d d0          	mov    -0x30(%rbp),%rdi
  30:	e8 0f 7a 5a 00       	call   0x5a7a44
  35:	45 89 ac 24 30 1b 00 	mov    %r13d,0x1b30(%r12)
  3c:	00 
  3d:	49                   	rex.WB
  3e:	8d                   	.byte 0x8d
  3f:	bc                   	.byte 0xbc

Code starting with the faulting instruction
===========================================
   0:	72 42                	jb     0x44
   2:	48 8b 7d d0          	mov    -0x30(%rbp),%rdi
   6:	e8 0f 7a 5a 00       	call   0x5a7a1a
   b:	45 89 ac 24 30 1b 00 	mov    %r13d,0x1b30(%r12)
  12:	00 
  13:	49                   	rex.WB
  14:	8d                   	.byte 0x8d
  15:	bc                   	.byte 0xbc


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20260313/202603131533.347f08fb-lkp@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#]
  2026-03-13  8:11 [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#] kernel test robot
@ 2026-03-13  8:51 ` Matthieu Baerts
  0 siblings, 0 replies; 2+ messages in thread
From: Matthieu Baerts @ 2026-03-13  8:51 UTC (permalink / raw)
  To: kernel test robot; +Cc: oe-lkp, lkp

Hello,

On 13/03/2026 09:11, kernel test robot wrote:
> kernel test robot noticed "BUG:soft_lockup-CPU##stuck_for#s![make:#]" on:
> 
> commit: 5e4b47fd150c2ab74396945eae895ed419505daa ("Revert "sched/mmcid: Switch over to the new mechanism"")
> https://git.kernel.org/cgit/linux/kernel/git/matttbe/net-next.git gh-issue-617-stalls-starting-vsock

Thank you for the bug report. A newer version exists without this issue:

https://lore.kernel.org/all/20260310201009.257617049@kernel.org

Applied in tip: sched/urgent

Cheers,
Matt
-- 
Sponsored by the NGI0 Core fund.


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2026-03-13  8:51 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-13  8:11 [matttbe-net-next:gh-issue-617-stalls-starting-vsock] [sched/mmcid] 5e4b47fd15: BUG:soft_lockup-CPU##stuck_for#s![make:#] kernel test robot
2026-03-13  8:51 ` Matthieu Baerts

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox