All of lore.kernel.org
 help / color / mirror / Atom feed
* [bigeasy-staging:futex_local_v4.5] [futex]  51319c5cb6: BUG:sleeping_function_called_from_invalid_context_at_kernel/nsproxy.c
@ 2024-12-24  7:36 kernel test robot
  0 siblings, 0 replies; only message in thread
From: kernel test robot @ 2024-12-24  7:36 UTC (permalink / raw)
  To: Sebastian Andrzej Siewior; +Cc: oe-lkp, lkp, oliver.sang



Hello,

we reported
"[bigeasy-staging:futex_local_v4.5] [futex]  51319c5cb6: stress-ng.pthread.ops_per_sec 68.4% regression"
in
https://lore.kernel.org/all/202412201111.1290bdf8-lkp@intel.com/

now we also notice one new issue as below. it doesn't always happen, but
with considerable rate, and keeps clean on parent by more runs.

7ee6fb8b9098b494 51319c5cb6c2f84994a14d11de0
---------------- ---------------------------
       fail:runs  %reproduction    fail:runs
           |             |             |
           :50          34%          17:52    dmesg.BUG:scheduling_while_atomic
           :50          34%          17:52    dmesg.BUG:sleeping_function_called_from_invalid_context_at_kernel/nsproxy.c


below full report FYI.


kernel test robot noticed "BUG:sleeping_function_called_from_invalid_context_at_kernel/nsproxy.c" on:

commit: 51319c5cb6c2f84994a14d11de0fc26321bed99d ("futex: Resize futex hash table based on number of threads.")
https://git.kernel.org/cgit/linux/kernel/git/bigeasy/staging.git futex_local_v4.5

in testcase: xfstests
version: xfstests-x86_64-8467552f-1_20241215
with following parameters:

	disk: 4HDD
	fs: xfs
	test: xfs-scratch-reflink-06



config: x86_64-rhel-9.4-func
compiler: gcc-12
test machine: 4 threads Intel(R) Xeon(R) CPU E3-1225 v5 @ 3.30GHz (Skylake) with 16G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202412241517.b81e285e-lkp@intel.com


[  145.073215][    C1] BUG: sleeping function called from invalid context at kernel/nsproxy.c:243
[  145.073222][    C1] in_atomic(): 1, irqs_disabled(): 0, non_block: 0, pid: 3374, name: xfs_repair
[  145.073224][    C1] preempt_count: 7ffffffe, expected: 0
[  145.073226][    C1] RCU nest depth: 0, expected: 0
[  145.073229][    C1] CPU: 1 UID: 0 PID: 3374 Comm: xfs_repair Tainted: G S                 6.13.0-rc1-00008-g51319c5cb6c2 #1
[  145.073234][    C1] Tainted: [S]=CPU_OUT_OF_SPEC
[  145.073235][    C1] Hardware name: HP HP Z238 Microtower Workstation/8183, BIOS N51 Ver. 01.63 10/05/2017
[  145.073236][    C1] Call Trace:
[  145.073238][    C1]  <TASK>
[ 145.073239][ C1] dump_stack_lvl (lib/dump_stack.c:123 (discriminator 1))
[ 145.073247][ C1] __might_resched (kernel/sched/core.c:8759)
[ 145.073252][ C1] ? free_pid (kernel/pid.c:163)
[ 145.073255][ C1] switch_task_namespaces (include/linux/kernel.h:73 kernel/nsproxy.c:243)
[ 145.073259][ C1] copy_process (kernel/fork.c:2678)
[ 145.073264][ C1] ? __pfx_copy_process (kernel/fork.c:2153)
[ 145.073267][ C1] ? _inline_copy_from_user (arch/x86/include/asm/uaccess_64.h:134 arch/x86/include/asm/uaccess_64.h:134 arch/x86/include/asm/uaccess_64.h:141 include/linux/uaccess.h:178)
[ 145.073271][ C1] ? copy_clone_args_from_user (include/linux/uaccess.h:401 kernel/fork.c:2999)
[ 145.073275][ C1] kernel_clone (include/linux/random.h:26 kernel/fork.c:2832)
[ 145.073277][ C1] ? __pfx_kernel_clone (kernel/fork.c:2791)
[ 145.073279][ C1] ? __pfx___alloc_pages_noprof (mm/page_alloc.c:4717)
[ 145.073283][ C1] ? __pfx_mm_cid_get+0x10/0x10
[ 145.073286][ C1] __do_sys_clone3 (kernel/fork.c:3135)
[ 145.073289][ C1] ? __pfx___do_sys_clone3 (kernel/fork.c:3115)
[ 145.073292][ C1] ? __pfx_try_charge_memcg (mm/memcontrol.c:2186)
[ 145.073296][ C1] ? __memcg_slab_post_alloc_hook (mm/memcontrol.c:3001 (discriminator 2))
[ 145.073299][ C1] ? _raw_spin_lock_irq (arch/x86/include/asm/atomic.h:107 include/linux/atomic/atomic-arch-fallback.h:2170 include/linux/atomic/atomic-instrumented.h:1302 include/asm-generic/qspinlock.h:111 include/linux/spinlock.h:187 include/linux/spinlock_api_smp.h:120 kernel/locking/spinlock.c:170)
[ 145.073302][ C1] ? __pfx__raw_spin_lock_irq (kernel/locking/spinlock.c:169)
[ 145.073304][ C1] ? __mod_memcg_lruvec_state (mm/memcontrol.c:583 mm/memcontrol.c:766)
[ 145.073307][ C1] ? __pfx__raw_spin_lock (kernel/locking/spinlock.c:153)
[ 145.073309][ C1] ? recalc_sigpending (arch/x86/include/asm/bitops.h:75 include/asm-generic/bitops/instrumented-atomic.h:42 include/linux/thread_info.h:102 kernel/signal.c:180 kernel/signal.c:177)
[ 145.073313][ C1] ? sigprocmask (kernel/signal.c:3242)
[ 145.073315][ C1] ? __lruvec_stat_mod_folio (mm/memcontrol.c:811)
[ 145.073318][ C1] do_syscall_64 (arch/x86/entry/common.c:52 arch/x86/entry/common.c:83)
[ 145.073321][ C1] ? __pfx_sigprocmask (kernel/signal.c:3219)
[ 145.073324][ C1] ? __folio_mod_stat (mm/rmap.c:1303)
[ 145.073328][ C1] ? _copy_to_user (arch/x86/include/asm/uaccess_64.h:126 arch/x86/include/asm/uaccess_64.h:147 include/linux/uaccess.h:197 lib/usercopy.c:26)
[ 145.073330][ C1] ? __x64_sys_rt_sigprocmask (kernel/signal.c:3324 kernel/signal.c:3301 kernel/signal.c:3301)
[ 145.073333][ C1] ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3301)
[ 145.073337][ C1] ? syscall_exit_to_user_mode (arch/x86/include/asm/irqflags.h:37 arch/x86/include/asm/irqflags.h:92 include/linux/entry-common.h:232 kernel/entry/common.c:206 kernel/entry/common.c:218)
[ 145.073340][ C1] ? do_syscall_64 (arch/x86/entry/common.c:102)
[ 145.073343][ C1] ? _raw_spin_lock_irqsave (arch/x86/include/asm/atomic.h:107 include/linux/atomic/atomic-arch-fallback.h:2170 include/linux/atomic/atomic-instrumented.h:1302 include/asm-generic/qspinlock.h:111 include/linux/spinlock.h:187 include/linux/spinlock_api_smp.h:111 kernel/locking/spinlock.c:162)
[ 145.073345][ C1] ? __pfx__raw_spin_lock_irqsave (kernel/locking/spinlock.c:161)
[ 145.073347][ C1] ? mt_find (lib/maple_tree.c:6950)
[ 145.073350][ C1] ? try_to_wake_up (kernel/sched/core.c:4168)
[ 145.073353][ C1] ? wake_q_add (arch/x86/include/asm/atomic.h:93 include/linux/atomic/atomic-arch-fallback.h:749 include/linux/atomic/atomic-instrumented.h:253 include/linux/refcount.h:184 include/linux/refcount.h:241 include/linux/refcount.h:258 include/linux/sched/task.h:119 kernel/sched/core.c:1024)
[ 145.073356][ C1] ? __pfx_try_to_wake_up (kernel/sched/core.c:4168)
[ 145.073359][ C1] ? rwsem_mark_wake (kernel/locking/rwsem.c:434)
[ 145.073363][ C1] ? __pfx_rwsem_mark_wake (kernel/locking/rwsem.c:412)
[ 145.073366][ C1] ? __asan_memset (mm/kasan/shadow.c:84)
[ 145.073369][ C1] ? rseq_ip_fixup (kernel/rseq.c:257 kernel/rseq.c:291)
[ 145.073373][ C1] ? __pfx_rseq_ip_fixup (kernel/rseq.c:275)
[ 145.073376][ C1] ? __pfx_mem_cgroup_handle_over_high (mm/memcontrol.c:2092)
[ 145.073379][ C1] ? do_user_addr_fault (include/linux/mmap_lock.h:171 arch/x86/mm/fault.c:1417)
[ 145.073383][ C1] ? fpregs_restore_userregs (arch/x86/include/asm/bitops.h:75 include/asm-generic/bitops/instrumented-atomic.h:42 include/linux/thread_info.h:102 arch/x86/kernel/fpu/context.h:79)
[ 145.073387][ C1] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
[  145.073391][    C1] RIP: 0033:0x7f340071d849
[ 145.073394][ C1] Code: 31 ed e9 44 ff ff ff e8 25 e9 00 00 0f 1f 44 00 00 b8 ea ff ff ff 48 85 ff 74 2c 48 85 d2 74 27 49 89 c8 b8 b3 01 00 00 0f 05 <48> 85 c0 7c 18 74 01 c3 31 ed 48 83 e4 f0 4c 89 c7 ff d2 48 89 c7
All code
========
   0:	31 ed                	xor    %ebp,%ebp
   2:	e9 44 ff ff ff       	jmp    0xffffffffffffff4b
   7:	e8 25 e9 00 00       	call   0xe931
   c:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
  11:	b8 ea ff ff ff       	mov    $0xffffffea,%eax
  16:	48 85 ff             	test   %rdi,%rdi
  19:	74 2c                	je     0x47
  1b:	48 85 d2             	test   %rdx,%rdx
  1e:	74 27                	je     0x47
  20:	49 89 c8             	mov    %rcx,%r8
  23:	b8 b3 01 00 00       	mov    $0x1b3,%eax
  28:	0f 05                	syscall
  2a:*	48 85 c0             	test   %rax,%rax		<-- trapping instruction
  2d:	7c 18                	jl     0x47
  2f:	74 01                	je     0x32
  31:	c3                   	ret
  32:	31 ed                	xor    %ebp,%ebp
  34:	48 83 e4 f0          	and    $0xfffffffffffffff0,%rsp
  38:	4c 89 c7             	mov    %r8,%rdi
  3b:	ff d2                	call   *%rdx
  3d:	48 89 c7             	mov    %rax,%rdi

Code starting with the faulting instruction
===========================================
   0:	48 85 c0             	test   %rax,%rax
   3:	7c 18                	jl     0x1d
   5:	74 01                	je     0x8
   7:	c3                   	ret
   8:	31 ed                	xor    %ebp,%ebp
   a:	48 83 e4 f0          	and    $0xfffffffffffffff0,%rsp
   e:	4c 89 c7             	mov    %r8,%rdi
  11:	ff d2                	call   *%rdx
  13:	48 89 c7             	mov    %rax,%rdi
[  145.073397][    C1] RSP: 002b:00007f33b4ff8988 EFLAGS: 00000206 ORIG_RAX: 00000000000001b3
[  145.073400][    C1] RAX: ffffffffffffffda RBX: 00007f340069cef0 RCX: 00007f340071d849
[  145.073402][    C1] RDX: 00007f340069cef0 RSI: 0000000000000058 RDI: 00007f33b4ff89d0
[  145.073404][    C1] RBP: 00007f33f20fa680 R08: 00007f33f20fa680 R09: 00007f33b4ff8ac7
[  145.073405][    C1] R10: 0000000000000008 R11: 0000000000000206 R12: fffffffffffffd80
[  145.073407][    C1] R13: 0000000000000000 R14: 00007f33b4ff89d0 R15: 00007f33f18fa000
[  145.073410][    C1]  </TASK>
[  145.073416][    C1] BUG: scheduling while atomic: xfs_repair/3374/0x7fffffff
[  145.073418][    C1] Modules linked in:
[  145.073420][ T3374]  xfs btrfs blake2b_generic xor zstd_compress raid6_pq libcrc32c snd_soc_avs snd_soc_hda_codec snd_hda_codec_hdmi snd_hda_ext_core sd_mod intel_rapl_msr intel_rapl_common snd_hda_codec_realtek snd_soc_core snd_hda_codec_generic snd_hda_scodec_component x86_pkg_temp_thermal intel_powerclamp snd_compress coretemp sg snd_hda_intel snd_intel_dspcfg kvm_intel snd_intel_sdw_acpi ipmi_devintf ipmi_msghandler i915 kvm snd_hda_codec cec crct10dif_pclmul crc32_pclmul snd_hda_core intel_gtt crc32c_intel snd_hwdep drm_buddy ghash_clmulni_intel mei_wdt wmi_bmof ttm snd_pcm rapl ahci libahci intel_cstate drm_display_helper snd_timer snd drm_kms_helper video serio_raw pcspkr intel_uncore soundcore mei_me i2c_i801 intel_pmc_core intel_pch_thermal libata ie31200_edac i2c_smbus mei intel_vsec pmt_telemetry pmt_class wmi acpi_pad binfmt_misc loop fuse drm dm_mod ip_tables
[  145.448098][    C1]
[  145.448101][    C1] CPU: 1 UID: 0 PID: 3374 Comm: xfs_repair Tainted: G S      W          6.13.0-rc1-00008-g51319c5cb6c2 #1
[  145.448106][    C1] Tainted: [S]=CPU_OUT_OF_SPEC, [W]=WARN
[  145.448107][    C1] Hardware name: HP HP Z238 Microtower Workstation/8183, BIOS N51 Ver. 01.63 10/05/2017
[  145.448108][    C1] Call Trace:
[  145.448124][    C1]  <TASK>
[ 145.448125][ C1] dump_stack_lvl (lib/dump_stack.c:123 (discriminator 1))
[ 145.448131][ C1] __schedule_bug (kernel/sched/core.c:5930)
[ 145.448136][ C1] schedule_debug (arch/x86/include/asm/preempt.h:33 kernel/sched/core.c:5957)
[ 145.448138][ C1] ? __do_sys_clone3 (kernel/fork.c:3135)
[ 145.448141][ C1] __schedule (arch/x86/include/asm/jump_label.h:36 include/linux/jump_label.h:207 kernel/sched/features.h:57 kernel/sched/core.c:6657)
[ 145.448144][ C1] ? __pfx___schedule (kernel/sched/core.c:6637)
[ 145.448147][ C1] ? __pfx_try_charge_memcg (mm/memcontrol.c:2186)
[ 145.448164][ C1] ? __memcg_slab_post_alloc_hook (mm/memcontrol.c:3001 (discriminator 2))
[ 145.448167][ C1] ? _raw_spin_lock_irq (arch/x86/include/asm/atomic.h:107 include/linux/atomic/atomic-arch-fallback.h:2170 include/linux/atomic/atomic-instrumented.h:1302 include/asm-generic/qspinlock.h:111 include/linux/spinlock.h:187 include/linux/spinlock_api_smp.h:120 kernel/locking/spinlock.c:170)
[ 145.448170][ C1] schedule (arch/x86/include/asm/preempt.h:84 kernel/sched/core.c:6834 kernel/sched/core.c:6848)
[ 145.448173][ C1] syscall_exit_to_user_mode (kernel/entry/common.c:102 include/linux/entry-common.h:329 kernel/entry/common.c:207 kernel/entry/common.c:218)
[ 145.448176][ C1] do_syscall_64 (arch/x86/entry/common.c:102)
[ 145.448178][ C1] ? __pfx_sigprocmask (kernel/signal.c:3219)
[ 145.448181][ C1] ? __folio_mod_stat (mm/rmap.c:1303)
[ 145.448185][ C1] ? _copy_to_user (arch/x86/include/asm/uaccess_64.h:126 arch/x86/include/asm/uaccess_64.h:147 include/linux/uaccess.h:197 lib/usercopy.c:26)
[ 145.448189][ C1] ? __x64_sys_rt_sigprocmask (kernel/signal.c:3324 kernel/signal.c:3301 kernel/signal.c:3301)
[ 145.448207][ C1] ? __pfx___x64_sys_rt_sigprocmask (kernel/signal.c:3301)
[ 145.448211][ C1] ? syscall_exit_to_user_mode (arch/x86/include/asm/irqflags.h:37 arch/x86/include/asm/irqflags.h:92 include/linux/entry-common.h:232 kernel/entry/common.c:206 kernel/entry/common.c:218)
[ 145.448213][ C1] ? do_syscall_64 (arch/x86/entry/common.c:102)
[ 145.448215][ C1] ? _raw_spin_lock_irqsave (arch/x86/include/asm/atomic.h:107 include/linux/atomic/atomic-arch-fallback.h:2170 include/linux/atomic/atomic-instrumented.h:1302 include/asm-generic/qspinlock.h:111 include/linux/spinlock.h:187 include/linux/spinlock_api_smp.h:111 kernel/locking/spinlock.c:162)
[ 145.448218][ C1] ? __pfx__raw_spin_lock_irqsave (kernel/locking/spinlock.c:161)
[ 145.448233][ C1] ? mt_find (lib/maple_tree.c:6950)
[ 145.448236][ C1] ? try_to_wake_up (kernel/sched/core.c:4168)
[ 145.448239][ C1] ? wake_q_add (arch/x86/include/asm/atomic.h:93 include/linux/atomic/atomic-arch-fallback.h:749 include/linux/atomic/atomic-instrumented.h:253 include/linux/refcount.h:184 include/linux/refcount.h:241 include/linux/refcount.h:258 include/linux/sched/task.h:119 kernel/sched/core.c:1024)
[ 145.448242][ C1] ? __pfx_try_to_wake_up (kernel/sched/core.c:4168)
[ 145.448245][ C1] ? rwsem_mark_wake (kernel/locking/rwsem.c:434)
[ 145.448249][ C1] ? __pfx_rwsem_mark_wake (kernel/locking/rwsem.c:412)
[ 145.448252][ C1] ? __asan_memset (mm/kasan/shadow.c:84)
[ 145.448256][ C1] ? rseq_ip_fixup (kernel/rseq.c:257 kernel/rseq.c:291)
[ 145.448259][ C1] ? __pfx_rseq_ip_fixup (kernel/rseq.c:275)
[ 145.448262][ C1] ? __pfx_mem_cgroup_handle_over_high (mm/memcontrol.c:2092)
[ 145.448265][ C1] ? do_user_addr_fault (include/linux/mmap_lock.h:171 arch/x86/mm/fault.c:1417)
[ 145.448268][ C1] ? fpregs_restore_userregs (arch/x86/include/asm/bitops.h:75 include/asm-generic/bitops/instrumented-atomic.h:42 include/linux/thread_info.h:102 arch/x86/kernel/fpu/context.h:79)
[ 145.448273][ C1] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
[  145.448277][    C1] RIP: 0033:0x7f340071d849
[ 145.448280][ C1] Code: 31 ed e9 44 ff ff ff e8 25 e9 00 00 0f 1f 44 00 00 b8 ea ff ff ff 48 85 ff 74 2c 48 85 d2 74 27 49 89 c8 b8 b3 01 00 00 0f 05 <48> 85 c0 7c 18 74 01 c3 31 ed 48 83 e4 f0 4c 89 c7 ff d2 48 89 c7
All code
========
   0:	31 ed                	xor    %ebp,%ebp
   2:	e9 44 ff ff ff       	jmp    0xffffffffffffff4b
   7:	e8 25 e9 00 00       	call   0xe931
   c:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
  11:	b8 ea ff ff ff       	mov    $0xffffffea,%eax
  16:	48 85 ff             	test   %rdi,%rdi
  19:	74 2c                	je     0x47
  1b:	48 85 d2             	test   %rdx,%rdx
  1e:	74 27                	je     0x47
  20:	49 89 c8             	mov    %rcx,%r8
  23:	b8 b3 01 00 00       	mov    $0x1b3,%eax
  28:	0f 05                	syscall
  2a:*	48 85 c0             	test   %rax,%rax		<-- trapping instruction
  2d:	7c 18                	jl     0x47
  2f:	74 01                	je     0x32
  31:	c3                   	ret
  32:	31 ed                	xor    %ebp,%ebp
  34:	48 83 e4 f0          	and    $0xfffffffffffffff0,%rsp
  38:	4c 89 c7             	mov    %r8,%rdi
  3b:	ff d2                	call   *%rdx
  3d:	48 89 c7             	mov    %rax,%rdi

Code starting with the faulting instruction
===========================================
   0:	48 85 c0             	test   %rax,%rax
   3:	7c 18                	jl     0x1d
   5:	74 01                	je     0x8
   7:	c3                   	ret
   8:	31 ed                	xor    %ebp,%ebp
   a:	48 83 e4 f0          	and    $0xfffffffffffffff0,%rsp
   e:	4c 89 c7             	mov    %r8,%rdi
  11:	ff d2                	call   *%rdx
  13:	48 89 c7             	mov    %rax,%rdi
[  145.448282][    C1] RSP: 002b:00007f33b4ff8988 EFLAGS: 00000206 ORIG_RAX: 00000000000001b3
[  145.448285][    C1] RAX: ffffffffffffffea RBX: 00007f340069cef0 RCX: 00007f340071d849
[  145.448287][    C1] RDX: 00007f340069cef0 RSI: 0000000000000058 RDI: 00007f33b4ff89d0
[  145.448289][    C1] RBP: 00007f33f20fa680 R08: 00007f33f20fa680 R09: 00007f33b4ff8ac7
[  145.448290][    C1] R10: 0000000000000008 R11: 0000000000000206 R12: fffffffffffffd80
[  145.448292][    C1] R13: 0000000000000000 R14: 00007f33b4ff89d0 R15: 00007f33f18fa000
[  145.448295][    C1]  </TASK>
[  156.427599][  T281] _check_xfs_filesystem: filesystem on /dev/sda4 is inconsistent (r)
[  156.427609][  T281]
[  156.439078][  T281] (see /lkp/benchmarks/xfstests/results//xfs/127.full for details)
[  156.439125][  T281]
[  156.472482][  T281] _check_dmesg: something found in dmesg (see /lkp/benchmarks/xfstests/results//xfs/127.dmesg)


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241224/202412241517.b81e285e-lkp@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2024-12-24  7:37 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-12-24  7:36 [bigeasy-staging:futex_local_v4.5] [futex] 51319c5cb6: BUG:sleeping_function_called_from_invalid_context_at_kernel/nsproxy.c kernel test robot

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.