All of lore.kernel.org
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: Breno Leitao <leitao@debian.org>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
	<linux-crypto@vger.kernel.org>,
	Herbert Xu <herbert@gondor.apana.org.au>,
	Tejun Heo <tj@kernel.org>, <netdev@vger.kernel.org>,
	<oliver.sang@intel.com>
Subject: [herbert-cryptodev-2.6:master] [rhashtable] e1d3422c95: WARNING:at_mm/util.c:#__kvmalloc_node_noprof
Date: Tue, 31 Dec 2024 13:24:06 +0800	[thread overview]
Message-ID: <202412311213.4e69877e-lkp@intel.com> (raw)



Hello,


we reported
"[herbert-cryptodev-2.6:master] [rhashtable]  e1d3422c95: stress-ng.syscall.ops_per_sec 98.9% regression"
in
https://lore.kernel.org/all/202412271017.cad7675-lkp@intel.com/

now we observed below WARNING in another tests. it doesn't always happen.
11 out of 20 runs as below. but keeps clean on parent.


f916e44487f56df4 e1d3422c95f003eba241c176adf
---------------- ---------------------------
       fail:runs  %reproduction    fail:runs
           |             |             |
           :20          55%          11:20    dmesg.RIP:__kvmalloc_node_noprof
           :20          55%          11:20    dmesg.WARNING:at_mm/util.c:#__kvmalloc_node_noprof

below full report FYI.


kernel test robot noticed "WARNING:at_mm/util.c:#__kvmalloc_node_noprof" on:

commit: e1d3422c95f003eba241c176adfe593c33e8a8f6 ("rhashtable: Fix potential deadlock by moving schedule_work outside lock")
https://git.kernel.org/cgit/linux/kernel/git/herbert/cryptodev-2.6.git master

in testcase: reaim
version: reaim-x86_64-7.0.1.13-1_20240229
with following parameters:

	runtime: 300s
	nr_task: 100%
	test: short
	cpufreq_governor: performance



config: x86_64-rhel-9.4
compiler: gcc-12
test machine: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202412311213.4e69877e-lkp@intel.com


kern  :warn  : [   45.855882] ------------[ cut here ]------------
kern :warn : [   45.861844] WARNING: CPU: 5 PID: 379 at mm/util.c:662 __kvmalloc_node_noprof (mm/util.c:662 (discriminator 1))
kern  :warn  : [   45.871496] Modules linked in: btrfs blake2b_generic xor raid6_pq libcrc32c sr_mod sd_mod cdrom sg intel_rapl_msr intel_rapl_common binfmt_misc sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul crc32_pclmul isci crc32c_intel ghash_clmulni_intel snd_pcm libsas ahci mgag200 ipmi_si libahci scsi_transport_sas rapl snd_timer drm_client_lib snd intel_cstate mei_me drm_shmem_helper ipmi_devintf i2c_i801 soundcore ioatdma libata mei intel_uncore drm_kms_helper ipmi_msghandler lpc_ich pcspkr i2c_smbus dca wmi joydev drm fuse loop dm_mod ip_tables
kern  :warn  : [   45.927716] CPU: 5 UID: 0 PID: 379 Comm: kworker/5:1 Not tainted 6.13.0-rc2-00035-ge1d3422c95f0 #1
kern  :warn  : [   45.938497] Hardware name: Intel Corporation S2600CP/S2600CP, BIOS SE5C600.86B.99.99.x069.071520130923 07/15/2013
kern  :warn  : [   45.950732] Workqueue: events rht_deferred_worker
kern :warn : [   45.956775] RIP: 0010:__kvmalloc_node_noprof (mm/util.c:662 (discriminator 1))
kern :warn : [ 45.963290] Code: 00 00 48 8d 4c 0a ff e8 1d 54 05 00 48 83 c4 18 5b 5d 41 5c c3 cc cc cc cc 48 b9 00 00 00 00 00 20 00 00 eb c3 80 e7 20 75 e6 <0f> 0b eb e2 66 66 2e 0f 1f 84 00 00 00 00 00 66 66 2e 0f 1f 84 00
All code
========
   0:	00 00                	add    %al,(%rax)
   2:	48 8d 4c 0a ff       	lea    -0x1(%rdx,%rcx,1),%rcx
   7:	e8 1d 54 05 00       	call   0x55429
   c:	48 83 c4 18          	add    $0x18,%rsp
  10:	5b                   	pop    %rbx
  11:	5d                   	pop    %rbp
  12:	41 5c                	pop    %r12
  14:	c3                   	ret
  15:	cc                   	int3
  16:	cc                   	int3
  17:	cc                   	int3
  18:	cc                   	int3
  19:	48 b9 00 00 00 00 00 	movabs $0x200000000000,%rcx
  20:	20 00 00 
  23:	eb c3                	jmp    0xffffffffffffffe8
  25:	80 e7 20             	and    $0x20,%bh
  28:	75 e6                	jne    0x10
  2a:*	0f 0b                	ud2		<-- trapping instruction
  2c:	eb e2                	jmp    0x10
  2e:	66 66 2e 0f 1f 84 00 	data16 cs nopw 0x0(%rax,%rax,1)
  35:	00 00 00 00 
  39:	66                   	data16
  3a:	66                   	data16
  3b:	2e                   	cs
  3c:	0f                   	.byte 0xf
  3d:	1f                   	(bad)
  3e:	84 00                	test   %al,(%rax)

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2
   2:	eb e2                	jmp    0xffffffffffffffe6
   4:	66 66 2e 0f 1f 84 00 	data16 cs nopw 0x0(%rax,%rax,1)
   b:	00 00 00 00 
   f:	66                   	data16
  10:	66                   	data16
  11:	2e                   	cs
  12:	0f                   	.byte 0xf
  13:	1f                   	(bad)
  14:	84 00                	test   %al,(%rax)
kern  :warn  : [   45.985030] RSP: 0018:ffffc9000726bdf8 EFLAGS: 00010246
kern  :warn  : [   45.991655] RAX: 0000000000000000 RBX: 00000000000000c0 RCX: 0000000000000013
kern  :warn  : [   46.000399] RDX: 0000000000000013 RSI: ffffffff8143fb29 RDI: 0000000000052dc0
kern  :warn  : [   46.009144] RBP: 0000000080000040 R08: ffff888100050ec0 R09: 00000000000003bf
kern  :warn  : [   46.017889] R10: ffffc9000726bdf8 R11: 0000000000000000 R12: 00000000ffffffff
kern  :warn  : [   46.026632] R13: 0000000000000dc0 R14: ffffffff832bdb78 R15: ffffc9003f200000
kern  :warn  : [   46.035377] FS:  0000000000000000(0000) GS:ffff88881f480000(0000) knlGS:0000000000000000
kern  :warn  : [   46.045189] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kern  :warn  : [   46.052381] CR2: 00007fb81ab04010 CR3: 00000001ddf98002 CR4: 00000000001706f0
kern  :warn  : [   46.061140] Call Trace:
kern  :warn  : [   46.064650]  <TASK>
kern :warn : [   46.067773] ? __warn (kernel/panic.c:748)
kern :warn : [   46.072169] ? __kvmalloc_node_noprof (mm/util.c:662 (discriminator 1))
kern :warn : [   46.078004] ? report_bug (lib/bug.c:180 lib/bug.c:219)
kern :warn : [   46.082885] ? handle_bug (arch/x86/kernel/traps.c:285)
kern :warn : [   46.087570] ? exc_invalid_op (arch/x86/kernel/traps.c:309 (discriminator 1))
kern :warn : [   46.092640] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621)
kern :warn : [   46.098091] ? __kmalloc_node_noprof (arch/x86/include/asm/bitops.h:417 include/asm-generic/getorder.h:46 mm/slub.c:4273 mm/slub.c:4289)
kern :warn : [   46.104039] ? __kvmalloc_node_noprof (mm/util.c:662 (discriminator 1))
kern :warn : [   46.109874] ? __kvmalloc_node_noprof (mm/util.c:653)
kern :warn : [   46.115710] bucket_table_alloc+0x49/0x170
kern :warn : [   46.121753] rht_deferred_worker (lib/rhashtable.c:367 lib/rhashtable.c:427)
kern :warn : [   46.127216] process_one_work (kernel/workqueue.c:3234)
kern :warn : [   46.132492] worker_thread (kernel/workqueue.c:3304 kernel/workqueue.c:3391)
kern :warn : [   46.137471] ? __pfx_worker_thread (kernel/workqueue.c:3337)
kern :warn : [   46.143031] ? __pfx_worker_thread (kernel/workqueue.c:3337)
kern :warn : [   46.148587] kthread (kernel/kthread.c:389)
kern :warn : [   46.152789] ? __pfx_kthread (kernel/kthread.c:342)
kern :warn : [   46.157765] ret_from_fork (arch/x86/kernel/process.c:153)
kern :warn : [   46.162551] ? __pfx_kthread (kernel/kthread.c:342)
kern :warn : [   46.167527] ret_from_fork_asm (arch/x86/entry/entry_64.S:257)
kern  :warn  : [   46.172691]  </TASK>
kern  :warn  : [   46.175920] ---[ end trace 0000000000000000 ]---
user  :err   : [   48.391965] create_shared_memory(): can't create semaphore, pausing...

user  :err   : [   48.404021] create_shared_memory(): can't create semaphore, pausing...

user  :err   : [   48.415885] create_shared_memory(): can't create semaphore, pausing...

user  :err   : [   48.427763] create_shared_memory(): can't create semaphore, pausing...

user  :err   : [   48.439628] create_shared_memory(): can't create semaphore, pausing...

user  :err   : [   48.451489] create_shared_memory(): can't create semaphore, pausing...




The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20241231/202412311213.4e69877e-lkp@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


                 reply	other threads:[~2024-12-31  5:24 UTC|newest]

Thread overview: [no followups] expand[flat|nested]  mbox.gz  Atom feed

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202412311213.4e69877e-lkp@intel.com \
    --to=oliver.sang@intel.com \
    --cc=herbert@gondor.apana.org.au \
    --cc=leitao@debian.org \
    --cc=linux-crypto@vger.kernel.org \
    --cc=lkp@intel.com \
    --cc=netdev@vger.kernel.org \
    --cc=oe-lkp@lists.linux.dev \
    --cc=tj@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.