All of lore.kernel.org
 help / color / mirror / Atom feed
From: kernel test robot <oliver.sang@intel.com>
To: Zi Yan <ziy@nvidia.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
	<linux-kernel@vger.kernel.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Baolin Wang <baolin.wang@linux.alibaba.com>,
	David Hildenbrand <david@redhat.com>,
	"Huang, Ying" <ying.huang@intel.com>,
	"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
	Matthew Wilcox <willy@infradead.org>,
	Ryan Roberts <ryan.roberts@arm.com>,
	SeongJae Park <sj@kernel.org>, "Yang Shi" <shy828301@gmail.com>,
	Yin Fengwei <fengwei.yin@intel.com>, <linux-mm@kvack.org>,
	<oliver.sang@intel.com>
Subject: [linus:master] [mm/migrate]  7262f208ca: kernel_BUG_at_mm/migrate.c
Date: Mon, 17 Jun 2024 15:03:08 +0800	[thread overview]
Message-ID: <202406171436.a30c129-oliver.sang@intel.com> (raw)



Hello,

kernel test robot noticed "kernel_BUG_at_mm/migrate.c" on:

commit: 7262f208ca681385d133844be8a58d9b4ca185f7 ("mm/migrate: split source folio if it is on deferred split list")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

[test failed on linus/master      32f88d65f01bf6f45476d7edbe675e44fb9e1d58]
[test failed on linux-next/master 234cb065ad82915ff8d06ce01e01c3e640b674d2]

in testcase: vm-scalability
version: vm-scalability-x86_64-6f4ef16-0_20240303
with following parameters:

	runtime: 300s
	size: 8T
	test: anon-cow-seq
	cpufreq_governor: performance



compiler: gcc-13
test machine: 96 threads 2 sockets Intel(R) Xeon(R) Platinum 8260L CPU @ 2.40GHz (Cascade Lake) with 128G memory

(please refer to attached dmesg/kmsg for entire log/backtrace)



If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@intel.com>
| Closes: https://lore.kernel.org/oe-lkp/202406171436.a30c129-oliver.sang@intel.com


[   84.214952][ T6581] ------------[ cut here ]------------
[   84.219158][ T1289] 781916337 bytes / 1533701 usecs = 497874 KB/s
[   84.219928][ T6581] kernel BUG at mm/migrate.c:2634!
[   84.225273][ T1289]
[   84.226742][ T1289] 781916337 bytes / 1533702 usecs = 497873 KB/s
[   84.231379][ T6581] invalid opcode: 0000 [#1] SMP NOPTI
[   84.236360][ T1289]
[   84.238534][ T6581] CPU: 15 PID: 6581 Comm: usemem Tainted: G S                 6.9.0-rc4-00136-g7262f208ca68 #1
[   84.238538][ T6581] Hardware name: Intel Corporation S2600WFD/S2600WFD, BIOS SE5C620.86B.0D.01.0286.011120190816 01/11/2019
[   84.246187][ T1289] 781916337 bytes / 1533701 usecs = 497874 KB/s
[ 84.249854][ T6581] RIP: 0010:migrate_misplaced_folio (mm/migrate.c:2634 (discriminator 1)) 
[   84.252050][ T1289]
[ 84.262214][ T6581] Code: a0 b4 1b 83 e8 a8 23 f6 ff 48 89 df e8 a0 3f f5 ff 45 31 e4 8b 44 24 1c 85 c0 75 10 48 8b 44 24 20 48 39 e8 0f 84 27 fe ff ff <0f> 0b 41 89 c5 65 4c 01 2d ba 1d bf 7e 48 8b 3b 48 c1 ef 36 e8 2e
All code
========
   0:	a0 b4 1b 83 e8 a8 23 	movabs 0xfff623a8e8831bb4,%al
   7:	f6 ff 
   9:	48 89 df             	mov    %rbx,%rdi
   c:	e8 a0 3f f5 ff       	callq  0xfffffffffff53fb1
  11:	45 31 e4             	xor    %r12d,%r12d
  14:	8b 44 24 1c          	mov    0x1c(%rsp),%eax
  18:	85 c0                	test   %eax,%eax
  1a:	75 10                	jne    0x2c
  1c:	48 8b 44 24 20       	mov    0x20(%rsp),%rax
  21:	48 39 e8             	cmp    %rbp,%rax
  24:	0f 84 27 fe ff ff    	je     0xfffffffffffffe51
  2a:*	0f 0b                	ud2    		<-- trapping instruction
  2c:	41 89 c5             	mov    %eax,%r13d
  2f:	65 4c 01 2d ba 1d bf 	add    %r13,%gs:0x7ebf1dba(%rip)        # 0x7ebf1df1
  36:	7e 
  37:	48 8b 3b             	mov    (%rbx),%rdi
  3a:	48 c1 ef 36          	shr    $0x36,%rdi
  3e:	e8                   	.byte 0xe8
  3f:	2e                   	cs

Code starting with the faulting instruction
===========================================
   0:	0f 0b                	ud2    
   2:	41 89 c5             	mov    %eax,%r13d
   5:	65 4c 01 2d ba 1d bf 	add    %r13,%gs:0x7ebf1dba(%rip)        # 0x7ebf1dc7
   c:	7e 
   d:	48 8b 3b             	mov    (%rbx),%rdi
  10:	48 c1 ef 36          	shr    $0x36,%rdi
  14:	e8                   	.byte 0xe8
  15:	2e                   	cs
[   84.262217][ T6581] RSP: 0000:ffffc9002080fd08 EFLAGS: 00010206
[   84.262221][ T6581] RAX: ffffea01487467c8 RBX: ffffea0148740000 RCX: 0000000000000000
[   84.262223][ T6581] RDX: 000000000000027f RSI: 00000000000001ff RDI: 0000000000000001
[   84.262225][ T6581] RBP: ffffc9002080fd28 R08: 0000000000000000 R09: 0000000000000001
[   84.262226][ T6581] R10: 000000000000080c R11: 0000000000000000 R12: 0000000000000001
[   84.274946][ T1289] 781916337 bytes / 1533699 usecs = 497874 KB/s
[   84.279439][ T6581] R13: 00000000000001ff R14: 0000000000000200 R15: ffff88907ffd5000
[   84.279441][ T6581] FS:  00007f1b11213740(0000) GS:ffff88903f9c0000(0000) knlGS:0000000000000000
[   84.285537][ T1289]
[   84.287725][ T6581] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   84.287728][ T6581] CR2: 00007f1b0fe00000 CR3: 0000005f54c9e003 CR4: 00000000007706f0
[   84.287730][ T6581] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   84.287731][ T6581] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   84.308684][ T1289] 781916337 bytes / 1533744 usecs = 497860 KB/s
[   84.313094][ T6581] PKRU: 55555554
[   84.313097][ T6581] Call Trace:
[   84.313100][ T6581]  <TASK>
[   84.320929][ T1289]
[ 84.328756][ T6581] ? die (arch/x86/kernel/dumpstack.c:421 arch/x86/kernel/dumpstack.c:434 arch/x86/kernel/dumpstack.c:447) 
[   84.337626][ T1289] 781916337 bytes / 1533702 usecs = 497873 KB/s
[ 84.344414][ T6581] ? do_trap (arch/x86/kernel/traps.c:114 arch/x86/kernel/traps.c:155) 
[   84.350509][ T1289]
[ 84.358331][ T6581] ? migrate_misplaced_folio (mm/migrate.c:2634 (discriminator 1)) 
[   84.368158][ T1289] 781916337 bytes / 1533806 usecs = 497840 KB/s
[ 84.369304][ T6581] ? do_error_trap (arch/x86/include/asm/traps.h:58 arch/x86/kernel/traps.c:176) 
[ 84.369306][ T6581] ? migrate_misplaced_folio (mm/migrate.c:2634 (discriminator 1)) 
[   84.375745][ T1289]
[ 84.383566][ T6581] ? exc_invalid_op (arch/x86/kernel/traps.c:267) 
[   84.392383][ T1289] 781916337 bytes / 1537066 usecs = 496784 KB/s
[ 84.399219][ T6581] ? migrate_misplaced_folio (mm/migrate.c:2634 (discriminator 1)) 
[ 84.399222][ T6581] ? asm_exc_invalid_op (arch/x86/include/asm/idtentry.h:621) 
[   84.405314][ T1289]
[ 84.408718][ T6581] ? migrate_misplaced_folio (mm/migrate.c:2634 (discriminator 1)) 
[ 84.408721][ T6581] ? migrate_misplaced_folio (mm/migrate.c:2630 (discriminator 2)) 
[   84.412861][ T1289] 781916337 bytes / 1533702 usecs = 497873 KB/s
[ 84.414661][ T6581] do_huge_pmd_numa_page (mm/huge_memory.c:1759) 
[   84.416855][ T1289]
[ 84.420436][ T6581] __handle_mm_fault (mm/memory.c:5429) 
[   84.427485][ T1289] 781916337 bytes / 1545381 usecs = 494111 KB/s
[ 84.430542][ T6581] handle_mm_fault (mm/memory.c:5608) 
[   84.432735][ T1289]
[ 84.438220][ T6581] do_user_addr_fault (arch/x86/mm/fault.c:1364) 
[   84.445291][ T1289] 781916337 bytes / 1545380 usecs = 494111 KB/s
[ 84.448765][ T6581] exc_page_fault (arch/x86/include/asm/irqflags.h:37 arch/x86/include/asm/irqflags.h:72 arch/x86/mm/fault.c:1514 arch/x86/mm/fault.c:1564) 
[   84.454252][ T1289]
[ 84.456445][ T6581] asm_exc_page_fault (arch/x86/include/asm/idtentry.h:623) 
[   84.456448][ T6581] RIP: 0033:0x561f6c00dad4
[   84.461953][ T1289] 781916337 bytes / 1545382 usecs = 494110 KB/s
[ 84.467077][ T6581] Code: 01 00 00 00 e8 0d f9 ff ff 89 c7 e8 6c ff ff ff bf 00 00 00 00 e8 fc f8 ff ff 85 d2 74 08 48 8d 04 f7 48 8b 00 c3 48 8d 04 f7 <48> 89 30 b8 00 00 00 00 c3 41 54 55 53 48 85 ff 0f 84 23 01 00 00
All code
========
   0:	01 00                	add    %eax,(%rax)
   2:	00 00                	add    %al,(%rax)
   4:	e8 0d f9 ff ff       	callq  0xfffffffffffff916
   9:	89 c7                	mov    %eax,%edi
   b:	e8 6c ff ff ff       	callq  0xffffffffffffff7c
  10:	bf 00 00 00 00       	mov    $0x0,%edi
  15:	e8 fc f8 ff ff       	callq  0xfffffffffffff916
  1a:	85 d2                	test   %edx,%edx
  1c:	74 08                	je     0x26
  1e:	48 8d 04 f7          	lea    (%rdi,%rsi,8),%rax
  22:	48 8b 00             	mov    (%rax),%rax
  25:	c3                   	retq   
  26:	48 8d 04 f7          	lea    (%rdi,%rsi,8),%rax
  2a:*	48 89 30             	mov    %rsi,(%rax)		<-- trapping instruction
  2d:	b8 00 00 00 00       	mov    $0x0,%eax
  32:	c3                   	retq   
  33:	41 54                	push   %r12
  35:	55                   	push   %rbp
  36:	53                   	push   %rbx
  37:	48 85 ff             	test   %rdi,%rdi
  3a:	0f 84 23 01 00 00    	je     0x163

Code starting with the faulting instruction
===========================================
   0:	48 89 30             	mov    %rsi,(%rax)
   3:	b8 00 00 00 00       	mov    $0x0,%eax
   8:	c3                   	retq   
   9:	41 54                	push   %r12
   b:	55                   	push   %rbp
   c:	53                   	push   %rbx
   d:	48 85 ff             	test   %rdi,%rdi
  10:	0f 84 23 01 00 00    	je     0x139


The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240617/202406171436.a30c129-oliver.sang@intel.com



-- 
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki


             reply	other threads:[~2024-06-17  7:03 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-06-17  7:03 kernel test robot [this message]
2024-06-17 23:28 ` [linus:master] [mm/migrate] 7262f208ca: kernel_BUG_at_mm/migrate.c Hugh Dickins

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=202406171436.a30c129-oliver.sang@intel.com \
    --to=oliver.sang@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=baolin.wang@linux.alibaba.com \
    --cc=david@redhat.com \
    --cc=fengwei.yin@intel.com \
    --cc=kirill.shutemov@linux.intel.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=lkp@intel.com \
    --cc=oe-lkp@lists.linux.dev \
    --cc=ryan.roberts@arm.com \
    --cc=shy828301@gmail.com \
    --cc=sj@kernel.org \
    --cc=willy@infradead.org \
    --cc=ying.huang@intel.com \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.