From: "Yajun Deng" <yajun.deng@linux.dev>
To: "Mike Rapoport" <rppt@linux.ibm.com>
Cc: "Sachin Sant" <sachinp@linux.ibm.com>,
"linuxppc-dev" <linuxppc-dev@lists.ozlabs.org>,
"open list" <linux-kernel@vger.kernel.org>,
linux-mm@kvack.org
Subject: Re: [6.1.0-rc3-next-20221104] Boot failure - kernel BUG at mm/memblock.c:519
Date: Wed, 09 Nov 2022 10:03:03 +0000 [thread overview]
Message-ID: <e00989c4a69943cb4f60fc7ffaa06f8c@linux.dev> (raw)
In-Reply-To: <Y2oLYB7Tu7J91tVm@linux.ibm.com>
[-- Attachment #1: Type: text/plain, Size: 4409 bytes --]
Hey Mike,
Can you help me test the attached file?
Please use this new patch instead of the one in memblock tree.
November 8, 2022 3:55 PM, "Mike Rapoport" <rppt@linux.ibm.com> wrote:
> Hi Yajun,
>
> On Tue, Nov 08, 2022 at 02:27:53AM +0000, Yajun Deng wrote:
>
>> Hi Sachin,
>> I didn't have a powerpc architecture machine. I don't know why this happened.
>>
>> Hi Mike,
>> Do you have any suggestions?
>
> You can try reproducing the bug qemu or work with Sachin to debug the
> issue.
>
>> I tested in tools/testing/memblock, and it was successful.
>
> Memblock tests provide limited coverage still and they don't deal with all
> possible cases.
>
> For now I'm dropping this patch from the memblock tree until the issue is
> fixed.
>
>> November 6, 2022 8:07 PM, "Sachin Sant" <sachinp@linux.ibm.com> wrote:
>>
>> While booting recent linux-next on a IBM Power10 Server LPAR
>> following crash is observed:
>>
>> [ 0.000000] numa: Partition configured for 32 NUMA nodes.
>> [ 0.000000] ------------[ cut here ]------------
>> [ 0.000000] kernel BUG at mm/memblock.c:519!
>> [ 0.000000] Oops: Exception in kernel mode, sig: 5 [#1]
>> [ 0.000000] LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=2048 NUMA pSeries
>> [ 0.000000] Modules linked in:
>> [ 0.000000] CPU: 0 PID: 0 Comm: swapper Not tainted 6.1.0-rc3-next-20221104 #1
>> [ 0.000000] Hardware name: IBM,9080-HEX POWER10 (raw) 0x800200 0xf000006 of:IBM,FW1030.00
>> (NH1030_026) hv:phyp pSeries
>> [ 0.000000] NIP: c0000000004ba240 LR: c0000000004bb240 CTR: c0000000004ba210
>> [ 0.000000] REGS: c000000002a8b7b0 TRAP: 0700 Not tainted (6.1.0-rc3-next-20221104)
>> [ 0.000000] MSR: 8000000000021033 <SF,ME,IR,DR,RI,LE> CR: 24042424 XER: 00000001
>> [ 0.000000] CFAR: c0000000004ba290 IRQMASK: 1
>> [ 0.000000] GPR00: c0000000004bb240 c000000002a8ba50 c00000000136ee00 c0000010f3ac00a8
>> [ 0.000000] GPR04: 0000000000000000 c0000010f3ac0090 00000010f3ac0000 0000000000000d00
>> [ 0.000000] GPR08: 0000000000000001 0000000000000007 0000000000000001 0000000000000081
>> [ 0.000000] GPR12: c0000000004ba210 c000000002e10000 0000000000000000 000000000000000d
>> [ 0.000000] GPR16: 000000000f6be620 000000000f6be8e8 000000000f6be788 000000000f6bed58
>> [ 0.000000] GPR20: 000000000f6f6d58 c0000000029a8de8 00000010f3ad8800 0000000000000080
>> [ 0.000000] GPR24: 00000010f3ad7b00 0000000000000000 0000000000000100 0000000000000d00
>> [ 0.000000] GPR28: 00000010f3ad7b00 c0000000029a8de8 c0000000029a8e00 0000000000000006
>> [ 0.000000] NIP [c0000000004ba240] memblock_merge_regions.isra.12+0x40/0x130
>> [ 0.000000] LR [c0000000004bb240] memblock_add_range+0x190/0x300
>> [ 0.000000] Call Trace:
>> [ 0.000000] [c000000002a8ba50] [0000000000000100] 0x100 (unreliable)
>> [ 0.000000] [c000000002a8ba90] [c0000000004bb240] memblock_add_range+0x190/0x300
>> [ 0.000000] [c000000002a8bb10] [c0000000004bb5e0] memblock_reserve+0x70/0xd0
>> [ 0.000000] [c000000002a8bba0] [c000000002045234] memblock_alloc_range_nid+0x11c/0x1e8
>> [ 0.000000] [c000000002a8bc60] [c0000000020453a4] memblock_alloc_internal+0xa4/0x110
>> [ 0.000000] [c000000002a8bcb0] [c0000000020456cc] memblock_alloc_try_nid+0x94/0xcc
>> [ 0.000000] [c000000002a8bd40] [c00000000200b570] alloc_paca_data+0x7c/0xcc
>> [ 0.000000] [c000000002a8bdb0] [c00000000200b770] allocate_paca+0x8c/0x28c
>> [ 0.000000] [c000000002a8be50] [c00000000200a26c] setup_arch+0x1c4/0x4d8
>> [ 0.000000] [c000000002a8bed0] [c000000002004378] start_kernel+0xb4/0xa84
>> [ 0.000000] [c000000002a8bf90] [c00000000000da90] start_here_common+0x1c/0x20
>> [ 0.000000] Instruction dump:
>> [ 0.000000] 7c0802a6 fba1ffe8 fbc1fff0 fbe1fff8 7c7d1b78 7c9e2378 3be00000 f8010010
>> [ 0.000000] f821ffc1 e9230000 3969ffff 4800000c <0b0a0000> 7d3f4b78 393f0001 7fbf5840
>> [ 0.000000] ---[ end trace 0000000000000000 ]---
>> [ 0.000000]
>> [ 0.000000] Kernel panic - not syncing: Fatal exception
>> [ 0.000000] Rebooting in 180 seconds..
>>
>> This problem was introduced with next-20221101. Git bisect points to
>> following patch
>>
>> commit 3f82c9c4ac377082e1230f5299e0ccce07b15e12
>> Date: Tue Oct 25 15:09:43 2022 +0800
>> memblock: don't run loop in memblock_add_range() twice
>>
>> Reverting this patch helps boot the kernel to login prompt.
>>
>> Have attached .config
>>
>> - Sachin
>
> --
> Sincerely yours,
> Mike.
[-- Attachment #2: 0001-memblock-don-t-run-loop-in-memblock_add_range-twice-.patch --]
[-- Type: application/octet-stream, Size: 4191 bytes --]
From 0a48980d7346c582eeb7198d822d56a4c1926632 Mon Sep 17 00:00:00 2001
From: Yajun Deng <yajun.deng@linux.dev>
Date: Wed, 9 Nov 2022 17:35:27 +0800
Subject: [PATCH v3] memblock: don't run loop in memblock_add_range() twice in
most case
There is no need round twice in memblock_add_range() in most case.
We can call memblock_double_array() to extend the size if type->cnt
greater or equal to type->max before memblock_insert_region(); otherwise,
we can insert the new region directly.
Special: memblock_double_array() would nested call memblock_add_range()
if slab_is_available() is NULL, so we should repeat again before insert
in that case.
v3:
- Fix nested call in memblock_double_array().
v2:
- Add a comment when the allocation is required.
Signed-off-by: Yajun Deng <yajun.deng@linux.dev>
---
mm/memblock.c | 76 +++++++++++++++++++++++----------------------------
1 file changed, 34 insertions(+), 42 deletions(-)
diff --git a/mm/memblock.c b/mm/memblock.c
index 511d4783dcf1..42d3e4157f4a 100644
--- a/mm/memblock.c
+++ b/mm/memblock.c
@@ -578,11 +578,11 @@ static int __init_memblock memblock_add_range(struct memblock_type *type,
phys_addr_t base, phys_addr_t size,
int nid, enum memblock_flags flags)
{
- bool insert = false;
phys_addr_t obase = base;
phys_addr_t end = base + memblock_cap_size(base, &size);
- int idx, nr_new;
+ unsigned long ocnt = type->cnt;
struct memblock_region *rgn;
+ int idx;
if (!size)
return 0;
@@ -598,24 +598,8 @@ static int __init_memblock memblock_add_range(struct memblock_type *type,
return 0;
}
- /*
- * The worst case is when new range overlaps all existing regions,
- * then we'll need type->cnt + 1 empty regions in @type. So if
- * type->cnt * 2 + 1 is less than type->max, we know
- * that there is enough empty regions in @type, and we can insert
- * regions directly.
- */
- if (type->cnt * 2 + 1 < type->max)
- insert = true;
-
repeat:
- /*
- * The following is executed twice. Once with %false @insert and
- * then with %true. The first counts the number of regions needed
- * to accommodate the new area. The second actually inserts them.
- */
base = obase;
- nr_new = 0;
for_each_memblock_type(idx, type, rgn) {
phys_addr_t rbase = rgn->base;
@@ -634,11 +618,25 @@ static int __init_memblock memblock_add_range(struct memblock_type *type,
WARN_ON(nid != memblock_get_region_node(rgn));
#endif
WARN_ON(flags != rgn->flags);
- nr_new++;
- if (insert)
- memblock_insert_region(type, idx++, base,
- rbase - base, nid,
- flags);
+
+ /*
+ * if type->cnt greater or equal to type->max,
+ * resize array; otherwise, insert directly.
+ *
+ * special: there may be nested call this function
+ * in memblock_double_array(), so it should repeat in
+ * that case.
+ */
+ if (type->cnt >= type->max) {
+ if (memblock_double_array(type, obase, size))
+ return -ENOMEM;
+ else if (!slab_is_available())
+ goto repeat;
+ }
+
+ memblock_insert_region(type, idx++, base,
+ rbase - base, nid,
+ flags);
}
/* area below @rend is dealt with, forget about it */
base = min(rend, end);
@@ -646,29 +644,23 @@ static int __init_memblock memblock_add_range(struct memblock_type *type,
/* insert the remaining portion */
if (base < end) {
- nr_new++;
- if (insert)
- memblock_insert_region(type, idx, base, end - base,
- nid, flags);
+ if (type->cnt >= type->max) {
+ if (memblock_double_array(type, obase, size))
+ return -ENOMEM;
+ else if (!slab_is_available())
+ goto repeat;
+ }
+
+ memblock_insert_region(type, idx, base, end - base,
+ nid, flags);
}
- if (!nr_new)
+ if (ocnt == type->cnt)
return 0;
- /*
- * If this was the first round, resize array and repeat for actual
- * insertions; otherwise, merge and return.
- */
- if (!insert) {
- while (type->cnt + nr_new > type->max)
- if (memblock_double_array(type, obase, size) < 0)
- return -ENOMEM;
- insert = true;
- goto repeat;
- } else {
- memblock_merge_regions(type);
- return 0;
- }
+ memblock_merge_regions(type);
+
+ return 0;
}
/**
--
2.25.1
next prev parent reply other threads:[~2022-11-09 10:04 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-11-06 12:07 [6.1.0-rc3-next-20221104] Boot failure - kernel BUG at mm/memblock.c:519 Sachin Sant
2022-11-08 2:27 ` Yajun Deng
2022-11-08 7:55 ` Mike Rapoport
2022-11-08 9:59 ` Yajun Deng
2022-11-09 10:03 ` Yajun Deng [this message]
2022-11-09 10:25 ` Yajun Deng
2022-11-09 10:55 ` Sachin Sant
2022-11-09 11:32 ` Yajun Deng
2022-11-09 11:42 ` Mike Rapoport
2022-11-09 11:48 ` Yajun Deng
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=e00989c4a69943cb4f60fc7ffaa06f8c@linux.dev \
--to=yajun.deng@linux.dev \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linuxppc-dev@lists.ozlabs.org \
--cc=rppt@linux.ibm.com \
--cc=sachinp@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).