From: Alex Chiang <achiang@hp.com>
To: Christoph Lameter <cl@linux-foundation.org>
Cc: penberg@cs.helsinki.fi, linux-ia64@vger.kernel.org, linux-mm@kvack.org
Subject: Re: SLUB ia64 linux-next crash bisected to 756dee75
Date: Thu, 14 Jan 2010 00:53:04 +0000 [thread overview]
Message-ID: <20100114005304.GC27766@ldl.fc.hp.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1001130851310.24496@router.home>
* Christoph Lameter <cl@linux-foundation.org>:
> On Tue, 12 Jan 2010, Alex Chiang wrote:
>
> > My HP rx8640 (ia64, 16 CPUs) is experiencing a bad paging request
> > during boot.
>
> Hmmm... Thats with 64k page size?
Yes.
CONFIG_IA64_PAGE_SIZE_64KB=y
> > SLUB: Unable to allocate memory from node 2
> > SLUB: Allocating a useless per node structure in order to be able to continue
>
> Huh? What wrong with node 2?
I've seen that message for quite some time now.
Here's some info from the EFI shell.
Shell> dimmconfig
MEMORY INFORMATION
Cab/ Total Active Failed SW Deconf HW Deconf
Cell Slot Mem Mem DIMMs DIMMs DIMMs Unknown
---- ----- --------- --------- ------ --------- --------- -------
0 0/0 32768 MB 32768 MB 0 0 0 0
1 0/1 32768 MB 32768 MB 0 0 0 0
Active Memory : 65536 MB
Interleaved Memory : 512 MB
NonInterleaved Memory : 65024 MB
Installed Memory : 65536 MB
Firmware puts each cell into a NUMA node, so we should really
only have 2 nodes, but for some reason, that 3rd node gets
created too. I haven't inspected the SRAT/SLIT on this machine
recently, but can do so if you want me to.
> > [<a0000001001a7c60>] kmem_cache_open+0x420/0xca0
> > spà0007860955fdf0 bspà000786095512e0
> > [<a0000001001a8cf0>] dma_kmalloc_cache+0x2d0/0x440
> > spà0007860955fdf0 bspà00078609551290
>
> Maybe we miscalculated the number of DMA caches needed.
>
> Does this patch fix it?
Nope, same oops.
Hm... from the boot log
ACPI: SLIT table looks invalid. Not used.
Number of logical nodes in system = 3
Number of memory chunks in system = 5
...
Virtual mem_map starts at 0xa07ffffe5a400000
Zone PFN ranges:
DMA 0x00000001 -> 0x00010000
Normal 0x00010000 -> 0x0787fc00
Movable zone start PFN for each node
early_node_map[5] active PFN ranges
2: 0x00000001 -> 0x00001ffe
0: 0x07002000 -> 0x07005db7
0: 0x07005db8 -> 0x0707fb00
1: 0x07800000 -> 0x0787fbd9
1: 0x0787fbe8 -> 0x0787fbfd
On node 0 totalpages: 514815
free_area_init_node: node 0, pgdat e000070020080000, node_mem_map a07fffffe2470000
Normal zone: 440 pages used for memmap
Normal zone: 514375 pages, LIFO batch:1
On node 1 totalpages: 523246
free_area_init_node: node 1, pgdat e000078000090080, node_mem_map a07ffffffe400000
Normal zone: 448 pages used for memmap
Normal zone: 522798 pages, LIFO batch:1
On node 2 totalpages: 8189
free_area_init_node: node 2, pgdat e000000000120100, node_mem_map a07ffffe5a400000
DMA zone: 7 pages used for memmap
DMA zone: 0 pages reserved
DMA zone: 8182 pages, LIFO batch:0
So the kernel doesn't like the SLIT; does it go off and create
its own NUMA nodes then?
/ac
WARNING: multiple messages have this Message-ID (diff)
From: Alex Chiang <achiang@hp.com>
To: Christoph Lameter <cl@linux-foundation.org>
Cc: penberg@cs.helsinki.fi, linux-ia64@vger.kernel.org, linux-mm@kvack.org
Subject: Re: SLUB ia64 linux-next crash bisected to 756dee75
Date: Wed, 13 Jan 2010 17:53:04 -0700 [thread overview]
Message-ID: <20100114005304.GC27766@ldl.fc.hp.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1001130851310.24496@router.home>
* Christoph Lameter <cl@linux-foundation.org>:
> On Tue, 12 Jan 2010, Alex Chiang wrote:
>
> > My HP rx8640 (ia64, 16 CPUs) is experiencing a bad paging request
> > during boot.
>
> Hmmm... Thats with 64k page size?
Yes.
CONFIG_IA64_PAGE_SIZE_64KB=y
> > SLUB: Unable to allocate memory from node 2
> > SLUB: Allocating a useless per node structure in order to be able to continue
>
> Huh? What wrong with node 2?
I've seen that message for quite some time now.
Here's some info from the EFI shell.
Shell> dimmconfig
MEMORY INFORMATION
Cab/ Total Active Failed SW Deconf HW Deconf
Cell Slot Mem Mem DIMMs DIMMs DIMMs Unknown
---- ----- --------- --------- ------ --------- --------- -------
0 0/0 32768 MB 32768 MB 0 0 0 0
1 0/1 32768 MB 32768 MB 0 0 0 0
Active Memory : 65536 MB
Interleaved Memory : 512 MB
NonInterleaved Memory : 65024 MB
Installed Memory : 65536 MB
Firmware puts each cell into a NUMA node, so we should really
only have 2 nodes, but for some reason, that 3rd node gets
created too. I haven't inspected the SRAT/SLIT on this machine
recently, but can do so if you want me to.
> > [<a0000001001a7c60>] kmem_cache_open+0x420/0xca0
> > sp=e00007860955fdf0 bsp=e0000786095512e0
> > [<a0000001001a8cf0>] dma_kmalloc_cache+0x2d0/0x440
> > sp=e00007860955fdf0 bsp=e000078609551290
>
> Maybe we miscalculated the number of DMA caches needed.
>
> Does this patch fix it?
Nope, same oops.
Hm... from the boot log
ACPI: SLIT table looks invalid. Not used.
Number of logical nodes in system = 3
Number of memory chunks in system = 5
...
Virtual mem_map starts at 0xa07ffffe5a400000
Zone PFN ranges:
DMA 0x00000001 -> 0x00010000
Normal 0x00010000 -> 0x0787fc00
Movable zone start PFN for each node
early_node_map[5] active PFN ranges
2: 0x00000001 -> 0x00001ffe
0: 0x07002000 -> 0x07005db7
0: 0x07005db8 -> 0x0707fb00
1: 0x07800000 -> 0x0787fbd9
1: 0x0787fbe8 -> 0x0787fbfd
On node 0 totalpages: 514815
free_area_init_node: node 0, pgdat e000070020080000, node_mem_map a07fffffe2470000
Normal zone: 440 pages used for memmap
Normal zone: 514375 pages, LIFO batch:1
On node 1 totalpages: 523246
free_area_init_node: node 1, pgdat e000078000090080, node_mem_map a07ffffffe400000
Normal zone: 448 pages used for memmap
Normal zone: 522798 pages, LIFO batch:1
On node 2 totalpages: 8189
free_area_init_node: node 2, pgdat e000000000120100, node_mem_map a07ffffe5a400000
DMA zone: 7 pages used for memmap
DMA zone: 0 pages reserved
DMA zone: 8182 pages, LIFO batch:0
So the kernel doesn't like the SLIT; does it go off and create
its own NUMA nodes then?
/ac
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2010-01-14 0:53 UTC|newest]
Thread overview: 59+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-01-13 0:29 SLUB ia64 linux-next crash bisected to 756dee75 Alex Chiang
2010-01-13 14:59 ` Christoph Lameter
2010-01-13 14:59 ` Christoph Lameter
2010-01-14 0:53 ` Alex Chiang [this message]
2010-01-14 0:53 ` Alex Chiang
2010-01-14 15:01 ` Christoph Lameter
2010-01-14 15:01 ` Christoph Lameter
2010-01-14 18:01 ` Alex Chiang
2010-01-14 18:01 ` Alex Chiang
2010-01-14 15:18 ` Christoph Lameter
2010-01-14 15:18 ` Christoph Lameter
2010-01-14 18:22 ` Alex Chiang
2010-01-14 18:22 ` Alex Chiang
2010-01-14 19:17 ` Pekka Enberg
2010-01-14 19:17 ` Pekka Enberg
2010-01-14 20:32 ` Alex Chiang
2010-01-14 20:32 ` Alex Chiang
2010-01-14 20:58 ` Christoph Lameter
2010-01-14 20:58 ` Christoph Lameter
2010-01-14 21:29 ` Alex Chiang
2010-01-14 21:29 ` Alex Chiang
2010-01-14 21:31 ` Pekka Enberg
2010-01-14 21:31 ` Pekka Enberg
2010-01-14 21:59 ` Christoph Lameter
2010-01-14 21:59 ` Christoph Lameter
2010-01-14 22:01 ` Christoph Lameter
2010-01-14 22:01 ` Christoph Lameter
2010-01-15 20:07 ` Christoph Lameter
2010-01-15 20:07 ` Christoph Lameter
2010-01-15 20:35 ` Lee Schermerhorn
2010-01-15 20:35 ` Lee Schermerhorn
2010-01-15 23:32 ` Christoph Lameter
2010-01-15 23:32 ` Christoph Lameter
2010-01-19 18:53 ` Christoph Lameter
2010-01-19 18:53 ` Christoph Lameter
2010-01-19 20:02 ` Alex Chiang
2010-01-19 20:02 ` Alex Chiang
2010-01-19 20:29 ` Christoph Lameter
2010-01-19 20:29 ` Christoph Lameter
2010-01-19 21:29 ` Alex Chiang
2010-01-19 21:29 ` Alex Chiang
2010-01-19 21:50 ` Christoph Lameter
2010-01-19 21:50 ` Christoph Lameter
2010-01-20 22:46 ` Alex Chiang
2010-01-20 22:46 ` Alex Chiang
2010-01-21 21:47 ` Alex Chiang
2010-01-21 21:47 ` Alex Chiang
2010-01-21 22:45 ` Christoph Lameter
2010-01-21 22:45 ` Christoph Lameter
2010-01-21 23:05 ` Alex Chiang
2010-01-21 23:05 ` Alex Chiang
2010-01-21 23:43 ` Christoph Lameter
2010-01-21 23:43 ` Christoph Lameter
2010-01-22 0:15 ` Alex Chiang
2010-01-22 0:15 ` Alex Chiang
2010-01-22 14:43 ` Christoph Lameter
2010-01-22 14:43 ` Christoph Lameter
2010-01-22 16:37 ` Pekka Enberg
2010-01-22 16:37 ` Pekka Enberg
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100114005304.GC27766@ldl.fc.hp.com \
--to=achiang@hp.com \
--cc=cl@linux-foundation.org \
--cc=linux-ia64@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=penberg@cs.helsinki.fi \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.