All of lore.kernel.org
 help / color / mirror / Atom feed
From: Alex Chiang <achiang@hp.com>
To: Christoph Lameter <cl@linux-foundation.org>
Cc: penberg@cs.helsinki.fi, linux-ia64@vger.kernel.org, linux-mm@kvack.org
Subject: Re: SLUB ia64 linux-next crash bisected to 756dee75
Date: Thu, 14 Jan 2010 00:53:04 +0000	[thread overview]
Message-ID: <20100114005304.GC27766@ldl.fc.hp.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1001130851310.24496@router.home>

* Christoph Lameter <cl@linux-foundation.org>:
> On Tue, 12 Jan 2010, Alex Chiang wrote:
> 
> > My HP rx8640 (ia64, 16 CPUs) is experiencing a bad paging request
> > during boot.
> 
> Hmmm... Thats with 64k page size?

Yes.

CONFIG_IA64_PAGE_SIZE_64KB=y

> > SLUB: Unable to allocate memory from node 2
> > SLUB: Allocating a useless per node structure in order to be able to continue
> 
> Huh? What wrong with node 2?

I've seen that message for quite some time now.

Here's some info from the EFI shell.

Shell> dimmconfig

MEMORY INFORMATION

         Cab/     Total    Active   Failed  SW Deconf HW Deconf
   Cell  Slot      Mem      Mem     DIMMs     DIMMs     DIMMs   Unknown
   ----  -----  --------- --------- ------  --------- --------- -------
     0    0/0    32768 MB  32768 MB    0         0         0        0
     1    0/1    32768 MB  32768 MB    0         0         0        0

   Active Memory           : 65536 MB
     Interleaved Memory    :   512 MB
     NonInterleaved Memory : 65024 MB
   Installed Memory        : 65536 MB

Firmware puts each cell into a NUMA node, so we should really
only have 2 nodes, but for some reason, that 3rd node gets
created too. I haven't inspected the SRAT/SLIT on this machine
recently, but can do so if you want me to.

> >  [<a0000001001a7c60>] kmem_cache_open+0x420/0xca0
> >                                 spà0007860955fdf0 bspà000786095512e0
> >  [<a0000001001a8cf0>] dma_kmalloc_cache+0x2d0/0x440
> >                                 spà0007860955fdf0 bspà00078609551290
> 
> Maybe we miscalculated the number of DMA caches needed.
> 
> Does this patch fix it?

Nope, same oops.

Hm... from the boot log

ACPI: SLIT table looks invalid. Not used.
Number of logical nodes in system = 3
Number of memory chunks in system = 5
...
Virtual mem_map starts at 0xa07ffffe5a400000
Zone PFN ranges:
  DMA      0x00000001 -> 0x00010000
  Normal   0x00010000 -> 0x0787fc00
Movable zone start PFN for each node
early_node_map[5] active PFN ranges
    2: 0x00000001 -> 0x00001ffe
    0: 0x07002000 -> 0x07005db7
    0: 0x07005db8 -> 0x0707fb00
    1: 0x07800000 -> 0x0787fbd9
    1: 0x0787fbe8 -> 0x0787fbfd
On node 0 totalpages: 514815
free_area_init_node: node 0, pgdat e000070020080000, node_mem_map a07fffffe2470000
  Normal zone: 440 pages used for memmap
  Normal zone: 514375 pages, LIFO batch:1
On node 1 totalpages: 523246
free_area_init_node: node 1, pgdat e000078000090080, node_mem_map a07ffffffe400000
  Normal zone: 448 pages used for memmap
  Normal zone: 522798 pages, LIFO batch:1
On node 2 totalpages: 8189
free_area_init_node: node 2, pgdat e000000000120100, node_mem_map a07ffffe5a400000
  DMA zone: 7 pages used for memmap
  DMA zone: 0 pages reserved
  DMA zone: 8182 pages, LIFO batch:0

So the kernel doesn't like the SLIT; does it go off and create
its own NUMA nodes then?

/ac


WARNING: multiple messages have this Message-ID (diff)
From: Alex Chiang <achiang@hp.com>
To: Christoph Lameter <cl@linux-foundation.org>
Cc: penberg@cs.helsinki.fi, linux-ia64@vger.kernel.org, linux-mm@kvack.org
Subject: Re: SLUB ia64 linux-next crash bisected to 756dee75
Date: Wed, 13 Jan 2010 17:53:04 -0700	[thread overview]
Message-ID: <20100114005304.GC27766@ldl.fc.hp.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1001130851310.24496@router.home>

* Christoph Lameter <cl@linux-foundation.org>:
> On Tue, 12 Jan 2010, Alex Chiang wrote:
> 
> > My HP rx8640 (ia64, 16 CPUs) is experiencing a bad paging request
> > during boot.
> 
> Hmmm... Thats with 64k page size?

Yes.

CONFIG_IA64_PAGE_SIZE_64KB=y

> > SLUB: Unable to allocate memory from node 2
> > SLUB: Allocating a useless per node structure in order to be able to continue
> 
> Huh? What wrong with node 2?

I've seen that message for quite some time now.

Here's some info from the EFI shell.

Shell> dimmconfig

MEMORY INFORMATION

         Cab/     Total    Active   Failed  SW Deconf HW Deconf
   Cell  Slot      Mem      Mem     DIMMs     DIMMs     DIMMs   Unknown
   ----  -----  --------- --------- ------  --------- --------- -------
     0    0/0    32768 MB  32768 MB    0         0         0        0
     1    0/1    32768 MB  32768 MB    0         0         0        0

   Active Memory           : 65536 MB
     Interleaved Memory    :   512 MB
     NonInterleaved Memory : 65024 MB
   Installed Memory        : 65536 MB

Firmware puts each cell into a NUMA node, so we should really
only have 2 nodes, but for some reason, that 3rd node gets
created too. I haven't inspected the SRAT/SLIT on this machine
recently, but can do so if you want me to.

> >  [<a0000001001a7c60>] kmem_cache_open+0x420/0xca0
> >                                 sp=e00007860955fdf0 bsp=e0000786095512e0
> >  [<a0000001001a8cf0>] dma_kmalloc_cache+0x2d0/0x440
> >                                 sp=e00007860955fdf0 bsp=e000078609551290
> 
> Maybe we miscalculated the number of DMA caches needed.
> 
> Does this patch fix it?

Nope, same oops.

Hm... from the boot log

ACPI: SLIT table looks invalid. Not used.
Number of logical nodes in system = 3
Number of memory chunks in system = 5
...
Virtual mem_map starts at 0xa07ffffe5a400000
Zone PFN ranges:
  DMA      0x00000001 -> 0x00010000
  Normal   0x00010000 -> 0x0787fc00
Movable zone start PFN for each node
early_node_map[5] active PFN ranges
    2: 0x00000001 -> 0x00001ffe
    0: 0x07002000 -> 0x07005db7
    0: 0x07005db8 -> 0x0707fb00
    1: 0x07800000 -> 0x0787fbd9
    1: 0x0787fbe8 -> 0x0787fbfd
On node 0 totalpages: 514815
free_area_init_node: node 0, pgdat e000070020080000, node_mem_map a07fffffe2470000
  Normal zone: 440 pages used for memmap
  Normal zone: 514375 pages, LIFO batch:1
On node 1 totalpages: 523246
free_area_init_node: node 1, pgdat e000078000090080, node_mem_map a07ffffffe400000
  Normal zone: 448 pages used for memmap
  Normal zone: 522798 pages, LIFO batch:1
On node 2 totalpages: 8189
free_area_init_node: node 2, pgdat e000000000120100, node_mem_map a07ffffe5a400000
  DMA zone: 7 pages used for memmap
  DMA zone: 0 pages reserved
  DMA zone: 8182 pages, LIFO batch:0

So the kernel doesn't like the SLIT; does it go off and create
its own NUMA nodes then?

/ac

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2010-01-14  0:53 UTC|newest]

Thread overview: 59+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-01-13  0:29 SLUB ia64 linux-next crash bisected to 756dee75 Alex Chiang
2010-01-13 14:59 ` Christoph Lameter
2010-01-13 14:59   ` Christoph Lameter
2010-01-14  0:53   ` Alex Chiang [this message]
2010-01-14  0:53     ` Alex Chiang
2010-01-14 15:01     ` Christoph Lameter
2010-01-14 15:01       ` Christoph Lameter
2010-01-14 18:01       ` Alex Chiang
2010-01-14 18:01         ` Alex Chiang
2010-01-14 15:18 ` Christoph Lameter
2010-01-14 15:18   ` Christoph Lameter
2010-01-14 18:22   ` Alex Chiang
2010-01-14 18:22     ` Alex Chiang
2010-01-14 19:17     ` Pekka Enberg
2010-01-14 19:17       ` Pekka Enberg
2010-01-14 20:32       ` Alex Chiang
2010-01-14 20:32         ` Alex Chiang
2010-01-14 20:58         ` Christoph Lameter
2010-01-14 20:58           ` Christoph Lameter
2010-01-14 21:29           ` Alex Chiang
2010-01-14 21:29             ` Alex Chiang
2010-01-14 21:31             ` Pekka Enberg
2010-01-14 21:31               ` Pekka Enberg
2010-01-14 21:59               ` Christoph Lameter
2010-01-14 21:59                 ` Christoph Lameter
2010-01-14 22:01             ` Christoph Lameter
2010-01-14 22:01               ` Christoph Lameter
2010-01-15 20:07 ` Christoph Lameter
2010-01-15 20:07   ` Christoph Lameter
2010-01-15 20:35   ` Lee Schermerhorn
2010-01-15 20:35     ` Lee Schermerhorn
2010-01-15 23:32     ` Christoph Lameter
2010-01-15 23:32       ` Christoph Lameter
2010-01-19 18:53       ` Christoph Lameter
2010-01-19 18:53         ` Christoph Lameter
2010-01-19 20:02         ` Alex Chiang
2010-01-19 20:02           ` Alex Chiang
2010-01-19 20:29           ` Christoph Lameter
2010-01-19 20:29             ` Christoph Lameter
2010-01-19 21:29             ` Alex Chiang
2010-01-19 21:29               ` Alex Chiang
2010-01-19 21:50               ` Christoph Lameter
2010-01-19 21:50                 ` Christoph Lameter
2010-01-20 22:46                 ` Alex Chiang
2010-01-20 22:46                   ` Alex Chiang
2010-01-21 21:47                 ` Alex Chiang
2010-01-21 21:47                   ` Alex Chiang
2010-01-21 22:45                   ` Christoph Lameter
2010-01-21 22:45                     ` Christoph Lameter
2010-01-21 23:05                     ` Alex Chiang
2010-01-21 23:05                       ` Alex Chiang
2010-01-21 23:43                       ` Christoph Lameter
2010-01-21 23:43                         ` Christoph Lameter
2010-01-22  0:15                         ` Alex Chiang
2010-01-22  0:15                           ` Alex Chiang
2010-01-22 14:43                           ` Christoph Lameter
2010-01-22 14:43                             ` Christoph Lameter
2010-01-22 16:37                             ` Pekka Enberg
2010-01-22 16:37                               ` Pekka Enberg

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100114005304.GC27766@ldl.fc.hp.com \
    --to=achiang@hp.com \
    --cc=cl@linux-foundation.org \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=penberg@cs.helsinki.fi \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.