All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andrew Morton <akpm@linux-foundation.org>
To: "Luck, Tony" <tony.luck@intel.com>
Cc: Yasunori Goto <y-goto@jp.fujitsu.com>, Mel Gorman <mel@skynet.ie>,
	Jeremy Higdon <jeremy@sgi.com>,
	Kamalesh Babulal <kamalesh@linux.vnet.ibm.com>,
	Andi Kleen <ak@suse.de>,
	linux-kernel@vger.kernel.org,
	Balbir Singh <balbir@linux.vnet.ibm.com>,
	linux-ia64@vger.kernel.org, Christoph Lameter <clameter@sgi.com>,
	linux-mm@kvack.org
Subject: Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory
Date: Thu, 23 Aug 2007 21:21:33 +0000	[thread overview]
Message-ID: <20070823142133.9359a1ce.akpm@linux-foundation.org> (raw)
In-Reply-To: <617E1C2C70743745A92448908E030B2A023EB020@scsmsx411.amr.corp.intel.com>

On Thu, 23 Aug 2007 10:22:26 -0700
"Luck, Tony" <tony.luck@intel.com> wrote:

> > __get_free_pages() of swiotlb_alloc_coherent() fails in rc3-mm1.
> > But, it doesn't fail on rc2-mm2, and kernel can boot up.
> 
> That looks to be part of the problem here ... failing an order=3
> allocation during boot on a system that just a few lines earlier
> in the boot log reported "Memory: 37474000k/37680640k available"
> looks bad ... but perhaps having *more* memory is part of your problem.
> You may have run low on GFP_DMA memory because some allocation
> scaled by memory size has chewed up a lot of your memory.  To check
> this try booting with a "mem=4G" parameter and see if that helps
> you.
> 
> But it is also bad that the swiotlb() code failed to handle this.
> Can you check whether the problem is related to the size of the
> allocation being just over 256K (a magic number for swiotlb since
> IO_TLB_SEGSIZE is 128 times a slab size of 2k).  Try changing
> lib/swiotlb.c to set IO_TLB_SEGSIZE to 256 instead.
> 

Others are reporting machines which fail int he memory allcoator much
earlier, and which claim to have four CPUs and 16 nodes.  So something is
very wonky in the rc3-mm1 page allocator.

I guess suspicion has to be directed at the memoryless-nodes patches, but
until that's cleared up I don't think there's much to be gained from
chasing this iommu problem, now that you've worked out that it's a bogus
memory allocation failure (thanks).



WARNING: multiple messages have this Message-ID (diff)
From: Andrew Morton <akpm@linux-foundation.org>
To: "Luck, Tony" <tony.luck@intel.com>
Cc: "Yasunori Goto" <y-goto@jp.fujitsu.com>,
	"Mel Gorman" <mel@skynet.ie>, "Jeremy Higdon" <jeremy@sgi.com>,
	"Kamalesh Babulal" <kamalesh@linux.vnet.ibm.com>,
	"Andi Kleen" <ak@suse.de>, <linux-kernel@vger.kernel.org>,
	"Balbir Singh" <balbir@linux.vnet.ibm.com>,
	<linux-ia64@vger.kernel.org>,
	Christoph Lameter <clameter@sgi.com>,
	linux-mm@kvack.org
Subject: Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted
Date: Thu, 23 Aug 2007 14:21:33 -0700	[thread overview]
Message-ID: <20070823142133.9359a1ce.akpm@linux-foundation.org> (raw)
In-Reply-To: <617E1C2C70743745A92448908E030B2A023EB020@scsmsx411.amr.corp.intel.com>

On Thu, 23 Aug 2007 10:22:26 -0700
"Luck, Tony" <tony.luck@intel.com> wrote:

> > __get_free_pages() of swiotlb_alloc_coherent() fails in rc3-mm1.
> > But, it doesn't fail on rc2-mm2, and kernel can boot up.
> 
> That looks to be part of the problem here ... failing an order=3
> allocation during boot on a system that just a few lines earlier
> in the boot log reported "Memory: 37474000k/37680640k available"
> looks bad ... but perhaps having *more* memory is part of your problem.
> You may have run low on GFP_DMA memory because some allocation
> scaled by memory size has chewed up a lot of your memory.  To check
> this try booting with a "mem=4G" parameter and see if that helps
> you.
> 
> But it is also bad that the swiotlb() code failed to handle this.
> Can you check whether the problem is related to the size of the
> allocation being just over 256K (a magic number for swiotlb since
> IO_TLB_SEGSIZE is 128 times a slab size of 2k).  Try changing
> lib/swiotlb.c to set IO_TLB_SEGSIZE to 256 instead.
> 

Others are reporting machines which fail int he memory allcoator much
earlier, and which claim to have four CPUs and 16 nodes.  So something is
very wonky in the rc3-mm1 page allocator.

I guess suspicion has to be directed at the memoryless-nodes patches, but
until that's cleared up I don't think there's much to be gained from
chasing this iommu problem, now that you've worked out that it's a bogus
memory allocation failure (thanks).



WARNING: multiple messages have this Message-ID (diff)
From: Andrew Morton <akpm@linux-foundation.org>
To: "Luck, Tony" <tony.luck@intel.com>
Cc: Yasunori Goto <y-goto@jp.fujitsu.com>, Mel Gorman <mel@skynet.ie>,
	Jeremy Higdon <jeremy@sgi.com>,
	Kamalesh Babulal <kamalesh@linux.vnet.ibm.com>,
	Andi Kleen <ak@suse.de>,
	linux-kernel@vger.kernel.org,
	Balbir Singh <balbir@linux.vnet.ibm.com>,
	linux-ia64@vger.kernel.org, Christoph Lameter <clameter@sgi.com>,
	linux-mm@kvack.org
Subject: Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted
Date: Thu, 23 Aug 2007 14:21:33 -0700	[thread overview]
Message-ID: <20070823142133.9359a1ce.akpm@linux-foundation.org> (raw)
In-Reply-To: <617E1C2C70743745A92448908E030B2A023EB020@scsmsx411.amr.corp.intel.com>

On Thu, 23 Aug 2007 10:22:26 -0700
"Luck, Tony" <tony.luck@intel.com> wrote:

> > __get_free_pages() of swiotlb_alloc_coherent() fails in rc3-mm1.
> > But, it doesn't fail on rc2-mm2, and kernel can boot up.
> 
> That looks to be part of the problem here ... failing an order=3
> allocation during boot on a system that just a few lines earlier
> in the boot log reported "Memory: 37474000k/37680640k available"
> looks bad ... but perhaps having *more* memory is part of your problem.
> You may have run low on GFP_DMA memory because some allocation
> scaled by memory size has chewed up a lot of your memory.  To check
> this try booting with a "mem=4G" parameter and see if that helps
> you.
> 
> But it is also bad that the swiotlb() code failed to handle this.
> Can you check whether the problem is related to the size of the
> allocation being just over 256K (a magic number for swiotlb since
> IO_TLB_SEGSIZE is 128 times a slab size of 2k).  Try changing
> lib/swiotlb.c to set IO_TLB_SEGSIZE to 256 instead.
> 

Others are reporting machines which fail int he memory allcoator much
earlier, and which claim to have four CPUs and 16 nodes.  So something is
very wonky in the rc3-mm1 page allocator.

I guess suspicion has to be directed at the memoryless-nodes patches, but
until that's cleared up I don't think there's much to be gained from
chasing this iommu problem, now that you've worked out that it's a bogus
memory allocation failure (thanks).


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2007-08-23 21:21 UTC|newest]

Thread overview: 63+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-08-22 14:32 [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted Kamalesh Babulal
2007-08-22 16:19 ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory Andrew Morton
2007-08-22 16:19   ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted Andrew Morton
2007-08-22 16:35   ` Andi Kleen
2007-08-22 17:25     ` Andi Kleen
2007-08-22 18:31     ` Kamalesh Babulal
2007-08-22 18:43       ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory Kamalesh Babulal
2007-08-22 21:04       ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted Luck, Tony
2007-08-22 21:04         ` Luck, Tony
2007-08-22 22:24         ` Kamalesh Babulal
2007-08-22 22:36           ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory Kamalesh Babulal
2007-08-22 22:56           ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted Luck, Tony
2007-08-22 22:56             ` Luck, Tony
2007-08-22 23:11             ` Jeremy Higdon
2007-08-22 23:11               ` Jeremy Higdon
2007-08-22 23:27               ` Luck, Tony
2007-08-22 23:27                 ` Luck, Tony
2007-08-22 23:54                 ` Jeremy Higdon
2007-08-22 23:54                   ` Jeremy Higdon
2007-08-23  0:05                   ` Luck, Tony
2007-08-23  0:05                     ` Luck, Tony
2007-08-23  1:09                     ` Jeremy Higdon
2007-08-23  1:09                       ` Jeremy Higdon
2007-08-23  1:16                       ` Jeremy Higdon
2007-08-23  1:16                         ` Jeremy Higdon
2007-08-23  9:15                 ` Mel Gorman
2007-08-23  9:15                   ` Mel Gorman
2007-08-23 13:27                   ` Yasunori Goto
2007-08-23 13:27                     ` Yasunori Goto
2007-08-23 17:22                     ` Luck, Tony
2007-08-23 17:22                       ` Luck, Tony
2007-08-23 21:21                       ` Andrew Morton [this message]
2007-08-23 21:21                         ` Andrew Morton
2007-08-23 21:21                         ` Andrew Morton
2007-08-24  6:53                         ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory wo Yasunori Goto
2007-08-24  6:53                           ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Yasunori Goto
2007-08-24  6:53                           ` Yasunori Goto
2007-08-24 14:52                           ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memor Mel Gorman
2007-08-24 14:52                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Mel Gorman
2007-08-24 14:52                             ` Mel Gorman
2007-08-24 15:49                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Lee Schermerhorn
2007-08-24 15:49                               ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Lee Schermerhorn
2007-08-24 15:49                               ` Lee Schermerhorn
2007-08-24 17:00                               ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel Christoph Lameter
2007-08-24 17:00                                 ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Christoph Lameter
2007-08-24 17:00                                 ` Christoph Lameter
2007-08-24 18:03                                 ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Lee Schermerhorn
2007-08-24 18:03                                   ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Lee Schermerhorn
2007-08-24 18:03                                   ` Lee Schermerhorn
2007-08-24 18:08                                   ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel Christoph Lameter
2007-08-24 18:08                                     ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Christoph Lameter
2007-08-24 18:08                                     ` Christoph Lameter
2007-08-24 17:02                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel Christoph Lameter
2007-08-24 17:02                               ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Christoph Lameter
2007-08-24 17:02                               ` Christoph Lameter
2007-08-24 16:46                           ` Kamalesh Babulal
2007-08-24 16:58                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel Kamalesh Babulal
2007-08-24 16:46                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Kamalesh Babulal
2007-08-28 22:41                           ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Adam Litke
2007-08-28 22:41                             ` [PATCH] Fix find_next_best_node (Re: [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted) Adam Litke
2007-08-28 22:41                             ` Adam Litke
2007-08-23  9:22                 ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory would be corrupted Kamalesh Babulal
2007-08-23  9:34                   ` [BUG] 2.6.23-rc3-mm1 Kernel panic - not syncing: DMA: Memory Kamalesh Babulal

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070823142133.9359a1ce.akpm@linux-foundation.org \
    --to=akpm@linux-foundation.org \
    --cc=ak@suse.de \
    --cc=balbir@linux.vnet.ibm.com \
    --cc=clameter@sgi.com \
    --cc=jeremy@sgi.com \
    --cc=kamalesh@linux.vnet.ibm.com \
    --cc=linux-ia64@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mel@skynet.ie \
    --cc=tony.luck@intel.com \
    --cc=y-goto@jp.fujitsu.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.