From: Andy Whitcroft <apw@shadowen.org>
To: Andy Whitcroft <apw@shadowen.org>
Cc: "H. Peter Anvin" <hpa@zytor.com>, Mel Gorman <mel@csn.ul.ie>,
Andrew Morton <akpm@linux-foundation.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Steve Fox <drfickle@us.ibm.com>
Subject: Re: 2.6.22-rc1-mm1
Date: Mon, 11 Jun 2007 14:58:36 +0100 [thread overview]
Message-ID: <466D550C.4030600@shadowen.org> (raw)
In-Reply-To: <4667D4BC.2020806@shadowen.org>
Andy Whitcroft wrote:
> H. Peter Anvin wrote:
>> Andy Whitcroft wrote:
>>>> It definitely sounds like a memory clobber of some sort.
>>>>
>>>> Usual suspects, in addition to the input/output buffers you already
>>>> looked at, would be the heap and the stack. Finding where the stack
>>>> pointer lives would be my first, instinctive guess.
>>> The stack seems to be where it should be and seems to stay pretty much
>>> in the same place as it should. Adding checks for the heap also seem to
>>> stay within bounds. I've tried making the stack and the heap 64k to no
>>> effect.
>>>
>>> Moving the kernel to other places in memory seems to kill the decode
>>> completely during gunzip() which may be a hint I am not sure.
>>>
>>> This thing is trying to ruin my mind.
>>>
>> Yours and mine both. Seems like *something* is clobbering memory, but
>> what and why is a mystery. The fact that putting the kernel in a higher
>> point in memory is a good indication that this clobber is at a
>> relatively high address.
>>
>> How much RAM does this machine have?
>
> This is as 12GB machine. 3 numa nodes.
>
> I checked out the location of the IDT and GDT and both seem sane, in the
> 9xxxx range below the kernel destination.
>
> I also note that on another machine of this type, one Node only in that
> case some of the "did work" cases do not work. Also when I applied some
> of my patches on the top "working" cases stopped working. So whatever
> it is is definatly related to the shape of the kernel to be loaded.
> Very confusing.
Ok, in fact when the kernel is moved elsewhere in the address space it
will decode properly. There was a check in there for not loading at the
right address which was catching me out ... as errors do not show up as
we have no serial support. Doh.
Once I had gotten this decoding at other addresses I simply tried moving
the base address for the kernel elsewhere. I am able to successfully
boot the kernel at 16MB and 256MB. This seems like something outside
the decoder scribbling.
I would not normally recommend moving the base address of the kernel.
However, this problem at least so far has only shown up on the NUMA-Q
platform which is at best described as a very small volume
sub-architecture. There are areas in which it differers from mainstream
BIOS and we are no longer able to get details of these differences.
We actually have no proof as yet this is or is not a NUMA-Q specific
problem. For instance these machines tend to run less modules and more
builtin stuff than the average due to an owner dislike of modules. So
we could have a lurking kernel size issue or similar.
I am therefore proposing change the base address for NUMA-Q only (patch
to follow this email). And that we remain aware of the issue and on the
lookout for similar breakage on mainstream x86 platforms. At least with
this patch we can get wider testing on the rest of the kernel.
-apw
next prev parent reply other threads:[~2007-06-11 13:59 UTC|newest]
Thread overview: 151+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-05-16 3:19 2.6.22-rc1-mm1 Andrew Morton
2007-05-16 6:06 ` 2.6.22-rc1-mm1 KAMEZAWA Hiroyuki
2007-05-16 7:58 ` 2.6.22-rc1-mm1 Jeff Garzik
2007-05-16 8:04 ` 2.6.22-rc1-mm1 Andrew Morton
2007-05-16 15:33 ` 2.6.22-rc1-mm1 Jeff Garzik
2007-05-16 20:24 ` 2.6.22-rc1-mm1 Darrick J. Wong
2007-05-16 16:54 ` 2.6.22-rc1-mm1 Randy Dunlap
2007-05-16 7:57 ` 2.6.22-rc1-mm1 - s390 vs. md Cornelia Huck
2007-05-16 17:21 ` Williams, Dan J
2007-05-16 10:18 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-05-16 15:16 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-16 17:40 ` 2.6.22-rc1-mm1 Mel Gorman
2007-05-16 17:55 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-16 18:18 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-05-16 18:00 ` 2.6.22-rc1-mm1 Andrew Morton
2007-05-16 23:32 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-16 23:36 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-17 9:35 ` 2.6.22-rc1-mm1 Mel Gorman
2007-05-29 22:34 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-06-01 9:50 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-06-01 23:12 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-06-05 18:38 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-06-05 22:57 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-06-07 9:49 ` 2.6.22-rc1-mm1 Andy Whitcroft
2007-06-11 13:58 ` Andy Whitcroft [this message]
[not found] ` <63a08cc7547f14065becdf9a94d0d529@pinky>
2007-06-11 16:15 ` [PATCH] move the kernel to 16MB for NUMA-Q Andrew Morton
2007-06-11 17:20 ` Dave Jones
2007-06-11 17:36 ` H. Peter Anvin
2007-06-11 18:19 ` Jan Engelhardt
2007-06-11 18:46 ` Dave Jones
2007-06-11 19:17 ` Alan Cox
2007-06-11 20:07 ` Rene Herman
2007-06-11 20:21 ` Rene Herman
2007-06-11 19:01 ` H. Peter Anvin
2007-06-11 20:44 ` Jan Engelhardt
2007-06-11 20:51 ` H. Peter Anvin
2007-06-15 11:12 ` Jan Engelhardt
2007-06-11 17:49 ` Rene Herman
2007-06-11 17:58 ` H. Peter Anvin
2007-06-11 18:01 ` Rene Herman
2007-05-17 4:16 ` 2.6.22-rc1-mm1 Bharata B Rao
2007-05-18 8:54 ` 2.6.22-rc1-mm1 young dave
2007-05-18 10:07 ` 2.6.22-rc1-mm1 young dave
2007-05-18 16:54 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-18 16:59 ` 2.6.22-rc1-mm1 Mel Gorman
2007-05-21 0:53 ` 2.6.22-rc1-mm1 young dave
2007-05-21 4:49 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-21 5:00 ` 2.6.22-rc1-mm1 young dave
2007-05-21 5:03 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-21 5:39 ` 2.6.22-rc1-mm1 young dave
[not found] ` <465138CC.3060605@zytor.com>
2007-05-21 8:41 ` 2.6.22-rc1-mm1 young dave
2007-05-21 16:35 ` 2.6.22-rc1-mm1 H. Peter Anvin
2007-05-22 2:14 ` 2.6.22-rc1-mm1 young dave
[not found] ` <465319C1.2080206@zytor.com>
2007-05-23 1:15 ` 2.6.22-rc1-mm1 young dave
2007-05-16 12:10 ` (NFS) BUG: at page-writeback.c:829 [Was: 2.6.22-rc1-mm1] Jiri Slaby
2007-05-16 12:39 ` Nick Piggin
2007-05-16 12:44 ` Jiri Slaby
2007-05-16 12:47 ` Nick Piggin
2007-05-16 13:00 ` Trond Myklebust
2007-05-16 13:06 ` Nick Piggin
2007-05-16 12:52 ` Trond Myklebust
2007-05-16 14:30 ` 2.6.22-rc1-mm1 Michal Piotrowski
2007-05-16 14:37 ` 2.6.22-rc1-mm1 Nick Piggin
[not found] ` <6bffcb0e0705160935r1767a764hce72f24f9eee6c1e@mail.gmail.com>
2007-05-16 16:58 ` 2.6.22-rc1-mm1 Jiri Slaby
2007-05-16 15:34 ` 2.6.22-rc1-mm1 Gabriel C
2007-05-16 16:24 ` 2.6.22-rc1-mm1 Michal Piotrowski
2007-05-16 16:41 ` 2.6.22-rc1-mm1 Andrew Morton
2007-05-17 2:06 ` 2.6.22-rc1-mm1 David Chinner
2007-05-17 8:41 ` [xfs-masters] 2.6.22-rc1-mm1 Christoph Hellwig
2007-05-17 20:05 ` Michal Piotrowski
2007-05-18 2:11 ` David Chinner
2007-05-21 10:11 ` David Chinner
2007-05-21 10:23 ` Christoph Hellwig
2007-05-22 10:44 ` David Chinner
2007-05-22 11:42 ` Christoph Hellwig
2007-05-22 23:23 ` Nathan Scott
2007-05-22 14:45 ` Michal Piotrowski
2007-05-16 16:50 ` 2.6.22-rc1-mm1 Randy Dunlap
2007-05-16 17:00 ` 2.6.22-rc1-mm1 Richard Purdie
2007-05-16 17:06 ` 2.6.22-rc1-mm1 Andrew Morton
2007-05-16 19:55 ` 2.6.22-rc1-mm1 Richard Purdie
2007-05-16 20:00 ` 2.6.22-rc1-mm1 Richard Purdie
2007-05-18 17:34 ` 2.6.22-rc1-mm1 Edward Shishkin
2007-05-16 17:37 ` 2.6.22-rc1-mm1 [cannot change thermal trip points] Maciej Rutecki
2007-05-16 17:47 ` Chuck Ebbert
2007-05-16 17:47 ` Chuck Ebbert
2007-05-16 18:10 ` Goulven Guillard
2007-05-16 18:10 ` Goulven Guillard
2007-05-17 9:23 ` Pavel Machek
2007-05-17 13:36 ` Maciej Rutecki
2007-05-17 19:08 ` Len Brown
2007-05-17 20:09 ` Maciej Rutecki
2007-05-17 20:42 ` Maciej Rutecki
2007-05-17 21:53 ` Pavel Machek
2007-05-17 22:42 ` Len Brown
2007-05-21 12:11 ` Pavel Machek
2007-06-01 2:46 ` Len Brown
2007-06-04 11:16 ` Pavel Machek
2007-05-17 19:17 ` Len Brown
2007-05-17 21:52 ` Pavel Machek
2007-05-17 22:35 ` Len Brown
2007-06-04 9:02 ` Stefan Seyfried
2007-06-04 9:02 ` Stefan Seyfried
2007-06-04 11:06 ` Pavel Machek
2007-05-19 19:56 ` Thomas Renninger
2007-05-21 3:50 ` Len Brown
2007-05-21 11:31 ` Thomas Renninger
2007-05-21 12:10 ` Pavel Machek
2007-05-21 13:27 ` Matthew Garrett
2007-05-21 13:29 ` Pavel Machek
2007-05-21 13:36 ` Matthew Garrett
2007-05-21 13:40 ` Pavel Machek
2007-05-21 13:45 ` Matthew Garrett
2007-05-21 22:42 ` Pavel Machek
2007-05-22 0:31 ` Matthew Garrett
2007-05-22 9:06 ` Pavel Machek
2007-05-22 9:16 ` Matthew Garrett
2007-05-22 9:28 ` Goulven Guillard
2007-05-22 9:28 ` Goulven Guillard
2007-05-22 10:05 ` Maciej Rutecki
2007-06-04 9:13 ` Stefan Seyfried
2007-06-04 9:13 ` Stefan Seyfried
2007-05-24 14:16 ` 2.6.22-rc1-mm1 Implementing fan/thermal control in userspace - Was: " Thomas Renninger
2007-05-24 14:36 ` Matthew Garrett
2007-05-24 18:18 ` Thomas Renninger
2007-05-25 6:38 ` Pavel Machek
2007-05-27 21:51 ` Matthew Garrett
2007-05-28 10:58 ` Pavel Machek
2007-05-28 12:50 ` Matthew Garrett
2007-05-28 12:53 ` Pavel Machek
2007-05-16 18:55 ` 2.6.22-rc1-mm1: IDE compile error Adrian Bunk
2007-05-23 23:45 ` Bartlomiej Zolnierkiewicz
2007-05-24 10:55 ` Alan Cox
2007-05-24 18:53 ` H. Peter Anvin
2007-05-25 0:05 ` H. Peter Anvin
2007-05-25 0:14 ` Alan Cox
2007-05-25 0:18 ` H. Peter Anvin
2007-05-25 0:38 ` Alan Cox
2007-05-25 0:51 ` H. Peter Anvin
2007-05-25 14:19 ` Alan Cox
2007-05-17 12:38 ` 2.6.22-rc1-mm1 - Call trace in slub_def.h Reuben Farrelly
2007-05-17 12:52 ` Satyam Sharma
2007-05-20 10:12 ` 2.6.22-rc1-mm1 Mariusz Kozlowski
2007-05-20 10:12 ` 2.6.22-rc1-mm1 Mariusz Kozlowski
2007-05-20 10:21 ` 2.6.22-rc1-mm1 Sam Ravnborg
2007-05-20 10:21 ` 2.6.22-rc1-mm1 Sam Ravnborg
2007-05-20 15:33 ` 2.6.22-rc1-mm1 Kumar Gala
2007-05-20 15:33 ` 2.6.22-rc1-mm1 Kumar Gala
2007-05-22 7:25 ` 2.6.22-rc1-mm1: evm BUG when reading sysfs file Joseph Fannin
2007-05-22 21:23 ` Andrew Morton
2007-05-25 21:05 ` Mimi Zohar
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=466D550C.4030600@shadowen.org \
--to=apw@shadowen.org \
--cc=akpm@linux-foundation.org \
--cc=drfickle@us.ibm.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mel@csn.ul.ie \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.