* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
@ 2004-03-09 17:14 ` Aron Griffis
2004-03-09 17:21 ` Aron Griffis
` (10 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 17:14 UTC (permalink / raw)
To: linux-ia64
Aron Griffis wrote: [Tue Mar 09 2004, 11:09:04AM EST]
> I'm running davidm's bk kernel, version 2.6.4-rc1, built with gcc-3.3.
^^^^^^^^
Sorry, I meant gcc-3.3.3
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
2004-03-09 17:14 ` Aron Griffis
@ 2004-03-09 17:21 ` Aron Griffis
2004-03-09 17:54 ` Jesse Barnes
` (9 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 17:21 UTC (permalink / raw)
To: linux-ia64
Aron Griffis wrote: [Tue Mar 09 2004, 11:09:04AM EST]
> <<00>>KIenr nienlt eprarnuipct: hAainedel,e rk i-l lniontg siynntceirnrugp
> h a t
Deciphered:
Kernel panic: Aiee, killing interrupt
In interrupt handler - not syncing
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
2004-03-09 17:14 ` Aron Griffis
2004-03-09 17:21 ` Aron Griffis
@ 2004-03-09 17:54 ` Jesse Barnes
2004-03-09 18:47 ` Aron Griffis
` (8 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Jesse Barnes @ 2004-03-09 17:54 UTC (permalink / raw)
To: linux-ia64
On Tue, Mar 09, 2004 at 11:09:04AM -0500, Aron Griffis wrote:
> I'm running davidm's bk kernel, version 2.6.4-rc1, built with gcc-3.3.
> I've been poking around with SysRq and hit a crash last evening with the
> following footprint. After this the machine was wedged so badly that
> the Ethernet MP stopped responding completely, so I was unable to power
> cycle the machine until I arrived at the office.
What kind of machine are you running on? What's your .config look like?
Jesse
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (2 preceding siblings ...)
2004-03-09 17:54 ` Jesse Barnes
@ 2004-03-09 18:47 ` Aron Griffis
2004-03-09 18:57 ` Grant Grundler
` (7 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 18:47 UTC (permalink / raw)
To: linux-ia64
Jesse Barnes wrote: [Tue Mar 09 2004, 12:54:25PM EST]
> What kind of machine are you running on? What's your .config look like?
Long's Peak. I've dropped the config at
http://dev.gentoo.org/~agriffis/sysrq_oops/
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (3 preceding siblings ...)
2004-03-09 18:47 ` Aron Griffis
@ 2004-03-09 18:57 ` Grant Grundler
2004-03-09 19:25 ` Jesse Barnes
` (6 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Grant Grundler @ 2004-03-09 18:57 UTC (permalink / raw)
To: linux-ia64
On Tue, Mar 09, 2004 at 01:47:15PM -0500, Aron Griffis wrote:
> Jesse Barnes wrote: [Tue Mar 09 2004, 12:54:25PM EST]
> > What kind of machine are you running on? What's your .config look like?
>
> Long's Peak.
That's an RX2600 for anyone wanting to find the product on www.hp.com
grant
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (4 preceding siblings ...)
2004-03-09 18:57 ` Grant Grundler
@ 2004-03-09 19:25 ` Jesse Barnes
2004-03-09 20:10 ` Kenneth Chen
` (5 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Jesse Barnes @ 2004-03-09 19:25 UTC (permalink / raw)
To: linux-ia64
On Tue, Mar 09, 2004 at 10:57:48AM -0800, Grant Grundler wrote:
> On Tue, Mar 09, 2004 at 01:47:15PM -0500, Aron Griffis wrote:
> > Jesse Barnes wrote: [Tue Mar 09 2004, 12:54:25PM EST]
> > > What kind of machine are you running on? What's your .config look like?
> >
> > Long's Peak.
>
> That's an RX2600 for anyone wanting to find the product on www.hp.com
Ah, zx1 then. I accidentally deleted the original backtrace and it
doesn't appear to be in the list archives yet, but I'm guessing that
since the panic occured at arch/ia64/mm/contig.c:show_mem+0x100, I'm
guessing the problem is somewhere after the call to show_free_areas().
I don't have a zx1 box that's easy to test with, so I probably won't be
much help.
Jesse
^ permalink raw reply [flat|nested] 13+ messages in thread* RE: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (5 preceding siblings ...)
2004-03-09 19:25 ` Jesse Barnes
@ 2004-03-09 20:10 ` Kenneth Chen
2004-03-09 21:13 ` Aron Griffis
` (4 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Kenneth Chen @ 2004-03-09 20:10 UTC (permalink / raw)
To: linux-ia64
Jesse Barnes wrote on Tue, March 09, 2004 11:26 AM
> On Tue, Mar 09, 2004 at 10:57:48AM -0800, Grant Grundler wrote:
> > On Tue, Mar 09, 2004 at 01:47:15PM -0500, Aron Griffis wrote:
> > > Jesse Barnes wrote: [Tue Mar 09 2004, 12:54:25PM EST]
> > > > What kind of machine are you running on?
> > >
> > > Long's Peak.
> >
> > That's an RX2600 for anyone wanting to find the product on www.hp.com
>
> Ah, zx1 then. I accidentally deleted the original backtrace and it
> doesn't appear to be in the list archives yet, but I'm guessing that
> since the panic occured at arch/ia64/mm/contig.c:show_mem+0x100, I'm
> guessing the problem is somewhere after the call to show_free_areas().
> I don't have a zx1 box that's easy to test with, so I probably won't be
> much help.
Looks like it passed beyond show_free_areas(), faulting IP was in the
while loop accessing mem_map variable.
By the way, the local variable total is redundant in that function.
Same data already exists with max_mapnr.
- Ken
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (6 preceding siblings ...)
2004-03-09 20:10 ` Kenneth Chen
@ 2004-03-09 21:13 ` Aron Griffis
2004-03-09 21:36 ` Aron Griffis
` (3 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 21:13 UTC (permalink / raw)
To: linux-ia64
Jesse Barnes wrote: [Tue Mar 09 2004, 02:25:41PM EST]
> Ah, zx1 then. I accidentally deleted the original backtrace
http://dev.gentoo.org/~agriffis/oops.txt
> and it
> doesn't appear to be in the list archives yet, but I'm guessing that
> since the panic occured at arch/ia64/mm/contig.c:show_mem+0x100, I'm
> guessing the problem is somewhere after the call to show_free_areas().
> I don't have a zx1 box that's easy to test with, so I probably won't be
> much help.
So I have the vmlinux which was built with debugging symbols (I assume
this is why I got a backtrace as well)... however I'm not immediately
seeing how to trace this back into show_mem()
$ gdb vmlinux
(gdb) list *(show_mem+0x100)
0xa00000010005b9e0 is in show_mem (bitops.h:280).
275 }
276
277 static __inline__ int
278 test_bit (int nr, const volatile void *addr)
279 {
280 return 1 & (((const volatile __u32 *) addr)[nr >> 5] >> (nr & 31));
281 }
282
283 /**
284 * ffz - find the first zero bit in a long word
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (7 preceding siblings ...)
2004-03-09 21:13 ` Aron Griffis
@ 2004-03-09 21:36 ` Aron Griffis
2004-03-09 22:49 ` Kenneth Chen
` (2 subsequent siblings)
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 21:36 UTC (permalink / raw)
To: linux-ia64
Kenneth Chen wrote: [Tue Mar 09 2004, 03:10:50PM EST]
> Looks like it passed beyond show_free_areas(), faulting IP was in the
> while loop accessing mem_map variable.
How did you determine that?
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* RE: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (8 preceding siblings ...)
2004-03-09 21:36 ` Aron Griffis
@ 2004-03-09 22:49 ` Kenneth Chen
2004-03-09 23:31 ` Aron Griffis
2004-03-09 23:39 ` Andreas Schwab
11 siblings, 0 replies; 13+ messages in thread
From: Kenneth Chen @ 2004-03-09 22:49 UTC (permalink / raw)
To: linux-ia64
Aron Griffis wrote on Tue, March 09, 2004 1:36 PM
> > Looks like it passed beyond show_free_areas(), faulting IP was
> > in the while loop accessing mem_map variable.
> How did you determine that?
Do objdump on vmlinux, then look at the disassembled code at
faulting IP:
Show_mem+0x100
a00000010003cd00: [MMI] ld4.acq r21=[r16];;
It is page faulting on r16, lookup r16 in the panic dump, it shows:
r16 : a0007fffff0fffb0, which basically points to a page struct in
the mem_map array, you might want to verify whether that address is
valid or not.
Exam the code up/down a little bit, you would realize the while loop
is in between show_mem+0xe0 and show_mem+0x1dc, variable i is stored
in r17/r3, lookup up these instructions to see how the pointer is
calculated with index i:
(i*5*16 + base mem_map, sizeof(struct page) = 80).
a00000010003cce6: sxt4 r23=r17;;
a00000010003ccec: shladd r22=r23,2,r23;;
a00000010003ccf0: [MMI] shladd r16=r22,4,r19;;
Anyway, hope you get the idea ....
- Ken
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (9 preceding siblings ...)
2004-03-09 22:49 ` Kenneth Chen
@ 2004-03-09 23:31 ` Aron Griffis
2004-03-09 23:39 ` Andreas Schwab
11 siblings, 0 replies; 13+ messages in thread
From: Aron Griffis @ 2004-03-09 23:31 UTC (permalink / raw)
To: linux-ia64
Kenneth Chen wrote: [Tue Mar 09 2004, 05:49:26PM EST]
> Anyway, hope you get the idea ....
Thanks! That's exactly what I was trying to do but lacked the
know-how...
--
Aron Griffis
hp Linux and Open Source Lab
Key fingerprint = 4601 AE87 379D A917 BA62 5263 C284 0366 5E6A 3C6B
^ permalink raw reply [flat|nested] 13+ messages in thread* Re: crash on SysRq : Show Memory
2004-03-09 16:09 crash on SysRq : Show Memory Aron Griffis
` (10 preceding siblings ...)
2004-03-09 23:31 ` Aron Griffis
@ 2004-03-09 23:39 ` Andreas Schwab
11 siblings, 0 replies; 13+ messages in thread
From: Andreas Schwab @ 2004-03-09 23:39 UTC (permalink / raw)
To: linux-ia64
jbarnes@sgi.com (Jesse Barnes) writes:
> On Tue, Mar 09, 2004 at 10:57:48AM -0800, Grant Grundler wrote:
>> On Tue, Mar 09, 2004 at 01:47:15PM -0500, Aron Griffis wrote:
>> > Jesse Barnes wrote: [Tue Mar 09 2004, 12:54:25PM EST]
>> > > What kind of machine are you running on? What's your .config look like?
>> >
>> > Long's Peak.
>>
>> That's an RX2600 for anyone wanting to find the product on www.hp.com
>
> Ah, zx1 then.
I can reproduce it on Tiger as well. Doesn't seem to be platform
dependent.
Andreas.
--
Andreas Schwab, SuSE Labs, schwab@suse.de
SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany
Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5
"And now for something completely different."
^ permalink raw reply [flat|nested] 13+ messages in thread