* 2.4.25 + 040218.diff OOPS
@ 2004-03-20 11:50 Tomas Szepe
2004-03-23 17:12 ` Bjorn Helgaas
` (5 more replies)
0 siblings, 6 replies; 7+ messages in thread
From: Tomas Szepe @ 2004-03-20 11:50 UTC (permalink / raw)
To: linux-ia64
[-- Attachment #1: Type: text/plain, Size: 9190 bytes --]
Hi,
I just got the following OOPS. Running the dump through ksymoops doesn't
give me anything useful on account of these errors:
...
Warning (compare_maps): mismatch on symbol QUIRK_LIST, ksyms_base says e00000000480d6e0, System.map says e00000000461fe40.
Ignoring ksyms_base entry
Warning (compare_maps): mismatch on symbol SELECT_DRIVE, ksyms_base says e00000000480f070, System.map says e00000000461fbe0.
Ignoring ksyms_base entry
...
The box is an HP integrity rx1600, bootup dmesg gzipped/attached.
The kernel (version $subj) has been compiled using gcc-3.3.3 and
binutils-2.14.90.0.8. This is no particular distribution, I've
compiled the userland all by myself (there's no ia64 slackware).
Any help appreciated!
Mar 20 11:46:07 columbia kernel: bash[15068]: NaT consumption 2216203124768
Mar 20 11:46:07 columbia kernel:
Mar 20 11:46:07 columbia kernel: Pid: 15068, CPU 0, comm: bash
Mar 20 11:46:07 columbia kernel: psr : 0000121008022038 ifs : 800000000000070f ip : [<e000000004449e71>] Not tainted
Mar 20 11:46:07 columbia kernel: ip is at (no symbol)
Mar 20 11:46:07 columbia kernel: unat: 0000000000000000 pfs : 0000000000000309 rsc : 0000000000000003
Mar 20 11:46:07 columbia kernel: rnat: 000000000000048c bsps: 0000000000000003 pr : 0000000000056a59
Mar 20 11:46:07 columbia kernel: ldrs: 0000000000000000 ccv : 0000000000000121 fpsr: 0009804c0270033f
Mar 20 11:46:07 columbia kernel: csd : 0000000000000000 ssd : 0000000000000000
Mar 20 11:46:07 columbia kernel: b0 : e0000000045b88e0 b6 : e000000004402d60 b7 : e0000000045b0c80
Mar 20 11:46:07 columbia kernel: f6 : 0ffff8000000000000000 f7 : 000000000000000000000
Mar 20 11:46:07 columbia kernel: f8 : 000000000000000000000 f9 : 000000000000000000000
Mar 20 11:46:07 columbia kernel: f10 : 000000000000000000000 f11 : 000000000000000000000
Mar 20 11:46:07 columbia kernel: r1 : e000000004ad5d30 r2 : 0000000000000000 r3 : e0000000385b8000
Mar 20 11:46:07 columbia kernel: r8 : e0000000045b0c80 r9 : 0000000000000121 r10 : 0000000000000120
Mar 20 11:46:07 columbia kernel: r11 : 0000000000000121 r12 : e000000037257e00 r13 : e000000037250000
Mar 20 11:46:07 columbia kernel: r14 : e000000004818028 r15 : e0000000385b81b8 r16 : 0000000000000121
Mar 20 11:46:07 columbia kernel: r17 : 0000000000000001 r18 : fffffffffffffffe r19 : e0000000385b91e0
Mar 20 11:46:07 columbia kernel: r20 : 0000000000000000 r21 : e0000000385bb000 r22 : e0000000385b9b10
Mar 20 11:46:07 columbia kernel: r23 : 0000000000000020 r24 : 00000fffffffa420 r25 : 0000000000000000
Mar 20 11:46:07 columbia kernel: r26 : 0000000000000001 r27 : 0000000000000001 r28 : 60000fffffffa421
Mar 20 11:46:07 columbia kernel: r29 : 60000fffffffa420 r30 : 0000000000004000 r31 : 0000000000000001
Mar 20 11:46:07 columbia kernel:
Mar 20 11:46:07 columbia kernel: Call Trace:
Mar 20 11:46:07 columbia kernel: [<e00000000440f820>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257960 bsp=e0000000372510d0
Mar 20 11:46:07 columbia kernel: [<e000000004429390>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257b30 bsp=e000000037251098
Mar 20 11:46:07 columbia kernel: [<e00000000442a080>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257b30 bsp=e000000037251050
Mar 20 11:46:07 columbia kernel: [<e00000000440a460>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257c30 bsp=e000000037251050
Mar 20 11:46:07 columbia kernel: [<e000000004449e70>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e00 bsp=e000000037250fd0
Mar 20 11:46:07 columbia kernel: [<e0000000045b88e0>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e00 bsp=e000000037250fa0
Mar 20 11:46:07 columbia kernel: [<e0000000045af2a0>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e00 bsp=e000000037250f80
Mar 20 11:46:07 columbia kernel: [<e0000000045b29f0>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e00 bsp=e000000037250ea0
Mar 20 11:46:07 columbia kernel: [<e0000000045a95d0>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e30 bsp=e000000037250e60
Mar 20 11:46:07 columbia kernel: [<e0000000044a0140>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e30 bsp=e000000037250dd8
Mar 20 11:46:07 columbia kernel: [<e00000000440a260>] (no symbol)
Mar 20 11:46:07 columbia kernel: sp=e000000037257e30 bsp=e000000037250dd8
Mar 20 11:47:51 columbia kernel: <1>Unable to handle kernel NULL pointer dereferencemc[15066]: Oops 11012296146944
Mar 20 11:47:51 columbia kernel:
Mar 20 11:47:51 columbia kernel: Pid: 15066, CPU 0, comm: mc
Mar 20 11:47:51 columbia kernel: psr : 0000121008026018 ifs : 800000000000038c ip : [<e0000000045a7be1>] Not tainted
Mar 20 11:47:51 columbia kernel: ip is at (no symbol)
Mar 20 11:47:51 columbia kernel: unat: 0000000000000000 pfs : 0000000000000894 rsc : 0000000000000003
Mar 20 11:47:51 columbia kernel: rnat: 0000000000000000 bsps: 0000000000000000 pr : 000000000005a955
Mar 20 11:47:51 columbia kernel: ldrs: 0000000000000000 ccv : 0000000000000045 fpsr: 0009804c8a70433f
Mar 20 11:47:51 columbia kernel: csd : 0000000000000000 ssd : 0000000000000000
Mar 20 11:47:51 columbia kernel: b0 : e0000000045aa780 b6 : e000000004402d60 b7 : e0000000045abac0
Mar 20 11:47:51 columbia kernel: f6 : 1003e0000000000000000 f7 : 1003e0000000000000078
Mar 20 11:47:51 columbia kernel: f8 : 1003e000000000000dc61 f9 : 1003e000000000009782b
Mar 20 11:47:51 columbia kernel: f10 : 1003e0000000000000474 f11 : 1003e0000000000000000
Mar 20 11:47:51 columbia kernel: r1 : e000000004ad5d30 r2 : 0000000000000502 r3 : 0000000000000090
Mar 20 11:47:51 columbia kernel: r8 : e000000003ca0680 r9 : e000000006144588 r10 : e000000006144580
Mar 20 11:47:51 columbia kernel: r11 : e000000002ef5390 r12 : e00000003e0dfdf0 r13 : e00000003e0d8000
Mar 20 11:47:51 columbia kernel: r14 : e0000000048135a8 r15 : 0000000000000000 r16 : e0000000385b8ac8
Mar 20 11:47:51 columbia kernel: r17 : 0000000000005401 r18 : 0000000000005401 r19 : e0000000048bce44
Mar 20 11:47:51 columbia kernel: r20 : 0000000000000000 r21 : e000000003ca07b0 r22 : e0000000061479b0
Mar 20 11:47:51 columbia kernel: r23 : e0000000061479a8 r24 : e0000000048bce40 r25 : e0000000048d7b38
Mar 20 11:47:51 columbia kernel: r26 : e0000000048d6b48 r27 : e000000003c53288 r28 : e0000000048bc580
Mar 20 11:47:51 columbia kernel: r29 : e000000002ef5688 r30 : e000000002ef5680 r31 : e000000003c52088
Mar 20 11:47:51 columbia kernel:
Mar 20 11:47:51 columbia kernel: Call Trace:
Mar 20 11:47:51 columbia kernel: [<e00000000440f820>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0df9c0 bsp=e00000003e0d9170
Mar 20 11:47:51 columbia kernel: [<e000000004429390>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfb90 bsp=e00000003e0d9138
Mar 20 11:47:51 columbia kernel: [<e000000004444970>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfb90 bsp=e00000003e0d90d8
Mar 20 11:47:51 columbia kernel: [<e00000000440a460>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfc20 bsp=e00000003e0d90d8
Mar 20 11:47:51 columbia kernel: [<e0000000045a7be0>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfdf0 bsp=e00000003e0d9078
Mar 20 11:47:51 columbia kernel: [<e0000000045aa780>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfdf0 bsp=e00000003e0d8fe8
Mar 20 11:47:51 columbia kernel: [<e0000000045abae0>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8fc0
Mar 20 11:47:51 columbia kernel: [<e0000000044a2440>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8f70
Mar 20 11:47:51 columbia kernel: [<e00000000449ec40>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8f40
Mar 20 11:47:51 columbia kernel: [<e0000000044575b0>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8ee0
Mar 20 11:47:51 columbia kernel: [<e0000000044581d0>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8e50
Mar 20 11:47:51 columbia kernel: [<e000000004458480>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8df0
Mar 20 11:47:51 columbia kernel: [<e0000000045aa780>] (no symbol)
Mar 20 11:47:51 columbia kernel: sp=e00000003e0dfe30 bsp=e00000003e0d8d98
--
Tomas Szepe <szepe@pinerecords.com>
[-- Attachment #2: dmesg.gz --]
[-- Type: application/x-gunzip, Size: 3013 bytes --]
^ permalink raw reply [flat|nested] 7+ messages in thread* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe @ 2004-03-23 17:12 ` Bjorn Helgaas 2004-03-24 10:54 ` Tomas Szepe ` (4 subsequent siblings) 5 siblings, 0 replies; 7+ messages in thread From: Bjorn Helgaas @ 2004-03-23 17:12 UTC (permalink / raw) To: linux-ia64 Can you look up these IPs by hand, since ksymoops doesn't work? (I always use gdb: "info line *0xe00000000447cfe2"). > Mar 20 11:46:07 columbia kernel: bash[15068]: NaT consumption 2216203124768 > Mar 20 11:46:07 columbia kernel: > Mar 20 11:46:07 columbia kernel: Pid: 15068, CPU 0, comm: bash > Mar 20 11:46:07 columbia kernel: psr : 0000121008022038 ifs : 800000000000070f ip : [<e000000004449e71>] Not tainted > Mar 20 11:47:51 columbia kernel: <1>Unable to handle kernel NULL pointer dereferencemc[15066]: Oops 11012296146944 > Mar 20 11:47:51 columbia kernel: > Mar 20 11:47:51 columbia kernel: Pid: 15066, CPU 0, comm: mc > Mar 20 11:47:51 columbia kernel: psr : 0000121008026018 ifs : 800000000000038c ip : [<e0000000045a7be1>] Not tainted If these are reproducible, it would be useful to figure out how, so I could try it here. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe 2004-03-23 17:12 ` Bjorn Helgaas @ 2004-03-24 10:54 ` Tomas Szepe 2004-03-24 13:00 ` Tomas Szepe ` (3 subsequent siblings) 5 siblings, 0 replies; 7+ messages in thread From: Tomas Szepe @ 2004-03-24 10:54 UTC (permalink / raw) To: linux-ia64 On Mar-23 2004, Tue, 10:12 -0700 Bjorn Helgaas <bjorn.helgaas@hp.com> wrote: > Can you look up these IPs by hand, since ksymoops doesn't work? > (I always use gdb: "info line *0xe00000000447cfe2"). Ok, I'll try to when (if) the OOPS reappears. > > Mar 20 11:46:07 columbia kernel: bash[15068]: NaT consumption 2216203124768 > > Mar 20 11:46:07 columbia kernel: > > Mar 20 11:46:07 columbia kernel: Pid: 15068, CPU 0, comm: bash > > Mar 20 11:46:07 columbia kernel: psr : 0000121008022038 ifs : 800000000000070f ip : [<e000000004449e71>] Not tainted > > > Mar 20 11:47:51 columbia kernel: <1>Unable to handle kernel NULL pointer dereferencemc[15066]: Oops 11012296146944 > > Mar 20 11:47:51 columbia kernel: > > Mar 20 11:47:51 columbia kernel: Pid: 15066, CPU 0, comm: mc > > Mar 20 11:47:51 columbia kernel: psr : 0000121008026018 ifs : 800000000000038c ip : [<e0000000045a7be1>] Not tainted > > If these are reproducible, it would be useful to figure out how, > so I could try it here. No "luck" so far, the machine has been running solid these days (except I'm getting occassional segfaults in userland, mainly from make and bash). Thanks! -- Tomas Szepe <szepe@pinerecords.com> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe 2004-03-23 17:12 ` Bjorn Helgaas 2004-03-24 10:54 ` Tomas Szepe @ 2004-03-24 13:00 ` Tomas Szepe 2004-03-24 14:34 ` Keith Owens ` (2 subsequent siblings) 5 siblings, 0 replies; 7+ messages in thread From: Tomas Szepe @ 2004-03-24 13:00 UTC (permalink / raw) To: linux-ia64 On Mar-20 2004, Sat, 12:50 +0100 Tomas Szepe <szepe@pinerecords.com> wrote: > Mar 20 11:47:51 columbia kernel: <1>Unable to handle kernel NULL pointer dereferencemc[15066]: Oops 11012296146944 > Mar 20 11:47:51 columbia kernel: > Mar 20 11:47:51 columbia kernel: Pid: 15066, CPU 0, comm: mc > Mar 20 11:47:51 columbia kernel: psr : 0000121008026018 ifs : 800000000000038c ip : [<e0000000045a7be1>] Not tainted > Mar 20 11:47:51 columbia kernel: ip is at (no symbol) [snip] Ok, this was most likely a false alarm. HP has just contacted us to say sorry for having sent us a busted server (they said something about broken capacitors affecting the bus). A replacement beast should appear in about 10 days, I'll be back to report how things are going. Thanks for understanding! -- Tomas Szepe <szepe@pinerecords.com> ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe ` (2 preceding siblings ...) 2004-03-24 13:00 ` Tomas Szepe @ 2004-03-24 14:34 ` Keith Owens 2004-03-24 15:29 ` Andreas Schwab 2004-03-24 15:53 ` Bjorn Helgaas 5 siblings, 0 replies; 7+ messages in thread From: Keith Owens @ 2004-03-24 14:34 UTC (permalink / raw) To: linux-ia64 On Mar-23 2004, Tue, 10:12 -0700 Bjorn Helgaas <bjorn.helgaas@hp.com> wrote: > Can you look up these IPs by hand, since ksymoops doesn't work? Because IA64 does not distinguish between the address of the function and the address of the function descriptor. Sometimes the symbol points at the function code, sometimes at the descriptor. It is impossible to match maps that use the same symbol to point to different locations. PPC64 got it right. They use 'func' for the descriptor and '.func' for the code body, and are consistent about which is which. Too late to do anything about it for IA64, the maps are permanently inconsistent. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe ` (3 preceding siblings ...) 2004-03-24 14:34 ` Keith Owens @ 2004-03-24 15:29 ` Andreas Schwab 2004-03-24 15:53 ` Bjorn Helgaas 5 siblings, 0 replies; 7+ messages in thread From: Andreas Schwab @ 2004-03-24 15:29 UTC (permalink / raw) To: linux-ia64 Keith Owens <kaos@ocs.com.au> writes: > PPC64 got it right. Don't tell that the binutils people. :-) Andreas. -- Andreas Schwab, SuSE Labs, schwab@suse.de SuSE Linux AG, Maxfeldstraße 5, 90409 Nürnberg, Germany Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: 2.4.25 + 040218.diff OOPS 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe ` (4 preceding siblings ...) 2004-03-24 15:29 ` Andreas Schwab @ 2004-03-24 15:53 ` Bjorn Helgaas 5 siblings, 0 replies; 7+ messages in thread From: Bjorn Helgaas @ 2004-03-24 15:53 UTC (permalink / raw) To: linux-ia64 On Wednesday 24 March 2004 3:54 am, Tomas Szepe wrote: > On Mar-23 2004, Tue, 10:12 -0700 > Bjorn Helgaas <bjorn.helgaas@hp.com> wrote: > > > Can you look up these IPs by hand, since ksymoops doesn't work? > > (I always use gdb: "info line *0xe00000000447cfe2"). > > Ok, I'll try to when (if) the OOPS reappears. If you still have the kernel image that was running when you saw the oops, you can still run gdb on that image. ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2004-03-24 15:53 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2004-03-20 11:50 2.4.25 + 040218.diff OOPS Tomas Szepe 2004-03-23 17:12 ` Bjorn Helgaas 2004-03-24 10:54 ` Tomas Szepe 2004-03-24 13:00 ` Tomas Szepe 2004-03-24 14:34 ` Keith Owens 2004-03-24 15:29 ` Andreas Schwab 2004-03-24 15:53 ` Bjorn Helgaas
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox