linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
* Unstable MPC8245 board
@ 2003-02-25 22:46 William A. Gatliff
  2003-02-25 23:03 ` Chris Zimman
  0 siblings, 1 reply; 6+ messages in thread
From: William A. Gatliff @ 2003-02-25 22:46 UTC (permalink / raw)
  To: linuxppc-embedded


Guys:


I'm trying to run a recent (two weeks ago, Makefile has 2.4.21-pre4)
MV kernel on a semi-custom MPC8245 board that looks a lot like a
Cogent CSB281.  I'm using binutils-2.13.1, gcc-3.2.1, uClibc-0.9.17,
busybox-0.60.4.  I'm also using the uMON bootloader from the CSB281.

The kernel boots up fine, and mounts an initial ramdisk (romfs-0.3.1)
containing a very minimal busybox environment.  After a few ls -lR's,
the system freezes hard.

According to my bdi2000, I usually end up in trap 700 before the CPU
halts.  But sometimes I get pretty decent oops messages first:

tulip_interrupt: ioaddr=c3000c00 tulip_debug=5
Oops: kernel access of bad area, sig: 11
NIP: C00F6EB4 XER: 20000000 LR: C00F7CE4 SP: C0117180 REGS: c01170d0 TRAP: 0300
   Not tainted
MSR: 00001032 EE: 0 PR: 0 FP: 0 ME: 1 IR/DR: 11
DAR: 504067FC, DSISR: 20000000
TASK = c0115440[0] 'swapper' Last syscall: 120
last math c0902000 last altivec 00000000
GPR00: 00000004 C0117180 C0115440 504067FC FFFFFFFE C01052FD C0117220 0000000A
GPR08: 00000000 504067FC FFFFFFFF C01171B8 C0140000 00000000 305C8801 B80688E6
GPR16: D0040604 C3180200 00022412 00000000 00001032 00000000 00000000 00000019
GPR24: 00000400 C0144FB4 C0117220 FFFFFFFF 504067FC C01453B3 00000000 C0144FB4
Call backtrace:
C0015F94 C0015D0C C008DAB4 C0006F4C C0007010 C0005FAC C0007558
C000756C C0003B38 C01275DC 00003828
Kernel panic: Aiee, killing interrupt handler!
In interrupt handler - not syncing
 <0>Rebooting in 180 seconds..Initializing the MPC8245 PCI Bridge

(Yes, I also have problems without _any_ ethernet drivers installed).

The rest of the time, everything is simply scrambled by the time I
type HALT on the bdi.

I know what your answer will be, especially now that I've read the
mailing list archives over and over: "memory interface problem".  I
tend to agree--- it's my main theory at the moment, in fact.  I'm just
not sure what to do about it.  Some questions:

* Is there any reasonable explanation besides a memory problem?

* I see the checksum for the kernel's .text region changing at
runtime.  I'm pretty sure this isn't normal, is it?

* Does anyone have a working CSB281 kernel (or similar) that I could
borrow?  :^) Sandpoint won't get it, I don't have all the extra
hardware onboard (I think I look more like a Muenski).

* Can anyone suggest a really conservative memory interface setup to
test with?  The MPC8245 is a bit of a stretch for me at the moment...


Here's my .config.  For brevity, I omitted everything that's not set.

CONFIG_RWSEM_XCHGADD_ALGORITHM=y
CONFIG_HAVE_DEC_LOCK=y
CONFIG_EXPERIMENTAL=y
CONFIG_ADVANCED_OPTIONS=y
CONFIG_PPC=y
CONFIG_PPC32=y
CONFIG_6xx=y
CONFIG_PPC_STD_MMU=y
CONFIG_CSB281=y
CONFIG_PPC_ISATIMER=y
CONFIG_PCI=y
CONFIG_NET=y
CONFIG_SYSCTL=y
CONFIG_SYSVIPC=y
CONFIG_KCORE_ELF=y
CONFIG_BINFMT_ELF=y
CONFIG_KERNEL_ELF=y
CONFIG_BINFMT_MISC=y
CONFIG_PPC_RTC=y
CONFIG_CMDLINE_BOOL=y
CONFIG_CMDLINE="console=ttyS0,38400 root=/dev/ram"
CONFIG_BLK_DEV_LOOP=y
CONFIG_BLK_DEV_RAM=y
CONFIG_BLK_DEV_RAM_SIZE=4096
CONFIG_BLK_DEV_INITRD=y
CONFIG_UNIX=y
CONFIG_INET=y
CONFIG_IP_PNP=y
CONFIG_IP_PNP_DHCP=y
CONFIG_IP_PNP_BOOTP=y
CONFIG_NET_ETHERNET=y
CONFIG_NET_PCI=y
CONFIG_PCNET32=y
CONFIG_TULIP=y
CONFIG_TULIP_MMIO=y
CONFIG_NATSEMI=y
CONFIG_NE2K_PCI=y
CONFIG_8139CP=y
CONFIG_SERIAL=y
CONFIG_SERIAL_CONSOLE=y
CONFIG_UNIX98_PTYS=y
CONFIG_UNIX98_PTY_COUNT=256
CONFIG_TMPFS=y
CONFIG_RAMFS=y
CONFIG_PROC_FS=y
CONFIG_DEVPTS_FS=y
CONFIG_ROMFS_FS=y
CONFIG_NFS_FS=y
CONFIG_ROOT_NFS=y
CONFIG_SUNRPC=y
CONFIG_LOCKD=y
CONFIG_MSDOS_PARTITION=y
CONFIG_DEBUG_KERNEL=y
CONFIG_BDI_SWITCH=y
CONFIG_MORE_COMPILE_OPTIONS=y
CONFIG_COMPILE_OPTIONS="-g -ggdb"




I'm stumped.  Thanks to _anyone_ who can help me make any progress on
this...




b.g.
--
Bill Gatliff
http://billgatliff.com

** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Unstable MPC8245 board
  2003-02-25 22:46 Unstable MPC8245 board William A. Gatliff
@ 2003-02-25 23:03 ` Chris Zimman
  2003-02-26  1:34   ` William A. Gatliff
  2003-02-26 19:33   ` Linux on Cogent CSB281? William A. Gatliff
  0 siblings, 2 replies; 6+ messages in thread
From: Chris Zimman @ 2003-02-25 23:03 UTC (permalink / raw)
  To: William A. Gatliff; +Cc: linuxppc-embedded


Bill,

Just a few things to try before launching into kernel stuff:

I'm not familiar with uMON, but I would guess it has some memory tests
in it.  Can you do burst reads/writes inside of uMON reliably?

Without knowing anything else, from what I see here, I'm guessing that
your problem is related to memory.  But testing outside of the kernel
will definitely help to rule that out.

--Chris


On Tue, Feb 25, 2003 at 04:46:19PM -0600, William A. Gatliff wrote:
>
> Guys:
>
>
> I'm trying to run a recent (two weeks ago, Makefile has 2.4.21-pre4)
> MV kernel on a semi-custom MPC8245 board that looks a lot like a
> Cogent CSB281.  I'm using binutils-2.13.1, gcc-3.2.1, uClibc-0.9.17,
> busybox-0.60.4.  I'm also using the uMON bootloader from the CSB281.
>
> The kernel boots up fine, and mounts an initial ramdisk (romfs-0.3.1)
> containing a very minimal busybox environment.  After a few ls -lR's,
> the system freezes hard.
>
> According to my bdi2000, I usually end up in trap 700 before the CPU
> halts.  But sometimes I get pretty decent oops messages first:
>
> tulip_interrupt: ioaddr=c3000c00 tulip_debug=5
> Oops: kernel access of bad area, sig: 11
> NIP: C00F6EB4 XER: 20000000 LR: C00F7CE4 SP: C0117180 REGS: c01170d0 TRAP: 0300
>    Not tainted
> MSR: 00001032 EE: 0 PR: 0 FP: 0 ME: 1 IR/DR: 11
> DAR: 504067FC, DSISR: 20000000
> TASK = c0115440[0] 'swapper' Last syscall: 120
> last math c0902000 last altivec 00000000
> GPR00: 00000004 C0117180 C0115440 504067FC FFFFFFFE C01052FD C0117220 0000000A
> GPR08: 00000000 504067FC FFFFFFFF C01171B8 C0140000 00000000 305C8801 B80688E6
> GPR16: D0040604 C3180200 00022412 00000000 00001032 00000000 00000000 00000019
> GPR24: 00000400 C0144FB4 C0117220 FFFFFFFF 504067FC C01453B3 00000000 C0144FB4
> Call backtrace:
> C0015F94 C0015D0C C008DAB4 C0006F4C C0007010 C0005FAC C0007558
> C000756C C0003B38 C01275DC 00003828
> Kernel panic: Aiee, killing interrupt handler!
> In interrupt handler - not syncing
>  <0>Rebooting in 180 seconds..Initializing the MPC8245 PCI Bridge
>
> (Yes, I also have problems without _any_ ethernet drivers installed).
>
> The rest of the time, everything is simply scrambled by the time I
> type HALT on the bdi.
>
> I know what your answer will be, especially now that I've read the
> mailing list archives over and over: "memory interface problem".  I
> tend to agree--- it's my main theory at the moment, in fact.  I'm just
> not sure what to do about it.  Some questions:
>
> * Is there any reasonable explanation besides a memory problem?
>
> * I see the checksum for the kernel's .text region changing at
> runtime.  I'm pretty sure this isn't normal, is it?
>
> * Does anyone have a working CSB281 kernel (or similar) that I could
> borrow?  :^) Sandpoint won't get it, I don't have all the extra
> hardware onboard (I think I look more like a Muenski).
>
> * Can anyone suggest a really conservative memory interface setup to
> test with?  The MPC8245 is a bit of a stretch for me at the moment...
>
>
> Here's my .config.  For brevity, I omitted everything that's not set.
>
> CONFIG_RWSEM_XCHGADD_ALGORITHM=y
> CONFIG_HAVE_DEC_LOCK=y
> CONFIG_EXPERIMENTAL=y
> CONFIG_ADVANCED_OPTIONS=y
> CONFIG_PPC=y
> CONFIG_PPC32=y
> CONFIG_6xx=y
> CONFIG_PPC_STD_MMU=y
> CONFIG_CSB281=y
> CONFIG_PPC_ISATIMER=y
> CONFIG_PCI=y
> CONFIG_NET=y
> CONFIG_SYSCTL=y
> CONFIG_SYSVIPC=y
> CONFIG_KCORE_ELF=y
> CONFIG_BINFMT_ELF=y
> CONFIG_KERNEL_ELF=y
> CONFIG_BINFMT_MISC=y
> CONFIG_PPC_RTC=y
> CONFIG_CMDLINE_BOOL=y
> CONFIG_CMDLINE="console=ttyS0,38400 root=/dev/ram"
> CONFIG_BLK_DEV_LOOP=y
> CONFIG_BLK_DEV_RAM=y
> CONFIG_BLK_DEV_RAM_SIZE=4096
> CONFIG_BLK_DEV_INITRD=y
> CONFIG_UNIX=y
> CONFIG_INET=y
> CONFIG_IP_PNP=y
> CONFIG_IP_PNP_DHCP=y
> CONFIG_IP_PNP_BOOTP=y
> CONFIG_NET_ETHERNET=y
> CONFIG_NET_PCI=y
> CONFIG_PCNET32=y
> CONFIG_TULIP=y
> CONFIG_TULIP_MMIO=y
> CONFIG_NATSEMI=y
> CONFIG_NE2K_PCI=y
> CONFIG_8139CP=y
> CONFIG_SERIAL=y
> CONFIG_SERIAL_CONSOLE=y
> CONFIG_UNIX98_PTYS=y
> CONFIG_UNIX98_PTY_COUNT=256
> CONFIG_TMPFS=y
> CONFIG_RAMFS=y
> CONFIG_PROC_FS=y
> CONFIG_DEVPTS_FS=y
> CONFIG_ROMFS_FS=y
> CONFIG_NFS_FS=y
> CONFIG_ROOT_NFS=y
> CONFIG_SUNRPC=y
> CONFIG_LOCKD=y
> CONFIG_MSDOS_PARTITION=y
> CONFIG_DEBUG_KERNEL=y
> CONFIG_BDI_SWITCH=y
> CONFIG_MORE_COMPILE_OPTIONS=y
> CONFIG_COMPILE_OPTIONS="-g -ggdb"
>
>
>
>
> I'm stumped.  Thanks to _anyone_ who can help me make any progress on
> this...
>
>
>
>
> b.g.
> --
> Bill Gatliff
> http://billgatliff.com
>

** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Unstable MPC8245 board
  2003-02-25 23:03 ` Chris Zimman
@ 2003-02-26  1:34   ` William A. Gatliff
  2003-02-26 16:02     ` Jan Damborsky
  2003-02-26 19:33   ` Linux on Cogent CSB281? William A. Gatliff
  1 sibling, 1 reply; 6+ messages in thread
From: William A. Gatliff @ 2003-02-26  1:34 UTC (permalink / raw)
  To: Chris Zimman; +Cc: linuxppc-embedded


Chris:


> I'm not familiar with uMON, but I would guess it has some memory tests
> in it.  Can you do burst reads/writes inside of uMON reliably?

Good question.  I have the source, I could add such a test.  (uMON has
a memory test, but it's a "walking ones and zeros" for verifying
hardware connectivity, not all that other stuff that's a problem with
SDRAMs).

What's a C construct that would virtually guarantee a burst read or
write?

> Without knowing anything else, from what I see here, I'm guessing that
> your problem is related to memory.  But testing outside of the kernel
> will definitely help to rule that out.

Ok.  Thanks, that's a start...


b.g.
--
Bill Gatliff
http://billgatliff.com

** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Unstable MPC8245 board
  2003-02-26  1:34   ` William A. Gatliff
@ 2003-02-26 16:02     ` Jan Damborsky
  0 siblings, 0 replies; 6+ messages in thread
From: Jan Damborsky @ 2003-02-26 16:02 UTC (permalink / raw)
  To: linuxppc-embedded


William A. Gatliff wrote:

 >What's a C construct that would virtually guarantee a burst read or
 >write?
 >
 >
I use MPC8xx and I had some problem with RWM bursting a two month ago.
PPC core uses bursting in several situations, not always.
For example, with I-cache and D-cache disabled, there are no bursts when
reading
instructiontions or reading or writting data from or to RWM, there are
only single beat
read or write cycles occuring on bus.
When enabled I-cache or D-cache in write-through mode, there are only
burst reads used,
no burst writes. In order to force PPC core to use burst-write cycles,
it is necessary to enable
D-cache working in copyback mode.
So when I want to test burst reads and burst writes:
[1] I enable D-cache in copyback mode (not so trivial as enable only
I-cache working)
[2] allocate a large array of longs (for example ten times more then
size of D-cache is)
[3] writting pseudo random data to this array.
[4] Verifying written data.
What happens:
When writting data, these are stored in D-cache until there is no free
place in D-cache.
Then when writting more data, older data from D-cache are stored in RWM
by means
of burst writes in order to make free place for newer data.
When verifiyng, data are read to cache from RWM by means of burst reads.
It is only a little example I used.
                                                                      HANYS


** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Linux on Cogent CSB281?
  2003-02-25 23:03 ` Chris Zimman
  2003-02-26  1:34   ` William A. Gatliff
@ 2003-02-26 19:33   ` William A. Gatliff
  2003-02-26 19:48     ` Gary Thomas
  1 sibling, 1 reply; 6+ messages in thread
From: William A. Gatliff @ 2003-02-26 19:33 UTC (permalink / raw)
  To: linuxppc-embedded


Guys:


Does anyone have any linux-2.4.x running on a Cogent CSB281?

I would love to have a parallel solution to mine to look at.  My
kernel (mv's -dev tree, 2.4.21-pre4) boots and runs off a ramdisk
mostly fine, but crashes in ways that make me suspect a memory
controller problem.

I'm only interested in the kernel itself, I don't care at the moment
about PCI or anything else.  I'll sort through those problems (and
contribute patches if anyone else is interested) once I have the
kernel itself running solid.


Thanks!


b.g.
 --
Bill Gatliff
http://billgatliff.com

** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Linux on Cogent CSB281?
  2003-02-26 19:33   ` Linux on Cogent CSB281? William A. Gatliff
@ 2003-02-26 19:48     ` Gary Thomas
  0 siblings, 0 replies; 6+ messages in thread
From: Gary Thomas @ 2003-02-26 19:48 UTC (permalink / raw)
  To: bgat; +Cc: linuxppc embedded


On Wed, 2003-02-26 at 12:33, William A. Gatliff wrote:
>
> Guys:
>
>
> Does anyone have any linux-2.4.x running on a Cogent CSB281?
>
> I would love to have a parallel solution to mine to look at.  My
> kernel (mv's -dev tree, 2.4.21-pre4) boots and runs off a ramdisk
> mostly fine, but crashes in ways that make me suspect a memory
> controller problem.
>
> I'm only interested in the kernel itself, I don't care at the moment
> about PCI or anything else.  I'll sort through those problems (and
> contribute patches if anyone else is interested) once I have the
> kernel itself running solid.

What sort of strange things do you see?  Can you duplicate them,
or are they random?

I've seen this board act weirdly running eCos as well.  I don't
have a Linux kernel for this board, but I'd be glad to try yours
just to see if the behavior is consistent across boards.

--
------------------------------------------------------------
Gary Thomas                 |
MLB Associates              |  Consulting for the
+1 (970) 229-1963           |    Embedded world
http://www.mlbassoc.com/    |
email: <gary@mlbassoc.com>  |
gpg: http://www.chez-thomas.org/gary/gpg_key.asc
------------------------------------------------------------


** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2003-02-26 19:48 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-02-25 22:46 Unstable MPC8245 board William A. Gatliff
2003-02-25 23:03 ` Chris Zimman
2003-02-26  1:34   ` William A. Gatliff
2003-02-26 16:02     ` Jan Damborsky
2003-02-26 19:33   ` Linux on Cogent CSB281? William A. Gatliff
2003-02-26 19:48     ` Gary Thomas

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).