* Re: sparc32 SMP debugging advice needed
2007-03-23 21:42 sparc32 SMP debugging advice needed Robert Reif
@ 2007-03-24 10:55 ` BERTRAND Joël
2007-03-24 16:34 ` Tom "spot" Callaway
2007-03-24 16:53 ` Jiri Gaisler
2 siblings, 0 replies; 4+ messages in thread
From: BERTRAND Joël @ 2007-03-24 10:55 UTC (permalink / raw)
To: sparclinux
Hello,
Robert Reif a écrit :
> I'm trying to get a git rc4 SMP kernel working on an ss20 and am running
> into a few problems.
I have built the same kernel with some debug information about
deadlocks and spinlocks. This kernel works fine with one SuperSPARC but
not with one HyperSPARC (cache aliasing management mistake). With dual
SuperSPARC (SM71), kernel randomly hangs with spinlock bug. Sometimes,
it can boot, sometimes not, and I don't know how debug this kernel. When
it completely boots, I can see some deadlocks (for example, run "make
-j2" to build a kernel randomly stops with cc in D state), some randomly
oops (syscall_to_hard) when pipe syscall is executed.
> An SMP kernel boots and runs fine using debian etch and a single
> CPU. However I do get this in kern.log:
>
> Mar 23 10:59:53 ss20 kernel: Kernel command line: root=/dev/sda2 ro
> Mar 23 10:59:53 ss20 kernel: PID hash table entries: 1024 (order: 10,
> 4096 bytes)
> Mar 23 10:59:53 ss20 kernel: start_kernel(): bug: interrupts were
> enabled early
> Mar 23 10:59:53 ss20 kernel: Console: colour dummy device 80x25
>
> It looks like something similar was fixed a while back on sparc64 but
> I don't know if this is related. I'm recompiling a kernel with more
> debugging turned on to see what happens.
>
> I get a hang when booting the same kernel with 2 CPUs at:
>
> Begin: Loading essential drivers... ...
>
> Nothing interesting before that.
>
> Any advice on how to approach fixing these issues? I can test patches.
I can test too. If someones has some idea about this trouble or a
method to debug this kernel, he is welcome ;-)
Regards,
JKB
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: sparc32 SMP debugging advice needed
2007-03-23 21:42 sparc32 SMP debugging advice needed Robert Reif
2007-03-24 10:55 ` BERTRAND Joël
@ 2007-03-24 16:34 ` Tom "spot" Callaway
2007-03-24 16:53 ` Jiri Gaisler
2 siblings, 0 replies; 4+ messages in thread
From: Tom "spot" Callaway @ 2007-03-24 16:34 UTC (permalink / raw)
To: sparclinux
On Fri, 2007-03-23 at 17:42 -0400, Robert Reif wrote:
> Any advice on how to approach fixing these issues? I can test
> patches.
Sadly, it seems that in the case of sparc32, no one who is willing is
able, and no one who is able is willing.
There is no maintainer, Dave Miller isn't willing to take it on (not
that I blame him), and much of the hardware is starting to fail, making
debugging that much more fun.
Given that UP has so many issues, I don't even fathom how SMP is going
to work right anytime soon.
I'd be happy to be proven wrong here, but the likelyhood of a stable
sparc32 kernel is very very low.
~spot
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: sparc32 SMP debugging advice needed
2007-03-23 21:42 sparc32 SMP debugging advice needed Robert Reif
2007-03-24 10:55 ` BERTRAND Joël
2007-03-24 16:34 ` Tom "spot" Callaway
@ 2007-03-24 16:53 ` Jiri Gaisler
2 siblings, 0 replies; 4+ messages in thread
From: Jiri Gaisler @ 2007-03-24 16:53 UTC (permalink / raw)
To: sparclinux
I have a feeling that the broken sparc32 SMP is related to
the CPU-specific SMP code, rather than the whole sparc32 port.
We have added SMP support for the leon3 (V8) processor to
linux-2.6.18.1, and are happily running systems with 8 CPUs
in hardware and up to 16 CPUs in simulation. This is done
using cache snooping to synchronize processors, i.e. we do
NOT flush or disable data caches to keep the system running.
I would therefore appreciate if the sparc32 SMP code was left
in the kernel, and not removed because it does not work on
legacy Sun systems. Leon3/SMP will be used in several projects
(and products), including the UC Berkeley RAMP massively parallel
computer system. We will make an effort to sync our leon3
port to the latest kernel version (2.6.21 ?), and try (again)
to submit the patches for review and inclusion in the
mainstream kernel.
Jiri Gaisler.
Gaisler Research.
Tom "spot" Callaway wrote:
> On Fri, 2007-03-23 at 17:42 -0400, Robert Reif wrote:
>> Any advice on how to approach fixing these issues? I can test
>> patches.
>
> Sadly, it seems that in the case of sparc32, no one who is willing is
> able, and no one who is able is willing.
>
> There is no maintainer, Dave Miller isn't willing to take it on (not
> that I blame him), and much of the hardware is starting to fail, making
> debugging that much more fun.
>
> Given that UP has so many issues, I don't even fathom how SMP is going
> to work right anytime soon.
>
> I'd be happy to be proven wrong here, but the likelyhood of a stable
> sparc32 kernel is very very low.
>
> ~spot
>
> -
> To unsubscribe from this list: send the line "unsubscribe sparclinux" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread