* [parisc-linux] SMP kernel problems on a D350
@ 2002-09-19 7:38 Istvan Gyenes
0 siblings, 0 replies; 29+ messages in thread
From: Istvan Gyenes @ 2002-09-19 7:38 UTC (permalink / raw)
To: parisc-linux
Hello List,
I'm trying to compile an SMP kernel for my D 350 (2cpu) server without
success.
The kernel source is 2.4.19-pa18 and in non-smp configuration it works
well. Anyway the SMP kernel compiles fine but when I try to boot it
stops at the "If this is the last message you see, you may need to switch
your console" line. I've switched the console but got no other output.
(It was the same with 2.4.19-pa14)
The system was installed from a debian 3.0 CD.
Can somebody send me a working .config? Or what can be the problem?
Thanks in advance,
__
Steve
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
[not found] <200209190805.KAA0000032531@simba.sch.bme.hu>
@ 2002-09-19 9:17 ` Istvan Gyenes
2002-09-19 12:21 ` J.Steindlberger
2002-09-19 22:46 ` Grant Grundler
0 siblings, 2 replies; 29+ messages in thread
From: Istvan Gyenes @ 2002-09-19 9:17 UTC (permalink / raw)
To: J.Steindlberger; +Cc: parisc-linux
Hello Joerg,
The only difference between the non-smp and smp kernel config file is
CONFIG_SMP=yes , AFAIK.
I made a "make menuconfig" and the only thing I've changed is SMP support.
The strange thing is that the precompiled smp kernel from the install cd
boots fine. (2.4.18-smp)
BTW where can I select 32bit/64bit support?
__
Steve
On Thu, 19 Sep 2002, J.Steindlberger wrote:
> Hi Steve,
>
> I'm no developer, but I know, that there are a few restrictions with D-Class
> machines. Did You select 64bit support? This machine is a 32bit architecture.
>
> Regards
> Joerg
>
> On Thursday 19 September 2002 09:38, you wrote:
> > I'm trying to compile an SMP kernel for my D 350 (2cpu) server without
> > success.
> > The kernel source is 2.4.19-pa18 and in non-smp configuration it works
> > well. Anyway the SMP kernel compiles fine but when I try to boot it
> > stops at the "If this is the last message you see, you may need to switch
> > your console" line. I've switched the console but got no other output.
> > (It was the same with 2.4.19-pa14)
> > The system was installed from a debian 3.0 CD.
> > Can somebody send me a working .config? Or what can be the problem?
>
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-19 9:17 ` [parisc-linux] SMP kernel problems on a D350 Istvan Gyenes
@ 2002-09-19 12:21 ` J.Steindlberger
2002-09-19 12:29 ` Ryan Bradetich
2002-09-19 22:46 ` Grant Grundler
1 sibling, 1 reply; 29+ messages in thread
From: J.Steindlberger @ 2002-09-19 12:21 UTC (permalink / raw)
To: Istvan Gyenes; +Cc: parisc-linux
Hi,
You can choose 32bit/64bit in the "Processor type" section. And only if You
choose a processor that supports more than 32bit.
Did You try the config file included in the 2.4.18-smp kernel image package?
Perhaps it's different in more than that SMP-section. So far for my ideas. If
You still get problems, there is something to be fixed and the answering is
up to an expert -- not me ;-) , sorry. I hope I could help You anyway.
Joerg
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-19 12:21 ` J.Steindlberger
@ 2002-09-19 12:29 ` Ryan Bradetich
0 siblings, 0 replies; 29+ messages in thread
From: Ryan Bradetich @ 2002-09-19 12:29 UTC (permalink / raw)
To: J.Steindlberger; +Cc: Istvan Gyenes, parisc-linux
On Thu, 2002-09-19 at 06:21, J.Steindlberger wrote:
> Hi,
>
> You can choose 32bit/64bit in the "Processor type" section. And only if You
> choose a processor that supports more than 32bit.
ie PA8x00 chipset. From the hwdb the D350 has the following processor:
UL Proc 1-way T'100 (821/D250,D350) (Processor) (PA7200 (PCX-T'))
so 64-bit would not be supported on this system.
- Ryan
> Did You try the config file included in the 2.4.18-smp kernel image package?
> Perhaps it's different in more than that SMP-section. So far for my ideas. If
> You still get problems, there is something to be fixed and the answering is
> up to an expert -- not me ;-) , sorry. I hope I could help You anyway.
>
> Joerg
> _______________________________________________
> parisc-linux mailing list
> parisc-linux@lists.parisc-linux.org
> http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
>
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-19 9:17 ` [parisc-linux] SMP kernel problems on a D350 Istvan Gyenes
2002-09-19 12:21 ` J.Steindlberger
@ 2002-09-19 22:46 ` Grant Grundler
2002-09-20 8:28 ` Istvan Gyenes
1 sibling, 1 reply; 29+ messages in thread
From: Grant Grundler @ 2002-09-19 22:46 UTC (permalink / raw)
To: Istvan Gyenes; +Cc: J.Steindlberger, parisc-linux
Istvan Gyenes wrote:
> Hello Joerg,
>
> The only difference between the non-smp and smp kernel config file is
> CONFIG_SMP=yes , AFAIK.
> I made a "make menuconfig" and the only thing I've changed is SMP support.
I'm paranoid. I do "make distclean" when doing anything other than
adding/removing drivers. Save/restore the .config if you need to before
running "make distclean". I don't trust the Makefiles to rebuild
everything correctly for "global" CONFIG_ changes like "SMP".
> The strange thing is that the precompiled smp kernel from the install cd
> boots fine. (2.4.18-smp)
SMP on 2.4.19 isn't as stable yet. So that's no surprise.
If you want to debug this further, define "EARLY_BOOTUP_DEBUG"
in arch/parisc/kernel/pdc_cons.c and you should get more output
about how far the kernel gets before it crashes/hangs.
grant
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-19 22:46 ` Grant Grundler
@ 2002-09-20 8:28 ` Istvan Gyenes
2002-09-20 19:48 ` Carlos O'Donell
2002-09-21 4:31 ` Grant Grundler
0 siblings, 2 replies; 29+ messages in thread
From: Istvan Gyenes @ 2002-09-20 8:28 UTC (permalink / raw)
To: Grant Grundler; +Cc: parisc-linux
Thanks I'll try that!
Another question: If 2.4.19 SMP not enough stable where can I find the
latest stable smp kernel source?
Thanks,
__
Steve
On Thu, 19 Sep 2002, Grant Grundler wrote:
> Istvan Gyenes wrote:
> > Hello Joerg,
> >
> > The only difference between the non-smp and smp kernel config file is
> > CONFIG_SMP=yes , AFAIK.
> > I made a "make menuconfig" and the only thing I've changed is SMP support.
>
> I'm paranoid. I do "make distclean" when doing anything other than
> adding/removing drivers. Save/restore the .config if you need to before
> running "make distclean". I don't trust the Makefiles to rebuild
> everything correctly for "global" CONFIG_ changes like "SMP".
>
> > The strange thing is that the precompiled smp kernel from the install cd
> > boots fine. (2.4.18-smp)
>
> SMP on 2.4.19 isn't as stable yet. So that's no surprise.
>
> If you want to debug this further, define "EARLY_BOOTUP_DEBUG"
> in arch/parisc/kernel/pdc_cons.c and you should get more output
> about how far the kernel gets before it crashes/hangs.
>
> grant
> _______________________________________________
> parisc-linux mailing list
> parisc-linux@lists.parisc-linux.org
> http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
>
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 8:28 ` Istvan Gyenes
@ 2002-09-20 19:48 ` Carlos O'Donell
2002-09-20 20:02 ` Jeremy Drake
2002-09-21 4:31 ` Grant Grundler
1 sibling, 1 reply; 29+ messages in thread
From: Carlos O'Donell @ 2002-09-20 19:48 UTC (permalink / raw)
To: Istvan Gyenes; +Cc: Grant Grundler, parisc-linux
> Thanks I'll try that!
>
> Another question: If 2.4.19 SMP not enough stable where can I find the
> latest stable smp kernel source?
>
> Thanks,
>
> Steve
Nothing up this sleeve, nothing up this sleeve...
<carlos pulls a stable smp kernel out of his hat>
Tada! ;)
I'm not quite certain that we ever had a stable
SMP kernel. While an older kernel might seem to
give you SMP stability, it does so at the cost of
speed and the introduction of old bugs.
If you can find some test cases for Non-SMP vs.
SMP stability, then we'll be a step in the right
direction.
c.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 19:48 ` Carlos O'Donell
@ 2002-09-20 20:02 ` Jeremy Drake
2002-09-20 20:37 ` Carlos O'Donell
2002-09-20 20:37 ` [parisc-linux] SMP kernel problems on a D350 Bdale Garbee
0 siblings, 2 replies; 29+ messages in thread
From: Jeremy Drake @ 2002-09-20 20:02 UTC (permalink / raw)
To: Carlos O'Donell; +Cc: Istvan Gyenes, Grant Grundler, parisc-linux
On Fri, 20 Sep 2002, Carlos O'Donell wrote:
> > Thanks I'll try that!
> >
> > Another question: If 2.4.19 SMP not enough stable where can I find the
> > latest stable smp kernel source?
> >
> > Thanks,
> >
> > Steve
>
> Nothing up this sleeve, nothing up this sleeve...
> <carlos pulls a stable smp kernel out of his hat>
>
> Tada! ;)
>
> I'm not quite certain that we ever had a stable
> SMP kernel. While an older kernel might seem to
> give you SMP stability, it does so at the cost of
> speed and the introduction of old bugs.
For me, the last kernel that didn't crash on my J5000 in smp mode while
doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
I wouldn't recommend using it, however...
>
> If you can find some test cases for Non-SMP vs.
> SMP stability, then we'll be a step in the right
> direction.
>
> c.
>
> _______________________________________________
> parisc-linux mailing list
> parisc-linux@lists.parisc-linux.org
> http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
>
--
Mason's First Law of Synergism:
The one day you'd sell your soul for something, souls are a glut.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:02 ` Jeremy Drake
@ 2002-09-20 20:37 ` Carlos O'Donell
2002-09-20 20:46 ` John David Anglin
2002-09-21 3:38 ` [parisc-linux] malloc limits John David Anglin
2002-09-20 20:37 ` [parisc-linux] SMP kernel problems on a D350 Bdale Garbee
1 sibling, 2 replies; 29+ messages in thread
From: Carlos O'Donell @ 2002-09-20 20:37 UTC (permalink / raw)
To: Jeremy Drake; +Cc: Istvan Gyenes, Grant Grundler, parisc-linux
>
> For me, the last kernel that didn't crash on my J5000 in smp mode while
> doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
>
> I wouldn't recommend using it, however...
If I _wasn't_ using my A500 for contiual binutils/glibc/gcc
builds, I'd be testing out the SMP problems :)
When running an SMP kernel and doing multiple compiles, the
box was rather unsuable e.g. random process death.
As Randolph noted to me on IRC, it looks like fixing the mmap
issues we have would be a step in the right direction. When I
get my current projects completed (glibc fixing)... I want to
look at this :) then again, maybe I'll be doing glibc work for
the rest of my usefull days ;)
We also fail many of the LTP tests having to do with
signals, mmap'ing, and direct IO.
c.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:02 ` Jeremy Drake
2002-09-20 20:37 ` Carlos O'Donell
@ 2002-09-20 20:37 ` Bdale Garbee
2002-09-20 20:52 ` Carlos O'Donell
2002-09-20 23:11 ` Jeremy Drake
1 sibling, 2 replies; 29+ messages in thread
From: Bdale Garbee @ 2002-09-20 20:37 UTC (permalink / raw)
To: parisc-linux
jeremyd@apptechsys.com (Jeremy Drake) writes:
> For me, the last kernel that didn't crash on my J5000 in smp mode while
> doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
I've had good luck with the 2.4.19 kernel images I've uploaded to unstable,
which are built and running on my J5000 in 64-bit SMP mode. I don't promise
they're "stable", but the apt-get update problem is gone.
Bdale
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:37 ` Carlos O'Donell
@ 2002-09-20 20:46 ` John David Anglin
2002-09-20 20:50 ` Randolph Chung
2002-09-21 3:38 ` [parisc-linux] malloc limits John David Anglin
1 sibling, 1 reply; 29+ messages in thread
From: John David Anglin @ 2002-09-20 20:46 UTC (permalink / raw)
To: Carlos O'Donell; +Cc: jeremyd, frts, grundler, parisc-linux
> When running an SMP kernel and doing multiple compiles, the
> box was rather unsuable e.g. random process death.
Is there a way to turn off the unaligned handler? It may be hiding
bad stuff going on in userland. There are still cases where expect
causes a continuous sequence of unaligned faults.
Dave
--
J. David Anglin dave.anglin@nrc.ca
National Research Council of Canada (613) 990-0752 (FAX: 952-6605)
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:46 ` John David Anglin
@ 2002-09-20 20:50 ` Randolph Chung
2002-09-20 20:55 ` Carlos O'Donell
2002-09-20 20:55 ` John David Anglin
0 siblings, 2 replies; 29+ messages in thread
From: Randolph Chung @ 2002-09-20 20:50 UTC (permalink / raw)
To: John David Anglin
Cc: Carlos O'Donell, jeremyd, frts, grundler, parisc-linux
> Is there a way to turn off the unaligned handler? It may be hiding
> bad stuff going on in userland. There are still cases where expect
> causes a continuous sequence of unaligned faults.
not at runtime, but i can build a kernel with this turned off and let
you test it.
randolph
--
Randolph Chung
Debian GNU/Linux Developer, hppa/ia64 ports
http://www.tausq.org/
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:37 ` [parisc-linux] SMP kernel problems on a D350 Bdale Garbee
@ 2002-09-20 20:52 ` Carlos O'Donell
2002-09-20 23:11 ` Jeremy Drake
1 sibling, 0 replies; 29+ messages in thread
From: Carlos O'Donell @ 2002-09-20 20:52 UTC (permalink / raw)
To: Bdale Garbee; +Cc: parisc-linux
>
> > For me, the last kernel that didn't crash on my J5000 in smp mode while
> > doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
>
> I've had good luck with the 2.4.19 kernel images I've uploaded to unstable,
> which are built and running on my J5000 in 64-bit SMP mode. I don't promise
> they're "stable", but the apt-get update problem is gone.
What type of workloads do you have that machine doing?
c.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:50 ` Randolph Chung
@ 2002-09-20 20:55 ` Carlos O'Donell
2002-09-21 23:20 ` Randolph Chung
2002-09-20 20:55 ` John David Anglin
1 sibling, 1 reply; 29+ messages in thread
From: Carlos O'Donell @ 2002-09-20 20:55 UTC (permalink / raw)
To: Randolph Chung; +Cc: John David Anglin, parisc-linux
> > Is there a way to turn off the unaligned handler? It may be hiding
> > bad stuff going on in userland. There are still cases where expect
> > causes a continuous sequence of unaligned faults.
>
> not at runtime, but i can build a kernel with this turned off and let
> you test it.
>
> randolph
Would be nice to have a proc interface for this.
I would like to do consecutive testing with it
enabled and disabled.
c.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:50 ` Randolph Chung
2002-09-20 20:55 ` Carlos O'Donell
@ 2002-09-20 20:55 ` John David Anglin
2002-09-20 21:51 ` Randolph Chung
1 sibling, 1 reply; 29+ messages in thread
From: John David Anglin @ 2002-09-20 20:55 UTC (permalink / raw)
To: randolph; +Cc: carlos, jeremyd, frts, grundler, parisc-linux
> > Is there a way to turn off the unaligned handler? It may be hiding
> > bad stuff going on in userland. There are still cases where expect
> > causes a continuous sequence of unaligned faults.
>
> not at runtime, but i can build a kernel with this turned off and let
> you test it.
Is there any software that actually needs the unaligned handler?
I think it would be useful for test purposes to have a kernel with
it off.
Dave
--
J. David Anglin dave.anglin@nrc.ca
National Research Council of Canada (613) 990-0752 (FAX: 952-6605)
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:55 ` John David Anglin
@ 2002-09-20 21:51 ` Randolph Chung
0 siblings, 0 replies; 29+ messages in thread
From: Randolph Chung @ 2002-09-20 21:51 UTC (permalink / raw)
To: John David Anglin; +Cc: carlos, jeremyd, frts, grundler, parisc-linux
> Is there any software that actually needs the unaligned handler?
> I think it would be useful for test purposes to have a kernel with
> it off.
off the top of my head....
in the kernel, the usb driver has some unaligned accesses.
in userspace, several of the network utilities (tcpdump, nmap, etc) make
unaligned accesses
randolph
--
Randolph Chung
Debian GNU/Linux Developer, hppa/ia64 ports
http://www.tausq.org/
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:37 ` [parisc-linux] SMP kernel problems on a D350 Bdale Garbee
2002-09-20 20:52 ` Carlos O'Donell
@ 2002-09-20 23:11 ` Jeremy Drake
2002-09-20 23:46 ` Jeremy Drake
2002-09-20 23:55 ` Robert Stanford
1 sibling, 2 replies; 29+ messages in thread
From: Jeremy Drake @ 2002-09-20 23:11 UTC (permalink / raw)
To: Bdale Garbee; +Cc: parisc-linux
On 20 Sep 2002, Bdale Garbee wrote:
> jeremyd@apptechsys.com (Jeremy Drake) writes:
>
> > For me, the last kernel that didn't crash on my J5000 in smp mode while
> > doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
>
> I've had good luck with the 2.4.19 kernel images I've uploaded to unstable,
> which are built and running on my J5000 in 64-bit SMP mode. I don't promise
> they're "stable", but the apt-get update problem is gone.
I have installed 2.4.19-32-smp and 2.4.19-64-smp from unstable. The
32-bit one seems to still have the apt-get update problem, and will not
run X (no surprise there). However, 64bit seems to be fairly stable (will
do apt-get update and will run X). And here I thought 64-bit was LESS
stable than 32 :)
> > Bdale
> _______________________________________________
> parisc-linux mailing list
> parisc-linux@lists.parisc-linux.org
> http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
>
--
The rose of yore is but a name, mere names are left to us.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 23:11 ` Jeremy Drake
@ 2002-09-20 23:46 ` Jeremy Drake
2002-09-20 23:55 ` Robert Stanford
1 sibling, 0 replies; 29+ messages in thread
From: Jeremy Drake @ 2002-09-20 23:46 UTC (permalink / raw)
To: Bdale Garbee; +Cc: parisc-linux
Sorry for replying to myself, but I forgot to mention the one problem I
have had with the 2.4.19-64-smp so far. Setserial locked it up on
2.4.19-64-smp (no hpmc or error message, just locked up). After disabling
setserial, everything was fine.
On Fri, 20 Sep 2002, Jeremy Drake wrote:
> On 20 Sep 2002, Bdale Garbee wrote:
>
> > jeremyd@apptechsys.com (Jeremy Drake) writes:
> >
> > > For me, the last kernel that didn't crash on my J5000 in smp mode while
> > > doing apt-get update was kernel-image-2.4.17-32-smp_23.1_hppa.deb
> >
> > I've had good luck with the 2.4.19 kernel images I've uploaded to unstable,
> > which are built and running on my J5000 in 64-bit SMP mode. I don't promise
> > they're "stable", but the apt-get update problem is gone.
> I have installed 2.4.19-32-smp and 2.4.19-64-smp from unstable. The
> 32-bit one seems to still have the apt-get update problem, and will not
> run X (no surprise there). However, 64bit seems to be fairly stable (will
> do apt-get update and will run X). And here I thought 64-bit was LESS
> stable than 32 :)
>
>
> > > Bdale
> > _______________________________________________
> > parisc-linux mailing list
> > parisc-linux@lists.parisc-linux.org
> > http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
> >
>
>
--
"You need tender loving care once a week - so that I can slap you into shape."
- Ellyn Mustard
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 23:11 ` Jeremy Drake
2002-09-20 23:46 ` Jeremy Drake
@ 2002-09-20 23:55 ` Robert Stanford
1 sibling, 0 replies; 29+ messages in thread
From: Robert Stanford @ 2002-09-20 23:55 UTC (permalink / raw)
To: Parisc
Well things seem to be getting better on my 580
k580:~# uname -a
Linux k580 2.4.19-pa18 #26 SMP Sat Sep 21 09:09:22 EST 2002 parisc
unknown
k580:~# apt-get update
Get:1 http://ftp.au.debian.org unstable/main Packages [1962kB]
Get:2 http://ftp.au.debian.org unstable/main Release [82B]
Get:3 http://ftp.au.debian.org unstable/non-free Packages [50.0kB]
Get:4 http://ftp.au.debian.org unstable/non-free Release [86B]
Get:5 http://ftp.au.debian.org unstable/contrib Packages [47.8kB]
Get:6 http://ftp.au.debian.org unstable/contrib Release [85B]
Fetched 2060kB in 11m39s (2944B/s)
apt-get(181): unaligned access to 0x403ce08c at ip=0x4005e4f7
apt-get(181): unaligned access to 0xef20c024 at ip=0x4005e4fb
isr verification failed (isr: 00000000, sr7: 000000ac
Unaligned handler failed, ret = 1
k580:~# /etc/init.d/samba start
Starting Samba daemons: nmbd smbdsmbd(175): unaligned access to
0x4001a2b8 at if
Robert Stanford
^ permalink raw reply [flat|nested] 29+ messages in thread
* [parisc-linux] malloc limits
2002-09-20 20:37 ` Carlos O'Donell
2002-09-20 20:46 ` John David Anglin
@ 2002-09-21 3:38 ` John David Anglin
2002-09-21 4:14 ` Matthew Wilcox
1 sibling, 1 reply; 29+ messages in thread
From: John David Anglin @ 2002-09-21 3:38 UTC (permalink / raw)
To: Carlos O'Donell; +Cc: parisc-linux
In looking at the failure of the gcc v3 pthread2 test, I see it dies
with a segv in chunk_free when next is larger than 0x80000000:
#0 chunk_free (ar_ptr=0x400ad16c, p=0x4010c744) at malloc.c:3179
3179 nextsz = chunksize(next);
(gdb) p next
$2 = (struct malloc_chunk *) 0x802191cc
I thought there was a flat memory model. If so, shouldn't it be possible
for the data section to expand past 0x80000000?
The test will pass if I cut max_loop_count to 30000.
Dave
--
J. David Anglin dave.anglin@nrc.ca
National Research Council of Canada (613) 990-0752 (FAX: 952-6605)
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] malloc limits
2002-09-21 3:38 ` [parisc-linux] malloc limits John David Anglin
@ 2002-09-21 4:14 ` Matthew Wilcox
2002-09-21 4:46 ` Grant Grundler
0 siblings, 1 reply; 29+ messages in thread
From: Matthew Wilcox @ 2002-09-21 4:14 UTC (permalink / raw)
To: John David Anglin; +Cc: Carlos O'Donell, parisc-linux
On Fri, Sep 20, 2002 at 11:38:37PM -0400, John David Anglin wrote:
> I thought there was a flat memory model. If so, shouldn't it be possible
> for the data section to expand past 0x80000000?
There is a flat memory model... libs are mapped at 0x4000'0000 so that's
not it. worth looking at /proc/$pid/maps for that process, maybe?
--
Revolutions do not require corporate support.
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 8:28 ` Istvan Gyenes
2002-09-20 19:48 ` Carlos O'Donell
@ 2002-09-21 4:31 ` Grant Grundler
1 sibling, 0 replies; 29+ messages in thread
From: Grant Grundler @ 2002-09-21 4:31 UTC (permalink / raw)
To: Istvan Gyenes; +Cc: parisc-linux
Istvan Gyenes wrote:
> Thanks I'll try that!
>
> Another question: If 2.4.19 SMP not enough stable where can I find the
> latest stable smp kernel source?
I'd advise using the 2.4.19 images uploaded by Bdale to debian.org.
Mostly because it's "fall-out-of-bed" easy to get matching source in
case you need to change something or want to try something out.
If that doesn't work for you, for A500, one of the better ones is:
ftp://ftp.parisc-linux.org/kernels/a500/2.4.18-pa54.tgz
Look in kernels/32 or kernel/64 for other revs using default configs.
I'm convinced SMP instability is because of timing (race conditions)
and/or D-cache problems. My gut feeling if the 4-way associative
cache isn't getting flushed properly in all locations it needs to be.
I'm hoping someone who has more clue about VM and virtually indexed
caches could dig into this.
hth,
grant
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] malloc limits
2002-09-21 4:14 ` Matthew Wilcox
@ 2002-09-21 4:46 ` Grant Grundler
2002-09-21 5:24 ` John David Anglin
0 siblings, 1 reply; 29+ messages in thread
From: Grant Grundler @ 2002-09-21 4:46 UTC (permalink / raw)
To: John David Anglin; +Cc: Matthew Wilcox, Carlos O'Donell, parisc-linux
Matthew Wilcox wrote:
> On Fri, Sep 20, 2002 at 11:38:37PM -0400, John David Anglin wrote:
> > I thought there was a flat memory model. If so, shouldn't it be possible
> > for the data section to expand past 0x80000000?
>
> There is a flat memory model... libs are mapped at 0x4000'0000 so that's
> not it. worth looking at /proc/$pid/maps for that process, maybe?
is 0x80000000 the address or the size?
If it's the size then you get up into 0xc0000000 (which is ok).
Getting up into 0xf0000000 - 0xffffffff address is not.
grant
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] malloc limits
2002-09-21 4:46 ` Grant Grundler
@ 2002-09-21 5:24 ` John David Anglin
2002-09-21 22:33 ` Grant Grundler
0 siblings, 1 reply; 29+ messages in thread
From: John David Anglin @ 2002-09-21 5:24 UTC (permalink / raw)
To: Grant Grundler; +Cc: willy, carlos, parisc-linux
> is 0x80000000 the address or the size?
It's the address of the next contiguous chunk. This is roughly the sum
of the address plus the size of the chunk to be freed. The segv occurs
loading the size of the next chunk using the address.
I haven't been successful debugging the code directly. I can get the
code to seg fault by setting SIG37 to nostop noprint, but the debugger
seems to think the fault occurs following the INLINE_SYSCALL in
__sigsuspend. However, the address points to an ldi instruction
which can't seg fault, so I don't know what's up. The data that
I posted were from a core dump.
> If it's the size then you get up into 0xc0000000 (which is ok).
> Getting up into 0xf0000000 - 0xffffffff address is not.
Dave
--
J. David Anglin dave.anglin@nrc.ca
National Research Council of Canada (613) 990-0752 (FAX: 952-6605)
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] malloc limits
2002-09-21 5:24 ` John David Anglin
@ 2002-09-21 22:33 ` Grant Grundler
2002-09-22 5:43 ` John David Anglin
0 siblings, 1 reply; 29+ messages in thread
From: Grant Grundler @ 2002-09-21 22:33 UTC (permalink / raw)
To: John David Anglin; +Cc: willy, carlos, parisc-linux
"John David Anglin" wrote:
> It's the address of the next contiguous chunk. This is roughly the sum
> of the address plus the size of the chunk to be freed. The segv occurs
> loading the size of the next chunk using the address.
I'll assume this is happening on the A500 (PA2.0) and wonder if it's
a signed/unsigned bug. Look closely at how PA2.0 extends register
values and make sure code is treating addresses and sizes as unsigned.
> I haven't been successful debugging the code directly. I can get the
> code to seg fault by setting SIG37 to nostop noprint, but the debugger
> seems to think the fault occurs following the INLINE_SYSCALL in
> __sigsuspend. However, the address points to an ldi instruction
> which can't seg fault, so I don't know what's up.
Not all instructions trap precisely. FP ops definitely do not and
I thought a few others didn't either.
I'm wondering what happens when unaligned access should segfault.
Does the unaligned code handle check for that?
I'll take a quick look at that code path.
thanks,
grant
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-20 20:55 ` Carlos O'Donell
@ 2002-09-21 23:20 ` Randolph Chung
2002-09-22 0:57 ` Grant Grundler
0 siblings, 1 reply; 29+ messages in thread
From: Randolph Chung @ 2002-09-21 23:20 UTC (permalink / raw)
To: Carlos O'Donell, John David Anglin, parisc-linux
> Would be nice to have a proc interface for this.
> I would like to do consecutive testing with it
> enabled and disabled.
ftp://ftp.parisc-linux.org/patches/unaligned-procfs.diff
legolas:/home/randolph# cat /proc/sys/kernel/unaligned
Unaligned trap handler is enabled
legolas:/home/randolph# ./t; echo $?
0
legolas:/home/randolph# echo 0 >> /proc/sys/kernel/unaligned
legolas:/home/randolph# cat /proc/sys/kernel/unaligned
Unaligned trap handler is not enabled
legolas:/home/randolph# ./t; echo $?
Bus error
138
legolas:/home/randolph# echo 1 >> /proc/sys/kernel/unaligned
legolas:/home/randolph# cat /proc/sys/kernel/unaligned
Unaligned trap handler is enabled
legolas:/home/randolph# ./t; echo $?
0
if someone can review this real quick before i commit it to cvs, i'd
appreciate it. in particular, the point where it decides that the
unaligned trap is not enabled and forces the SIGBUS is not exactly at
the beginning of the trap handler -- it still prints the unaligned
message....
randolph
--
Randolph Chung
Debian GNU/Linux Developer, hppa/ia64 ports
http://www.tausq.org/
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
2002-09-21 23:20 ` Randolph Chung
@ 2002-09-22 0:57 ` Grant Grundler
0 siblings, 0 replies; 29+ messages in thread
From: Grant Grundler @ 2002-09-22 0:57 UTC (permalink / raw)
To: Randolph Chung; +Cc: parisc-linux
Randolph Chung wrote:
> legolas:/home/randolph# cat /proc/sys/kernel/unaligned
> Unaligned trap handler is enabled
> legolas:/home/randolph# ./t; echo $?
> 0
> legolas:/home/randolph# echo 0 >> /proc/sys/kernel/unaligned
Cool!
After reviewing the diff (on ftp.p-l.o/patches), only two nits
that have nothing to do with the code:
o cat output should relate to what I have to "echo" into the /proc file.
ie only display '0' or '1' when catting.
Or is "blah is enabled" by convention?
o SYSCTL_FILENAME should be "sys/kernel/unaligned_trap"
and then I think 0 or 1 should be clear enough to anyone
daring to mess with it.
> if someone can review this real quick before i commit it to cvs,
If you don't like my suggestions, I'm ok with committing
what you've already got.
> i'd
> appreciate it. in particular, the point where it decides that the
> unaligned trap is not enabled and forces the SIGBUS is not exactly at
> the beginning of the trap handler -- it still prints the unaligned
> message....
hmmm...if running under a debugger, one gets that info anyway.
But that's not always easy to do. I think it's ok since we don't
like to see unligned traps happen anyway.
Maybe a "unaligned_trap_msg" tunable?
/me runs...
thanks
grant
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] malloc limits
2002-09-21 22:33 ` Grant Grundler
@ 2002-09-22 5:43 ` John David Anglin
0 siblings, 0 replies; 29+ messages in thread
From: John David Anglin @ 2002-09-22 5:43 UTC (permalink / raw)
To: Grant Grundler; +Cc: willy, carlos, parisc-linux
> I'll assume this is happening on the A500 (PA2.0) and wonder if it's
> a signed/unsigned bug. Look closely at how PA2.0 extends register
> values and make sure code is treating addresses and sizes as unsigned.
This is the code that adds the chunk pointer plus size of chunk and
then tries to load the size of the next check:
0x402611d4 <chunk_free+32>: add,l r25,ret1,r31
0x402611d8 <chunk_free+36>: ldw 4(sr0,r31),r20
The add is a 64-bit add on a PA2.0 machine, so the result won't be
signed extended. My understanding is that the upper 32-bits are
truncated when the PSW W bit is zero. So, it isn't obvious to
me how this can be a signed/unsigned bug unless it is in the
kernel.
> > I haven't been successful debugging the code directly. I can get the
> > code to seg fault by setting SIG37 to nostop noprint, but the debugger
> > seems to think the fault occurs following the INLINE_SYSCALL in
> > __sigsuspend. However, the address points to an ldi instruction
> > which can't seg fault, so I don't know what's up.
>
> Not all instructions trap precisely. FP ops definitely do not and
> I thought a few others didn't either.
>
> I'm wondering what happens when unaligned access should segfault.
> Does the unaligned code handle check for that?
> I'll take a quick look at that code path.
There is definitely something strange with this program. It doesn't
seg fault 100% of the time. This suggests either a timing/lock problem
or something that isn't being properly initialized. I don't know
how to debug it under gdb because it seems to change the way traps
are handled. When I set a break, it appears that the code under test
catches the trap instead of gdb. The system also dumps core.
I've tried setting breaks in chunk_free and __pthread_mutex_lock
where the unaligned faults occur with a condition matching the
unaligned pointer value which i see in /var/log/debug. However,
I get the following:
Program received signal SIGTRAP, Trace/breakpoint trap.
0x4021e114 in __sigsuspend (set=0x25)
at ../sysdeps/unix/sysv/linux/sigsuspend.c:45
45 return INLINE_SYSCALL (rt_sigsuspend, 2, CHECK_SIGSET (set), _NSIG / 8);
(gdb) info proc
process 20194
cmdline = '/home/dave/pthread2.x0g'
warning: unable to read link '/proc/20194/cwd'
warning: unable to read link '/proc/20194/exe'
dave 20193 20041 0 21:41 pts/2 00:00:02 gdb pthread2.x0g
dave 20194 20193 0 21:43 pts/2 00:00:00 /home/dave/pthread2.x0g
dave 20199 20194 0 21:46 pts/2 00:00:00 [pthread2.x0g <defunct>]
I tried setting follow-fork-mode to child but it doesn't seem to follow
the child.
I don't think fp exceptions are involved.
I can see in debug that two traps occur associated with each run. They
are both type 15 (Data TLB Miss Fault) and they seem to both occur at
the same location.
The program pthread2.x0g is in my home directory on gsyprf11. If you
want to try it, it probably best to set
LD_LIBRARY_PATH=/home/dave/opt/gnu/lib
It may take several tries to get it to seg fault.
Dave
--
J. David Anglin dave.anglin@nrc.ca
National Research Council of Canada (613) 990-0752 (FAX: 952-6605)
^ permalink raw reply [flat|nested] 29+ messages in thread
* Re: [parisc-linux] SMP kernel problems on a D350
[not found] <20020921044102.3DD104829@dsl2.external.hp.com>
@ 2002-09-23 17:23 ` Jeremy Drake
0 siblings, 0 replies; 29+ messages in thread
From: Jeremy Drake @ 2002-09-23 17:23 UTC (permalink / raw)
To: Grant Grundler; +Cc: parisc-linux
On Fri, 20 Sep 2002, Grant Grundler wrote:
> If you have time, could you reproduce the "lockup" and then hit "TOC" button?
> If you cancel autoboot and get a PDC prompt (eg "BOOT_ADMIN>"), "ser pim"
> output will contain machine state when it was TOCed. Capture and Post that
> output to the mailing list and someone might see what the problem is.
>
OK. Here it is.... The kernel is 2.4.19-64-smp from unstable, version
18.1, so the System.map can be found there...
krakatoa:~# /bin/sh /etc/init.d/setserial start
Loading the saved-state of the serial devices...
Cannot set serial info: Device or resource busy
/dev/ttyS0 at 0x03f8 (irq = 195) is a 16550A
Firmware Version 5.0
Duplex Console IO Dependent Code (IODC) revision 1
------------------------------------------------------------------------------
(c) Copyright 1995-2000, Hewlett-Packard Company, All rights reserved
------------------------------------------------------------------------------
Processor Speed State Coprocessor State I/D Cache
--------- -------- --------------------- ----------------- -------------
0 440 MHz Active Functional 512 kB/1 MB
1 440 MHz Idle Functional 512 kB/1 MB
Central Bus Speed: 120 MHz
Available memory: 536870912 bytes
Good memory required: 46678016 bytes
Primary boot path: FWSCSI.5.0
Alternate boot path: FWSCSI.6.0
Console path: SERIAL_1.9600.8.none
Keyboard path: PCI8.0.0
Processor is booting from first available device.
To discontinue, press any key within 10 seconds.
\aBoot terminated.
----- Main Menu -------------------------------------------------------------
Command Description
------- -----------
BOot [PRI|ALT|<path>] Boot from specified path
PAth [PRI|ALT|CON|KEY [<path>]] Display or modify a path
SEArch [DIsplay|[[IPL] [<path>]]] Search for boot devices
COnfiguration [<command>] Access Configuration menu/commands
INformation [<command>] Access Information menu/commands
SERvice [<command>] Access Service menu/commands
DIsplay Redisplay the current menu
HElp [<menu>|<command>] Display help for menu or command
RESET Restart the system
-----
Main Menu: Enter command > ser pim toc
PROCESSOR PIM INFORMATION
----------------- Processor 0 TOC Information -------------------
General Registers 0 - 31
00-03 0000000000000000 0000000010478480 000000001010ded4 0000000010435ac8
04-07 0000000000000041 0000000000000001 0000000000000000 0000000000000000
08-11 000000001054b5a0 0000000000000003 00000000105b8b40 00000000105646cc
12-15 0000000000000000 00000000ffffffff 0000000000000001 00000000f0400004
16-19 00000000105b8b40 00000000f000017c 00000000f0000174 0000000000000000
20-23 0000000000000000 0000000000000000 00000000104456a0 00000000105555a0
24-27 00000000105b8b40 0000000000000041 0000000010435ac8 000000001054b5a0
28-31 0000000000000000 00000000105b8ef0 00000000105b9000 000000001055e5a0
<Press any key to continue (q to quit)>
Control Registers 0 - 31
00-03 0000000000000000 0000000000000000 0000000000000000 0000000000000000
04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000
08-11 000000000000295a 0000000000000000 00000000000000c0 000000000000003f
12-15 0000000000000000 0000000000000000 0000000000107000 0000000000000000
16-19 00005f379490dc48 0000000000000000 000000001010de1c 0000000000000000
20-23 0000000000000000 0000000000000000 0000007f082cff0e a000000000000000
24-27 0000000000487000 0000000015bc1000 0000000000044021 00000000f0412000
28-31 0000000055555555 0000000055555555 00000000105b8000 00000000105c0000
Space Registers 0 - 7
00-03 000a5680 000a5680 00000000 000a5680
04-07 00000000 00000000 00000000 00000000
IIA Space = 0x0000000000000000
IIA Offset = 0x000000001010de0c
CPU State = 0x9e000001
Main Menu: Enter command >
Main Menu: Enter command > reset
Resetting...
--
If a child annoys you, quiet him by brushing his hair. If this doesn't
work, use the other side of the brush on the other end of the child.
^ permalink raw reply [flat|nested] 29+ messages in thread
end of thread, other threads:[~2002-09-23 17:23 UTC | newest]
Thread overview: 29+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <200209190805.KAA0000032531@simba.sch.bme.hu>
2002-09-19 9:17 ` [parisc-linux] SMP kernel problems on a D350 Istvan Gyenes
2002-09-19 12:21 ` J.Steindlberger
2002-09-19 12:29 ` Ryan Bradetich
2002-09-19 22:46 ` Grant Grundler
2002-09-20 8:28 ` Istvan Gyenes
2002-09-20 19:48 ` Carlos O'Donell
2002-09-20 20:02 ` Jeremy Drake
2002-09-20 20:37 ` Carlos O'Donell
2002-09-20 20:46 ` John David Anglin
2002-09-20 20:50 ` Randolph Chung
2002-09-20 20:55 ` Carlos O'Donell
2002-09-21 23:20 ` Randolph Chung
2002-09-22 0:57 ` Grant Grundler
2002-09-20 20:55 ` John David Anglin
2002-09-20 21:51 ` Randolph Chung
2002-09-21 3:38 ` [parisc-linux] malloc limits John David Anglin
2002-09-21 4:14 ` Matthew Wilcox
2002-09-21 4:46 ` Grant Grundler
2002-09-21 5:24 ` John David Anglin
2002-09-21 22:33 ` Grant Grundler
2002-09-22 5:43 ` John David Anglin
2002-09-20 20:37 ` [parisc-linux] SMP kernel problems on a D350 Bdale Garbee
2002-09-20 20:52 ` Carlos O'Donell
2002-09-20 23:11 ` Jeremy Drake
2002-09-20 23:46 ` Jeremy Drake
2002-09-20 23:55 ` Robert Stanford
2002-09-21 4:31 ` Grant Grundler
[not found] <20020921044102.3DD104829@dsl2.external.hp.com>
2002-09-23 17:23 ` Jeremy Drake
2002-09-19 7:38 Istvan Gyenes
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.