Linux PARISC architecture development
 help / color / mirror / Atom feed
* [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
@ 2005-01-15 14:02 Joel Soete
  2005-01-15 14:41 ` Matthew Wilcox
  0 siblings, 1 reply; 4+ messages in thread
From: Joel Soete @ 2005-01-15 14:02 UTC (permalink / raw)
  To: List Parisc

[-- Attachment #1: Type: text/plain, Size: 1678 bytes --]

Hi all,

To stress a bit kernel I launch my usual loop test on the n4k (2way, 4Gb ram):
# while true ; do readprofile -r ; make $OPTB clean ; make $OPTB oldconfig; make -j2 $OPTB vmlinux 2>&1 | tee -a /var/logs/k-loop; 
readprofile >> /var/logs/prof-loop.doc ; done

With previous kernel it makes panic kernel hazardiously from 1 to more 200 occurences. With this 2.6.11-rc3-pa3 after about 49 runs 
some "Segmentation fault" occures:
make[3]: *** [drivers/pci/probe.o] Segmentation fault
make[3]: *** [drivers/block/ioctl.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/fpudispatch.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/denormal.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[3]: *** [drivers/md/raid0.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault

for some child's shell (I attached dmesg because very long).

I so stop this loop to reboot with same kernel but up build and relaunch the loop.
After about 12h it already runs 80 times without any segmantation fault.

Any idea?

Thanks,
     Joel

PS: this last was build with gcc-3.3-hppa64 and binutils-hppa64 while previous was build with hppa64-gcc-3.0 and same binutils

[-- Attachment #2: PageFault.doc --]
[-- Type: application/msword, Size: 28529 bytes --]

[-- Attachment #3: Type: text/plain, Size: 169 bytes --]

_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
  2005-01-15 14:02 [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up Joel Soete
@ 2005-01-15 14:41 ` Matthew Wilcox
       [not found]   ` <41E9952A.1000907@tiscali.be>
  0 siblings, 1 reply; 4+ messages in thread
From: Matthew Wilcox @ 2005-01-15 14:41 UTC (permalink / raw)
  To: Joel Soete; +Cc: List Parisc

On Sat, Jan 15, 2005 at 02:02:00PM +0000, Joel Soete wrote:
> With previous kernel it makes panic kernel hazardiously from 1 to more 200 
> occurences. With this 2.6.11-rc3-pa3 after about 49 runs some "Segmentation 
> fault" occures:
> make[3]: *** [drivers/pci/probe.o] Segmentation fault
> 
> for some child's shell (I attached dmesg because very long).
> 
> I so stop this loop to reboot with same kernel but up build and relaunch 
> the loop.
> After about 12h it already runs 80 times without any segmantation fault.
> 
> Any idea?

I suspect the new clear_user_page code.  Could you try backing that out
and rerunning your test?

-- 
"Next the statesmen will invent cheap lies, putting the blame upon 
the nation that is attacked, and every man will be glad of those
conscience-soothing falsities, and will diligently study them, and refuse
to examine any refutations of them; and thus he will by and by convince 
himself that the war is just, and will thank God for the better sleep 
he enjoys after this process of grotesque self-deception." -- Mark Twain
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
       [not found]     ` <20050115224118.GC10070@colo.lackof.org>
@ 2005-01-16  0:32       ` Joel Soete
  2005-01-25 13:56         ` Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up] Joel Soete
  0 siblings, 1 reply; 4+ messages in thread
From: Joel Soete @ 2005-01-16  0:32 UTC (permalink / raw)
  To: Grant Grundler; +Cc: List Parisc, Matthew Wilcox

Hello Grant,

Sorry for delay but the n stand in the office and join it from home i need to use vpn connection which deasable my local lan (and 
also my pop3 access to my isp :-( and obviously irc)

Grant Grundler wrote:
> On Sat, Jan 15, 2005 at 10:11:54PM +0000, Joel Soete wrote:
> 
>>this patch:
>><http://cvs.parisc-linux.org/linux-2.6/arch/parisc/kernel/pacache.S?r1=1.9&r2=1.10> ?
> 
> 
> I don't think willy is refering to any particular issue.
> 
> 
>>Well in doubt I revert all changes since:
>><http://cvs.parisc-linux.org/*checkout*/linux-2.6/arch/parisc/kernel/pacache.S?rev=1.9.20.2>
>>
>>Btw what do you suspect that Grant (and I) could have missed (just divid by 
>>2 the number of loop thanks to dword insn)?
> 
> 
> No - I mangled that. The code you submitted was correct in this regard.
Agree (but what could we imagine on a system which only have one atomic insn iirc?)

> I fixed the off-by-2x bug in the next commit:
> http://cvs.parisc-linux.org/linux-2.6/arch/parisc/kernel/pacache.S?r1=1.10&r2=1.11
> 
> My suspicion is the prefetching might be causing problems since it's going
> past the end of the page. The kernel tmp alias pages might be causing problems
> since we don't flush past the end of those. (Or something like that).
> 
sorry I didn't have yet understand all details of prefetching

> Can you remove only the prefetching in that algorithm?
> 
Yes, I did and already run one occurence of the loop without pb, .. so we have to be patient to see if first mentioned pb (failures 
after some loop occurence: about 50 last time) :-)

> 
>>But already strange results:
> 
> ...
> 
>>and after those first 11 failed it seems to continue??
> 
> 
> Do you mean after restarting the make the build completed?

No, just let continue the loop (that doesn't make sense to me but in fact I already encounter this case with 2.6.10-paXX but don't 
have time to report it).

Is nicol also encounter the same pb (just in case of small diff in hw config)?

Thanks,
	Joel
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up]
  2005-01-16  0:32       ` Joel Soete
@ 2005-01-25 13:56         ` Joel Soete
  0 siblings, 0 replies; 4+ messages in thread
From: Joel Soete @ 2005-01-25 13:56 UTC (permalink / raw)
  To: Grant Grundler; +Cc: Matthew Wilcox, List Parisc

Hello Grant,
> 
[...]
> > My suspicion is the prefetching might be causing problems since it's
going
> > past the end of the page. The kernel tmp alias pages might be causing=

problems
> > since we don't flush past the end of those. (Or something like that).=

> > 
> sorry I didn't have yet understand all details of prefetching
> 
> > Can you remove only the prefetching in that algorithm?
> > 
> Yes, I did and already run one occurence of the loop without pb, .. so
we
> have to be patient to see if first mentioned pb (failures 
> after some loop occurence: about 50 last time) :-)
> 
mmm, I was ongoing to forget why the n was looping on a kernel's build wh=
en
the b2k panic due to the same pb with lastest 2.6.11-rc2-pa0 (+workaround=
):
patst005 login: Backtrace:
 [<000000001017cd00>] do_wp_page+0x250/0x950
 [<000000001017ee14>] handle_mm_fault+0x1b4/0x1e0
 [<0000000010110ec4>] do_page_fault+0x2bc/0x360
 [<000000001011299c>] handle_interruption+0x34c/0x748
 [<0000000010107094>] intr_check_sig+0x0/0xc


Kernel Fault: Code=3D15 regs=3D000000001d68c540 (Addr=3D0000000020000000)=

      
     YZrvWESTHLNXBCVMcbcbcbcbOGFRQPDI
PSW: 00001000000001001110011100001111 Not tainted
r00-03  0000000000000000 0000000000000000 000000001017cd00 0000000010a0df=
c8
r04-07  00000000105e2020 000000001d874000 0000000010983960 000000000007e2=
b0
r08-11  000000001d71cad0 000000001e7e9c00 000000000ffffb25 000000001d4043=
f0
r12-15  0000000010983960 000000001d8dc000 0000000000000001 000000000006a4=
04
r16-19  000000001d432740 000000000006a404 000000000006b404 00000000000000=
00
r20-23  0000000000000000 0000000000000000 0000000000000000 00000000000000=
00
r24-27  00000000000000d2 0000000020000000 000000001d875000 00000000105e20=
20
r28-31  0000000010983960 000000001d68c510 000000001d68c540 00000000000000=
20
sr0-3   0000000000429800 0000000000000000 0000000000000000 00000000004298=
00
sr4-7   0000000000000000 0000000000000000 0000000000000000 00000000000000=
00

IASQ: 0000000000000000 0000000000000000 IAOQ: 00000000101012c4 0000000010=
1012c8
 IIR: 0f2010d3    ISR: 0000000000000000  IOR: 0000000020000000
 CPU:        0   CR30: 000000001d68c000 CR31: 00000000105a0000
 ORIG_R28: 00000000103e3e68
 IAOQ[0]: copy_user_page_asm+0xa4/0xb0
 IAOQ[1]: copy_user_page_asm+0xa8/0xb0
 RP(r2): do_wp_page+0x250/0x950
Kernel panic - not syncing: Kernel Fault

I so applied the same work around:
--- arch/parisc/kernel/pacache.S.Orig   2005-01-25 13:44:26.000000000 +01=
00
+++ arch/parisc/kernel/pacache.S        2005-01-25 13:44:55.000000000 +01=
00
@@ -306,13 +306,18 @@
 
        ldd             0(%r25), %r19           /* bundle 1 */
        ldi             32, %r1                 /* PAGE_SIZE/128 =3D=3D 3=
2 */
+
+#if    0       /* Comment out prefetching awaiting better fix */
        ldw             64(%r25), %r0           /* prefetch 1 cacheline a=
head
*/
        ldw             128(%r25), %r0          /* prefetch 2 */
+#endif
 
 1:     ldd             8(%r25), %r20
+#if    0       /* Comment out prefetching awaiting better fix */
        ldw             192(%r25), %r0          /* prefetch 3 */
        ldw             256(%r25), %r0          /* prefetch 4 */
        nop     /* pad to preserve alignment */
+#endif
 
        ldd             16(%r25), %r21          /* bundle 2 */
        ldd             24(%r25), %r22
=3D=3D=3D=3D=3D=3D=3D=3D=3D><=3D=3D=3D=3D=3D=3D=3D=3D=3D

which help too

(I ran the build kernel loop on the smp n4k during a week: 612 runs amoun=
g
111 failed because of various page_fault :-)

Thanks,
Joel


_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2005-01-25 13:56 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-01-15 14:02 [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up Joel Soete
2005-01-15 14:41 ` Matthew Wilcox
     [not found]   ` <41E9952A.1000907@tiscali.be>
     [not found]     ` <20050115224118.GC10070@colo.lackof.org>
2005-01-16  0:32       ` Joel Soete
2005-01-25 13:56         ` Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up] Joel Soete

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox