* [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
@ 2005-01-15 14:02 Joel Soete
2005-01-15 14:41 ` Matthew Wilcox
0 siblings, 1 reply; 4+ messages in thread
From: Joel Soete @ 2005-01-15 14:02 UTC (permalink / raw)
To: List Parisc
[-- Attachment #1: Type: text/plain, Size: 1678 bytes --]
Hi all,
To stress a bit kernel I launch my usual loop test on the n4k (2way, 4Gb ram):
# while true ; do readprofile -r ; make $OPTB clean ; make $OPTB oldconfig; make -j2 $OPTB vmlinux 2>&1 | tee -a /var/logs/k-loop;
readprofile >> /var/logs/prof-loop.doc ; done
With previous kernel it makes panic kernel hazardiously from 1 to more 200 occurences. With this 2.6.11-rc3-pa3 after about 49 runs
some "Segmentation fault" occures:
make[3]: *** [drivers/pci/probe.o] Segmentation fault
make[3]: *** [drivers/block/ioctl.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/fpudispatch.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/denormal.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[3]: *** [drivers/md/raid0.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/driver.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
make[2]: *** [arch/parisc/math-emu/frnd.o] Segmentation fault
for some child's shell (I attached dmesg because very long).
I so stop this loop to reboot with same kernel but up build and relaunch the loop.
After about 12h it already runs 80 times without any segmantation fault.
Any idea?
Thanks,
Joel
PS: this last was build with gcc-3.3-hppa64 and binutils-hppa64 while previous was build with hppa64-gcc-3.0 and same binutils
[-- Attachment #2: PageFault.doc --]
[-- Type: application/msword, Size: 28529 bytes --]
[-- Attachment #3: Type: text/plain, Size: 169 bytes --]
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
2005-01-15 14:02 [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up Joel Soete
@ 2005-01-15 14:41 ` Matthew Wilcox
[not found] ` <41E9952A.1000907@tiscali.be>
0 siblings, 1 reply; 4+ messages in thread
From: Matthew Wilcox @ 2005-01-15 14:41 UTC (permalink / raw)
To: Joel Soete; +Cc: List Parisc
On Sat, Jan 15, 2005 at 02:02:00PM +0000, Joel Soete wrote:
> With previous kernel it makes panic kernel hazardiously from 1 to more 200
> occurences. With this 2.6.11-rc3-pa3 after about 49 runs some "Segmentation
> fault" occures:
> make[3]: *** [drivers/pci/probe.o] Segmentation fault
>
> for some child's shell (I attached dmesg because very long).
>
> I so stop this loop to reboot with same kernel but up build and relaunch
> the loop.
> After about 12h it already runs 80 times without any segmantation fault.
>
> Any idea?
I suspect the new clear_user_page code. Could you try backing that out
and rerunning your test?
--
"Next the statesmen will invent cheap lies, putting the blame upon
the nation that is attacked, and every man will be glad of those
conscience-soothing falsities, and will diligently study them, and refuse
to examine any refutations of them; and thus he will by and by convince
himself that the war is just, and will thank God for the better sleep
he enjoys after this process of grotesque self-deception." -- Mark Twain
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up
[not found] ` <20050115224118.GC10070@colo.lackof.org>
@ 2005-01-16 0:32 ` Joel Soete
2005-01-25 13:56 ` Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up] Joel Soete
0 siblings, 1 reply; 4+ messages in thread
From: Joel Soete @ 2005-01-16 0:32 UTC (permalink / raw)
To: Grant Grundler; +Cc: List Parisc, Matthew Wilcox
Hello Grant,
Sorry for delay but the n stand in the office and join it from home i need to use vpn connection which deasable my local lan (and
also my pop3 access to my isp :-( and obviously irc)
Grant Grundler wrote:
> On Sat, Jan 15, 2005 at 10:11:54PM +0000, Joel Soete wrote:
>
>>this patch:
>><http://cvs.parisc-linux.org/linux-2.6/arch/parisc/kernel/pacache.S?r1=1.9&r2=1.10> ?
>
>
> I don't think willy is refering to any particular issue.
>
>
>>Well in doubt I revert all changes since:
>><http://cvs.parisc-linux.org/*checkout*/linux-2.6/arch/parisc/kernel/pacache.S?rev=1.9.20.2>
>>
>>Btw what do you suspect that Grant (and I) could have missed (just divid by
>>2 the number of loop thanks to dword insn)?
>
>
> No - I mangled that. The code you submitted was correct in this regard.
Agree (but what could we imagine on a system which only have one atomic insn iirc?)
> I fixed the off-by-2x bug in the next commit:
> http://cvs.parisc-linux.org/linux-2.6/arch/parisc/kernel/pacache.S?r1=1.10&r2=1.11
>
> My suspicion is the prefetching might be causing problems since it's going
> past the end of the page. The kernel tmp alias pages might be causing problems
> since we don't flush past the end of those. (Or something like that).
>
sorry I didn't have yet understand all details of prefetching
> Can you remove only the prefetching in that algorithm?
>
Yes, I did and already run one occurence of the loop without pb, .. so we have to be patient to see if first mentioned pb (failures
after some loop occurence: about 50 last time) :-)
>
>>But already strange results:
>
> ...
>
>>and after those first 11 failed it seems to continue??
>
>
> Do you mean after restarting the make the build completed?
No, just let continue the loop (that doesn't make sense to me but in fact I already encounter this case with 2.6.10-paXX but don't
have time to report it).
Is nicol also encounter the same pb (just in case of small diff in hw config)?
Thanks,
Joel
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 4+ messages in thread
* Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up]
2005-01-16 0:32 ` Joel Soete
@ 2005-01-25 13:56 ` Joel Soete
0 siblings, 0 replies; 4+ messages in thread
From: Joel Soete @ 2005-01-25 13:56 UTC (permalink / raw)
To: Grant Grundler; +Cc: Matthew Wilcox, List Parisc
Hello Grant,
>
[...]
> > My suspicion is the prefetching might be causing problems since it's
going
> > past the end of the page. The kernel tmp alias pages might be causing=
problems
> > since we don't flush past the end of those. (Or something like that).=
> >
> sorry I didn't have yet understand all details of prefetching
>
> > Can you remove only the prefetching in that algorithm?
> >
> Yes, I did and already run one occurence of the loop without pb, .. so
we
> have to be patient to see if first mentioned pb (failures
> after some loop occurence: about 50 last time) :-)
>
mmm, I was ongoing to forget why the n was looping on a kernel's build wh=
en
the b2k panic due to the same pb with lastest 2.6.11-rc2-pa0 (+workaround=
):
patst005 login: Backtrace:
[<000000001017cd00>] do_wp_page+0x250/0x950
[<000000001017ee14>] handle_mm_fault+0x1b4/0x1e0
[<0000000010110ec4>] do_page_fault+0x2bc/0x360
[<000000001011299c>] handle_interruption+0x34c/0x748
[<0000000010107094>] intr_check_sig+0x0/0xc
Kernel Fault: Code=3D15 regs=3D000000001d68c540 (Addr=3D0000000020000000)=
YZrvWESTHLNXBCVMcbcbcbcbOGFRQPDI
PSW: 00001000000001001110011100001111 Not tainted
r00-03 0000000000000000 0000000000000000 000000001017cd00 0000000010a0df=
c8
r04-07 00000000105e2020 000000001d874000 0000000010983960 000000000007e2=
b0
r08-11 000000001d71cad0 000000001e7e9c00 000000000ffffb25 000000001d4043=
f0
r12-15 0000000010983960 000000001d8dc000 0000000000000001 000000000006a4=
04
r16-19 000000001d432740 000000000006a404 000000000006b404 00000000000000=
00
r20-23 0000000000000000 0000000000000000 0000000000000000 00000000000000=
00
r24-27 00000000000000d2 0000000020000000 000000001d875000 00000000105e20=
20
r28-31 0000000010983960 000000001d68c510 000000001d68c540 00000000000000=
20
sr0-3 0000000000429800 0000000000000000 0000000000000000 00000000004298=
00
sr4-7 0000000000000000 0000000000000000 0000000000000000 00000000000000=
00
IASQ: 0000000000000000 0000000000000000 IAOQ: 00000000101012c4 0000000010=
1012c8
IIR: 0f2010d3 ISR: 0000000000000000 IOR: 0000000020000000
CPU: 0 CR30: 000000001d68c000 CR31: 00000000105a0000
ORIG_R28: 00000000103e3e68
IAOQ[0]: copy_user_page_asm+0xa4/0xb0
IAOQ[1]: copy_user_page_asm+0xa8/0xb0
RP(r2): do_wp_page+0x250/0x950
Kernel panic - not syncing: Kernel Fault
I so applied the same work around:
--- arch/parisc/kernel/pacache.S.Orig 2005-01-25 13:44:26.000000000 +01=
00
+++ arch/parisc/kernel/pacache.S 2005-01-25 13:44:55.000000000 +01=
00
@@ -306,13 +306,18 @@
ldd 0(%r25), %r19 /* bundle 1 */
ldi 32, %r1 /* PAGE_SIZE/128 =3D=3D 3=
2 */
+
+#if 0 /* Comment out prefetching awaiting better fix */
ldw 64(%r25), %r0 /* prefetch 1 cacheline a=
head
*/
ldw 128(%r25), %r0 /* prefetch 2 */
+#endif
1: ldd 8(%r25), %r20
+#if 0 /* Comment out prefetching awaiting better fix */
ldw 192(%r25), %r0 /* prefetch 3 */
ldw 256(%r25), %r0 /* prefetch 4 */
nop /* pad to preserve alignment */
+#endif
ldd 16(%r25), %r21 /* bundle 2 */
ldd 24(%r25), %r22
=3D=3D=3D=3D=3D=3D=3D=3D=3D><=3D=3D=3D=3D=3D=3D=3D=3D=3D
which help too
(I ran the build kernel loop on the smp n4k during a week: 612 runs amoun=
g
111 failed because of various page_fault :-)
Thanks,
Joel
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2005-01-25 13:56 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-01-15 14:02 [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up Joel Soete
2005-01-15 14:41 ` Matthew Wilcox
[not found] ` <41E9952A.1000907@tiscali.be>
[not found] ` <20050115224118.GC10070@colo.lackof.org>
2005-01-16 0:32 ` Joel Soete
2005-01-25 13:56 ` Same pb on b2k 64bit + 2.6.11-rc2-pa0 [Was: [parisc-linux] 2.6.11-rc3-pa3 smp "Segmentation fault" but not up] Joel Soete
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox