* random errors with bzip2
@ 2001-06-19 15:25 Rodrigo Ventura
2001-06-19 16:11 ` André Dahlqvist
0 siblings, 1 reply; 3+ messages in thread
From: Rodrigo Ventura @ 2001-06-19 15:25 UTC (permalink / raw)
To: linux-kernel
Hi. Let me first apologize if this message is at the wrong
mailing list, but in fact I haven't the faintest idea to where shall I
send this.
Randomly bzip2 gives an crc error when checking a big bz2
file, e.g. linux-2.4.5.tar.bz2 (20M), in a particular machine (asus
a7v, athlon@1.2G, 256M). The same file (with the same md5sum, NFS file
system) gives no problem whatsoever on other machines. On the asus
machine, sometimes the file is checked ok, other times it fails (crc
error). When several bzip2 processes are launched at the same time for
the same file, some of them fail, others return ok. These have been my
latest observations:
- avoiding shared IRQs (by moving PCI cards in different
slots) seems to reduce probability of error;
- fresh reboot seems to solve the problem, but after some
hours of uptime, it comes again;
- all machines are using 2.4.6-pre3, the file is local to the
asus machine but NFS for the others. The local filesystem is
*ReiserFS*;
- bzip2 executable is ok (as far as md5sum);
- it could be a memory problem, but if it were, lots of kernel
oops were expected, right?
- could it be a process pre-emption problem? (FPU?) the kernel
is compiled for generic i586 (to be shared by several machines);
- could it be a shared IRQ problem? I've been experiencing
several problems (freeze) related with IRQ sharing on asus boards, but
I'm not totally sure about this;
- I've read some posts about file shifting, can it cause these
kinds of bzip problems?
Cheers,
--
*** Rodrigo Martins de Matos Ventura <yoda@isr.ist.utl.pt>
*** Web page: http://www.isr.ist.utl.pt/~yoda
*** Teaching Assistant and PhD Student at ISR:
*** Instituto de Sistemas e Robotica, Polo de Lisboa
*** Instituto Superior Tecnico, Lisboa, PORTUGAL
*** PGP fingerprint = 0119 AD13 9EEE 264A 3F10 31D3 89B3 C6C4 60C6 4585
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: random errors with bzip2
2001-06-19 15:25 random errors with bzip2 Rodrigo Ventura
@ 2001-06-19 16:11 ` André Dahlqvist
2001-06-20 7:27 ` Ville Herva
0 siblings, 1 reply; 3+ messages in thread
From: André Dahlqvist @ 2001-06-19 16:11 UTC (permalink / raw)
To: linux-kernel
Rodrigo Ventura <yoda@isr.ist.utl.pt> wrote:
> - it could be a memory problem, but if it were, lots of kernel
> oops were expected, right?
This certainly sounds like a memory problem. I experienced almost the same
behaviour with a box some years ago, and it turned out to be memory. The
kernel didn't oops, and I actually had to run several kernel compiles at
the same time to have gcc die.
Try memtest86 on the suspect box.
--
André Dahlqvist <andre.dahlqvist@telia.com>
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: random errors with bzip2
2001-06-19 16:11 ` André Dahlqvist
@ 2001-06-20 7:27 ` Ville Herva
0 siblings, 0 replies; 3+ messages in thread
From: Ville Herva @ 2001-06-20 7:27 UTC (permalink / raw)
To: linux-kernel
On Tue, Jun 19, 2001 at 06:11:48PM +0200, you [André Dahlqvist] claimed:
> Rodrigo Ventura <yoda@isr.ist.utl.pt> wrote:
>
> > - it could be a memory problem, but if it were, lots of kernel
> > oops were expected, right?
>
> This certainly sounds like a memory problem. I experienced almost the same
> behaviour with a box some years ago, and it turned out to be memory. The
> kernel didn't oops, and I actually had to run several kernel compiles at
> the same time to have gcc die.
>
> Try memtest86 on the suspect box.
Seconded.
Exactly the same symptoms (bzip2); the culprit turned out to be memory.
That's when I wrote memburn (http://v.iki.fi/~vherva/memburn.c) for quick
testing without a boot (it did find the problem) and I then verified the
problem with memtest86 (http://reality.sgi.com/cbrady_denver/memtest86/).
You do have to run either for hours, propably for days to be sure.
The box has now ran perfectly for a year or so with the BadRam patch from
Rick van Rein (http://rick.vanrein.org/linux/badram/).
-- v --
v@iki.fi
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2001-06-20 7:28 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2001-06-19 15:25 random errors with bzip2 Rodrigo Ventura
2001-06-19 16:11 ` André Dahlqvist
2001-06-20 7:27 ` Ville Herva
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox