* 2.4.19-pre7 / corruption on unwanted reboot
@ 2002-05-04 21:11 Dirk Mueller
2002-05-05 2:07 ` Chris Mason
` (2 more replies)
0 siblings, 3 replies; 9+ messages in thread
From: Dirk Mueller @ 2002-05-04 21:11 UTC (permalink / raw)
To: reiserfs-list
Hi,
I've seen HEAVY file corruption on unwanted reboots (like pressing the reset
button accidently) on reiserfs with this kernel on 3 machines now.
The symptom is that it finds a LOT of files to "unlink" on journal replay,
which I find suspicious as those machines are lightly loaded.
I didn't follow the development too closely the last few weeks, but I
believe that something turned worse in this respect lately.
Note that reiserfsck doesn't find any error in the file system structure
before and after the journal replay on reboot,
still many files (especially those that were not touched for several hours
before the reboot) contain complete garbage after the journal replay.
Dirk
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-04 21:11 2.4.19-pre7 / corruption on unwanted reboot Dirk Mueller
@ 2002-05-05 2:07 ` Chris Mason
2002-05-06 12:36 ` Dirk Mueller
2002-05-05 8:09 ` Oleg Drokin
2002-05-06 10:59 ` Hans Reiser
2 siblings, 1 reply; 9+ messages in thread
From: Chris Mason @ 2002-05-05 2:07 UTC (permalink / raw)
To: Dirk Mueller; +Cc: reiserfs-list
On Sat, 2002-05-04 at 17:11, Dirk Mueller wrote:
> Hi,
>
> I've seen HEAVY file corruption on unwanted reboots (like pressing the reset
> button accidently) on reiserfs with this kernel on 3 machines now.
>
> The symptom is that it finds a LOT of files to "unlink" on journal replay,
> which I find suspicious as those machines are lightly loaded.
>
> I didn't follow the development too closely the last few weeks, but I
> believe that something turned worse in this respect lately.
>
> Note that reiserfsck doesn't find any error in the file system structure
> before and after the journal replay on reboot,
> still many files (especially those that were not touched for several hours
> before the reboot) contain complete garbage after the journal replay.
Hmmm, not good at all. Are these 3 systems IDE or scsi? Do they run
additional patches on top of pre7? What kernels < pre7 have you tried
that didn't show this problem?
-chris
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-04 21:11 2.4.19-pre7 / corruption on unwanted reboot Dirk Mueller
2002-05-05 2:07 ` Chris Mason
@ 2002-05-05 8:09 ` Oleg Drokin
2002-05-06 10:59 ` Hans Reiser
2 siblings, 0 replies; 9+ messages in thread
From: Oleg Drokin @ 2002-05-05 8:09 UTC (permalink / raw)
To: Dirk Mueller; +Cc: reiserfs-list
Hello!
On Sat, May 04, 2002 at 11:11:47PM +0200, Dirk Mueller wrote:
> I've seen HEAVY file corruption on unwanted reboots (like pressing the reset
> button accidently) on reiserfs with this kernel on 3 machines now.
This is bad. Does any earlier (but still relatively recent) kernel excibits
this same behavior?
> The symptom is that it finds a LOT of files to "unlink" on journal replay,
> which I find suspicious as those machines are lightly loaded.
How much is this "a LOT"? I think that even 10+ files that are unlinked
after the unclean reboot/crash is pretty normal. Programs tend to use
deleted files after the deletion, so this number highly depends
on kind of programs you usually run.
> I didn't follow the development too closely the last few weeks, but I
> believe that something turned worse in this respect lately.
In fact nothing unexpected went into 2.4.19-pre7, that was not available in
our pending patches directory on our ftp site.
> Note that reiserfsck doesn't find any error in the file system structure
> before and after the journal replay on reboot,
> still many files (especially those that were not touched for several hours
> before the reboot) contain complete garbage after the journal replay.
I sthere any other pattern for damaged files besides these were not touched for
several hours prior to reboot?
Do their mtimes confirm they were modified long ago?
Thanks for the report.
Bye,
Oleg
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-04 21:11 2.4.19-pre7 / corruption on unwanted reboot Dirk Mueller
2002-05-05 2:07 ` Chris Mason
2002-05-05 8:09 ` Oleg Drokin
@ 2002-05-06 10:59 ` Hans Reiser
2 siblings, 0 replies; 9+ messages in thread
From: Hans Reiser @ 2002-05-06 10:59 UTC (permalink / raw)
To: Dirk Mueller; +Cc: reiserfs-list
Dirk Mueller wrote:
>Hi,
>
>I've seen HEAVY file corruption on unwanted reboots (like pressing the reset
>button accidently) on reiserfs with this kernel on 3 machines now.
>
>The symptom is that it finds a LOT of files to "unlink" on journal replay,
>which I find suspicious as those machines are lightly loaded.
>
>I didn't follow the development too closely the last few weeks, but I
>believe that something turned worse in this respect lately.
>
>Note that reiserfsck doesn't find any error in the file system structure
>before and after the journal replay on reboot,
>still many files (especially those that were not touched for several hours
>before the reboot) contain complete garbage after the journal replay.
>
>
>Dirk
>
>
>
>
Were these files being written to near the time of the reboot?
hans
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-05 2:07 ` Chris Mason
@ 2002-05-06 12:36 ` Dirk Mueller
2002-05-06 12:54 ` Chris Mason
0 siblings, 1 reply; 9+ messages in thread
From: Dirk Mueller @ 2002-05-06 12:36 UTC (permalink / raw)
To: Chris Mason; +Cc: reiserfs-list
On Sam, 04 Mai 2002, Chris Mason wrote:
> Hmmm, not good at all. Are these 3 systems IDE or scsi? Do they run
> additional patches on top of pre7? What kernels < pre7 have you tried
> that didn't show this problem?
All IDE. The kernel that didn't show this problem was 2.4.16 (plain). No
additional patches on 2.4.19-pre7.
Dirk
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-06 12:36 ` Dirk Mueller
@ 2002-05-06 12:54 ` Chris Mason
2002-05-06 13:59 ` Dirk Mueller
0 siblings, 1 reply; 9+ messages in thread
From: Chris Mason @ 2002-05-06 12:54 UTC (permalink / raw)
To: Dirk Mueller; +Cc: reiserfs-list, axboe
On Mon, 2002-05-06 at 08:36, Dirk Mueller wrote:
> On Sam, 04 Mai 2002, Chris Mason wrote:
[ reiserfs corruption after a crash, 2.4.19pre7 ]
>
> > Hmmm, not good at all. Are these 3 systems IDE or scsi? Do they run
> > additional patches on top of pre7? What kernels < pre7 have you tried
> > that didn't show this problem?
>
> All IDE. The kernel that didn't show this problem was 2.4.16 (plain). No
> additional patches on 2.4.19-pre7.
Please tell us everything about your IDE config. Jens and I are already
trying to track down some odd reiserfs + ide problems on 2.4.19pre7, but
so far that was only with our barrier write patches applied.
-chris
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-06 12:54 ` Chris Mason
@ 2002-05-06 13:59 ` Dirk Mueller
2002-05-06 14:26 ` Chris Mason
0 siblings, 1 reply; 9+ messages in thread
From: Dirk Mueller @ 2002-05-06 13:59 UTC (permalink / raw)
To: reiserfs-list
On Mon, 06 Mai 2002, Chris Mason wrote:
> Please tell us everything about your IDE config. Jens and I are already
> trying to track down some odd reiserfs + ide problems on 2.4.19pre7, but
> so far that was only with our barrier write patches applied.
There is not much common. two of them are VIA 686 southbridge (KT133A,
KT333), one is something older, a Pentium chipset.
DMA 100 and DMA 66. We all use those Maxtor 80GB EIDE disks.
Dirk
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-06 13:59 ` Dirk Mueller
@ 2002-05-06 14:26 ` Chris Mason
2002-05-06 18:37 ` Dirk Mueller
0 siblings, 1 reply; 9+ messages in thread
From: Chris Mason @ 2002-05-06 14:26 UTC (permalink / raw)
To: Dirk Mueller; +Cc: reiserfs-list
On Mon, 2002-05-06 at 09:59, Dirk Mueller wrote:
> On Mon, 06 Mai 2002, Chris Mason wrote:
>
> > Please tell us everything about your IDE config. Jens and I are already
> > trying to track down some odd reiserfs + ide problems on 2.4.19pre7, but
> > so far that was only with our barrier write patches applied.
>
> There is not much common. two of them are VIA 686 southbridge (KT133A,
> KT333), one is something older, a Pentium chipset.
>
> DMA 100 and DMA 66. We all use those Maxtor 80GB EIDE disks.
Any suggestions on how I might reproduce locally?
-chris
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: 2.4.19-pre7 / corruption on unwanted reboot
2002-05-06 14:26 ` Chris Mason
@ 2002-05-06 18:37 ` Dirk Mueller
0 siblings, 0 replies; 9+ messages in thread
From: Dirk Mueller @ 2002-05-06 18:37 UTC (permalink / raw)
To: reiserfs-list
On Mon, 06 Mai 2002, Chris Mason wrote:
> Any suggestions on how I might reproduce locally?
not much. maybe try a lot of open, unlinked files when pressing reset and
then check the md5sum's of all files..
Dirk
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2002-05-06 18:37 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-05-04 21:11 2.4.19-pre7 / corruption on unwanted reboot Dirk Mueller
2002-05-05 2:07 ` Chris Mason
2002-05-06 12:36 ` Dirk Mueller
2002-05-06 12:54 ` Chris Mason
2002-05-06 13:59 ` Dirk Mueller
2002-05-06 14:26 ` Chris Mason
2002-05-06 18:37 ` Dirk Mueller
2002-05-05 8:09 ` Oleg Drokin
2002-05-06 10:59 ` Hans Reiser
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.