* REISER4 "strange" partition size
@ 2004-01-23 14:46 Paolo Correnti
2004-01-23 14:52 ` Nikita Danilov
0 siblings, 1 reply; 9+ messages in thread
From: Paolo Correnti @ 2004-01-23 14:46 UTC (permalink / raw)
To: REISER LIST
Hallo,
as anyone experience with this e-mail object ?
I'm testing kernel 2.6.1 + latest snapshot
and I've noticed that if a use mkreiserfs
and mkfs.reiser4 on 2 partitions of the same size
with df I obtain that reiser4 partition has "lost"
about 5% of space (for example on a 4 Gb partition
I see 3,8 Gb).
Then making intensive writing test on this reiser4
partition (for example: writing 1 million rows on an
Oracle table in archivelog mode) on each test I obtain
a corrupted redolog file [sometimes is recoverable,
sometimes I've to rebuild the entire DB).
I've never had this kind of problem with kernel 2.6.0
and last 2003 snapshot.
Best regards
Paolo
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 "strange" partition size
2004-01-23 14:46 REISER4 "strange" partition size Paolo Correnti
@ 2004-01-23 14:52 ` Nikita Danilov
2004-01-23 15:12 ` Paolo Correnti
0 siblings, 1 reply; 9+ messages in thread
From: Nikita Danilov @ 2004-01-23 14:52 UTC (permalink / raw)
To: Paolo Correnti; +Cc: REISER LIST
Paolo Correnti writes:
> Hallo,
>
> as anyone experience with this e-mail object ?
>
> I'm testing kernel 2.6.1 + latest snapshot
> and I've noticed that if a use mkreiserfs
> and mkfs.reiser4 on 2 partitions of the same size
> with df I obtain that reiser4 partition has "lost"
> about 5% of space (for example on a 4 Gb partition
> I see 3,8 Gb).
Good catch!
These 5% are reserved by reiser4 for internal use.
>
> Then making intensive writing test on this reiser4
> partition (for example: writing 1 million rows on an
> Oracle table in archivelog mode) on each test I obtain
> a corrupted redolog file [sometimes is recoverable,
> sometimes I've to rebuild the entire DB).
>
Hmm...
It is not clear how to diagnose this. What oracle are you using? Is it
available freely?
Also you can compile reiser4 with CONFIG_REISER4_CHECK and
CONFIG_REISER4_DEBUG on, but it will run _much_ _more_ _slowly_.
>
> I've never had this kind of problem with kernel 2.6.0
> and last 2003 snapshot.
>
> Best regards
>
> Paolo
>
Nikita.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 "strange" partition size
2004-01-23 14:52 ` Nikita Danilov
@ 2004-01-23 15:12 ` Paolo Correnti
2004-01-26 13:52 ` REISER4 corruptions errors Paolo Correnti
0 siblings, 1 reply; 9+ messages in thread
From: Paolo Correnti @ 2004-01-23 15:12 UTC (permalink / raw)
To: Nikita Danilov; +Cc: REISER LIST
--- Nikita Danilov <Nikita@Namesys.COM> wrote:
> >
> > Then making intensive writing test on this
> reiser4
> > partition (for example: writing 1 million rows on
> an
> > Oracle table in archivelog mode) on each test I
> obtain
> > a corrupted redolog file [sometimes is
> recoverable,
> > sometimes I've to rebuild the entire DB).
> >
>
> Hmm...
>
> It is not clear how to diagnose this. What oracle
> are you using? Is it
> available freely?
>
I'm using Oracle 9.2 on Linux (Slackware 9.1)
> Also you can compile reiser4 with
> CONFIG_REISER4_CHECK and
> CONFIG_REISER4_DEBUG on, but it will run _much_
> _more_ _slowly_.
>
OK, I'll try, also if like I've already wrote
I've never had this kind of problem with kernel
2.6.0 and last 2003 snapshot
Best regards
Paolo
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
^ permalink raw reply [flat|nested] 9+ messages in thread
* REISER4 corruptions errors
2004-01-23 15:12 ` Paolo Correnti
@ 2004-01-26 13:52 ` Paolo Correnti
2004-01-26 13:59 ` Nikita Danilov
0 siblings, 1 reply; 9+ messages in thread
From: Paolo Correnti @ 2004-01-26 13:52 UTC (permalink / raw)
To: Nikita Danilov; +Cc: REISER LIST
In my last message "REISER4 strange partition size"
I wrote:
"... making intensive writing test on this
reiser4 partition (for example: writing 1 million rows
on an Oracle table in archivelog mode) on each test I
obtain a corrupted redolog file [sometimes is
recoverable, sometimes I've to rebuild the entire
DB)."
And Nikita answered:
"It is not clear how to diagnose this. What oracle
are you using? Is it available freely?"
Really I think this test is not so easy to prepare
(you have to download Oracle, install it and bla bla
...) but I think my last test is more "human".
I recomplied Kernel 2.6.0 with
CONFIG_REISER4_BADBLOCKS=y
I'm using 2.6.0 + 20031223 snapshot because
2.6.1 + 20040119_fixed always gives me some file
corrupted and I was spending too much time re-creating
Oracle DB.
Reiser4 partition is mounted with tmgr.atom_max_age=20
option.
I noticed that if you copy a large amount of data (at
least 500 Mb) from a partition to a Reiser4 partition,
umounting Reiser4 part and using fsck.reiser4
you always obtain from 5 to 40 (and more) fixable
corruptions detected.
It seems that with 2.6.0 + 20031223 snapshot these
corruptions are not so "dangerous" to corrupt files
(I mean, also with these corruptions I'm able to work
with Oracle). Unfortunately I cannot say the same
with 2.6.1 and latest snapshot.
Any idea ?
All the best
Paolo
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 corruptions errors
2004-01-26 13:52 ` REISER4 corruptions errors Paolo Correnti
@ 2004-01-26 13:59 ` Nikita Danilov
2004-01-26 14:25 ` Paolo Correnti
0 siblings, 1 reply; 9+ messages in thread
From: Nikita Danilov @ 2004-01-26 13:59 UTC (permalink / raw)
To: Paolo Correnti; +Cc: REISER LIST
Paolo Correnti writes:
> In my last message "REISER4 strange partition size"
> I wrote:
>
> "... making intensive writing test on this
> reiser4 partition (for example: writing 1 million rows
> on an Oracle table in archivelog mode) on each test I
> obtain a corrupted redolog file [sometimes is
> recoverable, sometimes I've to rebuild the entire
> DB)."
>
> And Nikita answered:
>
> "It is not clear how to diagnose this. What oracle
> are you using? Is it available freely?"
>
> Really I think this test is not so easy to prepare
> (you have to download Oracle, install it and bla bla
> ...) but I think my last test is more "human".
>
> I recomplied Kernel 2.6.0 with
> CONFIG_REISER4_BADBLOCKS=y
>
> I'm using 2.6.0 + 20031223 snapshot because
> 2.6.1 + 20040119_fixed always gives me some file
> corrupted and I was spending too much time re-creating
> Oracle DB.
>
> Reiser4 partition is mounted with tmgr.atom_max_age=20
> option.
>
> I noticed that if you copy a large amount of data (at
> least 500 Mb) from a partition to a Reiser4 partition,
> umounting Reiser4 part and using fsck.reiser4
> you always obtain from 5 to 40 (and more) fixable
> corruptions detected.
>
> It seems that with 2.6.0 + 20031223 snapshot these
> corruptions are not so "dangerous" to corrupt files
> (I mean, also with these corruptions I'm able to work
> with Oracle). Unfortunately I cannot say the same
> with 2.6.1 and latest snapshot.
Well, for all I know, this very well may be a bug in the fsck, rather
than corruption of the on-disk data structures. Are you experiencing any
problems when _using_ this partition (error messages in the kernel log,
crashes, deadlocks, etc.)?
>
> Any idea ?
>
> All the best
>
> Paolo
>
Nikita.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 corruptions errors
2004-01-26 13:59 ` Nikita Danilov
@ 2004-01-26 14:25 ` Paolo Correnti
2004-01-26 15:26 ` Alex Zarochentsev
0 siblings, 1 reply; 9+ messages in thread
From: Paolo Correnti @ 2004-01-26 14:25 UTC (permalink / raw)
To: Nikita Danilov; +Cc: REISER LIST
--- Nikita Danilov <Nikita@Namesys.COM> wrote:
> Well, for all I know, this very well may be a bug in
> the fsck, rather than corruption of the on-disk data
> structures. Are
> you experiencing any
> problems when _using_ this partition (error messages
> in the kernel log,
> crashes, deadlocks, etc.)?
>
I have problems using the "20040119-fixed" partition
with 2.6.1 (after writing many MB of data, almost
always I obtain a file corrupted so that Oracle
doesn't start). With 2.6.0 and 20031223 snapshot I
have no problems using the "20031223" partition.
But in both tests I saw that fsck.reiser4 gave me
from 5 to 35 and more corruptions errors, all of type
Error: Node (210326), item (7): StatData of the file
[10001:1616662635f5445:10002] has the wrong bytes
(3625472), Should be
(3629056). Plugin (stat40).
So I was thinking (perhaps strangely ...) that this
kind of corruption was more dangerous with 2.6.1 +
20040119-fixed (I'm speaking about Oracle logfile
corrupted after 5 million rows written) than with
2.6.0 + 20030123 (which never gave me an Oracle file
corrupted, also after 10 million rows written).
I made the same test on 2 different disks so I'm
almost sure this is not an hardware problem.
Best regards
Paolo
__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free web site building tool. Try it!
http://webhosting.yahoo.com/ps/sb/
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 corruptions errors
2004-01-26 14:25 ` Paolo Correnti
@ 2004-01-26 15:26 ` Alex Zarochentsev
2004-01-26 16:36 ` Carl-Daniel Hailfinger
0 siblings, 1 reply; 9+ messages in thread
From: Alex Zarochentsev @ 2004-01-26 15:26 UTC (permalink / raw)
To: Paolo Correnti; +Cc: Nikita Danilov, REISER LIST
On Mon, Jan 26, 2004 at 06:25:28AM -0800, Paolo Correnti wrote:
>
> --- Nikita Danilov <Nikita@Namesys.COM> wrote:
>
> > Well, for all I know, this very well may be a bug in
> > the fsck, rather than corruption of the on-disk data
> > structures. Are
> > you experiencing any
> > problems when _using_ this partition (error messages
> > in the kernel log,
> > crashes, deadlocks, etc.)?
> >
>
> I have problems using the "20040119-fixed" partition
> with 2.6.1 (after writing many MB of data, almost
> always I obtain a file corrupted so that Oracle
> doesn't start). With 2.6.0 and 20031223 snapshot I
> have no problems using the "20031223" partition.
>
> But in both tests I saw that fsck.reiser4 gave me
> from 5 to 35 and more corruptions errors, all of type
>
> Error: Node (210326), item (7): StatData of the file
> [10001:1616662635f5445:10002] has the wrong bytes
> (3625472), Should be
> (3629056). Plugin (stat40).
>
> So I was thinking (perhaps strangely ...) that this
> kind of corruption was more dangerous with 2.6.1 +
> 20040119-fixed (I'm speaking about Oracle logfile
I guess it is not a dangerous corruption. Probaply Oracle does too strong
checks for its log files. The file content and size should be OK, except
i_blocks and i_bytes fields.
I think the source of that curruption is in the reiser4 deletion optimization
performed by the cut_tree() routine (inode_sub_bytes() is not called in some
cases).
this patch should help (! not tested):
===== tree.c 1.562 vs edited =====
--- 1.562/tree.c Wed Jan 14 11:46:20 2004
+++ edited/tree.c Mon Jan 26 18:24:39 2004
@@ -1468,7 +1468,7 @@
if ((result != 0) && (result != -E_NO_NEIGHBOR))
break;
/* Check can we delete the node as a whole. */
- if (iterations && znode_get_level(node) == LEAF_LEVEL &&
+ if (0 && iterations && znode_get_level(node) == LEAF_LEVEL &&
UNDER_RW(dk, current_tree, read,
keyle(from_key, znode_get_ld_key(node)))) {
result = delete_node(next_node_lock.node,
> corrupted after 5 million rows written) than with
> 2.6.0 + 20030123 (which never gave me an Oracle file
> corrupted, also after 10 million rows written).
>
> I made the same test on 2 different disks so I'm
> almost sure this is not an hardware problem.
>
> Best regards
>
> Paolo
>
>
> __________________________________
> Do you Yahoo!?
> Yahoo! SiteBuilder - Free web site building tool. Try it!
> http://webhosting.yahoo.com/ps/sb/
--
Alex.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 corruptions errors
2004-01-26 16:36 ` Carl-Daniel Hailfinger
@ 2004-01-26 15:51 ` Alex Zarochentsev
0 siblings, 0 replies; 9+ messages in thread
From: Alex Zarochentsev @ 2004-01-26 15:51 UTC (permalink / raw)
To: Carl-Daniel Hailfinger; +Cc: Paolo Correnti, REISER LIST
On Mon, Jan 26, 2004 at 05:36:10PM +0100, Carl-Daniel Hailfinger wrote:
> Alex Zarochentsev wrote:
>
> > this patch should help (! not tested):
> >
> > ===== tree.c 1.562 vs edited =====
> > --- 1.562/tree.c Wed Jan 14 11:46:20 2004
> > +++ edited/tree.c Mon Jan 26 18:24:39 2004
> > @@ -1468,7 +1468,7 @@
> > if ((result != 0) && (result != -E_NO_NEIGHBOR))
> > break;
> > /* Check can we delete the node as a whole. */
> > - if (iterations && znode_get_level(node) == LEAF_LEVEL &&
> > + if (0 && iterations && znode_get_level(node) == LEAF_LEVEL &&
> > UNDER_RW(dk, current_tree, read,
> > keyle(from_key, znode_get_ld_key(node)))) {
> > result = delete_node(next_node_lock.node,
>
> Was it your intention to disable the above code path?
no, it will be fixed in proper way. my fix is a temporary solution.
--
Alex.
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: REISER4 corruptions errors
2004-01-26 15:26 ` Alex Zarochentsev
@ 2004-01-26 16:36 ` Carl-Daniel Hailfinger
2004-01-26 15:51 ` Alex Zarochentsev
0 siblings, 1 reply; 9+ messages in thread
From: Carl-Daniel Hailfinger @ 2004-01-26 16:36 UTC (permalink / raw)
To: Alex Zarochentsev; +Cc: Paolo Correnti, Nikita Danilov, REISER LIST
Alex Zarochentsev wrote:
> this patch should help (! not tested):
>
> ===== tree.c 1.562 vs edited =====
> --- 1.562/tree.c Wed Jan 14 11:46:20 2004
> +++ edited/tree.c Mon Jan 26 18:24:39 2004
> @@ -1468,7 +1468,7 @@
> if ((result != 0) && (result != -E_NO_NEIGHBOR))
> break;
> /* Check can we delete the node as a whole. */
> - if (iterations && znode_get_level(node) == LEAF_LEVEL &&
> + if (0 && iterations && znode_get_level(node) == LEAF_LEVEL &&
> UNDER_RW(dk, current_tree, read,
> keyle(from_key, znode_get_ld_key(node)))) {
> result = delete_node(next_node_lock.node,
Was it your intention to disable the above code path?
Carl-Daniel
--
http://www.hailfinger.org/
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2004-01-26 16:36 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-01-23 14:46 REISER4 "strange" partition size Paolo Correnti
2004-01-23 14:52 ` Nikita Danilov
2004-01-23 15:12 ` Paolo Correnti
2004-01-26 13:52 ` REISER4 corruptions errors Paolo Correnti
2004-01-26 13:59 ` Nikita Danilov
2004-01-26 14:25 ` Paolo Correnti
2004-01-26 15:26 ` Alex Zarochentsev
2004-01-26 16:36 ` Carl-Daniel Hailfinger
2004-01-26 15:51 ` Alex Zarochentsev
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.