All of lore.kernel.org
 help / color / mirror / Atom feed
From: Will Smith <will@willsmith.org>
To: "Markus Törnqvist" <mjt@nysv.org>
Cc: reiserfs-list@namesys.com
Subject: Re: Benchmark : ext3 vs reiser4 and effects of fragmentation.
Date: Thu, 16 Sep 2004 22:50:43 +0800	[thread overview]
Message-ID: <4149A843.7010804@willsmith.org> (raw)
In-Reply-To: <20040916130809.GI26192@nysv.org>


Markus Törnqvist wrote:

>>yes, we had similar results.  it is so cool that reiser4 repacker effect
>>is visible outside namesys :)
> 
> S'cuse me, but why does it have to be run several times?

I don't pretend to know the maths/logic of why, but from the reiser4 
description (http://www.namesys.com/v4/v4.html#repacker)

"This repacker goes through the entire tree ordering, from left to right 
and then from right to left, alternating each time it runs."

"In the absence of FS activity the effect of this over time is to sort 
by tree order (defragment), and to pack with perfect efficiency."


My testing showed that this is indeed the case -

If I ran the repacker once (say left to right), the fragmentation was 
not fully reduced.

If I ran the repacker several times (left->right, right->left, and
repeat) then fragmentation fell down to almost 0.


Userspace Tool for Repacker
----------------------------

I take it from an earlier comment by Hans that the repacker is not
intended to be used yet?  If it is indeed ready for primetime,
is anybody working on userspace tools
(/etc/cron.daily/repacker.reiser4 or similar?)  I'd be happy to
write these if nobody else is - but it will be in /bin/bash shell.

My proposed design specs would include:

1) runs in /etc/cron.daily near the end (since cron causes lots of
    disk activity)
2) the last-run-direction should be persistently stored per filesystem
3) parallel operation on multiple filesystems where possible,
    but filesystems on a single physical device should
    run in serial (same as /sbin/fsck -A)
4) some kind of config file in /etc/, parameters could include
     o) maximum run time, global and per filesystem
     o) run/dont run flag, global and per filesystem


Any thoughts?  Shall I go ahead and write a first cut?



Will Smith






  reply	other threads:[~2004-09-16 14:50 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-09-15 22:02 Benchmark : ext3 vs reiser4 and effects of fragmentation Will Smith
2004-09-16  8:52 ` Alex Zarochentsev
2004-09-16 13:08   ` mjt
2004-09-16 14:50     ` Will Smith [this message]
2004-09-16 15:03       ` mjt
2004-09-16 17:16       ` Hans Reiser
2004-09-16 15:38     ` Alex Zarochentsev
2004-09-16 15:52       ` mjt
2004-09-16 17:26         ` Hans Reiser
2004-09-16 18:38         ` Alex Zarochentsev
2004-09-16 19:00           ` mjt
2004-09-16 19:05             ` Chris Mason
2004-09-17  2:29             ` Hans Reiser
2004-09-17 17:01               ` Philippe Gramoullé
2004-09-16 15:38 ` Christian Mayrhuber

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4149A843.7010804@willsmith.org \
    --to=will@willsmith.org \
    --cc=mjt@nysv.org \
    --cc=reiserfs-list@namesys.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.