From: eazgwmir@umail.furryterror.org (Zygo Blaxell)
To: reiserfs-list@namesys.com
Subject: Re: what do you do that stresses your filesystem?
Date: 11 Jan 2003 10:25:54 -0500 [thread overview]
Message-ID: <avpd22$o41$1@satsuki.furryterror.org> (raw)
In-Reply-To: 3E06F360.7000708@namesys.com
In article <3E06F360.7000708@namesys.com>,
Hans Reiser <reiser@namesys.com> wrote:
>We were discussing how to optimize reiser4 best, and came to realize
>that us developers did not have a good enough intuition for what users
>do that stresses their filesystem enough that they care about its
>performance.
>Booting the machine seems like one activity that many users end up
>waiting on the FS for. Yes?
>Starting up complex and big applications like xemacs and mozilla would
>be another. Yes?
Not for me. I reboot only for upgrades, crashes, power failures, and
dead hardware replacement. I try to avoid complex and big applications
for many reasons including startup time, but startup time is relatively
unimportant.
>Others?
Lots of rsync's, cp -al's, rm -rf's, sometimes involving millions of ~12K
files at a time. apt-get dist-upgrade on Debian. Big C/C++ compiles.
Often a number of these at the same time on the same machine.
I store multi-gigabyte collections of files about 12K in size each.
This isn't usually a performance issue as the files are normally accessed
at the rate of a few hundred per hour; however, relatively infrequently
I need to copy one of these collections to another machine, or process
all of the files at once, or transfer the collection to or from tape.
Often I want to _avoid_ putting stress on my filesystem. I want to do
full system backups, but I don't really care how long they take, and I
don't want any other users of the machine to be inconvenienced by them.
Obviously if the machine is otherwise idle, I want the backups to be fast.
My favorite toy benchmark application involves creating a bootable CD-ROM
from a chroot Debian system. This involves four phases:
1. Build a compressed initrd (mostly CPU bound)
2. Scan the chroot filesystem for identical files and
hardlinking them together (mostly FS bound)
3. mkzftree --parallelism 16 (CPU and FS bound)
4. mkisofs (mostly FS bound)
All input and output is on the same filesystem.
Phase 2 is interesting because the other filesystems (xfs, ext[23], tmpfs)
are all more than twice as slow as Reiser. This phase represents about
one third of the total time. It's a perl script that runs 'find', sorts
the results by size, then reads all files of identical size sequentially
to generate sha1sums, then replaces files with identical sha1sums with
hardlinks to one of the files.
Phase 3 is interesting because the distribution of system and real time
varies wildly depending on filesystem (and not, as one might expect, on
things like whether the machine has one or two CPU's or whether a disk
array is used or a single disk). reiser finished in 30% less real time
than the other filesystems despite being rather heavy in CPU system time.
Reiser is the fastest filesystem at each of these phases individually,
and overall reiser is about 40% faster than xfs or ext[23] over a wide
variety of system configurations. tmpfs was so slow that I didn't bother
letting it finish (I killed it after 90 minutes on a machine that runs
the benchmark on reiser in 22 minutes).
--
Zygo Blaxell (Laptop) <zblaxell@feedme.hungrycats.org>
GPG = D13D 6651 F446 9787 600B AD1E CCF3 6F93 2823 44AD
prev parent reply other threads:[~2003-01-11 15:25 UTC|newest]
Thread overview: 41+ messages / expand[flat|nested] mbox.gz Atom feed top
2002-12-23 11:28 what do you do that stresses your filesystem? Hans Reiser
2002-12-23 11:37 ` Anders Widman
2002-12-23 11:45 ` Hans Reiser
2002-12-23 12:36 ` Ed Tomlinson
2002-12-24 9:22 ` Hans Reiser
2002-12-23 22:02 ` Dieter Nützel
2002-12-23 11:49 ` new reiserfs4 snapshots? Ookhoi
2002-12-23 12:03 ` what do you do that stresses your filesystem? Hendrik Visage
2002-12-23 12:10 ` Oleg Drokin
2002-12-23 12:12 ` Hans Reiser
2002-12-23 12:21 ` Hendrik Visage
2002-12-23 12:26 ` Oleg Drokin
2002-12-23 12:37 ` Hendrik Visage
2002-12-23 14:18 ` Oleg Drokin
2002-12-23 14:37 ` Eric Whiting
2002-12-23 13:21 ` Russell Coker
2002-12-23 16:00 ` Alexander G. M. Smith
2002-12-24 9:26 ` Hans Reiser
2002-12-24 10:15 ` Ookhoi
2002-12-24 10:19 ` Oleg Drokin
2002-12-24 10:26 ` Ookhoi
2002-12-24 11:57 ` Hans Reiser
2002-12-24 12:41 ` Russell Coker
2002-12-23 14:44 ` bscott
2002-12-23 15:56 ` Ross Vandegrift
2002-12-23 20:12 ` Russell Coker
2002-12-23 16:33 ` Chris Haynes
2002-12-24 9:36 ` Hans Reiser
2003-01-06 15:14 ` Chris Mason
2002-12-23 18:54 ` Matthew Johnson
2002-12-23 21:04 ` Manuel Krause
2002-12-23 21:14 ` Andrew Clausen
2002-12-28 5:27 ` Brian Tinsley
2003-01-05 8:17 ` Hans Reiser
2003-01-05 11:49 ` Legato (was: " Hendrik Visage
2003-01-05 17:00 ` Brian Tinsley
2003-01-06 7:00 ` Hans Reiser
2003-01-05 16:51 ` Brian Tinsley
2003-01-06 7:10 ` Hans Reiser
2003-01-05 15:01 ` Philipp Gühring
2003-01-11 15:25 ` Zygo Blaxell [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='avpd22$o41$1@satsuki.furryterror.org' \
--to=eazgwmir@umail.furryterror.org \
--cc=reiserfs-list@namesys.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.