From: Jeff Mahoney <jeffm@suse.com>
To: Hans Reiser <reiser@namesys.com>
Cc: Sonny Rao <sonny@burdell.org>, reiserfs-list@namesys.com
Subject: Why larger extent counts aren't necessarily bad (was Re: Odd Block allocation behavior on Reiser3)
Date: Tue, 10 Aug 2004 16:12:10 -0400 [thread overview]
Message-ID: <41192C1A.6080909@suse.com> (raw)
In-Reply-To: <41190B5A.7070503@namesys.com>
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hans Reiser wrote:
| Sonny Rao wrote:
|> Below I made 24 one gigabyte files in sequence
|> All of them are similarly fragmented:
| this could explain some reiser3 performance problems. This is what
| happens when I spend all my time chasing funding and don't spend it
| reviewing code and benchmarks, sigh.
|
| Thanks for spotting this. I would be curious if this is occuring near
| the transition between unformatted nodes and their parents, or something
| else.
Having a high extent count isn't indicative of fragmentation negatively
affecting performance. In fact, it may be just the opposite.
I modified filefrag.c to display the displacement between the extents,
and the average extent length. My disk was only 9 GB, so I had to limit
my test to 8 1 GB files, but the results are the same - it's a
sequential write. The number of files has no bearing on the result.
For this workload, the patterns are so simple, it's distributed almost
perfectly. Even using the skip_busy algorithm by itself (a practice I
warned about over a year ago) produces acceptible results. The results
showed an median extent length of 1023 extents (1 less than contained in
an indirect pointer block), followed by a median extent displacement of
2 blocks. For all intents and purposes, the file is contiguous, with
metadata interspersed.
The pattern of a streaming read/write operation would be like so:
Locate file, locate first indirect pointer block, read blocks, find next
indirect pointer block, read blocks, ...
In the perceived "ideal" fragementation pattern of 9 fragments
(1024MB/128MB - 4k per bitmap + 8*4k remainder), the metadata is not
interspersed with the file data. It makes the fragmentation extent
number look nice and low, but it really means that every time we need to
read another indirect pointer block, we're seeking outside the data
stream, reading a few blocks (readahead), and seeking back.
In the fragmentation pattern created using the new allocation algorithms
that Chris and I developed, you'll get a higher fragmentation number,
but the extents are close together, and the pointer blocks are already
read in due to readahead. The "actual" fragmentation is lower than the
"ideal" case above since less seeking is required.
- -Jeff
- --
Jeff Mahoney
SuSE Labs
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
iD8DBQFBGSwaLPWxlyuTD7IRAkwWAJ4uPTDcvSAgKpJm6KA4KMcSDb5iKwCcDjKN
cdMveSkc/zsVdGzsZvx5SsM=
=l45w
-----END PGP SIGNATURE-----
next prev parent reply other threads:[~2004-08-10 20:12 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-08-09 20:19 Odd Block allocation behavior on Reiser3 Sonny Rao
2004-08-09 20:30 ` Chris Mason
2004-08-09 22:04 ` Sonny Rao
2004-08-10 7:16 ` Hans Reiser
2004-08-10 15:45 ` Sonny Rao
2004-08-10 17:52 ` Hans Reiser
2004-08-10 18:25 ` Chris Mason
2004-08-10 18:50 ` Chris Mason
2004-08-10 19:42 ` Hans Reiser
2004-08-10 20:29 ` Chris Mason
2004-08-10 21:47 ` Hans Reiser
2004-08-10 23:12 ` Sonny Rao
2004-08-11 1:31 ` Jeff Mahoney
2004-08-10 19:40 ` Hans Reiser
2004-08-10 23:00 ` Sonny Rao
2004-08-10 20:12 ` Jeff Mahoney [this message]
2004-09-09 17:04 ` Why larger extent counts aren't necessarily bad (was Re: Odd Block allocation behavior on Reiser3) Hans Reiser
2004-08-10 12:53 ` Odd Block allocation behavior on Reiser3 Chris Mason
2004-08-10 16:12 ` Sonny Rao
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=41192C1A.6080909@suse.com \
--to=jeffm@suse.com \
--cc=reiser@namesys.com \
--cc=reiserfs-list@namesys.com \
--cc=sonny@burdell.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.