From: Edward Shishkin <edward@namesys.com>
To: PFC <lists@peufeu.com>
Cc: ReiserFS List <reiserfs-list@namesys.com>
Subject: Re: Reiser4 und LZO compression
Date: Wed, 30 Aug 2006 20:50:31 +0400 [thread overview]
Message-ID: <44F5C1D7.5060904@namesys.com> (raw)
In-Reply-To: <op.te3aq1uwcigqcu@apollo13>
PFC wrote:
>
>> Maybe, but Reiser4 is supposed to be a general purpose filesystem
>> talking about its advantages/disadvantages wrt. gaming makes sense,
>
>
> I don't see a lot of gamers using Linux ;)
> But yes, gaming is what pushes hardware development these days, at
> least on the desktop.
>
> Also, as you said, gamers (like many others) reinvent filesystems
> and generally use the Big Zip File paradigm, which is not that stupid
> for a read only FS (if you cache all file offsets, reading can be
> pretty fast). However when you start storing ogg-compressed sound and
> JPEG images inside a zip file, it starts to stink.
>
> ***************************
>
>> Does the CPU power necessary to do the compression cost more or less
>> than another drive?
>
>
> ***************************
>
> It depends, you have to consider several distinct scenarios.
> For instance, on a big Postgres database server, the rule is to have
> as many spindles as you can.
> - If you are doing a lot of full table scans (like data mining etc),
> more spindles means reads can be parallelized ; of course this will
> mean more data will have to be decompressed.
> - If you are doing a lot of little transactions (web sites), it
> means seeks can be distributed around the various disks. In this case
> compression would be a big win because there is free CPU to use ;
> besides, it would virtually double the RAM cache size.
>
> You have to ponder cost (in CPU $) of compression versus the cost
> in "virtual RAM" saved for caching and the cost in disks not bought.
>
> ***************************
>
>> Do the two processors have separate caches, and thus being overly fined
>> grained makes you memory transfer bound or?
>
>
> It depends on which dual core system you use ; future systems (like
> Core) will definitely share cache as this is the best option.
>
> ***************************
>
> If we analyze the results of my little compression benchmarks, we
> find that :
> - gzip is way too slow.
> - lzo and lzf are pretty close.
>
> LZF is faster than LZO (especially on decompression) but compresses
> worse.
> So, when we are disk-bound, LZF will be slower.
> When we are CPU-bound, LZF will be faster.
>
> The differences are not that huge, though, so it might be worthwile
> to weight this against the respective code cleanliness, of which I have
> no idea.
>
> However my compression benchmarks mean nothing because I'm
> compressing whole files whereas reiser4 will be compressing little
> blocks of files. We must therefore evaluate the performance of
> compressors on little blocks, which is very different from 300
> megabytes files.
> For instance, the setup time of the compressor will be important
> (wether some huffman table needs to be constructed etc), and the
> compression ratios will be worse.
>
> Let's redo a benchmark then.
> For that I need to know if a compression block in reiser4 will be
> either :
> - a FS block containing several files (ie. a block will contain
> several small files)
> - a part of a file (ie. a small file will be 1 block)
>
> I think it's the second option, right ?
(Plain) file is considered as a set of logical clusters (64K by
default). Minimal unit occupied in memory by (plain) file is one
page. Compressed logical cluster is stored on disk in so-called
"disk clusters". Disk cluster is a set of special items (aka "ctails",
or "compressed bodies"), so that one block can contain (compressed)
data of many files and everything is packed tightly on disk.
next prev parent reply other threads:[~2006-08-30 16:50 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-08-27 0:34 Reiser4 und LZO compression Alexey Dobriyan
2006-08-27 8:04 ` Andrew Morton
2006-08-27 8:49 ` Ray Lee
2006-08-27 9:42 ` David Masover
2006-08-28 17:34 ` Jindrich Makovicka
2006-08-28 18:05 ` Edward Shishkin
2006-08-28 12:42 ` Jörn Engel
2006-08-29 13:14 ` PFC
2006-08-29 17:38 ` David Masover
2006-08-28 17:06 ` Hans Reiser
2006-08-28 17:37 ` Stefan Traby
2006-08-28 18:15 ` Edward Shishkin
2006-08-28 21:48 ` Nigel Cunningham
2006-08-28 23:32 ` Hans Reiser
2006-08-29 4:05 ` Jan Engelhardt
2006-08-29 5:41 ` Nigel Cunningham
2006-08-29 8:23 ` David Masover
2006-08-29 9:57 ` Nigel Cunningham
2006-08-29 11:09 ` Ray Lee
2006-08-29 11:38 ` Edward Shishkin
2006-08-29 22:03 ` Nigel Cunningham
2006-08-29 4:59 ` Paul Mundt
2006-08-29 5:47 ` Nigel Cunningham
2006-08-29 13:45 ` PFC
2006-08-29 14:38 ` Stefan Traby
2006-08-29 15:55 ` PFC
2006-08-29 17:56 ` Hans Reiser
2006-08-29 18:31 ` David Masover
2006-08-29 18:36 ` Gregory Maxwell
2006-08-29 19:11 ` David Masover
2006-08-29 19:38 ` Hans Reiser
2006-08-29 20:03 ` David Masover
2006-08-29 22:15 ` Toby Thain
2006-08-29 22:42 ` David Masover
2006-08-30 9:17 ` PFC
2006-08-30 10:45 ` David Masover
2006-08-30 16:50 ` Edward Shishkin [this message]
2006-08-30 16:55 ` Hans Reiser
2006-08-31 9:32 ` Clemens Eisserer
2006-08-31 12:00 ` Edward Shishkin
2006-08-31 15:14 ` Clemens Eisserer
2006-08-31 16:55 ` Hans Reiser
2006-08-31 18:08 ` Edward Shishkin
2006-08-31 19:22 ` David Masover
2006-08-29 15:41 ` Gregory Maxwell
2006-08-29 17:42 ` Hans Reiser
2006-08-29 9:29 ` Edward Shishkin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=44F5C1D7.5060904@namesys.com \
--to=edward@namesys.com \
--cc=lists@peufeu.com \
--cc=reiserfs-list@namesys.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.