All of lore.kernel.org
 help / color / mirror / Atom feed
From: PFC <lists@peufeu.com>
To: ReiserFS List <reiserfs-list@namesys.com>
Cc: ReiserFS List <reiserfs-list@namesys.com>
Subject: Re: Reiser4 und LZO compression
Date: Wed, 30 Aug 2006 11:17:15 +0200	[thread overview]
Message-ID: <op.te3aq1uwcigqcu@apollo13> (raw)
In-Reply-To: <44F4C2C9.3070101@slaphack.com>


> Maybe, but Reiser4 is supposed to be a general purpose filesystem
> talking about its advantages/disadvantages wrt. gaming makes sense,

	I don't see a lot of gamers using Linux ;)
	But yes, gaming is what pushes hardware development these days, at least  
on the desktop.

	Also, as you said, gamers (like many others) reinvent filesystems and  
generally use the Big Zip File paradigm, which is not that stupid for a  
read only FS (if you cache all file offsets, reading can be pretty fast).  
However when you start storing ogg-compressed sound and JPEG images inside  
a zip file, it starts to stink.

	***************************

> Does the CPU power necessary to do the compression cost more or less  
> than another drive?

	***************************

	It depends, you have to consider several distinct scenarios.
	For instance, on a big Postgres database server, the rule is to have as  
many spindles as you can.
	- If you are doing a lot of full table scans (like data mining etc), more  
spindles means reads can be parallelized ; of course this will mean more  
data will have to be decompressed.
	- If you are doing a lot of little transactions (web sites), it means  
seeks can be distributed around the various disks. In this case  
compression would be a big win because there is free CPU to use ; besides,  
it would virtually double the RAM cache size.

	You have to ponder cost (in CPU $) of compression versus the cost in  
"virtual RAM" saved for caching and the cost in disks not bought.

	***************************

> Do the two processors have separate caches, and thus being overly fined
> grained makes you memory transfer bound or?

	It depends on which dual core system you use ; future systems (like Core)  
will definitely share cache as this is the best option.

	***************************

	If we analyze the results of my little compression benchmarks, we find  
that :
	- gzip is way too slow.
	- lzo and lzf are pretty close.

	LZF is faster than LZO (especially on decompression) but compresses worse.
	So, when we are disk-bound, LZF will be slower.
	When we are CPU-bound, LZF will be faster.

	The differences are not that huge, though, so it might be worthwile to  
weight this against the respective code cleanliness, of which I have no  
idea.

	However my compression benchmarks mean nothing because I'm compressing  
whole files whereas reiser4 will be compressing little blocks of files. We  
must therefore evaluate the performance of compressors on little blocks,  
which is very different from 300 megabytes files.
	For instance, the setup time of the compressor will be important (wether  
some huffman table needs to be constructed etc), and the compression  
ratios will be worse.

	Let's redo a benchmark then.
	For that I need to know if a compression block in reiser4 will be either :
	- a FS block containing several files (ie. a block will contain several  
small files)
	- a part of a file (ie. a small file will be 1 block)

	I think it's the second option, right ?


	

  reply	other threads:[~2006-08-30  9:17 UTC|newest]

Thread overview: 47+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-08-27  0:34 Reiser4 und LZO compression Alexey Dobriyan
2006-08-27  8:04 ` Andrew Morton
2006-08-27  8:49   ` Ray Lee
2006-08-27  9:42   ` David Masover
2006-08-28 17:34     ` Jindrich Makovicka
2006-08-28 18:05       ` Edward Shishkin
2006-08-28 12:42   ` Jörn Engel
2006-08-29 13:14   ` PFC
2006-08-29 17:38     ` David Masover
2006-08-28 17:06 ` Hans Reiser
2006-08-28 17:37   ` Stefan Traby
2006-08-28 18:15     ` Edward Shishkin
2006-08-28 21:48       ` Nigel Cunningham
2006-08-28 23:32         ` Hans Reiser
2006-08-29  4:05         ` Jan Engelhardt
2006-08-29  5:41           ` Nigel Cunningham
2006-08-29  8:23             ` David Masover
2006-08-29  9:57               ` Nigel Cunningham
2006-08-29 11:09                 ` Ray Lee
2006-08-29 11:38                 ` Edward Shishkin
2006-08-29 22:03                   ` Nigel Cunningham
2006-08-29  4:59         ` Paul Mundt
2006-08-29  5:47           ` Nigel Cunningham
2006-08-29 13:45           ` PFC
2006-08-29 14:38             ` Stefan Traby
2006-08-29 15:55               ` PFC
2006-08-29 17:56                 ` Hans Reiser
2006-08-29 18:31                   ` David Masover
2006-08-29 18:36                     ` Gregory Maxwell
2006-08-29 19:11                       ` David Masover
2006-08-29 19:38                         ` Hans Reiser
2006-08-29 20:03                           ` David Masover
2006-08-29 22:15                             ` Toby Thain
2006-08-29 22:42                               ` David Masover
2006-08-30  9:17                                 ` PFC [this message]
2006-08-30 10:45                                   ` David Masover
2006-08-30 16:50                                   ` Edward Shishkin
2006-08-30 16:55                                     ` Hans Reiser
2006-08-31  9:32                                       ` Clemens Eisserer
2006-08-31 12:00                                         ` Edward Shishkin
2006-08-31 15:14                                           ` Clemens Eisserer
2006-08-31 16:55                                           ` Hans Reiser
2006-08-31 18:08                                             ` Edward Shishkin
2006-08-31 19:22                                         ` David Masover
2006-08-29 15:41             ` Gregory Maxwell
2006-08-29 17:42             ` Hans Reiser
2006-08-29  9:29         ` Edward Shishkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=op.te3aq1uwcigqcu@apollo13 \
    --to=lists@peufeu.com \
    --cc=reiserfs-list@namesys.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.