All of lore.kernel.org
 help / color / mirror / Atom feed
From: Hans Reiser <reiser@namesys.com>
To: Mike Benoit <ipso@snappymail.ca>
Cc: David Masover <ninja@slaphack.com>,
	reiserfs-list@namesys.com,
	Alexander Zarochentcev <zam@namesys.com>,
	vs <vs@thebsh.namesys.com>
Subject: Re: reiser4 status (correction)
Date: Sun, 23 Jul 2006 00:19:47 -0600	[thread overview]
Message-ID: <44C31503.9000005@namesys.com> (raw)
In-Reply-To: <1153598173.6659.219.camel@ipso.snappymail.ca>

Mike Benoit wrote:

>On Sat, 2006-07-22 at 07:34 -0500, David Masover wrote:
>
>  
>
>>The compression will probably mostly be about speed.  Remember, if
>>we're 
>>talking about people who want to see tangible, visceral results,
>>we're 
>>probably also talking about end-users.  And trust me, the vast
>>majority 
>>of most of my data (as an end-user) is not very compressible.
>>
>>    
>>
>
>Sure, mine too. Between the many gigs of MP3s and movies I have stored
>on my HD, only about 10-20GB is the OS, Email, and documents/source
>code. Even just compressing that small portion though I could probably
>save between 5-10GB. The difference is though I can do a df before, and
>a df after, and I can instantly see I got my moneys worth. Same with
>encryption. 
>  
>
I am looking forward to the first user email complaining that he
compressed a file stored on reiser4 and it didn't save space (and then
someday maybe even an email saying that he compressed a file and it took
up more space but the user space compressor is sure that it saved space
and he does not understand....).:)

>With the repacker it is much more difficult (for average users) to time
>how long a program takes to load some file (or launch), before the
>repacker and after.
>
I think you are confusing repacking and compression?  Repacking, by
removing seeks, will make it more predictable not less.

> Especially since caching comes in to play. 
>
>Also, according to this little poll on one of the compressed FUSE sites
>you linked to, more people are looking to compression for space saving,
>then for speed:
>http://parallel.vub.ac.be/~johan/compFUSEd/index.php?option=polls&task=results&pollid=31
>
>
>  
>
>>No, mostly we're talking about things like office documents, the 
>>majority of which fit in less than a gigabyte, and multimedia (music, 
>>movies, games) which will gain very little from compression.  If 
>>anything, the benefit would be mostly in compressing software.
>>
>>    
>>
>>>less tangible like fragmentation percentages and minor I/O throughput
>>>improvements. I used to work at a large, world wide web hosting company
>>>and I could see making a case to management for purchasing Reiser4
>>>compression would be pretty easy for our shared servers. Instantly
>>>freeing up large amounts of disk space (where .html/.php files were the
>>>vast majority) would save huge amounts of money on disk drives,
>>>especially since most of the servers used RAID1 and adding new drives
>>>was a huge pain in the neck. Making a case to purchase a repacker would
>>>be much, much more difficult.
>>>      
>>>
>>Hmm, the problem is, if storage space is really the big deal, it's been 
>>done before, and some of these efforts are still usable and free:
>>
>>http://parallel.vub.ac.be/~johan/compFUSEd/
>>http://www.miio.net/fusecompress/
>>http://north.one.pl/~kazik/pub/LZOlayer/
>>http://apfs.humorgraficojr.com/apfs_ingles.html
>>
>>And while we're on the topic, here's an FS that does unpacking of 
>>archives, probably about the same way we imagined it in Reiser4 
>>pseudofiles/magic:
>>
>>http://www.nongnu.org/unpackfs/
>>
>>But regardless, as far as I can tell, the only real, tangible benefit of 
>>using Reiser4 compression instead of one of those four FUSE filesystems 
>>is speed.  Reiser4 would compress/decompress when actually hitting the 
>>disk, not just the FS, and it would also probably use in-kernel 
>>compression, rather than calling out to userspace on every FS operation.
>>    
>>
I think that compressing only on flush is a big issue.  It was a lot
harder to code it, but it really makes a difference.  You don't want a
machine with a working set that fits into RAM to be compressing, that
would be lethal to performance, and it is a very important case.

Hans

  parent reply	other threads:[~2006-07-23  6:19 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-07-20 21:59 reiser4 status (correction) Hans Reiser
2006-07-21  3:02 ` David Masover
2006-07-21  8:44   ` Hans Reiser
2006-07-21 10:17     ` Sarath Menon
2006-07-21 19:13     ` David Masover
2006-07-21 20:41     ` Mike Benoit
2006-07-21 21:06       ` David Masover
2006-07-21 21:37         ` Mike Benoit
2006-07-21 22:29           ` Andreas Schäfer
2006-07-21 22:45             ` David Masover
2006-07-21 23:06               ` Andreas Schäfer
2006-07-22 20:07                 ` Maciej Sołtysiak
2006-07-21 22:40           ` David Masover
2006-07-21 23:53             ` Mike Benoit
2006-07-22  2:48               ` David Masover
2006-07-22  5:53                 ` Hans Reiser
2006-07-22  8:55                   ` Mike Benoit
2006-07-22 12:34                     ` David Masover
2006-07-22 19:56                       ` Mike Benoit
2006-07-22 20:37                         ` David Masover
2006-07-23  6:19                         ` Hans Reiser [this message]
2006-07-22 15:40                 ` portage tree (Was: Re: reiser4 status (correction)) Christian Trefzer
2006-07-23  5:50                   ` Hans Reiser
2006-07-24 15:12                     ` wiki entry (Was: Re: portage tree) Christian Trefzer
2006-07-22  0:49       ` reiser4 status (correction) Hans Reiser

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=44C31503.9000005@namesys.com \
    --to=reiser@namesys.com \
    --cc=ipso@snappymail.ca \
    --cc=ninja@slaphack.com \
    --cc=reiserfs-list@namesys.com \
    --cc=vs@thebsh.namesys.com \
    --cc=zam@namesys.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.