All of lore.kernel.org
 help / color / mirror / Atom feed
From: David Masover <ninja@slaphack.com>
To: Hans Reiser <reiser@namesys.com>
Cc: PFC <lists@peufeu.com>,
	"reiserfs-list@namesys.com" <reiserfs-list@namesys.com>
Subject: Re: Reiser4 und LZO compression
Date: Tue, 29 Aug 2006 13:31:40 -0500	[thread overview]
Message-ID: <44F4880C.3050507@slaphack.com> (raw)
In-Reply-To: <44F47FEB.9060005@namesys.com>

Hans Reiser wrote:

> PFC wrote:

>>     A big ass RAID will not get much benefit unless :
>>     - the buffer cache stores compressed pages, so compression
>> virtually doubles the RAM cache
>>     - or the CPU is really fast
>>     - or you put one of these neat FPGA modules in a free Opteron
>> socket and upload a soft-hardware LZF in it with a few gigabytes/s
>> throughput
> Or you look the sysadmin in the eyes, and say, your file servers have
> more out of disk space problems than load problems, yes?

I'd look at the IO-Wait number, also.

Compression makes sense if:
   - You spend a lot of time waiting for the disk.
   - You need disk space, and either:
      - You already have enough spare CPU to do compression
      - It's cheaper to buy enough CPU than to buy the space compression 
would save you.

Conversely, compression does NOT make sense if:
   - You spend a lot of time with the CPU busy and the disk idle.
   - You have more than enough disk space.
   - Disk space is cheaper than buying enough CPU to handle compression.
   - You've tried compression, and the CPU requirements slowed you more 
than you saved in disk access.

After a certain amount of RAID -- really, after the second or third disk 
in a mirrored array, or the third or fourth disk in RAID 5 -- at that 
point, I don't think adding more disks is really doing a huge amount to 
increase reliability, which means you're either trying to increase speed 
or space.  You can increase both of these by using compression, if you 
have the spare CPU, so the question becomes:  Does the CPU power 
necessary to do the compression cost more or less than another drive?

Especially in a big-ass RAID, you'll also want to be thinking about heat 
and power consumption, too.

There are still cases where compression loses, but they seem 
pathological enough that you'd want to benchmark to see if they really 
apply to you.  For instance, if you're dealing with lots of quick, 
read-only access to very tiny amounts of data, compression will likely 
slow you down, whereas adding another disk can speed you up.  If your 
data isn't very compressible, then you're just burning cycles for no 
point.  And, of course, the price/performance ratio (CPUs) and price/gig 
ratio (disk space) changes all the time.

And all of this is ignoring the very real possibility of a dedicated 
hardware compressor -- at which point, we could afford pretty much any 
algorithm you like, as long as the hardware can do it quickly enough. 
This is an advantage to using cryptoapi for the cryptocompress plugin, 
by the way -- it's one place where we could call out to the hardware later.

  reply	other threads:[~2006-08-29 18:31 UTC|newest]

Thread overview: 47+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-08-27  0:34 Reiser4 und LZO compression Alexey Dobriyan
2006-08-27  8:04 ` Andrew Morton
2006-08-27  8:49   ` Ray Lee
2006-08-27  9:42   ` David Masover
2006-08-28 17:34     ` Jindrich Makovicka
2006-08-28 18:05       ` Edward Shishkin
2006-08-28 12:42   ` Jörn Engel
2006-08-29 13:14   ` PFC
2006-08-29 17:38     ` David Masover
2006-08-28 17:06 ` Hans Reiser
2006-08-28 17:37   ` Stefan Traby
2006-08-28 18:15     ` Edward Shishkin
2006-08-28 21:48       ` Nigel Cunningham
2006-08-28 23:32         ` Hans Reiser
2006-08-29  4:05         ` Jan Engelhardt
2006-08-29  5:41           ` Nigel Cunningham
2006-08-29  8:23             ` David Masover
2006-08-29  9:57               ` Nigel Cunningham
2006-08-29 11:09                 ` Ray Lee
2006-08-29 11:38                 ` Edward Shishkin
2006-08-29 22:03                   ` Nigel Cunningham
2006-08-29  4:59         ` Paul Mundt
2006-08-29  5:47           ` Nigel Cunningham
2006-08-29 13:45           ` PFC
2006-08-29 14:38             ` Stefan Traby
2006-08-29 15:55               ` PFC
2006-08-29 17:56                 ` Hans Reiser
2006-08-29 18:31                   ` David Masover [this message]
2006-08-29 18:36                     ` Gregory Maxwell
2006-08-29 19:11                       ` David Masover
2006-08-29 19:38                         ` Hans Reiser
2006-08-29 20:03                           ` David Masover
2006-08-29 22:15                             ` Toby Thain
2006-08-29 22:42                               ` David Masover
2006-08-30  9:17                                 ` PFC
2006-08-30 10:45                                   ` David Masover
2006-08-30 16:50                                   ` Edward Shishkin
2006-08-30 16:55                                     ` Hans Reiser
2006-08-31  9:32                                       ` Clemens Eisserer
2006-08-31 12:00                                         ` Edward Shishkin
2006-08-31 15:14                                           ` Clemens Eisserer
2006-08-31 16:55                                           ` Hans Reiser
2006-08-31 18:08                                             ` Edward Shishkin
2006-08-31 19:22                                         ` David Masover
2006-08-29 15:41             ` Gregory Maxwell
2006-08-29 17:42             ` Hans Reiser
2006-08-29  9:29         ` Edward Shishkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=44F4880C.3050507@slaphack.com \
    --to=ninja@slaphack.com \
    --cc=lists@peufeu.com \
    --cc=reiser@namesys.com \
    --cc=reiserfs-list@namesys.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.