Auto-sparseifying

public inbox for linux-btrfs@vger.kernel.org
 help / color / mirror / Atom feed

* Auto-sparseifying
@ 2008-12-11 10:05 Oliver Mattos
  2008-12-11 13:54 ` Auto-sparseifying jim owens
  2008-12-11 14:57 ` Auto-sparseifying Chris Mason
  0 siblings, 2 replies; 3+ messages in thread
From: Oliver Mattos @ 2008-12-11 10:05 UTC (permalink / raw)
  To: linux-btrfs

Hi,

I've noticed many files have blocks of plain nulls up to a few kb long,
even files you wouldn't normally expect to, like ELF executables.  I
know that with compression enabled these will compress very small, but
that will have a reasonable hit on performance.  How much of an overhead
would it be to check all checksummed file extents to see if they match
the checksum for a blank (null filled) extent, and if it does then don't
save that data?   You may not even want to do it with checksums - just
by reading the first few bytes of data and checking for "nullness" would
let you know if the block is null or not.  (if the first 4 bytes are
null, then the whole block is likely to be nulls, so it's worth the
overhead of checking the whole block)

This would seem like a particularly low overhead space and performance
tweak.  (performance since read/write speed will be increased for
"average" files that contain a few null blocks)

Any thoughts?

Oliver.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Auto-sparseifying
  2008-12-11 10:05 Auto-sparseifying Oliver Mattos
@ 2008-12-11 13:54 ` jim owens
  2008-12-11 14:57 ` Auto-sparseifying Chris Mason
  1 sibling, 0 replies; 3+ messages in thread
From: jim owens @ 2008-12-11 13:54 UTC (permalink / raw)
  To: Oliver Mattos; +Cc: linux-btrfs

... and also Data De-duplication...

A reality check before people go off the deep end here
on these two space saving methods.  It is interesting to
know the "duplicate 512 byte blocks" and "null sequences"
from a statistical point of view.  But it is not practical
to sparse/de-dup at such a small granularity in the FS.

The trade off everyone is missing is that each sparse/dup
is an *extent* that must be tracked in the FS and to do
a read you must send a new *I/O for each disk extent*.

So we blow the metadata structures into unwieldy sizes
and we beat the crap out of the disk.  Even with an SSD
we add tremendous traffic in the I/O pipeline.

Sparse/de-dup on VM page sizes may work OK for small files
but is still not efficient for large files.

jim

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Auto-sparseifying
  2008-12-11 10:05 Auto-sparseifying Oliver Mattos
  2008-12-11 13:54 ` Auto-sparseifying jim owens
@ 2008-12-11 14:57 ` Chris Mason
  1 sibling, 0 replies; 3+ messages in thread
From: Chris Mason @ 2008-12-11 14:57 UTC (permalink / raw)
  To: Oliver Mattos; +Cc: linux-btrfs

On Thu, 2008-12-11 at 10:05 +0000, Oliver Mattos wrote:
> Hi,
> 
> I've noticed many files have blocks of plain nulls up to a few kb long,
> even files you wouldn't normally expect to, like ELF executables.  I
> know that with compression enabled these will compress very small, but
> that will have a reasonable hit on performance.  How much of an overhead
> would it be to check all checksummed file extents to see if they match
> the checksum for a blank (null filled) extent, and if it does then don't
> save that data?   You may not even want to do it with checksums - just
> by reading the first few bytes of data and checking for "nullness" would
> let you know if the block is null or not.  (if the first 4 bytes are
> null, then the whole block is likely to be nulls, so it's worth the
> overhead of checking the whole block)
> 
> This would seem like a particularly low overhead space and performance
> tweak.  (performance since read/write speed will be increased for
> "average" files that contain a few null blocks)
> 
> Any thoughts?

The first comment is that it won't be as fast as you expect ;)  Most
disks read 64k of data about as fast as they read 4k of data, and so if
you have a file with zeros sprinkled around the disk will end up reading
the zeros and just not sending them back to you.

Jim is definitely right about the cost of metadata for smaller extents.
Putting pointers to the zero extent into the file will greatly increase
the number of extents needed to describe a single file.

Traditional filesystems usually don't detect zeros and skip them because
userland will often write zeros to preallocate the file.  Unless btrfs
is in nodatacow mode, that preallocation step doesn't really impact
layout and we could map zeros to a virtual extent that was never written
or read.

But at the end of the day, the main place that zeros come from is
benchmarking programs.  I would prefer to use compression or dedup and
get larger benefits than to optimize away 4k at a time here and there.

-chris

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2008-12-11 14:57 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-12-11 10:05 Auto-sparseifying Oliver Mattos
2008-12-11 13:54 ` Auto-sparseifying jim owens
2008-12-11 14:57 ` Auto-sparseifying Chris Mason

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox