From: Boyd Waters <waters.boyd@gmail.com>
To: "linux-btrfs@vger.kernel.org" <linux-btrfs@vger.kernel.org>
Subject: Re: Content based storage
Date: Sun, 21 Mar 2010 02:55:28 -0400 [thread overview]
Message-ID: <-4111028596887385687@unknownmsgid> (raw)
In-Reply-To: <4BA54FC8.60806@redhat.com>
I realize that I've posted some dumb things in this thread so here's a
re-cast summary:
1) In the past, I experimented with fikesystem backups, using my own
file-level checksumming that would detect when a file was already in
the backup repository, and add a hard link rather than allocate new
blocks. You can do that today on any [posix] fikesystem that supports
hard links, by using rsync.
But you are far, far better off using snapshots.
2) I said that I got 7-to-1 "deduplication" using my hard-link system.
That's a meaningless statement, but anyway I was able to save twelve
or so backups of a 100GB dataset on a 160GB hard disk.
You would almost certainly see much better results by using snapshots
on ZFS or btrfs, where a snapshot takes almost no storage to create,
and only uses extra space for any changed blocks. Snapshots are block-
level.
3) Another meaningless statement was my subjective notion that ZFS
dedup led to performance degradation. Forget I said that, as actually
I have no idea. My system was operating with failing drives at the time.
Some people report better performace with ZFS dedup, as it decreases
the number of disk writes.
next prev parent reply other threads:[~2010-03-21 6:55 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-03-16 9:21 Content based storage David Brown
2010-03-16 22:45 ` Fabio
2010-03-17 8:21 ` David Brown
2010-03-17 0:45 ` Hubert Kario
2010-03-17 8:27 ` David Brown
2010-03-17 8:48 ` Heinz-Josef Claes
2010-03-17 15:25 ` Hubert Kario
2010-03-17 15:33 ` Leszek Ciesielski
2010-03-17 19:43 ` Hubert Kario
2010-03-20 2:46 ` Boyd Waters
2010-03-20 13:05 ` Ric Wheeler
2010-03-20 21:24 ` Boyd Waters
2010-03-20 22:16 ` Ric Wheeler
2010-03-20 22:44 ` Ric Wheeler
2010-03-21 6:55 ` Boyd Waters [this message]
2010-03-18 23:33 ` create debian package of btrfs kernel from git tree rk
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=-4111028596887385687@unknownmsgid \
--to=waters.boyd@gmail.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).