All of lore.kernel.org
 help / color / mirror / Atom feed
From: jim owens <owens6336@gmail.com>
To: Hubert Kario <hka@qbs.com.pl>
Cc: linux-btrfs@vger.kernel.org, Robert Collins <robertc@robertcollins.net>
Subject: Re: BackupPC, per-dir hard link limit, Debian packaging
Date: Tue, 02 Mar 2010 18:22:31 -0500	[thread overview]
Message-ID: <4B8D9DB7.7090207@gmail.com> (raw)
In-Reply-To: <201003021409.22441.hka@qbs.com.pl>

Hubert Kario wrote:
> On Tuesday 02 March 2010 03:29:05 Robert Collins wrote:
>> As I say, I realise this is queued to get addressed anyway, but it seems
>> like a realistic thing for people to do (use BackupPC on btrfs) - even
>> if something better still can be written to replace the BackupPC store
>> in the future. I will note though, that simple snapshots won't achieve
>> the deduplication level that BackupPC does, because the fils don't start
>> out as the same: they are identified as being identical post-backup.
> 
> Isn't the main idea behind deduplication to merge identical parts of files 
> together using cow? This way you could have many very similar images of 
> virtual machines, run the deduplication process and reduce massively the space 
> used while maintaining the differences between images.
> 
> If memory serves me right, the plan is to do it in userland on a post-fact 
> filesystem, not when the data is being saved. If such a daemon or program was 
> available you would run it on the system after rsyncing the workstations.
> 
> Though the question remains which system would reduce space usage more in your 
> use case. From my experience, hardlinks take less space on disk, I don't know 
> whatever it could be possible to optimise btrfs cow system for files that are 
> exactly the same.

Space use is not the key difference between these methods.
The btrfs COW makes data sharing safe.  The hard link method
means changing a file invalidates the content of all linked files.

So a BackupPC output should be read-only.

jim

  reply	other threads:[~2010-03-02 23:22 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-03-02  2:29 BackupPC, per-dir hard link limit, Debian packaging Robert Collins
2010-03-02 13:09 ` Hubert Kario
2010-03-02 23:22   ` jim owens [this message]
2010-03-03  0:05     ` Hubert Kario

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B8D9DB7.7090207@gmail.com \
    --to=owens6336@gmail.com \
    --cc=hka@qbs.com.pl \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=robertc@robertcollins.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.