* announcement of deduplication in Sun ZFS
@ 2009-11-02 20:25 David Nicol
2009-11-02 20:51 ` Alex Dedul
0 siblings, 1 reply; 5+ messages in thread
From: David Nicol @ 2009-11-02 20:25 UTC (permalink / raw)
To: BTRFS MAILING LIST
Is it inappropriate to post a relevant news item to this list? Please
correct me off-list if so.
http://blogs.sun.com/bonwick/en_US/entry/zfs_dedup
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS
2009-11-02 20:25 announcement of deduplication in Sun ZFS David Nicol
@ 2009-11-02 20:51 ` Alex Dedul
2009-11-02 23:04 ` Chris Samuel
0 siblings, 1 reply; 5+ messages in thread
From: Alex Dedul @ 2009-11-02 20:51 UTC (permalink / raw)
To: David Nicol; +Cc: BTRFS MAILING LIST
Hi!
> Is it inappropriate to post a relevant news item to this list? Please
> correct me off-list if so.
>
> http://blogs.sun.com/bonwick/en_US/entry/zfs_dedup
In any way thats a nice and interesting feature, thanks! :)
With best regards from the Soul, Alex.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS
2009-11-02 20:51 ` Alex Dedul
@ 2009-11-02 23:04 ` Chris Samuel
2009-11-02 23:26 ` John Dong
2009-11-02 23:26 ` Freddie Cash
0 siblings, 2 replies; 5+ messages in thread
From: Chris Samuel @ 2009-11-02 23:04 UTC (permalink / raw)
To: linux-btrfs
[-- Attachment #1: Type: Text/Plain, Size: 968 bytes --]
On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote:
> In any way thats a nice and interesting feature, thanks! :)
My concern would be that it increases the impact of a corruption of the block
that has been de-dup'd - in other words if the block that now represents the
same data in lots of files gets trashed then all those files have corrupt
data. This implies that you would want to keep around at least one other copy
of that data to be resilient in the face of corruption (checksums will let you
detect it, but not necessarily recover from it without duplicate copies).
Given that ZFS prides itself on detecting errors it would be strange if they
hadn't considered this in the implementation but I couldn't see any mention of
it.
cheers,
Chris
--
Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC
This email may come with a PGP signature as a file. Do not panic.
For more info see: http://en.wikipedia.org/wiki/OpenPGP
[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 481 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS
2009-11-02 23:04 ` Chris Samuel
@ 2009-11-02 23:26 ` John Dong
2009-11-02 23:26 ` Freddie Cash
1 sibling, 0 replies; 5+ messages in thread
From: John Dong @ 2009-11-02 23:26 UTC (permalink / raw)
To: Chris Samuel; +Cc: linux-btrfs
I don't think ZFS has any goals of being resilient to errors as far as
recovering from them without backups / redundant disks.... Errors
would still be detected though more data of course would be
unavailable potentially.
On Nov 2, 2009, at 6:04 PM, Chris Samuel wrote:
> On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote:
>
>> In any way thats a nice and interesting feature, thanks! :)
>
> My concern would be that it increases the impact of a corruption of
> the block
> that has been de-dup'd - in other words if the block that now
> represents the
> same data in lots of files gets trashed then all those files have
> corrupt
> data. This implies that you would want to keep around at least one
> other copy
> of that data to be resilient in the face of corruption (checksums
> will let you
> detect it, but not necessarily recover from it without duplicate
> copies).
>
> Given that ZFS prides itself on detecting errors it would be strange
> if they
> hadn't considered this in the implementation but I couldn't see any
> mention of
> it.
>
> cheers,
> Chris
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS
2009-11-02 23:04 ` Chris Samuel
2009-11-02 23:26 ` John Dong
@ 2009-11-02 23:26 ` Freddie Cash
1 sibling, 0 replies; 5+ messages in thread
From: Freddie Cash @ 2009-11-02 23:26 UTC (permalink / raw)
To: linux-btrfs
On Mon, Nov 2, 2009 at 3:04 PM, Chris Samuel <chris@csamuel.org> wrote:
> On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote:
>
> > In any way thats a nice and interesting feature, thanks! :)
>
> My concern would be that it increases the impact of a corruption of the block
> that has been de-dup'd - in other words if the block that now represents the
> same data in lots of files gets trashed then all those files have corrupt
> data. This implies that you would want to keep around at least one other copy
> of that data to be resilient in the face of corruption (checksums will let you
> detect it, but not necessarily recover from it without duplicate copies).
>
> Given that ZFS prides itself on detecting errors it would be strange if they
> hadn't considered this in the implementation but I couldn't see any mention of
> it.
In the PSARC description and mailing list thread, there's mention of a
setting for how many references before another copy of the block is
kept. The default was going to be 100. So after 100 references to
the same block, a second copy of the block would be stored on disk.
After 200 references, a third copy of the block would be stored on
disk. And so on.
There was some confusion on whether or not the 100 would be the
minimum or the maximum, though.
Either way, the admin sets the policy for how many extra copies to
keep around. And it would play nicely with the "copies=X" setting as
well (ie if copies=2, then you'd start with 2 copies of each deduped
block, and after 100 references, you'd store 2 more copies on disk,
and so on).
--
Freddie Cash
fjwcash@gmail.com
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2009-11-02 23:26 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-11-02 20:25 announcement of deduplication in Sun ZFS David Nicol
2009-11-02 20:51 ` Alex Dedul
2009-11-02 23:04 ` Chris Samuel
2009-11-02 23:26 ` John Dong
2009-11-02 23:26 ` Freddie Cash
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.