* announcement of deduplication in Sun ZFS @ 2009-11-02 20:25 David Nicol 2009-11-02 20:51 ` Alex Dedul 0 siblings, 1 reply; 5+ messages in thread From: David Nicol @ 2009-11-02 20:25 UTC (permalink / raw) To: BTRFS MAILING LIST Is it inappropriate to post a relevant news item to this list? Please correct me off-list if so. http://blogs.sun.com/bonwick/en_US/entry/zfs_dedup ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS 2009-11-02 20:25 announcement of deduplication in Sun ZFS David Nicol @ 2009-11-02 20:51 ` Alex Dedul 2009-11-02 23:04 ` Chris Samuel 0 siblings, 1 reply; 5+ messages in thread From: Alex Dedul @ 2009-11-02 20:51 UTC (permalink / raw) To: David Nicol; +Cc: BTRFS MAILING LIST Hi! > Is it inappropriate to post a relevant news item to this list? Please > correct me off-list if so. > > http://blogs.sun.com/bonwick/en_US/entry/zfs_dedup In any way thats a nice and interesting feature, thanks! :) With best regards from the Soul, Alex. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS 2009-11-02 20:51 ` Alex Dedul @ 2009-11-02 23:04 ` Chris Samuel 2009-11-02 23:26 ` John Dong 2009-11-02 23:26 ` Freddie Cash 0 siblings, 2 replies; 5+ messages in thread From: Chris Samuel @ 2009-11-02 23:04 UTC (permalink / raw) To: linux-btrfs [-- Attachment #1: Type: Text/Plain, Size: 968 bytes --] On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote: > In any way thats a nice and interesting feature, thanks! :) My concern would be that it increases the impact of a corruption of the block that has been de-dup'd - in other words if the block that now represents the same data in lots of files gets trashed then all those files have corrupt data. This implies that you would want to keep around at least one other copy of that data to be resilient in the face of corruption (checksums will let you detect it, but not necessarily recover from it without duplicate copies). Given that ZFS prides itself on detecting errors it would be strange if they hadn't considered this in the implementation but I couldn't see any mention of it. cheers, Chris -- Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC This email may come with a PGP signature as a file. Do not panic. For more info see: http://en.wikipedia.org/wiki/OpenPGP [-- Attachment #2: This is a digitally signed message part. --] [-- Type: application/pgp-signature, Size: 481 bytes --] ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS 2009-11-02 23:04 ` Chris Samuel @ 2009-11-02 23:26 ` John Dong 2009-11-02 23:26 ` Freddie Cash 1 sibling, 0 replies; 5+ messages in thread From: John Dong @ 2009-11-02 23:26 UTC (permalink / raw) To: Chris Samuel; +Cc: linux-btrfs I don't think ZFS has any goals of being resilient to errors as far as recovering from them without backups / redundant disks.... Errors would still be detected though more data of course would be unavailable potentially. On Nov 2, 2009, at 6:04 PM, Chris Samuel wrote: > On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote: > >> In any way thats a nice and interesting feature, thanks! :) > > My concern would be that it increases the impact of a corruption of > the block > that has been de-dup'd - in other words if the block that now > represents the > same data in lots of files gets trashed then all those files have > corrupt > data. This implies that you would want to keep around at least one > other copy > of that data to be resilient in the face of corruption (checksums > will let you > detect it, but not necessarily recover from it without duplicate > copies). > > Given that ZFS prides itself on detecting errors it would be strange > if they > hadn't considered this in the implementation but I couldn't see any > mention of > it. > > cheers, > Chris ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: announcement of deduplication in Sun ZFS 2009-11-02 23:04 ` Chris Samuel 2009-11-02 23:26 ` John Dong @ 2009-11-02 23:26 ` Freddie Cash 1 sibling, 0 replies; 5+ messages in thread From: Freddie Cash @ 2009-11-02 23:26 UTC (permalink / raw) To: linux-btrfs On Mon, Nov 2, 2009 at 3:04 PM, Chris Samuel <chris@csamuel.org> wrote: > On Tue, 3 Nov 2009 07:51:52 am Alex Dedul wrote: > > > In any way thats a nice and interesting feature, thanks! :) > > My concern would be that it increases the impact of a corruption of the block > that has been de-dup'd - in other words if the block that now represents the > same data in lots of files gets trashed then all those files have corrupt > data. This implies that you would want to keep around at least one other copy > of that data to be resilient in the face of corruption (checksums will let you > detect it, but not necessarily recover from it without duplicate copies). > > Given that ZFS prides itself on detecting errors it would be strange if they > hadn't considered this in the implementation but I couldn't see any mention of > it. In the PSARC description and mailing list thread, there's mention of a setting for how many references before another copy of the block is kept. The default was going to be 100. So after 100 references to the same block, a second copy of the block would be stored on disk. After 200 references, a third copy of the block would be stored on disk. And so on. There was some confusion on whether or not the 100 would be the minimum or the maximum, though. Either way, the admin sets the policy for how many extra copies to keep around. And it would play nicely with the "copies=X" setting as well (ie if copies=2, then you'd start with 2 copies of each deduped block, and after 100 references, you'd store 2 more copies on disk, and so on). -- Freddie Cash fjwcash@gmail.com ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2009-11-02 23:26 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-11-02 20:25 announcement of deduplication in Sun ZFS David Nicol 2009-11-02 20:51 ` Alex Dedul 2009-11-02 23:04 ` Chris Samuel 2009-11-02 23:26 ` John Dong 2009-11-02 23:26 ` Freddie Cash
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.