All of lore.kernel.org
 help / color / mirror / Atom feed
* Btrfs offline deduplication
@ 2014-07-31 23:54 Timofey Titovets
  2014-08-01 10:17 ` Austin S Hemmelgarn
  0 siblings, 1 reply; 7+ messages in thread
From: Timofey Titovets @ 2014-07-31 23:54 UTC (permalink / raw)
  To: linux-btrfs

Good time of day.
I have several questions about data deduplication on btrfs.
Sorry if i ask stupid questions or waste you time %)

What about implementation of offline data deduplication? I don't see
any activity on this place, may be i need to ask a particular person?
Where the problem? May be a can i try to help (testing as example)?

I could be wrong, but as i understand btrfs store crc32 checksum one
per file, if this is true, may be make a sense to create small worker
for dedup files? Like worker for autodefrag?
With simple logic like:
if sum1 == sum2 && file_size1 == file_size2; then
if (bit_to_bit_identical(file1,2)); then merge(file1, file2);
This can be first attempt to implement per file offline dedup
What you think about it? could i be wrong? or this is a horrible crutch?
(as i understand it not change format of fs)

(bedup and other tools, its cool, but have several problem with these
tools and i think, what kernel implementation can work better).

-- 
Best regards,
Timofey.

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2014-08-01 20:18 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-07-31 23:54 Btrfs offline deduplication Timofey Titovets
2014-08-01 10:17 ` Austin S Hemmelgarn
2014-08-01 13:23   ` David Sterba
2014-08-01 14:16     ` Austin S Hemmelgarn
2014-08-01 18:55       ` Mark Fasheh
2014-08-01 19:18         ` Austin S Hemmelgarn
2014-08-01 20:18           ` Mark Fasheh

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.