From: "Austin S. Hemmelgarn" <ahferroin7@gmail.com>
To: bo.li.liu@oracle.com
Cc: James Hogarth <james.hogarth@gmail.com>, linux-btrfs@vger.kernel.org
Subject: Re: Query about proposed dedup patches and behaviours
Date: Thu, 14 Jan 2016 14:41:27 -0500 [thread overview]
Message-ID: <5697F9E7.1020004@gmail.com> (raw)
In-Reply-To: <20160114192647.GB24567@localhost.localdomain>
On 2016-01-14 14:26, Liu Bo wrote:
> On Thu, Jan 14, 2016 at 11:46:33AM -0500, Austin S. Hemmelgarn wrote:
>> On 2016-01-14 11:13, James Hogarth wrote:
>>> Hi,
>>>
>>> The duperemove[1] tool is in the process for packaging for Fedora at
>>> present but I was wondering what future this may have with the 4.5
>>> dedup patches being proposed.
>>>
>>> WIll the btrfs command have the ability to out-of-line dedup files
>>> similar to duperemove (thus negating the need for it) or will this
>>> only control in-line dedup with a tool like duperemove still being
>>> required for periodic only (or restricted path) dedup?
>> Unless I'm horribly misreading the code, the regular btrfs-progs will not be
>> adding the ability to do out-of-band deduplication. It may at some point
>> add a shortcut for the required ioctl to be used from scripts, but that's
>> probably unlikely.
>>>
>>> To avoid memory usage bloat if the btrfs command can order dedup of X
>>> files on the path correctly can it be passed a path to carry the hash
>>> map in some form (similar to how dupeemeove can use sqlite for this)
>>> or is this another use case for the external tool?
>> This shouldn't be an issue for in-line deduplication, as that's handled in
>> the kernel.
>>>
>>> Finally what's the present situation with regards to defragmentation
>>> and deduplication? Is it safe to turn on autodefrag now when using
>>> snapshots and duperemove? What should the behaviour be with the
>>> proposed 4.5 dedup patches if both inline dedup and autodefrag are
>>> enabled as mount options?
>> I'm not entirely certain how deduplication would interact with any form of
>> defragmentation. I'm pretty certain though that autodefrag does properly
>> handle snapshots, such that the reflinks aren't broken, and it's the
>> original copy that gets any shared extents defragmented into it.
>
> If it refers to snapshot-aware defrag, it's been disabled, so now btrfs
> will not maintain reflinks between snapshots.
>
I was under the impression that autodefrag had been done separately from
the snapshot-aware manually triggered defrag, and that it's always been
snapshot aware.
next prev parent reply other threads:[~2016-01-14 19:42 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-01-14 16:13 Query about proposed dedup patches and behaviours James Hogarth
2016-01-14 16:46 ` Austin S. Hemmelgarn
2016-01-14 19:26 ` Liu Bo
2016-01-14 19:41 ` Austin S. Hemmelgarn [this message]
2016-01-15 1:47 ` Duncan
2016-01-15 9:33 ` James Hogarth
2016-01-15 12:18 ` Duncan
2016-01-20 15:33 ` Interjection: autodefrag mount option aye, nae? Al
2016-01-20 15:39 ` Austin S. Hemmelgarn
2016-01-20 18:39 ` Duncan
2016-01-21 20:59 ` Kai Krakow
2016-01-22 12:14 ` Austin S. Hemmelgarn
2016-01-22 19:43 ` Kai Krakow
2016-01-23 22:11 ` Query about proposed dedup patches and behaviours Mark Fasheh
2016-01-24 5:12 ` Duncan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5697F9E7.1020004@gmail.com \
--to=ahferroin7@gmail.com \
--cc=bo.li.liu@oracle.com \
--cc=james.hogarth@gmail.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).