From: Duncan <1i5t5.duncan@cox.net>
To: linux-btrfs@vger.kernel.org
Subject: Re: btrfs fi defrag does not defrag files >256kB?
Date: Thu, 28 Jul 2016 05:14:03 +0000 (UTC) [thread overview]
Message-ID: <pan$aef9a$f7db9e1f$4c68e09$be53f063@cox.net> (raw)
In-Reply-To: CAD=QJKjoSR8R_hDnr4sgWizMugp+jvmHQwT-SMDdNJH=G9HEkw@mail.gmail.com
Nicholas D Steeves posted on Wed, 27 Jul 2016 13:19:01 -0400 as excerpted:
> Is there any reason why defrag without -t cannot detect and default to
> the data chunk size, or why it does not default to 1 GiB?
I don't know the answer, but have wondered that myself. 256 KiB seems a
rather small default, to me. I'd expect something in the MiB range, at
least, maybe the same 2 MiB that modern partitioners tend to use for
alignment, for the same reason, that tends to be a reasonable whole
multiple of most erase-block sizes, and if the partition is aligned,
should prevent unnecessary read-modify-write cycles on ssd, and help with
tiled zones on SMR drives as well.
As to the question in the subject line, AFAIK btrfs fi defrag works on
extents, not filesize per-se, so using the default 256 KiB target, yes
it'll defrag files larger than that, but only for extents that are
smaller than that. If all the extents are 256 KiB plus, defrag won't do
anything with it without a larger target option or unless the compress
option is also used, in which case it rewrites everything it is pointed
at, in ordered to recompress it.
Talking about compression, it's worth mentioning that filefrag doesn't
understand btrfs compression either, and will count each 128 KiB
(uncompressed size) compression block as a separate extent. To get the
true picture using filefrag, you need to use verbose and either eyeball
the results manually or feed them into a script that processes the
numbers and combines "extents" if they are reported as immediately
consecutive on the filesystem.
As such, filefrag, given the regular opportunistic compression, turns out
to be a good method of determining whether a file (of over 128 KiB in
size) is actually compressed or not, since if it is filefrag will report
multiple 128 KiB extents, while if it's not, extent size should be much
less regular, likely with larger extents unless the file is often
modified and rewritten in-place.
--
Duncan - List replies preferred. No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master." Richard Stallman
next prev parent reply other threads:[~2016-07-28 5:14 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-07-26 23:03 btrfs fi defrag -r -t 32M? What is actually happening? Nicholas D Steeves
2016-07-27 1:10 ` Duncan
2016-07-27 17:19 ` btrfs fi defrag does not defrag files >256kB? Nicholas D Steeves
2016-07-28 5:14 ` Duncan [this message]
2016-07-28 10:55 ` David Sterba
2016-07-28 17:25 ` Duncan
2016-07-28 17:53 ` Nicholas D Steeves
2016-07-29 3:56 ` Duncan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='pan$aef9a$f7db9e1f$4c68e09$be53f063@cox.net' \
--to=1i5t5.duncan@cox.net \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).