From: "Holger Hoffstätte" <holger@applied-asynchrony.com>
To: jeffm@suse.com, linux-btrfs@vger.kernel.org
Subject: Re: [PATCH 0/3] btrfs-progs: check improve 'checking extents' scalability
Date: Thu, 23 Jun 2016 23:24:50 +0200 [thread overview]
Message-ID: <576C53A2.3090704@applied-asynchrony.com> (raw)
In-Reply-To: <1466709966-31506-1-git-send-email-jeffm@suse.com>
On 06/23/16 21:26, jeffm@suse.com wrote:
> From: Jeff Mahoney <jeffm@suse.com>
>
> While running xfstests generic/291, which creates a single file populated
> with reflinks to the same extent, I found that fsck had been running for
> hours. perf top lead me to find_data_backref as the culprit, and a litte
> more digging made it clear: For every extent record we add, we iterate
> the entire list first. My test case had ~2M records. That math doesn't
> go well.
>
> This patchset converts the extent_backref list to an rbtree. The test
> that used to run for more than 8 hours without completing now takes
> less than 20 seconds.
Very interesting. Time for science!
unpatched:
holger>time btrfs check /dev/sdc1
Checking filesystem on /dev/sdc1
..
btrfs check /dev/sdc1 17.03s user 3.79s system 25% cpu 1:22.82 total
patched:
holger>time btrfs check /dev/sdc1
Checking filesystem on /dev/sdc1
..
btrfs check /dev/sdc1 17.03s user 3.74s system 24% cpu 1:23.24 total
This is a 1TB disk with ~850GB data in 4 subvolumes, ~2 snapshots each.
I guess it only starts to matter (relative to the necessary I/O cost per
extent) when the level of sharing is higher, i.e. many more snapshots?
OTOH it doesn't explode, so that's good. :)
cheers
Holger
next prev parent reply other threads:[~2016-06-23 21:24 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-06-23 19:26 [PATCH 0/3] btrfs-progs: check improve 'checking extents' scalability jeffm
2016-06-23 19:26 ` [PATCH 1/3] btrfs-progs: check: add helpers for converting between structures jeffm
2016-06-23 19:26 ` [PATCH 2/3] btrfs-progs: check: supplement extent backref list with rbtree jeffm
2016-06-23 19:26 ` [PATCH 3/3] btrfs-progs: check: switch to iterating over the backref_tree jeffm
2016-06-23 21:24 ` Holger Hoffstätte [this message]
2016-06-23 21:38 ` [PATCH 0/3] btrfs-progs: check improve 'checking extents' scalability Jeff Mahoney
2016-07-04 12:20 ` David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=576C53A2.3090704@applied-asynchrony.com \
--to=holger@applied-asynchrony.com \
--cc=jeffm@suse.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).