From: Josef Bacik <josef@toxicpanda.com>
To: Qu Wenruo <wqu@suse.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v3 3/3] btrfs: Introduce new incompat feature, BG_TREE, to speed up mount time
Date: Fri, 11 Oct 2019 09:23:36 -0400 [thread overview]
Message-ID: <20191011132335.mo2zsapmmjhftezh@MacBook-Pro-91.local> (raw)
In-Reply-To: <20191010023928.24586-4-wqu@suse.com>
On Thu, Oct 10, 2019 at 10:39:28AM +0800, Qu Wenruo wrote:
> The overall idea of the new BG_TREE is pretty simple:
> Put BLOCK_GROUP_ITEMS into a separate tree.
>
> This brings one obvious enhancement:
> - Reduce mount time of large fs
>
> Although it could be possible to accept BLOCK_GROUP_ITEMS in either
> trees (extent root or bg root), I'll leave that kernel convert as
> alternatives to offline convert, as next step if there are a lot of
> interests in that.
>
> So for now, if an existing fs want to take advantage of BG_TREE feature,
> btrfs-progs will provide offline convertion tool.
>
> [[Benchmark]]
> Physical device: NVMe SSD
> VM device: VirtIO block device, backup by sparse file
> Nodesize: 4K (to bump up tree height)
> Extent data size: 4M
> Fs size used: 1T
>
> All file extents on disk is in 4M size, preallocated to reduce space usage
> (as the VM uses loopback block device backed by sparse file)
>
> Without patchset:
> Use ftrace function graph:
>
> 7) | open_ctree [btrfs]() {
> 7) | btrfs_read_block_groups [btrfs]() {
> 7) @ 805851.8 us | }
> 7) @ 911890.2 us | }
>
> btrfs_read_block_groups() takes 88% of the total mount time,
>
> With patchset, and use -O bg-tree mkfs option:
>
> 6) | open_ctree [btrfs]() {
> 6) | btrfs_read_block_groups [btrfs]() {
> 6) * 91204.69 us | }
> 6) @ 192039.5 us | }
>
> open_ctree() time is only 21% of original mount time.
> And btrfs_read_block_groups() only takes 47% of total open_ctree()
> execution time.
>
> The reason is pretty obvious when considering how many tree blocks needs
> to be read from disk:
> - Original extent tree:
> nodes: 55
> leaves: 1025
> total: 1080
> - Block group tree:
> nodes: 1
> leaves: 13
> total: 14
>
> Not to mention all the tree blocks readahead works pretty fine for bg
> tree, as we will read every item.
> While readahead for extent tree will just be a diaster, as all block
> groups are scatter across the whole extent tree.
>
> The reduction of mount time is already obvious even on super fast NVMe
> disk with memory cache.
> It would be even more obvious if the fs is on spinning rust.
>
> Signed-off-by: Qu Wenruo <wqu@suse.com>
You need to add
fs_info->bg_root->block_rsv = &fs_info->delayed_refs_rsv;
to btrfs_init_global_block_rsv, otherwise bad things will happen. Thanks,
Josef
next prev parent reply other threads:[~2019-10-11 13:23 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-10-10 2:39 [PATCH v3 0/3] btrfs: Introduce new incompat feature BG_TREE to hugely reduce mount time Qu Wenruo
2019-10-10 2:39 ` [PATCH v3 1/3] btrfs: block-group: Fix a memory leak due to missing btrfs_put_block_group() Qu Wenruo
2019-10-10 2:51 ` Anand Jain
2019-10-10 7:20 ` Johannes Thumshirn
2019-10-11 19:23 ` David Sterba
2019-10-10 2:39 ` [PATCH v3 2/3] btrfs: block-group: Refactor btrfs_read_block_groups() Qu Wenruo
2019-10-10 2:52 ` Anand Jain
2019-10-30 4:59 ` Qu WenRuo
2019-11-04 19:53 ` David Sterba
2019-11-04 21:44 ` David Sterba
2019-11-05 0:47 ` Qu Wenruo
2019-11-04 19:55 ` David Sterba
2019-10-10 2:39 ` [PATCH v3 3/3] btrfs: Introduce new incompat feature, BG_TREE, to speed up mount time Qu Wenruo
2019-10-10 5:21 ` Naohiro Aota
2019-10-11 13:23 ` Josef Bacik [this message]
2019-10-14 9:08 ` Anand Jain
2019-10-10 2:40 ` [PATCH v3 0/3] btrfs: Introduce new incompat feature BG_TREE to hugely reduce " Qu WenRuo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20191011132335.mo2zsapmmjhftezh@MacBook-Pro-91.local \
--to=josef@toxicpanda.com \
--cc=linux-btrfs@vger.kernel.org \
--cc=wqu@suse.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox