From: Boris Burkov <boris@bur.io>
To: Naohiro Aota <naota@elisp.net>
Cc: linux-btrfs@vger.kernel.org, Naohiro Aota <naohiro.aota@wdc.com>
Subject: Re: [PATCH 1/4] btrfs: delete unused BGs while reclaiming BGs
Date: Wed, 7 Jun 2023 11:48:37 -0700 [thread overview]
Message-ID: <20230607184837.GA3191631@zen> (raw)
In-Reply-To: <06288ff9c090a5bfe76e0a2080eb1fbd640cdd62.1686028197.git.naohiro.aota@wdc.com>
On Tue, Jun 06, 2023 at 02:36:33PM +0900, Naohiro Aota wrote:
> The reclaiming process only starts after the FS volumes are allocated to a
> certain level (75% by default). Thus, the list of reclaiming target block
> groups can build up so huge at the time the reclaim process kicks in. On a
> test run, there were over 1000 BGs in the reclaim list.
>
> As the reclaim involves rewriting the data, it takes really long time to
> reclaim the BGs. While the reclaim is running, btrfs_delete_unused_bgs()
> won't proceed because the reclaim side is holding
> fs_info->reclaim_bgs_lock. As a result, we will have a large number of unused
> BGs kept in the unused list. On my test run, I got 1057 unused BGs.
>
> Since deleting a block group is relatively easy and fast work, we can call
> btrfs_delete_unused_bgs() while it reclaims BGs, to avoid building up
> unused BGs.
>
> Fixes: 18bb8bbf13c1 ("btrfs: zoned: automatically reclaim zones")
> CC: stable@vger.kernel.org # 5.15+
> Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com>
> ---
> fs/btrfs/block-group.c | 14 ++++++++++++++
> 1 file changed, 14 insertions(+)
>
> diff --git a/fs/btrfs/block-group.c b/fs/btrfs/block-group.c
> index 618ba7670e66..c5547da0f6eb 100644
> --- a/fs/btrfs/block-group.c
> +++ b/fs/btrfs/block-group.c
> @@ -1824,10 +1824,24 @@ void btrfs_reclaim_bgs_work(struct work_struct *work)
>
> next:
> btrfs_put_block_group(bg);
> +
> + mutex_unlock(&fs_info->reclaim_bgs_lock);
> + /*
> + * Reclaiming all the BGs in the list can take really long.
> + * Prioritize cleanning up unused BGs.
> + */
> + btrfs_delete_unused_bgs(fs_info);
> + /*
> + * If we are interrupted by a balance, we can just bail out. The
> + * cleaner thread call me again if necessary.
> + */
> + if (!mutex_trylock(&fs_info->reclaim_bgs_lock))
> + goto end;
I agree that this fix makes sense and a lot of reclaim should not block
deleting unused bgs.
However, it feels quite hacky to me, because by the current design, we
are explicitly calling btrfs_delete_unused_bgs as a first class cleanup
action from the cleaner thread, before calling reclaim. I think a little
more cleanup to integrate the two together would reduce the "throw
things at the wall" feel here.
I would propose either:
1. Run them in parallel and make sure they release locks appropriately
so that everyone makes good forward progress. I think it's a good
model, but kind of a departure from the single cleaner thread so
maybe risky/a pain to code.
2. Just get rid of the explicit delete unused from cleaner and
integrate it as a first class component of this reclaim loop. This
loop becomes *the* delete unused work except shutdown type cases.
FWIW, not a NAK, just my 2c.
Thanks,
Boris
> spin_lock(&fs_info->unused_bgs_lock);
> }
> spin_unlock(&fs_info->unused_bgs_lock);
> mutex_unlock(&fs_info->reclaim_bgs_lock);
> +end:
> btrfs_exclop_finish(fs_info);
> sb_end_write(fs_info->sb);
> }
> --
> 2.40.1
>
next prev parent reply other threads:[~2023-06-07 18:49 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-06 5:36 [PATCH 0/4] btrfs: fixes for reclaim Naohiro Aota
2023-06-06 5:36 ` [PATCH 1/4] btrfs: delete unused BGs while reclaiming BGs Naohiro Aota
2023-06-06 9:54 ` Filipe Manana
2023-06-06 11:32 ` Johannes Thumshirn
2023-06-07 18:48 ` Boris Burkov [this message]
2023-06-12 12:50 ` Naohiro Aota
2023-06-06 5:36 ` [PATCH 2/4] btrfs: move out now unused BG from the reclaim list Naohiro Aota
2023-06-06 10:06 ` Filipe Manana
2023-06-06 11:33 ` Johannes Thumshirn
2023-06-06 5:36 ` [PATCH 3/4] btrfs: bail out reclaim process if filesystem is read-only Naohiro Aota
2023-06-06 10:16 ` Filipe Manana
2023-06-06 11:34 ` Johannes Thumshirn
2023-06-06 5:36 ` [PATCH 4/4] btrfs: reinsert BGs failed to reclaim Naohiro Aota
2023-06-06 10:25 ` Filipe Manana
2023-06-06 16:23 ` David Sterba
2023-06-06 23:55 ` Naohiro Aota
2023-06-08 11:48 ` David Sterba
2023-06-08 11:50 ` [PATCH 0/4] btrfs: fixes for reclaim David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230607184837.GA3191631@zen \
--to=boris@bur.io \
--cc=linux-btrfs@vger.kernel.org \
--cc=naohiro.aota@wdc.com \
--cc=naota@elisp.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).