From: Nikolay Borisov <nborisov@suse.com>
To: Josef Bacik <josef@toxicpanda.com>,
linux-btrfs@vger.kernel.org, kernel-team@fb.com
Subject: Re: [PATCH 8/8] btrfs: reserve extra space during evict()
Date: Fri, 14 Dec 2018 10:20:57 +0200 [thread overview]
Message-ID: <63d819a6-93c8-fcbe-9e12-a09097b53d92@suse.com> (raw)
In-Reply-To: <20181203152459.21630-9-josef@toxicpanda.com>
On 3.12.18 г. 17:24 ч., Josef Bacik wrote:
> We could generate a lot of delayed refs in evict but never have any left
> over space from our block rsv to make up for that fact. So reserve some
> extra space and give it to the transaction so it can be used to refill
> the delayed refs rsv every loop through the truncate path.
>
> Signed-off-by: Josef Bacik <josef@toxicpanda.com>
> ---
> fs/btrfs/inode.c | 25 +++++++++++++++++++++++--
> 1 file changed, 23 insertions(+), 2 deletions(-)
>
> diff --git a/fs/btrfs/inode.c b/fs/btrfs/inode.c
> index 623a71d871d4..8ac7abe2ae9b 100644
> --- a/fs/btrfs/inode.c
> +++ b/fs/btrfs/inode.c
> @@ -5258,13 +5258,15 @@ static struct btrfs_trans_handle *evict_refill_and_join(struct btrfs_root *root,
> {
> struct btrfs_fs_info *fs_info = root->fs_info;
> struct btrfs_block_rsv *global_rsv = &fs_info->global_block_rsv;
> + u64 delayed_refs_extra = btrfs_calc_trans_metadata_size(fs_info, 1);
> int failures = 0;
>
> for (;;) {
> struct btrfs_trans_handle *trans;
> int ret;
>
> - ret = btrfs_block_rsv_refill(root, rsv, rsv->size,
> + ret = btrfs_block_rsv_refill(root, rsv,
> + rsv->size + delayed_refs_extra,
> BTRFS_RESERVE_FLUSH_LIMIT);
Rather than having to play those tricks, why not just modify the call in
btrfs_evict_inode, from:
rsv->size = btrfs_calc_trunc_metadata_size(fs_info, 1);
to
rsv->size = btrfs_calc_trunc_metadata_size(fs_info, 2);
and add a comment what the number 2 means of course.
>
> if (ret && ++failures > 2) {
> @@ -5273,9 +5275,28 @@ static struct btrfs_trans_handle *evict_refill_and_join(struct btrfs_root *root,
> return ERR_PTR(-ENOSPC);
> }
>
> + /*
> + * Evict can generate a large amount of delayed refs without
> + * having a way to add space back since we exhaust our temporary
> + * block rsv. We aren't allowed to do FLUSH_ALL in this case
> + * because we could deadlock with so many things in the flushing
> + * code, so we have to try and hold some extra space to
> + * compensate for our delayed ref generation. If we can't get
> + * that space then we need see if we can steal our minimum from
> + * the global reserve. We will be ratelimited by the amount of
> + * space we have for the delayed refs rsv, so we'll end up
> + * committing and trying again.
> + */
> trans = btrfs_join_transaction(root);
> - if (IS_ERR(trans) || !ret)
> + if (IS_ERR(trans) || !ret) {
> + if (!IS_ERR(trans)) {
> + trans->block_rsv = &fs_info->trans_block_rsv;
This line is redundant since evict_refill_and_join is called before the
trans->block_rsv is modified.
> + trans->bytes_reserved = delayed_refs_extra;
Is this even correct, since we join a transaction it might have already
had some bytes reserved. So in anycase shouldn't the line here say:
trans->bytes_reserved += delayed_refs_extra ?
> + btrfs_block_rsv_migrate(rsv, trans->block_rsv,
> + delayed_refs_extra, 1);
> + }
> return trans;
> + }
>
> /*
> * Try to steal from the global reserve if there is space for
>
next prev parent reply other threads:[~2018-12-14 8:21 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-12-03 15:24 [PATCH 0/8][V2] Enospc cleanups and fixeS Josef Bacik
2018-12-03 15:24 ` [PATCH 1/8] btrfs: check if free bgs for commit Josef Bacik
2018-12-03 15:24 ` [PATCH 2/8] btrfs: dump block_rsv whe dumping space info Josef Bacik
2018-12-03 15:24 ` [PATCH 3/8] btrfs: don't use global rsv for chunk allocation Josef Bacik
2018-12-11 9:59 ` Nikolay Borisov
2018-12-03 15:24 ` [PATCH 4/8] btrfs: add ALLOC_CHUNK_FORCE to the flushing code Josef Bacik
2018-12-11 10:08 ` Nikolay Borisov
2018-12-11 16:47 ` David Sterba
2018-12-11 16:51 ` Nikolay Borisov
2018-12-11 19:04 ` David Sterba
2018-12-03 15:24 ` [PATCH 5/8] btrfs: don't enospc all tickets on flush failure Josef Bacik
2018-12-11 14:32 ` Nikolay Borisov
2018-12-03 15:24 ` [PATCH 6/8] btrfs: loop in inode_rsv_refill Josef Bacik
2018-12-12 16:01 ` Nikolay Borisov
2019-02-06 18:20 ` David Sterba
2019-01-30 16:41 ` David Sterba
2018-12-03 15:24 ` [PATCH 7/8] btrfs: be more explicit about allowed flush states Josef Bacik
2018-12-11 18:28 ` David Sterba
2018-12-12 8:40 ` Nikolay Borisov
2018-12-03 15:24 ` [PATCH 8/8] btrfs: reserve extra space during evict() Josef Bacik
2018-12-14 8:20 ` Nikolay Borisov [this message]
2018-12-13 14:11 ` [PATCH 0/8][V2] Enospc cleanups and fixeS David Sterba
2018-12-13 14:36 ` Nikolay Borisov
2018-12-13 14:45 ` Josef Bacik
2018-12-13 18:17 ` David Sterba
2018-12-13 18:28 ` Josef Bacik
2018-12-13 18:41 ` David Sterba
2019-02-08 16:08 ` David Sterba
-- strict thread matches above, loose matches on Subject: below --
2018-11-21 19:03 [PATCH 0/8] Enospc cleanups and fixes Josef Bacik
2018-11-21 19:03 ` [PATCH 8/8] btrfs: reserve extra space during evict() Josef Bacik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=63d819a6-93c8-fcbe-9e12-a09097b53d92@suse.com \
--to=nborisov@suse.com \
--cc=josef@toxicpanda.com \
--cc=kernel-team@fb.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).