Linux Btrfs filesystem development
 help / color / mirror / Atom feed
From: Omar Sandoval <osandov@osandov.com>
To: Alex Lyakas <alex.lyakas@zadara.com>
Cc: linux-btrfs@vger.kernel.org, kernel-team@fb.com
Subject: Re: [PATCH 2/2] btrfs: get rid of block group caching progress logic
Date: Thu, 20 Feb 2025 13:59:42 -0800	[thread overview]
Message-ID: <Z7elzqQqFb_YUO_J@telecaster> (raw)
In-Reply-To: <CAOcd+r1BGKYVRZK5iDdK6N0sr7CuCxvmmBjzNDXhZrv2o4cRYA@mail.gmail.com>

On Thu, Feb 20, 2025 at 09:26:47PM +0200, Alex Lyakas wrote:
> Hi Omar,
> 
> On Wed, Aug 17, 2022 at 2:13 AM Omar Sandoval <osandov@osandov.com> wrote:
> >
> > From: Omar Sandoval <osandov@fb.com>
> >
> > struct btrfs_caching_ctl::progress and struct
> > btrfs_block_group::last_byte_to_unpin were previously needed to ensure
> > that unpin_extent_range() didn't return a range to the free space cache
> > before the caching thread had a chance to cache that range. However, the
> > previous commit made it so that we always synchronously cache the block
> > group at the time that we pin the extent, so this machinery is no longer
> > necessary.
> >
> > Signed-off-by: Omar Sandoval <osandov@fb.com>
> > ---
> >  fs/btrfs/block-group.c     | 13 ------------
> >  fs/btrfs/block-group.h     |  2 --
> >  fs/btrfs/extent-tree.c     |  9 ++-------
> >  fs/btrfs/free-space-tree.c |  8 --------
> >  fs/btrfs/transaction.c     | 41 --------------------------------------
> >  fs/btrfs/zoned.c           |  1 -
> >  6 files changed, 2 insertions(+), 72 deletions(-)
> >
> > diff --git a/fs/btrfs/block-group.c b/fs/btrfs/block-group.c
> > index 1af6fc395a52..68992ad9ff2a 100644
> > --- a/fs/btrfs/block-group.c
> > +++ b/fs/btrfs/block-group.c
> > @@ -593,8 +593,6 @@ static int load_extent_tree_free(struct btrfs_caching_control *caching_ctl)
> >
> >                         if (need_resched() ||
> >                             rwsem_is_contended(&fs_info->commit_root_sem)) {
> > -                               if (wakeup)
> > -                                       caching_ctl->progress = last;
> >                                 btrfs_release_path(path);
> >                                 up_read(&fs_info->commit_root_sem);
> >                                 mutex_unlock(&caching_ctl->mutex);
> > @@ -618,9 +616,6 @@ static int load_extent_tree_free(struct btrfs_caching_control *caching_ctl)
> >                         key.objectid = last;
> >                         key.offset = 0;
> >                         key.type = BTRFS_EXTENT_ITEM_KEY;
> > -
> > -                       if (wakeup)
> > -                               caching_ctl->progress = last;
> >                         btrfs_release_path(path);
> >                         goto next;
> >                 }
> > @@ -655,7 +650,6 @@ static int load_extent_tree_free(struct btrfs_caching_control *caching_ctl)
> >
> >         total_found += add_new_free_space(block_group, last,
> >                                 block_group->start + block_group->length);
> > -       caching_ctl->progress = (u64)-1;
> >
> >  out:
> >         btrfs_free_path(path);
> > @@ -725,8 +719,6 @@ static noinline void caching_thread(struct btrfs_work *work)
> >         }
> >  #endif
> >
> > -       caching_ctl->progress = (u64)-1;
> > -
> >         up_read(&fs_info->commit_root_sem);
> >         btrfs_free_excluded_extents(block_group);
> >         mutex_unlock(&caching_ctl->mutex);
> > @@ -755,7 +747,6 @@ int btrfs_cache_block_group(struct btrfs_block_group *cache, bool wait)
> >         mutex_init(&caching_ctl->mutex);
> >         init_waitqueue_head(&caching_ctl->wait);
> >         caching_ctl->block_group = cache;
> > -       caching_ctl->progress = cache->start;
> >         refcount_set(&caching_ctl->count, 2);
> >         btrfs_init_work(&caching_ctl->work, caching_thread, NULL, NULL);
> >
> > @@ -2076,11 +2067,9 @@ static int read_one_block_group(struct btrfs_fs_info *info,
> >                 /* Should not have any excluded extents. Just in case, though. */
> >                 btrfs_free_excluded_extents(cache);
> >         } else if (cache->length == cache->used) {
> > -               cache->last_byte_to_unpin = (u64)-1;
> >                 cache->cached = BTRFS_CACHE_FINISHED;
> >                 btrfs_free_excluded_extents(cache);
> >         } else if (cache->used == 0) {
> > -               cache->last_byte_to_unpin = (u64)-1;
> >                 cache->cached = BTRFS_CACHE_FINISHED;
> >                 add_new_free_space(cache, cache->start,
> >                                    cache->start + cache->length);
> > @@ -2136,7 +2125,6 @@ static int fill_dummy_bgs(struct btrfs_fs_info *fs_info)
> >                 /* Fill dummy cache as FULL */
> >                 bg->length = em->len;
> >                 bg->flags = map->type;
> > -               bg->last_byte_to_unpin = (u64)-1;
> >                 bg->cached = BTRFS_CACHE_FINISHED;
> >                 bg->used = em->len;
> >                 bg->flags = map->type;
> > @@ -2482,7 +2470,6 @@ struct btrfs_block_group *btrfs_make_block_group(struct btrfs_trans_handle *tran
> >         set_free_space_tree_thresholds(cache);
> >         cache->used = bytes_used;
> >         cache->flags = type;
> > -       cache->last_byte_to_unpin = (u64)-1;
> >         cache->cached = BTRFS_CACHE_FINISHED;
> >         cache->global_root_id = calculate_global_root_id(fs_info, cache->start);
> >
> > diff --git a/fs/btrfs/block-group.h b/fs/btrfs/block-group.h
> > index 9dba28bb1806..59a86e82a28e 100644
> > --- a/fs/btrfs/block-group.h
> > +++ b/fs/btrfs/block-group.h
> > @@ -63,7 +63,6 @@ struct btrfs_caching_control {
> >         wait_queue_head_t wait;
> >         struct btrfs_work work;
> >         struct btrfs_block_group *block_group;
> > -       u64 progress;
> >         refcount_t count;
> >  };
> >
> > @@ -115,7 +114,6 @@ struct btrfs_block_group {
> >         /* Cache tracking stuff */
> >         int cached;
> >         struct btrfs_caching_control *caching_ctl;
> > -       u64 last_byte_to_unpin;
> >
> >         struct btrfs_space_info *space_info;
> >
> > diff --git a/fs/btrfs/extent-tree.c b/fs/btrfs/extent-tree.c
> > index 86ac953c69ac..bcd0e72cded3 100644
> > --- a/fs/btrfs/extent-tree.c
> > +++ b/fs/btrfs/extent-tree.c
> > @@ -2686,13 +2686,8 @@ static int unpin_extent_range(struct btrfs_fs_info *fs_info,
> >                 len = cache->start + cache->length - start;
> >                 len = min(len, end + 1 - start);
> >
> > -               down_read(&fs_info->commit_root_sem);
> > -               if (start < cache->last_byte_to_unpin && return_free_space) {
> > -                       u64 add_len = min(len, cache->last_byte_to_unpin - start);
> > -
> > -                       btrfs_add_free_space(cache, start, add_len);
> > -               }
> > -               up_read(&fs_info->commit_root_sem);
> > +               if (return_free_space)
> > +                       btrfs_add_free_space(cache, start, len);
> >
> >                 start += len;
> >                 total_unpinned += len;
> > diff --git a/fs/btrfs/free-space-tree.c b/fs/btrfs/free-space-tree.c
> > index 1bf89aa67216..367bcfcf68f5 100644
> > --- a/fs/btrfs/free-space-tree.c
> > +++ b/fs/btrfs/free-space-tree.c
> > @@ -1453,8 +1453,6 @@ static int load_free_space_bitmaps(struct btrfs_caching_control *caching_ctl,
> >                 ASSERT(key.type == BTRFS_FREE_SPACE_BITMAP_KEY);
> >                 ASSERT(key.objectid < end && key.objectid + key.offset <= end);
> >
> > -               caching_ctl->progress = key.objectid;
> > -
> >                 offset = key.objectid;
> >                 while (offset < key.objectid + key.offset) {
> >                         bit = free_space_test_bit(block_group, path, offset);
> > @@ -1490,8 +1488,6 @@ static int load_free_space_bitmaps(struct btrfs_caching_control *caching_ctl,
> >                 goto out;
> >         }
> >
> > -       caching_ctl->progress = (u64)-1;
> > -
> >         ret = 0;
> >  out:
> >         return ret;
> > @@ -1531,8 +1527,6 @@ static int load_free_space_extents(struct btrfs_caching_control *caching_ctl,
> >                 ASSERT(key.type == BTRFS_FREE_SPACE_EXTENT_KEY);
> >                 ASSERT(key.objectid < end && key.objectid + key.offset <= end);
> >
> > -               caching_ctl->progress = key.objectid;
> > -
> >                 total_found += add_new_free_space(block_group, key.objectid,
> >                                                   key.objectid + key.offset);
> >                 if (total_found > CACHING_CTL_WAKE_UP) {
> > @@ -1552,8 +1546,6 @@ static int load_free_space_extents(struct btrfs_caching_control *caching_ctl,
> >                 goto out;
> >         }
> >
> > -       caching_ctl->progress = (u64)-1;
> > -
> >         ret = 0;
> >  out:
> >         return ret;
> > diff --git a/fs/btrfs/transaction.c b/fs/btrfs/transaction.c
> > index 6e3b2cb6a04a..4c87bf2abc14 100644
> > --- a/fs/btrfs/transaction.c
> > +++ b/fs/btrfs/transaction.c
> > @@ -161,7 +161,6 @@ static noinline void switch_commit_roots(struct btrfs_trans_handle *trans)
> >         struct btrfs_transaction *cur_trans = trans->transaction;
> >         struct btrfs_fs_info *fs_info = trans->fs_info;
> >         struct btrfs_root *root, *tmp;
> > -       struct btrfs_caching_control *caching_ctl, *next;
> >
> >         /*
> >          * At this point no one can be using this transaction to modify any tree
> > @@ -196,46 +195,6 @@ static noinline void switch_commit_roots(struct btrfs_trans_handle *trans)
> >         }
> >         spin_unlock(&cur_trans->dropped_roots_lock);
> >
> > -       /*
> > -        * We have to update the last_byte_to_unpin under the commit_root_sem,
> > -        * at the same time we swap out the commit roots.
> > -        *
> > -        * This is because we must have a real view of the last spot the caching
> > -        * kthreads were while caching.  Consider the following views of the
> > -        * extent tree for a block group
> > -        *
> > -        * commit root
> > -        * +----+----+----+----+----+----+----+
> > -        * |\\\\|    |\\\\|\\\\|    |\\\\|\\\\|
> > -        * +----+----+----+----+----+----+----+
> > -        * 0    1    2    3    4    5    6    7
> > -        *
> > -        * new commit root
> > -        * +----+----+----+----+----+----+----+
> > -        * |    |    |    |\\\\|    |    |\\\\|
> > -        * +----+----+----+----+----+----+----+
> > -        * 0    1    2    3    4    5    6    7
> > -        *
> > -        * If the cache_ctl->progress was at 3, then we are only allowed to
> > -        * unpin [0,1) and [2,3], because the caching thread has already
> > -        * processed those extents.  We are not allowed to unpin [5,6), because
> > -        * the caching thread will re-start it's search from 3, and thus find
> > -        * the hole from [4,6) to add to the free space cache.
> > -        */
> > -       write_lock(&fs_info->block_group_cache_lock);
> > -       list_for_each_entry_safe(caching_ctl, next,
> > -                                &fs_info->caching_block_groups, list) {
> > -               struct btrfs_block_group *cache = caching_ctl->block_group;
> > -
> > -               if (btrfs_block_group_done(cache)) {
> > -                       cache->last_byte_to_unpin = (u64)-1;
> > -                       list_del_init(&caching_ctl->list);
> > -                       btrfs_put_caching_control(caching_ctl);
> Now the caching_ctl is not removed from fs_info->caching_block_groups,
> and remains there until close_ctree(). So eventually it will be
> removed, but for now just occupying memory. Is this intended?

No, I don't think this was intentional.

Thanks,
Omar

  reply	other threads:[~2025-02-20 21:59 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-08-16 23:12 [PATCH 0/2] btrfs: fix filesystem corruption caused by space cache race Omar Sandoval
2022-08-16 23:12 ` [PATCH 1/2] btrfs: fix space cache corruption and potential double allocations Omar Sandoval
2022-08-17  6:21   ` Andrea Gelmini
2022-08-17 16:53     ` Christoph Anton Mitterer
2022-08-17 23:59       ` Omar Sandoval
2022-08-18  0:22         ` Christoph Anton Mitterer
2022-08-18  0:30           ` Omar Sandoval
2022-08-19  0:16             ` Christoph Anton Mitterer
2022-09-01 16:59               ` Christoph Anton Mitterer
2022-09-01 18:18                 ` Omar Sandoval
2022-09-01 18:52                   ` Holger Hoffstätte
2022-09-01 18:57                     ` Omar Sandoval
2022-09-02 15:43                   ` Christoph Anton Mitterer
2022-09-02 21:25                   ` Christoph Anton Mitterer
2022-08-18  6:21         ` Andrea Gelmini
2022-08-18  6:40           ` Omar Sandoval
2022-08-17  9:47   ` Filipe Manana
2022-08-17 15:32     ` Omar Sandoval
2022-08-17 15:46       ` Filipe Manana
2022-08-22 13:26         ` David Sterba
2022-08-16 23:12 ` [PATCH 2/2] btrfs: get rid of block group caching progress logic Omar Sandoval
2022-08-17  9:47   ` Filipe Manana
2025-02-20 19:26   ` Alex Lyakas
2025-02-20 21:59     ` Omar Sandoval [this message]
2025-02-23 11:31       ` Alex Lyakas
2022-08-22 13:36 ` [PATCH 0/2] btrfs: fix filesystem corruption caused by space cache race David Sterba
2022-08-23 17:27 ` David Sterba
2022-08-23 19:20   ` Christoph Anton Mitterer
2022-08-23 20:29     ` David Sterba

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Z7elzqQqFb_YUO_J@telecaster \
    --to=osandov@osandov.com \
    --cc=alex.lyakas@zadara.com \
    --cc=kernel-team@fb.com \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox