linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Qu Wenruo <quwenruo.btrfs@gmx.com>
To: Anand Jain <anand.jain@oracle.com>, Qu Wenruo <wqu@suse.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v3 3/5] btrfs-progs: separate block group tree from extent tree v2
Date: Tue, 4 Oct 2022 08:05:52 +0800	[thread overview]
Message-ID: <7ca45e26-1ab5-bc4c-80bf-3e22857f27c9@gmx.com> (raw)
In-Reply-To: <2741b7c3-4036-50d7-26ea-aa32dd8ae466@gmx.com>



On 2022/10/4 07:28, Qu Wenruo wrote:
> 
> 
> On 2022/10/3 22:48, Anand Jain wrote:
>>
>> This patch is causing regressions; now can't mkfs with extent-tree-v2.
> 
> I'm already looking at it.

It's a more complex thing, not just a simple regression.

Firstly, commit "btrfs-progs: prepare merging compat feature lists" 
tries to merge the -O and -R options, which is a good idea.

The problem is, we're still just using the initial u64 numbers for 
btrfs_parse_fs_feaetures(), which we expect to get a simple U64 bit flags.

But unfortunately this means the u64 will have conflicting bits for 
compat_ro and incompat flags.

And for block group tree case, it's 1<<2 in compat_ro, while 1<<2 in 
incompat it's mixed bg.

Thus we trigger the problem.

I'll rework the merge patch to avoid the problem.

Thanks,
Qu

> 
>>
>>
>> $ mkfs.btrfs -f -O block-group-tree  /dev/nvme0n1
>> btrfs-progs v5.19.1
>> See http://btrfs.wiki.kernel.org for more information.
>>
>> ERROR: superblock magic doesn't match
>> ERROR: illegal nodesize 16384 (not equal to 4096 for mixed block group)
>>
>>
>>
>> $ mkfs.btrfs -f -O extent-tree-v2  /dev/nvme0n1
>> btrfs-progs v5.19.1
>> See http://btrfs.wiki.kernel.org for more information.
>>
>> ERROR: superblock magic doesn't match
>> NOTE: several default settings have changed in version 5.15, please 
>> make sure
>>        this does not affect your deployments:
>>        - DUP for metadata (-m dup)
>>        - enabled no-holes (-O no-holes)
>>        - enabled free-space-tree (-R free-space-tree)
>>
>> Unable to find block group for 0
>> Unable to find block group for 0
>> Unable to find block group for 0
>> ERROR: no space to allocate metadata chunk
>> ERROR: failed to create default block groups: -28
>>
>>
>>
>>
>> On 09/08/2022 14:03, Qu Wenruo wrote:
>>> Block group tree feature is completely a standalone feature, and it has
>>> been over 5 years before the initial introduction to solve the long
>>> mount time.
>>>
>>> I don't really want to waste another 5 years waiting for a feature which
>>> may or may not work, but definitely not properly reviewed for its
>>> preparation patches.
>>>
>>> So this patch will separate the block group tree feature into a
>>> standalone compat RO feature.
>>>
>>> There is a catch, in mkfs create_block_group_tree(), current
>>> tree-checker only accepts block group item with valid chunk_objectid,
>>> but the existing code from extent-tree-v2 didn't properly initialize it.
>>>
>>> This patch will also fix above mentioned problem so kernel can mount it
>>> correctly.
>>>
>>> Now mkfs/fsck should be able to handle the fs with block group tree.
>>>
>>> Signed-off-by: Qu Wenruo <wqu@suse.com>
>>> ---
>>>   check/main.c               |  8 ++------
>>>   common/fsfeatures.c        |  8 ++++++++
>>>   common/fsfeatures.h        |  2 ++
>>>   kernel-shared/ctree.h      |  9 ++++++++-
>>>   kernel-shared/disk-io.c    |  4 ++--
>>>   kernel-shared/disk-io.h    |  2 +-
>>>   kernel-shared/print-tree.c |  5 ++---
>>>   mkfs/common.c              | 31 ++++++++++++++++++++++++-------
>>>   mkfs/main.c                |  3 ++-
>>>   9 files changed, 51 insertions(+), 21 deletions(-)
>>>
>>> diff --git a/check/main.c b/check/main.c
>>> index 4f7ab8b29309..02abbd5289f9 100644
>>> --- a/check/main.c
>>> +++ b/check/main.c
>>> @@ -6293,7 +6293,7 @@ static int check_type_with_root(u64 rootid, u8 
>>> key_type)
>>>               goto err;
>>>           break;
>>>       case BTRFS_BLOCK_GROUP_ITEM_KEY:
>>> -        if (btrfs_fs_incompat(gfs_info, EXTENT_TREE_V2)) {
>>> +        if (btrfs_fs_compat_ro(gfs_info, BLOCK_GROUP_TREE)) {
>>>               if (rootid != BTRFS_BLOCK_GROUP_TREE_OBJECTID)
>>>                   goto err;
>>>           } else if (rootid != BTRFS_EXTENT_TREE_OBJECTID) {
>>> @@ -9071,10 +9071,6 @@ again:
>>>       ret = load_super_root(&normal_trees, gfs_info->chunk_root);
>>>       if (ret < 0)
>>>           goto out;
>>> -    ret = load_super_root(&normal_trees, gfs_info->block_group_root);
>>> -    if (ret < 0)
>>> -        goto out;
>>> -
>>>       ret = parse_tree_roots(&normal_trees, &dropping_trees);
>>>       if (ret < 0)
>>>           goto out;
>>> @@ -9574,7 +9570,7 @@ again:
>>>        * If we are extent tree v2 then we can reint the block group 
>>> root as
>>>        * well.
>>>        */
>>> -    if (btrfs_fs_incompat(gfs_info, EXTENT_TREE_V2)) {
>>> +    if (btrfs_fs_compat_ro(gfs_info, BLOCK_GROUP_TREE)) {
>>>           ret = btrfs_fsck_reinit_root(trans, 
>>> gfs_info->block_group_root);
>>>           if (ret) {
>>>               fprintf(stderr, "block group initialization failed\n");
>>> diff --git a/common/fsfeatures.c b/common/fsfeatures.c
>>> index 23a92c21a2cc..90704959b13b 100644
>>> --- a/common/fsfeatures.c
>>> +++ b/common/fsfeatures.c
>>> @@ -172,6 +172,14 @@ static const struct btrfs_feature 
>>> runtime_features[] = {
>>>           VERSION_TO_STRING2(safe, 4,9),
>>>           VERSION_TO_STRING2(default, 5,15),
>>>           .desc        = "free space tree (space_cache=v2)"
>>> +    }, {
>>> +        .name        = "block-group-tree",
>>> +        .flag        = BTRFS_RUNTIME_FEATURE_BLOCK_GROUP_TREE,
>>> +        .sysfs_name = "block_group_tree",
>>> +        VERSION_TO_STRING2(compat, 6,0),
>>> +        VERSION_NULL(safe),
>>> +        VERSION_NULL(default),
>>> +        .desc        = "block group tree to reduce mount time"
>>>       },
>>>       /* Keep this one last */
>>>       {
>>> diff --git a/common/fsfeatures.h b/common/fsfeatures.h
>>> index 9e39c667b900..a8d77fd4da05 100644
>>> --- a/common/fsfeatures.h
>>> +++ b/common/fsfeatures.h
>>> @@ -45,6 +45,8 @@
>>>   #define BTRFS_RUNTIME_FEATURE_QUOTA        (1ULL << 0)
>>>   #define BTRFS_RUNTIME_FEATURE_FREE_SPACE_TREE    (1ULL << 1)
>>> +#define BTRFS_RUNTIME_FEATURE_BLOCK_GROUP_TREE    (1ULL << 2)
>>> +
>>>   void btrfs_list_all_fs_features(u64 mask_disallowed);
>>>   void btrfs_list_all_runtime_features(u64 mask_disallowed);
>>> diff --git a/kernel-shared/ctree.h b/kernel-shared/ctree.h
>>> index c12076202577..d8909b3fdf20 100644
>>> --- a/kernel-shared/ctree.h
>>> +++ b/kernel-shared/ctree.h
>>> @@ -479,6 +479,12 @@ BUILD_ASSERT(sizeof(struct btrfs_super_block) == 
>>> BTRFS_SUPER_INFO_SIZE);
>>>    */
>>>   #define BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID    (1ULL << 1)
>>> +/*
>>> + * Save all block group items into a dedicated block group tree, to 
>>> greatly
>>> + * reduce mount time for large fs.
>>> + */
>>> +#define BTRFS_FEATURE_COMPAT_RO_BLOCK_GROUP_TREE    (1ULL << 5)
>>> +
>>>   #define BTRFS_FEATURE_INCOMPAT_MIXED_BACKREF    (1ULL << 0)
>>>   #define BTRFS_FEATURE_INCOMPAT_DEFAULT_SUBVOL    (1ULL << 1)
>>>   #define BTRFS_FEATURE_INCOMPAT_MIXED_GROUPS    (1ULL << 2)
>>> @@ -508,7 +514,8 @@ BUILD_ASSERT(sizeof(struct btrfs_super_block) == 
>>> BTRFS_SUPER_INFO_SIZE);
>>>    */
>>>   #define BTRFS_FEATURE_COMPAT_RO_SUPP            \
>>>       (BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE |    \
>>> -     BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID)
>>> +     BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID| \
>>> +     BTRFS_FEATURE_COMPAT_RO_BLOCK_GROUP_TREE)
>>>   #if EXPERIMENTAL
>>>   #define BTRFS_FEATURE_INCOMPAT_SUPP            \
>>> diff --git a/kernel-shared/disk-io.c b/kernel-shared/disk-io.c
>>> index 80db5976cc3f..6eeb5ecd1d59 100644
>>> --- a/kernel-shared/disk-io.c
>>> +++ b/kernel-shared/disk-io.c
>>> @@ -1203,7 +1203,7 @@ static int load_important_roots(struct 
>>> btrfs_fs_info *fs_info,
>>>           backup = sb->super_roots + index;
>>>       }
>>> -    if (!btrfs_fs_incompat(fs_info, EXTENT_TREE_V2)) {
>>> +    if (!btrfs_fs_compat_ro(fs_info, BLOCK_GROUP_TREE)) {
>>>           free(fs_info->block_group_root);
>>>           fs_info->block_group_root = NULL;
>>>           goto tree_root;
>>> @@ -1256,7 +1256,7 @@ int btrfs_setup_all_roots(struct btrfs_fs_info 
>>> *fs_info, u64 root_tree_bytenr,
>>>       if (ret)
>>>           return ret;
>>> -    if (btrfs_fs_incompat(fs_info, EXTENT_TREE_V2)) {
>>> +    if (btrfs_fs_compat_ro(fs_info, BLOCK_GROUP_TREE)) {
>>>           ret = find_and_setup_root(root, fs_info,
>>>                   BTRFS_BLOCK_GROUP_TREE_OBJECTID,
>>>                   fs_info->block_group_root);
>>> diff --git a/kernel-shared/disk-io.h b/kernel-shared/disk-io.h
>>> index bba97fc1a814..6c8eaa2bd13d 100644
>>> --- a/kernel-shared/disk-io.h
>>> +++ b/kernel-shared/disk-io.h
>>> @@ -232,7 +232,7 @@ int btrfs_global_root_insert(struct btrfs_fs_info 
>>> *fs_info,
>>>   static inline struct btrfs_root *btrfs_block_group_root(
>>>                           struct btrfs_fs_info *fs_info)
>>>   {
>>> -    if (btrfs_fs_incompat(fs_info, EXTENT_TREE_V2))
>>> +    if (btrfs_fs_compat_ro(fs_info, BLOCK_GROUP_TREE))
>>>           return fs_info->block_group_root;
>>>       return btrfs_extent_root(fs_info, 0);
>>>   }
>>> diff --git a/kernel-shared/print-tree.c b/kernel-shared/print-tree.c
>>> index bffe30b405c7..b2ee77c2fb73 100644
>>> --- a/kernel-shared/print-tree.c
>>> +++ b/kernel-shared/print-tree.c
>>> @@ -1668,6 +1668,7 @@ struct readable_flag_entry {
>>>   static struct readable_flag_entry compat_ro_flags_array[] = {
>>>       DEF_COMPAT_RO_FLAG_ENTRY(FREE_SPACE_TREE),
>>>       DEF_COMPAT_RO_FLAG_ENTRY(FREE_SPACE_TREE_VALID),
>>> +    DEF_COMPAT_RO_FLAG_ENTRY(BLOCK_GROUP_TREE),
>>>   };
>>>   static const int compat_ro_flags_num = sizeof(compat_ro_flags_array) /
>>>                          sizeof(struct readable_flag_entry);
>>> @@ -1754,9 +1755,7 @@ static void print_readable_compat_ro_flag(u64 
>>> flag)
>>>        */
>>>       return __print_readable_flag(flag, compat_ro_flags_array,
>>>                        compat_ro_flags_num,
>>> -                     BTRFS_FEATURE_COMPAT_RO_SUPP |
>>> -                     BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE |
>>> -                     BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID);
>>> +                     BTRFS_FEATURE_COMPAT_RO_SUPP);
>>>   }
>>>   static void print_readable_incompat_flag(u64 flag)
>>> diff --git a/mkfs/common.c b/mkfs/common.c
>>> index b72338551dfb..cb616f13ef9b 100644
>>> --- a/mkfs/common.c
>>> +++ b/mkfs/common.c
>>> @@ -75,6 +75,8 @@ static int btrfs_create_tree_root(int fd, struct 
>>> btrfs_mkfs_config *cfg,
>>>       int blk;
>>>       int i;
>>>       u8 uuid[BTRFS_UUID_SIZE];
>>> +    bool block_group_tree = !!(cfg->runtime_features &
>>> +                   BTRFS_RUNTIME_FEATURE_BLOCK_GROUP_TREE);
>>>       memset(buf->data + sizeof(struct btrfs_header), 0,
>>>           cfg->nodesize - sizeof(struct btrfs_header));
>>> @@ -101,6 +103,9 @@ static int btrfs_create_tree_root(int fd, struct 
>>> btrfs_mkfs_config *cfg,
>>>           if (blk == MKFS_ROOT_TREE || blk == MKFS_CHUNK_TREE)
>>>               continue;
>>> +        if (!block_group_tree && blk == MKFS_BLOCK_GROUP_TREE)
>>> +            continue;
>>> +
>>>           btrfs_set_root_bytenr(&root_item, cfg->blocks[blk]);
>>>           btrfs_set_disk_key_objectid(&disk_key,
>>>               reference_root_table[blk]);
>>> @@ -216,7 +221,8 @@ static int create_block_group_tree(int fd, struct 
>>> btrfs_mkfs_config *cfg,
>>>       memset(buf->data + sizeof(struct btrfs_header), 0,
>>>           cfg->nodesize - sizeof(struct btrfs_header));
>>> -    write_block_group_item(buf, 0, bg_offset, bg_size, bg_used, 0,
>>> +    write_block_group_item(buf, 0, bg_offset, bg_size, bg_used,
>>> +                   BTRFS_FIRST_CHUNK_TREE_OBJECTID,
>>>                      cfg->leaf_data_size -
>>>                      sizeof(struct btrfs_block_group_item));
>>>       btrfs_set_header_bytenr(buf, cfg->blocks[MKFS_BLOCK_GROUP_TREE]);
>>> @@ -357,6 +363,7 @@ int make_btrfs(int fd, struct btrfs_mkfs_config 
>>> *cfg)
>>>       u32 array_size;
>>>       u32 item_size;
>>>       u64 total_used = 0;
>>> +    u64 ro_flags = 0;
>>>       int skinny_metadata = !!(cfg->features &
>>>                    BTRFS_FEATURE_INCOMPAT_SKINNY_METADATA);
>>>       u64 num_bytes;
>>> @@ -365,6 +372,8 @@ int make_btrfs(int fd, struct btrfs_mkfs_config 
>>> *cfg)
>>>       bool add_block_group = true;
>>>       bool free_space_tree = !!(cfg->runtime_features &
>>>                     BTRFS_RUNTIME_FEATURE_FREE_SPACE_TREE);
>>> +    bool block_group_tree = !!(cfg->runtime_features &
>>> +                   BTRFS_RUNTIME_FEATURE_BLOCK_GROUP_TREE);
>>>       bool extent_tree_v2 = !!(cfg->features &
>>>                    BTRFS_FEATURE_INCOMPAT_EXTENT_TREE_V2);
>>> @@ -372,8 +381,13 @@ int make_btrfs(int fd, struct btrfs_mkfs_config 
>>> *cfg)
>>>              sizeof(enum btrfs_mkfs_block) * 
>>> ARRAY_SIZE(default_blocks));
>>>       blocks_nr = ARRAY_SIZE(default_blocks);
>>> -    /* Extent tree v2 needs an extra block for block group tree.*/
>>> -    if (extent_tree_v2) {
>>> +    /*
>>> +     * Add one new block for block group tree.
>>> +     * And for block group tree, we don't need to add block group item
>>> +     * into extent tree, the item will be handled in block group tree
>>> +     * initialization.
>>> +     */
>>> +    if (block_group_tree) {
>>>           mkfs_blocks_add(blocks, &blocks_nr, MKFS_BLOCK_GROUP_TREE);
>>>           add_block_group = false;
>>>       }
>>> @@ -433,12 +447,15 @@ int make_btrfs(int fd, struct btrfs_mkfs_config 
>>> *cfg)
>>>           btrfs_set_super_cache_generation(&super, -1);
>>>       btrfs_set_super_incompat_flags(&super, cfg->features);
>>>       if (free_space_tree) {
>>> -        u64 ro_flags = BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE |
>>> -            BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID;
>>> +        ro_flags |= (BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE |
>>> +                 BTRFS_FEATURE_COMPAT_RO_FREE_SPACE_TREE_VALID);
>>> -        btrfs_set_super_compat_ro_flags(&super, ro_flags);
>>>           btrfs_set_super_cache_generation(&super, 0);
>>>       }
>>> +    if (block_group_tree)
>>> +        ro_flags |= BTRFS_FEATURE_COMPAT_RO_BLOCK_GROUP_TREE;
>>> +    btrfs_set_super_compat_ro_flags(&super, ro_flags);
>>> +
>>>       if (extent_tree_v2)
>>>           btrfs_set_super_nr_global_roots(&super, 1);
>>> @@ -695,7 +712,7 @@ int make_btrfs(int fd, struct btrfs_mkfs_config 
>>> *cfg)
>>>               goto out;
>>>       }
>>> -    if (extent_tree_v2) {
>>> +    if (block_group_tree) {
>>>           ret = create_block_group_tree(fd, cfg, buf,
>>>                             system_group_offset,
>>>                             system_group_size, total_used);
>>> diff --git a/mkfs/main.c b/mkfs/main.c
>>> index ce096d362171..518ce0fd7523 100644
>>> --- a/mkfs/main.c
>>> +++ b/mkfs/main.c
>>> @@ -299,7 +299,8 @@ static int recow_roots(struct btrfs_trans_handle 
>>> *trans,
>>>       ret = __recow_root(trans, info->dev_root);
>>>       if (ret)
>>>           return ret;
>>> -        if (btrfs_fs_incompat(info, EXTENT_TREE_V2)) {
>>> +
>>> +    if (btrfs_fs_compat_ro(info, BLOCK_GROUP_TREE)) {
>>>           ret = __recow_root(trans, info->block_group_root);
>>>           if (ret)
>>>               return ret;
>>

  reply	other threads:[~2022-10-04  0:06 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-08-09  6:03 [PATCH v3 0/5] btrfs-progs: separate BLOCK_GROUP_TREE feature from extent-tree-v2 Qu Wenruo
2022-08-09  6:03 ` [PATCH v3 1/5] btrfs-progs: mkfs: dynamically modify mkfs blocks array Qu Wenruo
2022-08-09  6:03 ` [PATCH v3 2/5] btrfs-progs: don't save block group root into super block Qu Wenruo
2022-08-09  6:03 ` [PATCH v3 3/5] btrfs-progs: separate block group tree from extent tree v2 Qu Wenruo
2022-08-31 19:14   ` David Sterba
2022-08-31 21:43     ` Qu Wenruo
2022-09-01 12:15       ` Qu Wenruo
2022-09-02  9:21         ` David Sterba
2022-09-02  9:37           ` Qu Wenruo
2022-09-02 12:10             ` David Sterba
2022-10-03 14:48   ` Anand Jain
2022-10-03 23:28     ` Qu Wenruo
2022-10-04  0:05       ` Qu Wenruo [this message]
2022-08-09  6:03 ` [PATCH v3 4/5] btrfs-progs: btrfstune: add the ability to convert to block group tree feature Qu Wenruo
2022-08-09  6:03 ` [PATCH v3 5/5] btrfs-progs: mkfs: add artificial dependency for block group tree Qu Wenruo
2022-08-31 18:26 ` [PATCH v3 0/5] btrfs-progs: separate BLOCK_GROUP_TREE feature from extent-tree-v2 David Sterba

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=7ca45e26-1ab5-bc4c-80bf-3e22857f27c9@gmx.com \
    --to=quwenruo.btrfs@gmx.com \
    --cc=anand.jain@oracle.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=wqu@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).