Linux Btrfs filesystem development
 help / color / mirror / Atom feed
From: Qu Wenruo <quwenruo.btrfs@gmx.com>
To: Wang Yugui <wangyugui@e16-tech.com>, linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v3]btrfs: round down stripe size and chunk size to pow of 2
Date: Thu, 18 Aug 2022 14:25:57 +0800	[thread overview]
Message-ID: <c082f1f0-db71-4f16-43ee-08fd9fb3398f@gmx.com> (raw)
In-Reply-To: <20220818135316.E5CA.409509F4@e16-tech.com>



On 2022/8/18 13:53, Wang Yugui wrote:
> Hi,
>
>> On 2022/8/17 22:58, Wang Yugui wrote:
>>> In decide_stripe_size_regular(), when new disk is added to RAID0/RAID10/RAID56,
>>> it is better to free-then-reuse the free space if stripe size is kept or
>>> changed to 1/2. so stripe size of pow of 2 will be more friendly. Although
>>> roundup_pow_of_two() match better with orig round_up(), but
>>> rounddown_pow_of_two() is better to make sure <=ctl->max_chunk_size here.
>>
>> Why insist on round*_pow_of_two()?
>>
>> I see no reason that a power of 2 sized chunk has any benefit to btrfs.
>
> For stripe size, in some case,
> decide_stripe_size_regular()
>      if (ctl->stripe_size * data_stripes > ctl->max_chunk_size) {
>          /*
>           * Reduce stripe_size, round it up to a 16MB boundary again and
>           * then use it, unless it ends up being even bigger than the
>           * previous value we had already.
>           */
>          ctl->stripe_size = min(round_up(div_u64(ctl->max_chunk_size,
>                              data_stripes), SZ_16M),
>                         ctl->stripe_size);
>      }
>
> For example, RAID0/3 disk/max_chunk_size=1G,
> then stripe_size = about 1/3G

It's not correct already.

Firstly, @data_stripes for 3 disks RAID0 we should not have
max_chunk_size as 1G.

This is already a behavior change and should be investigated first.

>
> If another disk is added, RAID0/4 disk/max_chunk_size=1G,
> then stripe_size = 1/4G

Then your assumption on reduced stripe_size is already based on the
incorrect max chunk size behavior.

You're fixing a problem with wrong root cause.
>
> the mix of 1/3G and 1/4G stripe_size is difficult to manage alloc/free
> the space than the mix of 1/2G and 1/4G .
>
> For chunk size, it is more complex because of raid profile.
> decide_stripe_size_regular()
>      ctl->chunk_size = ctl->stripe_size * data_stripes;
> '
> for some raid proflie  such as single/RAID1,
> because stripe size is already set to a power of 2,
> if we set max_chunk_size to  a power of 2, then max_chunk_size will have
> a value same to chunk size/ stripe size in some case. it will be more
> easy to understand.
>
> Best Regards
> Wang Yugui (wangyugui@e16-tech.com)
> 2022/08/18
>
>
>>>
>>> In another rare case that file system is quite small, we calc max chunk size
>>> in pow of 2 too, so that max chunk size / chunk size /stripe size are same or
>>> match easy in some case.
>>>
>>> Signed-off-by: Wang Yugui <wangyugui@e16-tech.com>
>>> ---
>>> changes since v2:
>>> 	restore to rounddown_pow_of_two() from roundup_pow_of_two()
>>> changes since v1:
>>>    - change rounddown_pow_of_two() to roundup_pow_of_two() to match better with
>>>      orig roundup().
>>>
>>>    fs/btrfs/volumes.c | 20 +++++++++-----------
>>>    1 file changed, 9 insertions(+), 11 deletions(-)
>>>
>>> diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
>>> index 6595755..fab9765 100644
>>> --- a/fs/btrfs/volumes.c
>>> +++ b/fs/btrfs/volumes.c
>>> @@ -5083,9 +5083,9 @@ static void init_alloc_chunk_ctl_policy_regular(
>>>    	if (ctl->type & BTRFS_BLOCK_GROUP_SYSTEM)
>>>    		ctl->devs_max = min_t(int, ctl->devs_max, BTRFS_MAX_DEVS_SYS_CHUNK);
>>>
>>> -	/* We don't want a chunk larger than 10% of writable space */
>>> -	ctl->max_chunk_size = min(div_factor(fs_devices->total_rw_bytes, 1),
>>> -				  ctl->max_chunk_size);
>>> +	/* We don't want a chunk larger than 1/8 of writable space */
>>> +	ctl->max_chunk_size = min_t(u64, ctl->max_chunk_size,
>>> +			rounddown_pow_of_two(fs_devices->total_rw_bytes >> 3));
>>>    	ctl->dev_extent_min = BTRFS_STRIPE_LEN * ctl->dev_stripes;
>>>    }
>>>
>>> @@ -5143,10 +5143,9 @@ static void init_alloc_chunk_ctl_policy_zoned(
>>>    		BUG();
>>>    	}
>>>
>>> -	/* We don't want a chunk larger than 10% of writable space */
>>> -	limit = max(round_down(div_factor(fs_devices->total_rw_bytes, 1),
>>> -			       zone_size),
>>> -		    min_chunk_size);
>>> +	/* We don't want a chunk larger than 1/8 of writable space */
>>> +	limit = max_t(u64, min_chunk_size,
>>> +		rounddown_pow_of_two(fs_devices->total_rw_bytes >> 3));
>>>    	ctl->max_chunk_size = min(limit, ctl->max_chunk_size);
>>>    	ctl->dev_extent_min = zone_size * ctl->dev_stripes;
>>>    }
>>> @@ -5284,13 +5283,12 @@ static int decide_stripe_size_regular(struct alloc_chunk_ctl *ctl,
>>>    	 */
>>>    	if (ctl->stripe_size * data_stripes > ctl->max_chunk_size) {
>>>    		/*
>>> -		 * Reduce stripe_size, round it up to a 16MB boundary again and
>>> +		 * Reduce stripe_size, round it down to pow of 2 boundary again and
>>>    		 * then use it, unless it ends up being even bigger than the
>>>    		 * previous value we had already.
>>>    		 */
>>> -		ctl->stripe_size = min(round_up(div_u64(ctl->max_chunk_size,
>>> -							data_stripes), SZ_16M),
>>> -				       ctl->stripe_size);
>>> +		ctl->stripe_size = min_t(u64, ctl->stripe_size,
>>> +			rounddown_pow_of_two(div_u64(ctl->max_chunk_size, data_stripes)));
>>>    	}
>>>
>>>    	/* Align to BTRFS_STRIPE_LEN */
>
>

      reply	other threads:[~2022-08-18  6:26 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-08-17 14:58 [PATCH v3]btrfs: round down stripe size and chunk size to pow of 2 Wang Yugui
2022-08-18  5:30 ` Qu Wenruo
2022-08-18  5:53   ` Wang Yugui
2022-08-18  6:25     ` Qu Wenruo [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=c082f1f0-db71-4f16-43ee-08fd9fb3398f@gmx.com \
    --to=quwenruo.btrfs@gmx.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=wangyugui@e16-tech.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox