linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Qu Wenruo <quwenruo@cn.fujitsu.com>
To: <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH v2] btrfs: Enhance btrfs chunk allocation algorithm to reduce ENOSPC caused by unbalanced data/metadata allocation.
Date: Tue, 23 Dec 2014 10:33:58 +0800	[thread overview]
Message-ID: <5498D496.7070503@cn.fujitsu.com> (raw)
In-Reply-To: <1418894805-3439-1-git-send-email-quwenruo@cn.fujitsu.com>

Please ignore the patch, since __btrfs_alloc_chun() has other problems, 
mainly a mixed calculation on
on-disk space and logical space.

So I will send a patch first fixing the on-disk space and logical space 
things and implement the half-half
on-disk space allocation algorithm on the base of that patch.

Thanks,
Qu
-------- Original Message --------
Subject: [PATCH v2] btrfs: Enhance btrfs chunk allocation algorithm to 
reduce ENOSPC caused by unbalanced data/metadata allocation.
From: Qu Wenruo <quwenruo@cn.fujitsu.com>
To: <linux-btrfs@vger.kernel.org>
Date: 2014年12月18日 17:26
> When btrfs allocate a chunk, it will try to alloc up to 1G for data and
> 256M for metadata, or 10% of all the writeable space if there is enough
> space for the stripe on device.
>
> However, when we run out of space, this allocation may cause unbalanced
> chunk allocation.
> For example, there are only 1G unallocated space, and request for
> allocate DATA chunk is sent, and all the space will be allocated as data
> chunk, making later metadata chunk alloc request unable to handle, which
> will cause ENOSPC.
> This is the one of the common complains from end users about why ENOSPC
> happens but there is still available space.
>
> This patch will try not to alloc chunk which is more than half of the
> unallocated space, making the last space more balanced at a small cost
> of more fragmented chunk at the last 1G.
>
> Some easy example:
> Preallocate 17.5G on a 20G empty btrfs fs:
> [Before]
>   # btrfs fi show /mnt/test
> Label: none  uuid: da8741b1-5d47-4245-9e94-bfccea34e91e
> 	Total devices 1 FS bytes used 17.50GiB
> 	devid    1 size 20.00GiB used 20.00GiB path /dev/sdb
> All space is allocated. No space later metadata space.
>
> [After]
>   # btrfs fi show /mnt/test
> Label: none  uuid: e6935aeb-a232-4140-84f9-80aab1f23d56
> 	Total devices 1 FS bytes used 17.50GiB
> 	devid    1 size 20.00GiB used 19.77GiB path /dev/sdb
> About 230M is still available for later metadata allocation.
>
> Signed-off-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
> ---
> Changelog:
> v2:
>     Fix a dead judgement which will cause last 16~32M size unavailable.
> ---
>   fs/btrfs/volumes.c | 17 +++++++++++++++++
>   1 file changed, 17 insertions(+)
>
> diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
> index 0144790..1cd0256 100644
> --- a/fs/btrfs/volumes.c
> +++ b/fs/btrfs/volumes.c
> @@ -4236,6 +4236,7 @@ static int __btrfs_alloc_chunk(struct btrfs_trans_handle *trans,
>   	int ret;
>   	u64 max_stripe_size;
>   	u64 max_chunk_size;
> +	u64 total_avail_space = 0;
>   	u64 stripe_size;
>   	u64 num_bytes;
>   	u64 raid_stripe_len = BTRFS_STRIPE_LEN;
> @@ -4348,10 +4349,26 @@ static int __btrfs_alloc_chunk(struct btrfs_trans_handle *trans,
>   		devices_info[ndevs].max_avail = max_avail;
>   		devices_info[ndevs].total_avail = total_avail;
>   		devices_info[ndevs].dev = device;
> +		total_avail_space += total_avail;
>   		++ndevs;
>   	}
>   
>   	/*
> +	 * Try not to occupy more than half of the unallocated space.
> +	 * When run short of space and alloc all the space to
> +	 * data/metadata will cause ENOSPC to be triggered more easily.
> +	 *
> +	 * And since the minimum chunk size is 16M, the half-half will cause
> +	 * 16M allocated from 20M available space and reset 4M will not be
> +	 * used ever. In that case(16~32M), allocate all directly.
> +	 */
> +	if (total_avail_space < 32 * 1024 * 1024 &&
> +	    total_avail_space > 16 * 1024 * 1024)
> +		max_chunk_size = total_avail_space;
> +	else
> +		max_chunk_size = min(total_avail_space / 2, max_chunk_size);
> +
> +	/*
>   	 * now sort the devices by hole size / available space
>   	 */
>   	sort(devices_info, ndevs, sizeof(struct btrfs_device_info),


      reply	other threads:[~2014-12-23  3:00 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-12-18  9:26 [PATCH v2] btrfs: Enhance btrfs chunk allocation algorithm to reduce ENOSPC caused by unbalanced data/metadata allocation Qu Wenruo
2014-12-23  2:33 ` Qu Wenruo [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5498D496.7070503@cn.fujitsu.com \
    --to=quwenruo@cn.fujitsu.com \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).