From: Josef Bacik <jbacik@fb.com>
To: Chandan Rajendra <chandan@linux.vnet.ibm.com>,
<linux-btrfs@vger.kernel.org>
Cc: <clm@fb.com>, <bo.li.liu@oracle.com>, <dsterba@suse.cz>,
<quwenruo@cn.fujitsu.com>, <chandan@mykolab.com>
Subject: Re: [PATCH V5 01/13] Btrfs: __btrfs_buffered_write: Reserve/release extents aligned to block size
Date: Thu, 1 Oct 2015 10:37:52 -0400 [thread overview]
Message-ID: <560D4540.9090101@fb.com> (raw)
In-Reply-To: <1443608912-31667-2-git-send-email-chandan@linux.vnet.ibm.com>
On 09/30/2015 06:28 AM, Chandan Rajendra wrote:
> Currently, the code reserves/releases extents in multiples of PAGE_CACHE_SIZE
> units. Fix this by doing reservation/releases in block size units.
>
> Signed-off-by: Chandan Rajendra <chandan@linux.vnet.ibm.com>
> ---
> fs/btrfs/file.c | 44 +++++++++++++++++++++++++++++++-------------
> 1 file changed, 31 insertions(+), 13 deletions(-)
>
> diff --git a/fs/btrfs/file.c b/fs/btrfs/file.c
> index b823fac..12ce401 100644
> --- a/fs/btrfs/file.c
> +++ b/fs/btrfs/file.c
> @@ -499,7 +499,7 @@ int btrfs_dirty_pages(struct btrfs_root *root, struct inode *inode,
> loff_t isize = i_size_read(inode);
>
> start_pos = pos & ~((u64)root->sectorsize - 1);
> - num_bytes = ALIGN(write_bytes + pos - start_pos, root->sectorsize);
> + num_bytes = round_up(write_bytes + pos - start_pos, root->sectorsize);
>
> end_of_last_block = start_pos + num_bytes - 1;
> err = btrfs_set_extent_delalloc(inode, start_pos, end_of_last_block,
> @@ -1362,16 +1362,19 @@ fail:
> static noinline int
> lock_and_cleanup_extent_if_need(struct inode *inode, struct page **pages,
> size_t num_pages, loff_t pos,
> + size_t write_bytes,
> u64 *lockstart, u64 *lockend,
> struct extent_state **cached_state)
> {
> + struct btrfs_root *root = BTRFS_I(inode)->root;
> u64 start_pos;
> u64 last_pos;
> int i;
> int ret = 0;
>
> - start_pos = pos & ~((u64)PAGE_CACHE_SIZE - 1);
> - last_pos = start_pos + ((u64)num_pages << PAGE_CACHE_SHIFT) - 1;
> + start_pos = round_down(pos, root->sectorsize);
> + last_pos = start_pos
> + + round_up(pos + write_bytes - start_pos, root->sectorsize) - 1;
>
> if (start_pos < inode->i_size) {
> struct btrfs_ordered_extent *ordered;
> @@ -1489,6 +1492,7 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
>
> while (iov_iter_count(i) > 0) {
> size_t offset = pos & (PAGE_CACHE_SIZE - 1);
> + size_t sector_offset;
> size_t write_bytes = min(iov_iter_count(i),
> nrptrs * (size_t)PAGE_CACHE_SIZE -
> offset);
> @@ -1497,6 +1501,8 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> size_t reserve_bytes;
> size_t dirty_pages;
> size_t copied;
> + size_t dirty_sectors;
> + size_t num_sectors;
>
> WARN_ON(num_pages > nrptrs);
>
> @@ -1509,8 +1515,12 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> break;
> }
>
> - reserve_bytes = num_pages << PAGE_CACHE_SHIFT;
> + sector_offset = pos & (root->sectorsize - 1);
> + reserve_bytes = round_up(write_bytes + sector_offset,
> + root->sectorsize);
> +
> ret = btrfs_check_data_free_space(inode, reserve_bytes, write_bytes);
> +
> if (ret == -ENOSPC &&
> (BTRFS_I(inode)->flags & (BTRFS_INODE_NODATACOW |
> BTRFS_INODE_PREALLOC))) {
> @@ -1523,7 +1533,10 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> */
> num_pages = DIV_ROUND_UP(write_bytes + offset,
> PAGE_CACHE_SIZE);
> - reserve_bytes = num_pages << PAGE_CACHE_SHIFT;
> + reserve_bytes = round_up(write_bytes
> + + sector_offset,
> + root->sectorsize);
> +
> ret = 0;
> } else {
> ret = -ENOSPC;
> @@ -1558,8 +1571,8 @@ again:
> break;
>
> ret = lock_and_cleanup_extent_if_need(inode, pages, num_pages,
> - pos, &lockstart, &lockend,
> - &cached_state);
> + pos, write_bytes, &lockstart,
> + &lockend, &cached_state);
> if (ret < 0) {
> if (ret == -EAGAIN)
> goto again;
> @@ -1595,9 +1608,14 @@ again:
> * we still have an outstanding extent for the chunk we actually
> * managed to copy.
> */
> - if (num_pages > dirty_pages) {
> - release_bytes = (num_pages - dirty_pages) <<
> - PAGE_CACHE_SHIFT;
> + num_sectors = reserve_bytes >> inode->i_blkbits;
> + dirty_sectors = round_up(copied + sector_offset,
> + root->sectorsize);
> + dirty_sectors >>= inode->i_blkbits;
> +
> + if (num_sectors > dirty_sectors) {
> + release_bytes = (write_bytes - copied)
> + & ~((u64)root->sectorsize - 1);
> if (copied > 0) {
> spin_lock(&BTRFS_I(inode)->lock);
> BTRFS_I(inode)->outstanding_extents++;
> @@ -1611,7 +1629,8 @@ again:
> release_bytes);
> }
>
> - release_bytes = dirty_pages << PAGE_CACHE_SHIFT;
> + release_bytes = round_up(copied + sector_offset,
> + root->sectorsize);
>
> if (copied > 0)
> ret = btrfs_dirty_pages(root, inode, pages,
> @@ -1632,8 +1651,7 @@ again:
>
> if (only_release_metadata && copied > 0) {
> lockstart = round_down(pos, root->sectorsize);
> - lockend = lockstart +
> - (dirty_pages << PAGE_CACHE_SHIFT) - 1;
> + lockend = round_up(pos + copied, root->sectorsize) - 1;
>
> set_extent_bit(&BTRFS_I(inode)->io_tree, lockstart,
> lockend, EXTENT_NORESERVE, NULL,
>
Reviewed-by: Josef Bacik <jbacik@fb.com>
Thanks,
Josef
next prev parent reply other threads:[~2015-10-01 14:41 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-09-30 10:28 [PATCH V5 00/13] Btrfs: Pre subpagesize-blocksize cleanups Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 01/13] Btrfs: __btrfs_buffered_write: Reserve/release extents aligned to block size Chandan Rajendra
2015-10-01 14:37 ` Josef Bacik [this message]
2015-09-30 10:28 ` [PATCH V5 02/13] Btrfs: Compute and look up csums based on sectorsized blocks Chandan Rajendra
2015-10-01 14:39 ` Josef Bacik
2015-10-02 12:20 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 03/13] Btrfs: Direct I/O read: Work " Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 04/13] Btrfs: fallocate: Work with " Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 05/13] Btrfs: btrfs_page_mkwrite: Reserve space in sectorsized units Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 06/13] Btrfs: Search for all ordered extents that could span across a page Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 07/13] Btrfs: Use (eb->start, seq) as search key for tree modification log Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 08/13] Btrfs: btrfs_submit_direct_hook: Handle map_length < bio vector length Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 09/13] Btrfs: Limit inline extents to root->sectorsize Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 10/13] Btrfs: Fix block size returned to user space Chandan Rajendra
2015-10-01 14:58 ` Josef Bacik
2015-09-30 10:28 ` [PATCH V5 11/13] Btrfs: Clean pte corresponding to page straddling i_size Chandan Rajendra
2015-10-01 14:57 ` Josef Bacik
2015-10-02 16:34 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 12/13] Btrfs: prepare_pages: Retry adding a page to the page cache Chandan Rajendra
2015-10-01 14:50 ` Josef Bacik
2015-10-02 12:24 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 13/13] Btrfs: Return valid delalloc range when the page does not have PG_Dirty flag set or has been invalidated Chandan Rajendra
2015-10-01 14:48 ` Josef Bacik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=560D4540.9090101@fb.com \
--to=jbacik@fb.com \
--cc=bo.li.liu@oracle.com \
--cc=chandan@linux.vnet.ibm.com \
--cc=chandan@mykolab.com \
--cc=clm@fb.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
--cc=quwenruo@cn.fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.