From: Josef Bacik <jbacik@fb.com>
To: Chandan Rajendra <chandan@linux.vnet.ibm.com>,
<linux-btrfs@vger.kernel.org>
Cc: <clm@fb.com>, <bo.li.liu@oracle.com>, <dsterba@suse.cz>,
<quwenruo@cn.fujitsu.com>, <chandan@mykolab.com>
Subject: Re: [PATCH V5 01/13] Btrfs: __btrfs_buffered_write: Reserve/release extents aligned to block size
Date: Thu, 1 Oct 2015 10:37:52 -0400 [thread overview]
Message-ID: <560D4540.9090101@fb.com> (raw)
In-Reply-To: <1443608912-31667-2-git-send-email-chandan@linux.vnet.ibm.com>
On 09/30/2015 06:28 AM, Chandan Rajendra wrote:
> Currently, the code reserves/releases extents in multiples of PAGE_CACHE_SIZE
> units. Fix this by doing reservation/releases in block size units.
>
> Signed-off-by: Chandan Rajendra <chandan@linux.vnet.ibm.com>
> ---
> fs/btrfs/file.c | 44 +++++++++++++++++++++++++++++++-------------
> 1 file changed, 31 insertions(+), 13 deletions(-)
>
> diff --git a/fs/btrfs/file.c b/fs/btrfs/file.c
> index b823fac..12ce401 100644
> --- a/fs/btrfs/file.c
> +++ b/fs/btrfs/file.c
> @@ -499,7 +499,7 @@ int btrfs_dirty_pages(struct btrfs_root *root, struct inode *inode,
> loff_t isize = i_size_read(inode);
>
> start_pos = pos & ~((u64)root->sectorsize - 1);
> - num_bytes = ALIGN(write_bytes + pos - start_pos, root->sectorsize);
> + num_bytes = round_up(write_bytes + pos - start_pos, root->sectorsize);
>
> end_of_last_block = start_pos + num_bytes - 1;
> err = btrfs_set_extent_delalloc(inode, start_pos, end_of_last_block,
> @@ -1362,16 +1362,19 @@ fail:
> static noinline int
> lock_and_cleanup_extent_if_need(struct inode *inode, struct page **pages,
> size_t num_pages, loff_t pos,
> + size_t write_bytes,
> u64 *lockstart, u64 *lockend,
> struct extent_state **cached_state)
> {
> + struct btrfs_root *root = BTRFS_I(inode)->root;
> u64 start_pos;
> u64 last_pos;
> int i;
> int ret = 0;
>
> - start_pos = pos & ~((u64)PAGE_CACHE_SIZE - 1);
> - last_pos = start_pos + ((u64)num_pages << PAGE_CACHE_SHIFT) - 1;
> + start_pos = round_down(pos, root->sectorsize);
> + last_pos = start_pos
> + + round_up(pos + write_bytes - start_pos, root->sectorsize) - 1;
>
> if (start_pos < inode->i_size) {
> struct btrfs_ordered_extent *ordered;
> @@ -1489,6 +1492,7 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
>
> while (iov_iter_count(i) > 0) {
> size_t offset = pos & (PAGE_CACHE_SIZE - 1);
> + size_t sector_offset;
> size_t write_bytes = min(iov_iter_count(i),
> nrptrs * (size_t)PAGE_CACHE_SIZE -
> offset);
> @@ -1497,6 +1501,8 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> size_t reserve_bytes;
> size_t dirty_pages;
> size_t copied;
> + size_t dirty_sectors;
> + size_t num_sectors;
>
> WARN_ON(num_pages > nrptrs);
>
> @@ -1509,8 +1515,12 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> break;
> }
>
> - reserve_bytes = num_pages << PAGE_CACHE_SHIFT;
> + sector_offset = pos & (root->sectorsize - 1);
> + reserve_bytes = round_up(write_bytes + sector_offset,
> + root->sectorsize);
> +
> ret = btrfs_check_data_free_space(inode, reserve_bytes, write_bytes);
> +
> if (ret == -ENOSPC &&
> (BTRFS_I(inode)->flags & (BTRFS_INODE_NODATACOW |
> BTRFS_INODE_PREALLOC))) {
> @@ -1523,7 +1533,10 @@ static noinline ssize_t __btrfs_buffered_write(struct file *file,
> */
> num_pages = DIV_ROUND_UP(write_bytes + offset,
> PAGE_CACHE_SIZE);
> - reserve_bytes = num_pages << PAGE_CACHE_SHIFT;
> + reserve_bytes = round_up(write_bytes
> + + sector_offset,
> + root->sectorsize);
> +
> ret = 0;
> } else {
> ret = -ENOSPC;
> @@ -1558,8 +1571,8 @@ again:
> break;
>
> ret = lock_and_cleanup_extent_if_need(inode, pages, num_pages,
> - pos, &lockstart, &lockend,
> - &cached_state);
> + pos, write_bytes, &lockstart,
> + &lockend, &cached_state);
> if (ret < 0) {
> if (ret == -EAGAIN)
> goto again;
> @@ -1595,9 +1608,14 @@ again:
> * we still have an outstanding extent for the chunk we actually
> * managed to copy.
> */
> - if (num_pages > dirty_pages) {
> - release_bytes = (num_pages - dirty_pages) <<
> - PAGE_CACHE_SHIFT;
> + num_sectors = reserve_bytes >> inode->i_blkbits;
> + dirty_sectors = round_up(copied + sector_offset,
> + root->sectorsize);
> + dirty_sectors >>= inode->i_blkbits;
> +
> + if (num_sectors > dirty_sectors) {
> + release_bytes = (write_bytes - copied)
> + & ~((u64)root->sectorsize - 1);
> if (copied > 0) {
> spin_lock(&BTRFS_I(inode)->lock);
> BTRFS_I(inode)->outstanding_extents++;
> @@ -1611,7 +1629,8 @@ again:
> release_bytes);
> }
>
> - release_bytes = dirty_pages << PAGE_CACHE_SHIFT;
> + release_bytes = round_up(copied + sector_offset,
> + root->sectorsize);
>
> if (copied > 0)
> ret = btrfs_dirty_pages(root, inode, pages,
> @@ -1632,8 +1651,7 @@ again:
>
> if (only_release_metadata && copied > 0) {
> lockstart = round_down(pos, root->sectorsize);
> - lockend = lockstart +
> - (dirty_pages << PAGE_CACHE_SHIFT) - 1;
> + lockend = round_up(pos + copied, root->sectorsize) - 1;
>
> set_extent_bit(&BTRFS_I(inode)->io_tree, lockstart,
> lockend, EXTENT_NORESERVE, NULL,
>
Reviewed-by: Josef Bacik <jbacik@fb.com>
Thanks,
Josef
next prev parent reply other threads:[~2015-10-01 14:41 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-09-30 10:28 [PATCH V5 00/13] Btrfs: Pre subpagesize-blocksize cleanups Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 01/13] Btrfs: __btrfs_buffered_write: Reserve/release extents aligned to block size Chandan Rajendra
2015-10-01 14:37 ` Josef Bacik [this message]
2015-09-30 10:28 ` [PATCH V5 02/13] Btrfs: Compute and look up csums based on sectorsized blocks Chandan Rajendra
2015-10-01 14:39 ` Josef Bacik
2015-10-02 12:20 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 03/13] Btrfs: Direct I/O read: Work " Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 04/13] Btrfs: fallocate: Work with " Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 05/13] Btrfs: btrfs_page_mkwrite: Reserve space in sectorsized units Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 06/13] Btrfs: Search for all ordered extents that could span across a page Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 07/13] Btrfs: Use (eb->start, seq) as search key for tree modification log Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 08/13] Btrfs: btrfs_submit_direct_hook: Handle map_length < bio vector length Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 09/13] Btrfs: Limit inline extents to root->sectorsize Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 10/13] Btrfs: Fix block size returned to user space Chandan Rajendra
2015-10-01 14:58 ` Josef Bacik
2015-09-30 10:28 ` [PATCH V5 11/13] Btrfs: Clean pte corresponding to page straddling i_size Chandan Rajendra
2015-10-01 14:57 ` Josef Bacik
2015-10-02 16:34 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 12/13] Btrfs: prepare_pages: Retry adding a page to the page cache Chandan Rajendra
2015-10-01 14:50 ` Josef Bacik
2015-10-02 12:24 ` Chandan Rajendra
2015-09-30 10:28 ` [PATCH V5 13/13] Btrfs: Return valid delalloc range when the page does not have PG_Dirty flag set or has been invalidated Chandan Rajendra
2015-10-01 14:48 ` Josef Bacik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=560D4540.9090101@fb.com \
--to=jbacik@fb.com \
--cc=bo.li.liu@oracle.com \
--cc=chandan@linux.vnet.ibm.com \
--cc=chandan@mykolab.com \
--cc=clm@fb.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
--cc=quwenruo@cn.fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).