linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag
@ 2020-05-27 10:11 Nikolay Borisov
  2020-05-27 14:03 ` David Sterba
  2020-05-27 19:48 ` Filipe Manana
  0 siblings, 2 replies; 5+ messages in thread
From: Nikolay Borisov @ 2020-05-27 10:11 UTC (permalink / raw)
  To: linux-btrfs; +Cc: Nikolay Borisov

The flag simply replicates whether btrfs_inode::delallocs_inodes list
is empty or not. Just defer this check to the list management functions
(btrfs_add_delalloc_inodes/__btrfs_del_delalloc_inode) which are
always called under btrfs_root::delalloc_lock.

Signed-off-by: Nikolay Borisov <nborisov@suse.com>
---
 fs/btrfs/btrfs_inode.h |  1 -
 fs/btrfs/inode.c       | 11 ++---------
 2 files changed, 2 insertions(+), 10 deletions(-)

diff --git a/fs/btrfs/btrfs_inode.h b/fs/btrfs/btrfs_inode.h
index aeff56a0e105..da6743c70412 100644
--- a/fs/btrfs/btrfs_inode.h
+++ b/fs/btrfs/btrfs_inode.h
@@ -27,7 +27,6 @@ enum {
 	BTRFS_INODE_HAS_ASYNC_EXTENT,
 	BTRFS_INODE_NEEDS_FULL_SYNC,
 	BTRFS_INODE_COPY_EVERYTHING,
-	BTRFS_INODE_IN_DELALLOC_LIST,
 	BTRFS_INODE_HAS_PROPS,
 	BTRFS_INODE_SNAPSHOT_FLUSH,
 };
diff --git a/fs/btrfs/inode.c b/fs/btrfs/inode.c
index 7d2f6e55a234..3e87a6644e09 100644
--- a/fs/btrfs/inode.c
+++ b/fs/btrfs/inode.c
@@ -1865,8 +1865,6 @@ static void btrfs_add_delalloc_inodes(struct btrfs_root *root,
 	if (list_empty(&BTRFS_I(inode)->delalloc_inodes)) {
 		list_add_tail(&BTRFS_I(inode)->delalloc_inodes,
 			      &root->delalloc_inodes);
-		set_bit(BTRFS_INODE_IN_DELALLOC_LIST,
-			&BTRFS_I(inode)->runtime_flags);
 		root->nr_delalloc_inodes++;
 		if (root->nr_delalloc_inodes == 1) {
 			spin_lock(&fs_info->delalloc_root_lock);
@@ -1887,8 +1885,6 @@ void __btrfs_del_delalloc_inode(struct btrfs_root *root,

 	if (!list_empty(&inode->delalloc_inodes)) {
 		list_del_init(&inode->delalloc_inodes);
-		clear_bit(BTRFS_INODE_IN_DELALLOC_LIST,
-			  &inode->runtime_flags);
 		root->nr_delalloc_inodes--;
 		if (!root->nr_delalloc_inodes) {
 			ASSERT(list_empty(&root->delalloc_inodes));
@@ -1944,8 +1940,7 @@ void btrfs_set_delalloc_extent(struct inode *inode, struct extent_state *state,
 		BTRFS_I(inode)->delalloc_bytes += len;
 		if (*bits & EXTENT_DEFRAG)
 			BTRFS_I(inode)->defrag_bytes += len;
-		if (do_list && !test_bit(BTRFS_INODE_IN_DELALLOC_LIST,
-					 &BTRFS_I(inode)->runtime_flags))
+		if (do_list)
 			btrfs_add_delalloc_inodes(root, inode);
 		spin_unlock(&BTRFS_I(inode)->lock);
 	}
@@ -2014,9 +2009,7 @@ void btrfs_clear_delalloc_extent(struct inode *vfs_inode,
 					 fs_info->delalloc_batch);
 		spin_lock(&inode->lock);
 		inode->delalloc_bytes -= len;
-		if (do_list && inode->delalloc_bytes == 0 &&
-		    test_bit(BTRFS_INODE_IN_DELALLOC_LIST,
-					&inode->runtime_flags))
+		if (do_list && inode->delalloc_bytes == 0)
 			btrfs_del_delalloc_inode(root, inode);
 		spin_unlock(&inode->lock);
 	}
--
2.17.1


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag
  2020-05-27 10:11 [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag Nikolay Borisov
@ 2020-05-27 14:03 ` David Sterba
  2020-05-27 19:48 ` Filipe Manana
  1 sibling, 0 replies; 5+ messages in thread
From: David Sterba @ 2020-05-27 14:03 UTC (permalink / raw)
  To: Nikolay Borisov; +Cc: linux-btrfs

On Wed, May 27, 2020 at 01:11:04PM +0300, Nikolay Borisov wrote:
> The flag simply replicates whether btrfs_inode::delallocs_inodes list
> is empty or not. Just defer this check to the list management functions
> (btrfs_add_delalloc_inodes/__btrfs_del_delalloc_inode) which are
> always called under btrfs_root::delalloc_lock.
> 
> Signed-off-by: Nikolay Borisov <nborisov@suse.com>

Nice. Added to misc-next, thanks.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag
  2020-05-27 10:11 [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag Nikolay Borisov
  2020-05-27 14:03 ` David Sterba
@ 2020-05-27 19:48 ` Filipe Manana
  2020-05-27 21:24   ` David Sterba
  2020-05-28  7:09   ` Nikolay Borisov
  1 sibling, 2 replies; 5+ messages in thread
From: Filipe Manana @ 2020-05-27 19:48 UTC (permalink / raw)
  To: Nikolay Borisov; +Cc: linux-btrfs

On Wed, May 27, 2020 at 12:42 PM Nikolay Borisov <nborisov@suse.com> wrote:
>
> The flag simply replicates whether btrfs_inode::delallocs_inodes list
> is empty or not. Just defer this check to the list management functions
> (btrfs_add_delalloc_inodes/__btrfs_del_delalloc_inode) which are
> always called under btrfs_root::delalloc_lock.

The flag is there to avoid taking the root's delalloc_lock spinlock
everytime a range is marked for delalloc for any inode of the
subvolume.
Have you measured performance with very high concurrency of buffered
writes against files in the same subvolume?

Thanks.

>
> Signed-off-by: Nikolay Borisov <nborisov@suse.com>
> ---
>  fs/btrfs/btrfs_inode.h |  1 -
>  fs/btrfs/inode.c       | 11 ++---------
>  2 files changed, 2 insertions(+), 10 deletions(-)
>
> diff --git a/fs/btrfs/btrfs_inode.h b/fs/btrfs/btrfs_inode.h
> index aeff56a0e105..da6743c70412 100644
> --- a/fs/btrfs/btrfs_inode.h
> +++ b/fs/btrfs/btrfs_inode.h
> @@ -27,7 +27,6 @@ enum {
>         BTRFS_INODE_HAS_ASYNC_EXTENT,
>         BTRFS_INODE_NEEDS_FULL_SYNC,
>         BTRFS_INODE_COPY_EVERYTHING,
> -       BTRFS_INODE_IN_DELALLOC_LIST,
>         BTRFS_INODE_HAS_PROPS,
>         BTRFS_INODE_SNAPSHOT_FLUSH,
>  };
> diff --git a/fs/btrfs/inode.c b/fs/btrfs/inode.c
> index 7d2f6e55a234..3e87a6644e09 100644
> --- a/fs/btrfs/inode.c
> +++ b/fs/btrfs/inode.c
> @@ -1865,8 +1865,6 @@ static void btrfs_add_delalloc_inodes(struct btrfs_root *root,
>         if (list_empty(&BTRFS_I(inode)->delalloc_inodes)) {
>                 list_add_tail(&BTRFS_I(inode)->delalloc_inodes,
>                               &root->delalloc_inodes);
> -               set_bit(BTRFS_INODE_IN_DELALLOC_LIST,
> -                       &BTRFS_I(inode)->runtime_flags);
>                 root->nr_delalloc_inodes++;
>                 if (root->nr_delalloc_inodes == 1) {
>                         spin_lock(&fs_info->delalloc_root_lock);
> @@ -1887,8 +1885,6 @@ void __btrfs_del_delalloc_inode(struct btrfs_root *root,
>
>         if (!list_empty(&inode->delalloc_inodes)) {
>                 list_del_init(&inode->delalloc_inodes);
> -               clear_bit(BTRFS_INODE_IN_DELALLOC_LIST,
> -                         &inode->runtime_flags);
>                 root->nr_delalloc_inodes--;
>                 if (!root->nr_delalloc_inodes) {
>                         ASSERT(list_empty(&root->delalloc_inodes));
> @@ -1944,8 +1940,7 @@ void btrfs_set_delalloc_extent(struct inode *inode, struct extent_state *state,
>                 BTRFS_I(inode)->delalloc_bytes += len;
>                 if (*bits & EXTENT_DEFRAG)
>                         BTRFS_I(inode)->defrag_bytes += len;
> -               if (do_list && !test_bit(BTRFS_INODE_IN_DELALLOC_LIST,
> -                                        &BTRFS_I(inode)->runtime_flags))
> +               if (do_list)
>                         btrfs_add_delalloc_inodes(root, inode);
>                 spin_unlock(&BTRFS_I(inode)->lock);
>         }
> @@ -2014,9 +2009,7 @@ void btrfs_clear_delalloc_extent(struct inode *vfs_inode,
>                                          fs_info->delalloc_batch);
>                 spin_lock(&inode->lock);
>                 inode->delalloc_bytes -= len;
> -               if (do_list && inode->delalloc_bytes == 0 &&
> -                   test_bit(BTRFS_INODE_IN_DELALLOC_LIST,
> -                                       &inode->runtime_flags))
> +               if (do_list && inode->delalloc_bytes == 0)
>                         btrfs_del_delalloc_inode(root, inode);
>                 spin_unlock(&inode->lock);
>         }
> --
> 2.17.1
>


-- 
Filipe David Manana,

“Whether you think you can, or you think you can't — you're right.”

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag
  2020-05-27 19:48 ` Filipe Manana
@ 2020-05-27 21:24   ` David Sterba
  2020-05-28  7:09   ` Nikolay Borisov
  1 sibling, 0 replies; 5+ messages in thread
From: David Sterba @ 2020-05-27 21:24 UTC (permalink / raw)
  To: Filipe Manana; +Cc: Nikolay Borisov, linux-btrfs

On Wed, May 27, 2020 at 08:48:31PM +0100, Filipe Manana wrote:
> On Wed, May 27, 2020 at 12:42 PM Nikolay Borisov <nborisov@suse.com> wrote:
> >
> > The flag simply replicates whether btrfs_inode::delallocs_inodes list
> > is empty or not. Just defer this check to the list management functions
> > (btrfs_add_delalloc_inodes/__btrfs_del_delalloc_inode) which are
> > always called under btrfs_root::delalloc_lock.
> 
> The flag is there to avoid taking the root's delalloc_lock spinlock
> everytime a range is marked for delalloc for any inode of the
> subvolume.

I overlooked that not all uses of the bit are under delalloc_lock, which
would make it redundant, but both test_bit are inside inode lock, not
the delalloc lock.

> Have you measured performance with very high concurrency of buffered
> writes against files in the same subvolume?

I'll remove the patch for now unless the performance is verified to be
ok.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag
  2020-05-27 19:48 ` Filipe Manana
  2020-05-27 21:24   ` David Sterba
@ 2020-05-28  7:09   ` Nikolay Borisov
  1 sibling, 0 replies; 5+ messages in thread
From: Nikolay Borisov @ 2020-05-28  7:09 UTC (permalink / raw)
  To: fdmanana; +Cc: linux-btrfs



On 27.05.20 г. 22:48 ч., Filipe Manana wrote:
> On Wed, May 27, 2020 at 12:42 PM Nikolay Borisov <nborisov@suse.com> wrote:
>>
>> The flag simply replicates whether btrfs_inode::delallocs_inodes list
>> is empty or not. Just defer this check to the list management functions
>> (btrfs_add_delalloc_inodes/__btrfs_del_delalloc_inode) which are
>> always called under btrfs_root::delalloc_lock.
> 
> The flag is there to avoid taking the root's delalloc_lock spinlock
> everytime a range is marked for delalloc for any inode of the
> subvolume.
> Have you measured performance with very high concurrency of buffered
> writes against files in the same subvolume?
> 
> Thanks.


I performed the following test on a 16-core VM 
(physical cores are 12 on the host): 

fio --direct=0 --ioengine=sync --thread --directory=/media/scratch/ --invalidate=1 --group_reporting=1 \
--fallocate=posix --name=RandomWrites-async-64512-4k-4 --new_group --rw=randwrite --size=50m --numjobs=200 \
--bs=4k --fsync_on_close=0 --fallocate=none --end_fsync=0 --filename_format=FioWorkloads.\$jobnum

And here's what /proc/lock_stat report: 

With BTRFS_INODE_IN_DELALLOC_LIST: 

class name                     con-bounces    contentions   waittime-min   waittime-max waittime-total   waittime-avg    acq-bounces   acquisitions   holdtime-min   holdtime-max holdtime-total   holdtime-avg
 &root->delalloc_lock:           245              245             0.08          4.14          88.10          0.36            62168         122055         0.05          60.41        32721.41           0.27

Fio output:
 WRITE: bw=43.9MiB/s (45.0MB/s), 43.9MiB/s-43.9MiB/s (45.0MB/s-45.0MB/s), io=9.77GiB (10.5GB), run=228044-228044msec


Without BTRFS_INODE_IN_DELALLOC_LIST:
class name                           con-bounces    contentions   waittime-min   waittime-max waittime-total   waittime-avg    acq-bounces   acquisitions   holdtime-min   holdtime-max holdtime-total   holdtime-avg
 &root->delalloc_lock:                   8824          8838           0.05           210.92        3451.03           0.39        2542011        2685019           0.03         301.63      451369.98           0.17

  WRITE: bw=33.8MiB/s (35.5MB/s), 33.8MiB/s-33.8MiB/s (35.5MB/s-35.5MB/s), io=9.77GiB (10.5GB), run=295770-295770msec


So yeah, it does have noticeable effect, and massively reduces lock contentions on the delalloc_lock but 
it increases the critical section, due to the added avg times. But the improvement in performance 
in terms of throughput and reduced acquires/contentions is indisputable. So yeah, this patch 
should be dropped. 

Thanks for spotting this. 

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2020-05-28  7:09 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-05-27 10:11 [PATCH] btrfs: Remove BTRFS_INODE_IN_DELALLOC_LIST flag Nikolay Borisov
2020-05-27 14:03 ` David Sterba
2020-05-27 19:48 ` Filipe Manana
2020-05-27 21:24   ` David Sterba
2020-05-28  7:09   ` Nikolay Borisov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).