public inbox for linux-btrfs@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] btrfs: don't bother stripe length if the profile is not stripe based
@ 2021-11-19  6:19 Qu Wenruo
  2021-11-19  8:37 ` Johannes Thumshirn
                   ` (2 more replies)
  0 siblings, 3 replies; 5+ messages in thread
From: Qu Wenruo @ 2021-11-19  6:19 UTC (permalink / raw)
  To: linux-btrfs

[BUG]
When debugging calc_bio_boundaries(), I found that even for RAID1
metadata, we're following stripe length to calculate stripe boundary.

 # mkfs.btrfs -m raid1 -d raid1 /dev/test/scratch[12]
 # mount /dev/test/scratch /mnt/btrfs
 # xfs_io -f -c "pwrite 0 64K" /mnt/btrfs/file
 # umount

Above very basic operations will make calc_bio_boundaries() to report
the following result:

 submit_extent_page: r/i=1/1 file_offset=22036480 len_to_stripe_boundary=49152
 submit_extent_page: r/i=1/1 file_offset=30474240 len_to_stripe_boundary=65536
 ...
 submit_extent_page: r/i=1/1 file_offset=30523392 len_to_stripe_boundary=16384
 submit_extent_page: r/i=1/1 file_offset=30457856 len_to_stripe_boundary=16384
 submit_extent_page: r/i=5/257 file_offset=0 len_to_stripe_boundary=65536
 submit_extent_page: r/i=5/257 file_offset=65536 len_to_stripe_boundary=65536
 submit_extent_page: r/i=1/1 file_offset=30490624 len_to_stripe_boundary=49152
 submit_extent_page: r/i=1/1 file_offset=30507008 len_to_stripe_boundary=32768

Where "r/i" is the rootid and inode, 1/1 means they metadata.
The remaining names match the member used in kernel.

Even all data/metadata are using raid1, we're still following stripe
length.

[CAUSE]
This behavior is caused by a wrong condition in btrfs_get_io_geometry():

	if (map->type & BTRFS_BLOCK_GROUP_PROFILE_MASK) {
		/* Fill using stripe_len */
		len = min_t(u64, em->len - offset, max_len);
	} else {
		len = em->len - offset;
	}

This means, only for SINGLE we will not follow stripe_len.

However for profiles like RAID1*, DUP, they don't need to bother
stripe_len.

This can lead to unnecessry bio split for RAID1*/DUP profiles, and can
even be a blockage for future zoned RAID support.

[FIX]
Introduce one single-use macro, BTRFS_BLOCK_GROUP_STRIPE_MASK, and
change the condition to only calculate the length using stripe length
for stripe based profiles.

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
 fs/btrfs/volumes.c | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
index ae1a4f2a8af6..3dc1de376966 100644
--- a/fs/btrfs/volumes.c
+++ b/fs/btrfs/volumes.c
@@ -6296,6 +6296,10 @@ static bool need_full_stripe(enum btrfs_map_op op)
 	return (op == BTRFS_MAP_WRITE || op == BTRFS_MAP_GET_READ_MIRRORS);
 }
 
+
+#define BTRFS_BLOCK_GROUP_STRIPE_MASK	(BTRFS_BLOCK_GROUP_RAID0 |\
+					 BTRFS_BLOCK_GROUP_RAID10 |\
+					 BTRFS_BLOCK_GROUP_RAID56_MASK)
 /*
  * Calculate the geometry of a particular (address, len) tuple. This
  * information is used to calculate how big a particular bio can get before it
@@ -6345,7 +6349,8 @@ int btrfs_get_io_geometry(struct btrfs_fs_info *fs_info, struct extent_map *em,
 	stripe_offset = offset - stripe_offset;
 	data_stripes = nr_data_stripes(map);
 
-	if (map->type & BTRFS_BLOCK_GROUP_PROFILE_MASK) {
+	/* Only stripe based profiles needs to check against stripe length. */
+	if (map->type & BTRFS_BLOCK_GROUP_STRIPE_MASK) {
 		u64 max_len = stripe_len - stripe_offset;
 
 		/*
-- 
2.33.1


^ permalink raw reply related	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2021-11-22 18:29 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2021-11-19  6:19 [PATCH] btrfs: don't bother stripe length if the profile is not stripe based Qu Wenruo
2021-11-19  8:37 ` Johannes Thumshirn
2021-11-19 10:22 ` Anand Jain
2021-11-19 10:32   ` Qu Wenruo
2021-11-22 18:29 ` David Sterba

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox