linux-block.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCHSET 0/2] Optimize block_device utilization
@ 2023-04-14 13:48 Jens Axboe
  2023-04-14 13:48 ` [PATCH 1/2] block: re-arrange the struct block_device fields for better layout Jens Axboe
  2023-04-14 13:48 ` [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev Jens Axboe
  0 siblings, 2 replies; 9+ messages in thread
From: Jens Axboe @ 2023-04-14 13:48 UTC (permalink / raw)
  To: linux-block

Hi,

First is just a prep patch that moves struct device, which is quite the
pig, out of line in block_device, and then shuffles a few fields for
better layout. It also saves 16 bytes of space in there while at it.

Second patch caches the bdev->disk->fops->submit_bio state in the bdev,
so we can avoid this long dependent memory load chain in the fast
path.

-- 
Jens Axboe



^ permalink raw reply	[flat|nested] 9+ messages in thread

* [PATCH 1/2] block: re-arrange the struct block_device fields for better layout
  2023-04-14 13:48 [PATCHSET 0/2] Optimize block_device utilization Jens Axboe
@ 2023-04-14 13:48 ` Jens Axboe
  2023-04-16  5:51   ` Christoph Hellwig
  2023-04-14 13:48 ` [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev Jens Axboe
  1 sibling, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2023-04-14 13:48 UTC (permalink / raw)
  To: linux-block; +Cc: Jens Axboe

This moves struct device out-of-line as it's just used at open/close
time, so we can keep some of the commonly used fields closer together.
On a standard setup, it also reduces the size from 864 bytes to 848
bytes. Yes, struct device is a pig...

Signed-off-by: Jens Axboe <axboe@kernel.dk>
---
 include/linux/blk_types.h | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
index 99be590f952f..d68d6e951fad 100644
--- a/include/linux/blk_types.h
+++ b/include/linux/blk_types.h
@@ -40,26 +40,25 @@ struct bio_crypt_ctx;
 struct block_device {
 	sector_t		bd_start_sect;
 	sector_t		bd_nr_sectors;
+	struct gendisk *	bd_disk;
+	struct request_queue *	bd_queue;
 	struct disk_stats __percpu *bd_stats;
 	unsigned long		bd_stamp;
 	bool			bd_read_only;	/* read-only policy */
+	u8			bd_partno;
+	bool			bd_write_holder;
 	dev_t			bd_dev;
 	atomic_t		bd_openers;
+	spinlock_t		bd_size_lock; /* for bd_inode->i_size updates */
 	struct inode *		bd_inode;	/* will die */
 	struct super_block *	bd_super;
 	void *			bd_claiming;
-	struct device		bd_device;
 	void *			bd_holder;
+	/* The counter of freeze processes */
+	int			bd_fsfreeze_count;
 	int			bd_holders;
-	bool			bd_write_holder;
 	struct kobject		*bd_holder_dir;
-	u8			bd_partno;
-	spinlock_t		bd_size_lock; /* for bd_inode->i_size updates */
-	struct gendisk *	bd_disk;
-	struct request_queue *	bd_queue;
 
-	/* The counter of freeze processes */
-	int			bd_fsfreeze_count;
 	/* Mutex for freeze */
 	struct mutex		bd_fsfreeze_mutex;
 	struct super_block	*bd_fsfreeze_sb;
@@ -68,6 +67,7 @@ struct block_device {
 #ifdef CONFIG_FAIL_MAKE_REQUEST
 	bool			bd_make_it_fail;
 #endif
+	struct device		bd_device;
 } __randomize_layout;
 
 #define bdev_whole(_bdev) \
-- 
2.39.2


^ permalink raw reply related	[flat|nested] 9+ messages in thread

* [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev
  2023-04-14 13:48 [PATCHSET 0/2] Optimize block_device utilization Jens Axboe
  2023-04-14 13:48 ` [PATCH 1/2] block: re-arrange the struct block_device fields for better layout Jens Axboe
@ 2023-04-14 13:48 ` Jens Axboe
  2023-04-15  2:43   ` Damien Le Moal
  2023-04-16  5:53   ` Christoph Hellwig
  1 sibling, 2 replies; 9+ messages in thread
From: Jens Axboe @ 2023-04-14 13:48 UTC (permalink / raw)
  To: linux-block; +Cc: Jens Axboe

We have a long chain of memory dereferencing just to whether or not
this disk has a special submit_bio helper. As that's not necessarily
the common case, add a bd_submit_bio state in the bdev to avoid
traversing this memory dependency chain if we don't need to.

Signed-off-by: Jens Axboe <axboe@kernel.dk>
---
 block/bdev.c              | 1 +
 block/blk-core.c          | 8 ++++----
 block/genhd.c             | 4 ++++
 include/linux/blk_types.h | 1 +
 4 files changed, 10 insertions(+), 4 deletions(-)

diff --git a/block/bdev.c b/block/bdev.c
index 1795c7d4b99e..31a5d25b2b44 100644
--- a/block/bdev.c
+++ b/block/bdev.c
@@ -419,6 +419,7 @@ struct block_device *bdev_alloc(struct gendisk *disk, u8 partno)
 	bdev->bd_inode = inode;
 	bdev->bd_queue = disk->queue;
 	bdev->bd_stats = alloc_percpu(struct disk_stats);
+	bdev->bd_submit_bio = 0;
 	if (!bdev->bd_stats) {
 		iput(inode);
 		return NULL;
diff --git a/block/blk-core.c b/block/blk-core.c
index 269765d16cfd..ae7953539dc0 100644
--- a/block/blk-core.c
+++ b/block/blk-core.c
@@ -587,14 +587,14 @@ static inline blk_status_t blk_check_zone_append(struct request_queue *q,
 
 static void __submit_bio(struct bio *bio)
 {
-	struct gendisk *disk = bio->bi_bdev->bd_disk;
-
 	if (unlikely(!blk_crypto_bio_prep(&bio)))
 		return;
 
-	if (!disk->fops->submit_bio) {
+	if (!bio->bi_bdev->bd_submit_bio) {
 		blk_mq_submit_bio(bio);
 	} else if (likely(bio_queue_enter(bio) == 0)) {
+		struct gendisk *disk = bio->bi_bdev->bd_disk;
+
 		disk->fops->submit_bio(bio);
 		blk_queue_exit(disk->queue);
 	}
@@ -698,7 +698,7 @@ void submit_bio_noacct_nocheck(struct bio *bio)
 	 */
 	if (current->bio_list)
 		bio_list_add(&current->bio_list[0], bio);
-	else if (!bio->bi_bdev->bd_disk->fops->submit_bio)
+	else if (!bio->bi_bdev->bd_submit_bio)
 		__submit_bio_noacct_mq(bio);
 	else
 		__submit_bio_noacct(bio);
diff --git a/block/genhd.c b/block/genhd.c
index 02d9cfb9e077..07736c5db988 100644
--- a/block/genhd.c
+++ b/block/genhd.c
@@ -420,6 +420,10 @@ int __must_check device_add_disk(struct device *parent, struct gendisk *disk,
 	 */
 	elevator_init_mq(disk->queue);
 
+	/* Mark bdev as having a submit_bio, if needed */
+	if (disk->fops->submit_bio)
+		disk->part0->bd_submit_bio = 1;
+
 	/*
 	 * If the driver provides an explicit major number it also must provide
 	 * the number of minors numbers supported, and those will be used to
diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
index d68d6e951fad..c08e1c08b7ba 100644
--- a/include/linux/blk_types.h
+++ b/include/linux/blk_types.h
@@ -47,6 +47,7 @@ struct block_device {
 	bool			bd_read_only;	/* read-only policy */
 	u8			bd_partno;
 	bool			bd_write_holder;
+	bool			bd_submit_bio;
 	dev_t			bd_dev;
 	atomic_t		bd_openers;
 	spinlock_t		bd_size_lock; /* for bd_inode->i_size updates */
-- 
2.39.2


^ permalink raw reply related	[flat|nested] 9+ messages in thread

* Re: [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev
  2023-04-14 13:48 ` [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev Jens Axboe
@ 2023-04-15  2:43   ` Damien Le Moal
  2023-04-15  3:41     ` Jens Axboe
  2023-04-16  5:53   ` Christoph Hellwig
  1 sibling, 1 reply; 9+ messages in thread
From: Damien Le Moal @ 2023-04-15  2:43 UTC (permalink / raw)
  To: Jens Axboe, linux-block

On 4/14/23 22:48, Jens Axboe wrote:
> We have a long chain of memory dereferencing just to whether or not
> this disk has a special submit_bio helper. As that's not necessarily
> the common case, add a bd_submit_bio state in the bdev to avoid
> traversing this memory dependency chain if we don't need to.
> 
> Signed-off-by: Jens Axboe <axboe@kernel.dk>
> ---
>  block/bdev.c              | 1 +
>  block/blk-core.c          | 8 ++++----
>  block/genhd.c             | 4 ++++
>  include/linux/blk_types.h | 1 +
>  4 files changed, 10 insertions(+), 4 deletions(-)
> 
> diff --git a/block/bdev.c b/block/bdev.c
> index 1795c7d4b99e..31a5d25b2b44 100644
> --- a/block/bdev.c
> +++ b/block/bdev.c
> @@ -419,6 +419,7 @@ struct block_device *bdev_alloc(struct gendisk *disk, u8 partno)
>  	bdev->bd_inode = inode;
>  	bdev->bd_queue = disk->queue;
>  	bdev->bd_stats = alloc_percpu(struct disk_stats);
> +	bdev->bd_submit_bio = 0;

"= false;" would be better to match bd_submit_bio type.

[...]

> diff --git a/block/genhd.c b/block/genhd.c
> index 02d9cfb9e077..07736c5db988 100644
> --- a/block/genhd.c
> +++ b/block/genhd.c
> @@ -420,6 +420,10 @@ int __must_check device_add_disk(struct device *parent, struct gendisk *disk,
>  	 */
>  	elevator_init_mq(disk->queue);
>  
> +	/* Mark bdev as having a submit_bio, if needed */
> +	if (disk->fops->submit_bio)
> +		disk->part0->bd_submit_bio = 1;

"= true;" would be better to match the type.

Note that this could also be:

disk->part0->bd_submit_bio = disk->fops->submit_bio;

thus removing the if.



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev
  2023-04-15  2:43   ` Damien Le Moal
@ 2023-04-15  3:41     ` Jens Axboe
  0 siblings, 0 replies; 9+ messages in thread
From: Jens Axboe @ 2023-04-15  3:41 UTC (permalink / raw)
  To: Damien Le Moal, linux-block

On 4/14/23 8:43?PM, Damien Le Moal wrote:
> On 4/14/23 22:48, Jens Axboe wrote:
>> We have a long chain of memory dereferencing just to whether or not
>> this disk has a special submit_bio helper. As that's not necessarily
>> the common case, add a bd_submit_bio state in the bdev to avoid
>> traversing this memory dependency chain if we don't need to.
>>
>> Signed-off-by: Jens Axboe <axboe@kernel.dk>
>> ---
>>  block/bdev.c              | 1 +
>>  block/blk-core.c          | 8 ++++----
>>  block/genhd.c             | 4 ++++
>>  include/linux/blk_types.h | 1 +
>>  4 files changed, 10 insertions(+), 4 deletions(-)
>>
>> diff --git a/block/bdev.c b/block/bdev.c
>> index 1795c7d4b99e..31a5d25b2b44 100644
>> --- a/block/bdev.c
>> +++ b/block/bdev.c
>> @@ -419,6 +419,7 @@ struct block_device *bdev_alloc(struct gendisk *disk, u8 partno)
>>  	bdev->bd_inode = inode;
>>  	bdev->bd_queue = disk->queue;
>>  	bdev->bd_stats = alloc_percpu(struct disk_stats);
>> +	bdev->bd_submit_bio = 0;
> 
> "= false;" would be better to match bd_submit_bio type.

Done

>> diff --git a/block/genhd.c b/block/genhd.c
>> index 02d9cfb9e077..07736c5db988 100644
>> --- a/block/genhd.c
>> +++ b/block/genhd.c
>> @@ -420,6 +420,10 @@ int __must_check device_add_disk(struct device *parent, struct gendisk *disk,
>>  	 */
>>  	elevator_init_mq(disk->queue);
>>  
>> +	/* Mark bdev as having a submit_bio, if needed */
>> +	if (disk->fops->submit_bio)
>> +		disk->part0->bd_submit_bio = 1;
> 
> "= true;" would be better to match the type.
> 
> Note that this could also be:
> 
> disk->part0->bd_submit_bio = disk->fops->submit_bio;
> 
> thus removing the if.

I made it:

disk->part0->bd_submit_bio = disk->fops->submit_bio != NULL;

instead to make it explicit, I don't think that assignment would be
happy otherwise.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH 1/2] block: re-arrange the struct block_device fields for better layout
  2023-04-14 13:48 ` [PATCH 1/2] block: re-arrange the struct block_device fields for better layout Jens Axboe
@ 2023-04-16  5:51   ` Christoph Hellwig
  2023-04-16 19:01     ` Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Christoph Hellwig @ 2023-04-16  5:51 UTC (permalink / raw)
  To: Jens Axboe; +Cc: linux-block

On Fri, Apr 14, 2023 at 07:48:47AM -0600, Jens Axboe wrote:
> This moves struct device out-of-line as it's just used at open/close
> time, so we can keep some of the commonly used fields closer together.
> On a standard setup, it also reduces the size from 864 bytes to 848
> bytes. Yes, struct device is a pig...

Maybe add a comment about keeping struct device last and why?

Otherwise looks good:

Reviewed-by: Christoph Hellwig <hch@lst.de>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev
  2023-04-14 13:48 ` [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev Jens Axboe
  2023-04-15  2:43   ` Damien Le Moal
@ 2023-04-16  5:53   ` Christoph Hellwig
  2023-04-16 18:59     ` Jens Axboe
  1 sibling, 1 reply; 9+ messages in thread
From: Christoph Hellwig @ 2023-04-16  5:53 UTC (permalink / raw)
  To: Jens Axboe; +Cc: linux-block

On Fri, Apr 14, 2023 at 07:48:48AM -0600, Jens Axboe wrote:
> We have a long chain of memory dereferencing just to whether or not
> this disk has a special submit_bio helper. As that's not necessarily
> the common case, add a bd_submit_bio state in the bdev to avoid
> traversing this memory dependency chain if we don't need to.

Do you have any numbers on how this helps?

> +	bdev->bd_submit_bio = 0;

bd_submit_bio sounds like a function call, so I'd name this
bd_has_submit_io.

But maybe it might make more sense to just add a bit that this is
a blk-mq backed device into bd_state as that might be handy in other
places as well?

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev
  2023-04-16  5:53   ` Christoph Hellwig
@ 2023-04-16 18:59     ` Jens Axboe
  0 siblings, 0 replies; 9+ messages in thread
From: Jens Axboe @ 2023-04-16 18:59 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: linux-block

On 4/15/23 11:53 PM, Christoph Hellwig wrote:
> On Fri, Apr 14, 2023 at 07:48:48AM -0600, Jens Axboe wrote:
>> We have a long chain of memory dereferencing just to whether or not
>> this disk has a special submit_bio helper. As that's not necessarily
>> the common case, add a bd_submit_bio state in the bdev to avoid
>> traversing this memory dependency chain if we don't need to.
> 
> Do you have any numbers on how this helps?

I didn't run any numbers, but seems obvious to me that we don't want
to pull in 3 layers deep of pointer indirections when we can avoid
it.

>> +	bdev->bd_submit_bio = 0;
> 
> bd_submit_bio sounds like a function call, so I'd name this
> bd_has_submit_io.

Good point, I'll rename it.

> But maybe it might make more sense to just add a bit that this is
> a blk-mq backed device into bd_state as that might be handy in other
> places as well?

I'd rather just do that if needed.

-- 
Jens Axboe



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [PATCH 1/2] block: re-arrange the struct block_device fields for better layout
  2023-04-16  5:51   ` Christoph Hellwig
@ 2023-04-16 19:01     ` Jens Axboe
  0 siblings, 0 replies; 9+ messages in thread
From: Jens Axboe @ 2023-04-16 19:01 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: linux-block

On 4/15/23 11:51 PM, Christoph Hellwig wrote:
> On Fri, Apr 14, 2023 at 07:48:47AM -0600, Jens Axboe wrote:
>> This moves struct device out-of-line as it's just used at open/close
>> time, so we can keep some of the commonly used fields closer together.
>> On a standard setup, it also reduces the size from 864 bytes to 848
>> bytes. Yes, struct device is a pig...
> 
> Maybe add a comment about keeping struct device last and why?

Sure, done.

-- 
Jens Axboe



^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2023-04-16 19:02 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-04-14 13:48 [PATCHSET 0/2] Optimize block_device utilization Jens Axboe
2023-04-14 13:48 ` [PATCH 1/2] block: re-arrange the struct block_device fields for better layout Jens Axboe
2023-04-16  5:51   ` Christoph Hellwig
2023-04-16 19:01     ` Jens Axboe
2023-04-14 13:48 ` [PATCH 2/2] block: store bdev->bd_disk->fops->submit_bio state in bdev Jens Axboe
2023-04-15  2:43   ` Damien Le Moal
2023-04-15  3:41     ` Jens Axboe
2023-04-16  5:53   ` Christoph Hellwig
2023-04-16 18:59     ` Jens Axboe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).