From: Damien Le Moal <dlemoal@kernel.org>
To: Bart Van Assche <bvanassche@acm.org>, Jens Axboe <axboe@kernel.dk>
Cc: linux-block@vger.kernel.org, Christoph Hellwig <hch@lst.de>,
Damien Le Moal <damien.lemoal@wdc.com>
Subject: Re: [PATCH v2 2/5] block/mq-deadline: Only use zone locking if necessary
Date: Tue, 18 Jul 2023 15:38:43 +0900 [thread overview]
Message-ID: <98e2f100-76b0-c28f-bb02-4a41c1b71f5e@kernel.org> (raw)
In-Reply-To: <20230710180210.1582299-3-bvanassche@acm.org>
On 7/11/23 03:01, Bart Van Assche wrote:
> Measurements have shown that limiting the queue depth to one for zoned
> writes has a significant negative performance impact on zoned UFS devices.
> Hence this patch that disables zone locking from the mq-deadline scheduler
> for storage controllers that support pipelining zoned writes. This patch is
> based on the following assumptions:
> - Applications submit write requests to sequential write required zones
> in order.
> - It happens infrequently that zoned write requests are reordered by the
> block layer.
> - The storage controller does not reorder write requests that have been
> submitted to the same hardware queue. This is the case for UFS: the
> UFSHCI specification requires that UFS controllers process requests in
> order per hardware queue.
> - The I/O priority of all pipelined write requests is the same per zone.
> - Either no I/O scheduler is used or an I/O scheduler is used that
> submits write requests per zone in LBA order.
>
> Cc: Damien Le Moal <damien.lemoal@wdc.com>
> Signed-off-by: Bart Van Assche <bvanassche@acm.org>
> ---
> block/blk-zoned.c | 3 ++-
> block/mq-deadline.c | 14 +++++++++-----
> 2 files changed, 11 insertions(+), 6 deletions(-)
>
> diff --git a/block/blk-zoned.c b/block/blk-zoned.c
> index 0f9f97cdddd9..59560d1657e3 100644
> --- a/block/blk-zoned.c
> +++ b/block/blk-zoned.c
> @@ -504,7 +504,8 @@ static int blk_revalidate_zone_cb(struct blk_zone *zone, unsigned int idx,
> break;
> case BLK_ZONE_TYPE_SEQWRITE_REQ:
> case BLK_ZONE_TYPE_SEQWRITE_PREF:
> - if (!args->seq_zones_wlock) {
> + if (!blk_queue_pipeline_zoned_writes(q) &&
> + !args->seq_zones_wlock) {
I think that this change should go into the first patch, no ?
> args->seq_zones_wlock =
> blk_alloc_zone_bitmap(q->node, args->nr_zones);
> if (!args->seq_zones_wlock)
> diff --git a/block/mq-deadline.c b/block/mq-deadline.c
> index 6aa5daf7ae32..0bed2bdeed89 100644
> --- a/block/mq-deadline.c
> +++ b/block/mq-deadline.c
> @@ -353,7 +353,8 @@ deadline_fifo_request(struct deadline_data *dd, struct dd_per_prio *per_prio,
> return NULL;
>
> rq = rq_entry_fifo(per_prio->fifo_list[data_dir].next);
> - if (data_dir == DD_READ || !blk_queue_is_zoned(rq->q))
> + if (data_dir == DD_READ || !blk_queue_is_zoned(rq->q) ||
> + blk_queue_pipeline_zoned_writes(rq->q))
What about using blk_req_needs_zone_write_lock() ?
> return rq;
>
> /*
> @@ -398,7 +399,8 @@ deadline_next_request(struct deadline_data *dd, struct dd_per_prio *per_prio,
> if (!rq)
> return NULL;
>
> - if (data_dir == DD_READ || !blk_queue_is_zoned(rq->q))
> + if (data_dir == DD_READ || !blk_queue_is_zoned(rq->q) ||
> + blk_queue_pipeline_zoned_writes(rq->q))
Same.
> return rq;
>
> /*
> @@ -526,8 +528,9 @@ static struct request *__dd_dispatch_request(struct deadline_data *dd,
> }
>
> /*
> - * For a zoned block device, if we only have writes queued and none of
> - * them can be dispatched, rq will be NULL.
> + * For a zoned block device that requires write serialization, if we
> + * only have writes queued and none of them can be dispatched, rq will
> + * be NULL.
> */
> if (!rq)
> return NULL;
> @@ -933,7 +936,8 @@ static void dd_finish_request(struct request *rq)
>
> atomic_inc(&per_prio->stats.completed);
>
> - if (blk_queue_is_zoned(q)) {
> + if (blk_queue_is_zoned(rq->q) &&
> + !blk_queue_pipeline_zoned_writes(q)) {
And again here.
> unsigned long flags;
>
> spin_lock_irqsave(&dd->zone_lock, flags);
--
Damien Le Moal
Western Digital Research
next prev parent reply other threads:[~2023-07-18 6:38 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-07-10 18:01 [PATCH v2 0/5] Enable zoned write pipelining for UFS devices Bart Van Assche
2023-07-10 18:01 ` [PATCH v2 1/5] block: Introduce a request queue flag for pipelining zoned writes Bart Van Assche
2023-07-18 6:34 ` Damien Le Moal
2023-07-18 22:37 ` Bart Van Assche
2023-07-19 9:58 ` Damien Le Moal
2023-07-10 18:01 ` [PATCH v2 2/5] block/mq-deadline: Only use zone locking if necessary Bart Van Assche
2023-07-18 6:38 ` Damien Le Moal [this message]
2023-07-18 22:38 ` Bart Van Assche
2023-07-24 21:39 ` Bart Van Assche
2023-07-10 18:01 ` [PATCH v2 3/5] block/null_blk: Add support for pipelining zoned writes Bart Van Assche
2023-07-10 18:01 ` [PATCH v2 4/5] scsi: Retry unaligned " Bart Van Assche
2023-07-18 6:47 ` Damien Le Moal
2023-07-18 22:53 ` Bart Van Assche
2023-07-19 9:59 ` Damien Le Moal
2023-07-19 16:31 ` Bart Van Assche
2023-07-19 23:07 ` Damien Le Moal
2023-07-20 18:18 ` Bart Van Assche
2023-07-21 0:20 ` Damien Le Moal
2023-07-10 18:01 ` [PATCH v2 5/5] scsi: ufs: Enable zoned write pipelining Bart Van Assche
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=98e2f100-76b0-c28f-bb02-4a41c1b71f5e@kernel.org \
--to=dlemoal@kernel.org \
--cc=axboe@kernel.dk \
--cc=bvanassche@acm.org \
--cc=damien.lemoal@wdc.com \
--cc=hch@lst.de \
--cc=linux-block@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).