* [PATCH v3 0/4] SCSI core patches
@ 2023-05-17 23:09 Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 1/4] scsi: core: Use min() instead of open-coding it Bart Van Assche
` (3 more replies)
0 siblings, 4 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-17 23:09 UTC (permalink / raw)
To: Martin K . Petersen; +Cc: Christoph Hellwig, linux-scsi, Bart Van Assche
Hi Martin,
Please consider these SCSI core patches for the next merge window.
Thanks,
Bart.
Changes compared to v2:
- Dropped patch "scsi: core: Update a source code comment".
- In patch "scsi: core: Trace SCSI sense data", changed the format for the
sense key and left out a superfluous parenthese.
- In patch "scsi: core: Only kick the requeue list if necessary", moved the
blk_mq_kick_requeue_list() from scsi_run_host_queues() into scsi_run_queue().
Changes compared to v1:
- Improved the SCSI tracing patch as requested by Steven Rostedt and
Niklas Cassel.
- Added patch "scsi: core: Delay running the queue if the host is blocked".
Bart Van Assche (4):
scsi: core: Use min() instead of open-coding it
scsi: core: Trace SCSI sense data
scsi: core: Only kick the requeue list if necessary
scsi: core: Delay running the queue if the host is blocked
drivers/scsi/scsi_common.c | 3 +--
drivers/scsi/scsi_lib.c | 15 +++++++++------
include/trace/events/scsi.h | 21 +++++++++++++++++++--
3 files changed, 29 insertions(+), 10 deletions(-)
^ permalink raw reply [flat|nested] 12+ messages in thread
* [PATCH v3 1/4] scsi: core: Use min() instead of open-coding it
2023-05-17 23:09 [PATCH v3 0/4] SCSI core patches Bart Van Assche
@ 2023-05-17 23:09 ` Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 2/4] scsi: core: Trace SCSI sense data Bart Van Assche
` (2 subsequent siblings)
3 siblings, 0 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-17 23:09 UTC (permalink / raw)
To: Martin K . Petersen
Cc: Christoph Hellwig, linux-scsi, Bart Van Assche, Benjamin Block,
Douglas Gilbert
Use min() instead of open-coding it in scsi_normalize_sense().
Reviewed-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Benjamin Block <bblock@linux.ibm.com>
Cc: Douglas Gilbert <dgilbert@interlog.com>
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
---
drivers/scsi/scsi_common.c | 3 +--
1 file changed, 1 insertion(+), 2 deletions(-)
diff --git a/drivers/scsi/scsi_common.c b/drivers/scsi/scsi_common.c
index 6e50e81a8216..24dec80a6253 100644
--- a/drivers/scsi/scsi_common.c
+++ b/drivers/scsi/scsi_common.c
@@ -176,8 +176,7 @@ bool scsi_normalize_sense(const u8 *sense_buffer, int sb_len,
if (sb_len > 2)
sshdr->sense_key = (sense_buffer[2] & 0xf);
if (sb_len > 7) {
- sb_len = (sb_len < (sense_buffer[7] + 8)) ?
- sb_len : (sense_buffer[7] + 8);
+ sb_len = min(sb_len, sense_buffer[7] + 8);
if (sb_len > 12)
sshdr->asc = sense_buffer[12];
if (sb_len > 13)
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v3 2/4] scsi: core: Trace SCSI sense data
2023-05-17 23:09 [PATCH v3 0/4] SCSI core patches Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 1/4] scsi: core: Use min() instead of open-coding it Bart Van Assche
@ 2023-05-17 23:09 ` Bart Van Assche
2023-05-18 1:00 ` Ming Lei
2023-05-17 23:09 ` [PATCH v3 3/4] scsi: core: Only kick the requeue list if necessary Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked Bart Van Assche
3 siblings, 1 reply; 12+ messages in thread
From: Bart Van Assche @ 2023-05-17 23:09 UTC (permalink / raw)
To: Martin K . Petersen
Cc: Christoph Hellwig, linux-scsi, Bart Van Assche, Niklas Cassel,
Ming Lei, Hannes Reinecke, John Garry, Mike Christie
If a command fails, SCSI sense data is essential to determine why it
failed. Hence make the sense key, ASC and ASCQ codes available in the
ftrace output.
Reviewed-by: Niklas Cassel <niklas.cassel@wdc.com>
Reviewed-by: Christoph Hellwig <hch@lst.de>
Cc: Ming Lei <ming.lei@redhat.com>
Cc: Hannes Reinecke <hare@suse.de>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Mike Christie <michael.christie@oracle.com>
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
---
include/trace/events/scsi.h | 21 +++++++++++++++++++--
1 file changed, 19 insertions(+), 2 deletions(-)
diff --git a/include/trace/events/scsi.h b/include/trace/events/scsi.h
index a2c7befd451a..512792317577 100644
--- a/include/trace/events/scsi.h
+++ b/include/trace/events/scsi.h
@@ -269,9 +269,14 @@ DECLARE_EVENT_CLASS(scsi_cmd_done_timeout_template,
__field( unsigned int, prot_sglen )
__field( unsigned char, prot_op )
__dynamic_array(unsigned char, cmnd, cmd->cmd_len)
+ __field( u8, sense_key )
+ __field( u8, asc )
+ __field( u8, ascq )
),
TP_fast_assign(
+ struct scsi_sense_hdr sshdr;
+
__entry->host_no = cmd->device->host->host_no;
__entry->channel = cmd->device->channel;
__entry->id = cmd->device->id;
@@ -285,11 +290,22 @@ DECLARE_EVENT_CLASS(scsi_cmd_done_timeout_template,
__entry->prot_sglen = scsi_prot_sg_count(cmd);
__entry->prot_op = scsi_get_prot_op(cmd);
memcpy(__get_dynamic_array(cmnd), cmd->cmnd, cmd->cmd_len);
+ if (cmd->sense_buffer && SCSI_SENSE_VALID(cmd) &&
+ scsi_command_normalize_sense(cmd, &sshdr)) {
+ __entry->sense_key = sshdr.sense_key;
+ __entry->asc = sshdr.asc;
+ __entry->ascq = sshdr.ascq;
+ } else {
+ __entry->sense_key = 0;
+ __entry->asc = 0;
+ __entry->ascq = 0;
+ }
),
TP_printk("host_no=%u channel=%u id=%u lun=%u data_sgl=%u prot_sgl=%u " \
"prot_op=%s driver_tag=%d scheduler_tag=%d cmnd=(%s %s raw=%s) " \
- "result=(driver=%s host=%s message=%s status=%s)",
+ "result=(driver=%s host=%s message=%s status=%s "
+ "sense_key=%#x asc=%#x ascq=%#x)",
__entry->host_no, __entry->channel, __entry->id,
__entry->lun, __entry->data_sglen, __entry->prot_sglen,
show_prot_op_name(__entry->prot_op), __entry->driver_tag,
@@ -299,7 +315,8 @@ DECLARE_EVENT_CLASS(scsi_cmd_done_timeout_template,
"DRIVER_OK",
show_hostbyte_name(((__entry->result) >> 16) & 0xff),
"COMMAND_COMPLETE",
- show_statusbyte_name(__entry->result & 0xff))
+ show_statusbyte_name(__entry->result & 0xff),
+ __entry->sense_key, __entry->asc, __entry->ascq)
);
DEFINE_EVENT(scsi_cmd_done_timeout_template, scsi_dispatch_cmd_done,
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v3 3/4] scsi: core: Only kick the requeue list if necessary
2023-05-17 23:09 [PATCH v3 0/4] SCSI core patches Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 1/4] scsi: core: Use min() instead of open-coding it Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 2/4] scsi: core: Trace SCSI sense data Bart Van Assche
@ 2023-05-17 23:09 ` Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked Bart Van Assche
3 siblings, 0 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-17 23:09 UTC (permalink / raw)
To: Martin K . Petersen
Cc: Christoph Hellwig, linux-scsi, Bart Van Assche, Ming Lei,
Hannes Reinecke, John Garry, Mike Christie
Instead of running the request queue of each device associated with a
host every 3 ms (BLK_MQ_RESOURCE_DELAY) while host error handling is in
progress, run the request queue after error handling has finished.
Cc: Christoph Hellwig <hch@lst.de>
Cc: Ming Lei <ming.lei@redhat.com>
Cc: Hannes Reinecke <hare@suse.de>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Mike Christie <michael.christie@oracle.com>
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
---
drivers/scsi/scsi_lib.c | 13 ++++++++-----
1 file changed, 8 insertions(+), 5 deletions(-)
diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
index e59eb0cbfc83..e4f34217b879 100644
--- a/drivers/scsi/scsi_lib.c
+++ b/drivers/scsi/scsi_lib.c
@@ -122,11 +122,9 @@ static void scsi_mq_requeue_cmd(struct scsi_cmnd *cmd, unsigned long msecs)
WARN_ON_ONCE(true);
}
- if (msecs) {
- blk_mq_requeue_request(rq, false);
+ blk_mq_requeue_request(rq, false);
+ if (!scsi_host_in_recovery(cmd->device->host))
blk_mq_delay_kick_requeue_list(rq->q, msecs);
- } else
- blk_mq_requeue_request(rq, true);
}
/**
@@ -165,7 +163,8 @@ static void __scsi_queue_insert(struct scsi_cmnd *cmd, int reason, bool unbusy)
*/
cmd->result = 0;
- blk_mq_requeue_request(scsi_cmd_to_rq(cmd), true);
+ blk_mq_requeue_request(scsi_cmd_to_rq(cmd),
+ !scsi_host_in_recovery(cmd->device->host));
}
/**
@@ -449,6 +448,7 @@ static void scsi_run_queue(struct request_queue *q)
if (!list_empty(&sdev->host->starved_list))
scsi_starved_list_run(sdev->host);
+ blk_mq_kick_requeue_list(q);
blk_mq_run_hw_queues(q, false);
}
@@ -499,6 +499,9 @@ static void scsi_mq_uninit_cmd(struct scsi_cmnd *cmd)
static void scsi_run_queue_async(struct scsi_device *sdev)
{
+ if (scsi_host_in_recovery(sdev->host))
+ return;
+
if (scsi_target(sdev)->single_lun ||
!list_empty(&sdev->host->starved_list)) {
kblockd_schedule_work(&sdev->requeue_work);
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-17 23:09 [PATCH v3 0/4] SCSI core patches Bart Van Assche
` (2 preceding siblings ...)
2023-05-17 23:09 ` [PATCH v3 3/4] scsi: core: Only kick the requeue list if necessary Bart Van Assche
@ 2023-05-17 23:09 ` Bart Van Assche
2023-05-18 1:16 ` Ming Lei
3 siblings, 1 reply; 12+ messages in thread
From: Bart Van Assche @ 2023-05-17 23:09 UTC (permalink / raw)
To: Martin K . Petersen
Cc: Christoph Hellwig, linux-scsi, Bart Van Assche, Ming Lei,
Hannes Reinecke, John Garry, Mike Christie
Tell the block layer to rerun the queue after a delay instead of
immediately if the SCSI LLD decided to block the host.
Note: neither scsi_run_host_queues() nor scsi_run_queue() are called
from the fast path.
Reviewed-by: Christoph Hellwig <hch@lst.de>
Cc: Ming Lei <ming.lei@redhat.com>
Cc: Hannes Reinecke <hare@suse.de>
Cc: John Garry <john.g.garry@oracle.com>
Cc: Mike Christie <michael.christie@oracle.com>
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
---
drivers/scsi/scsi_lib.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
index e4f34217b879..b37718ebbbfc 100644
--- a/drivers/scsi/scsi_lib.c
+++ b/drivers/scsi/scsi_lib.c
@@ -1767,7 +1767,7 @@ static blk_status_t scsi_queue_rq(struct blk_mq_hw_ctx *hctx,
break;
case BLK_STS_RESOURCE:
case BLK_STS_ZONE_RESOURCE:
- if (scsi_device_blocked(sdev))
+ if (scsi_device_blocked(sdev) || shost->host_self_blocked)
ret = BLK_STS_DEV_RESOURCE;
break;
case BLK_STS_AGAIN:
^ permalink raw reply related [flat|nested] 12+ messages in thread
* Re: [PATCH v3 2/4] scsi: core: Trace SCSI sense data
2023-05-17 23:09 ` [PATCH v3 2/4] scsi: core: Trace SCSI sense data Bart Van Assche
@ 2023-05-18 1:00 ` Ming Lei
2023-05-18 19:22 ` Bart Van Assche
0 siblings, 1 reply; 12+ messages in thread
From: Ming Lei @ 2023-05-18 1:00 UTC (permalink / raw)
To: Bart Van Assche
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi, Niklas Cassel,
Hannes Reinecke, John Garry, Mike Christie, ming.lei
On Wed, May 17, 2023 at 04:09:25PM -0700, Bart Van Assche wrote:
> If a command fails, SCSI sense data is essential to determine why it
> failed. Hence make the sense key, ASC and ASCQ codes available in the
> ftrace output.
>
> Reviewed-by: Niklas Cassel <niklas.cassel@wdc.com>
> Reviewed-by: Christoph Hellwig <hch@lst.de>
> Cc: Ming Lei <ming.lei@redhat.com>
> Cc: Hannes Reinecke <hare@suse.de>
> Cc: John Garry <john.g.garry@oracle.com>
> Cc: Mike Christie <michael.christie@oracle.com>
> Signed-off-by: Bart Van Assche <bvanassche@acm.org>
> ---
> include/trace/events/scsi.h | 21 +++++++++++++++++++--
> 1 file changed, 19 insertions(+), 2 deletions(-)
>
> diff --git a/include/trace/events/scsi.h b/include/trace/events/scsi.h
> index a2c7befd451a..512792317577 100644
> --- a/include/trace/events/scsi.h
> +++ b/include/trace/events/scsi.h
> @@ -269,9 +269,14 @@ DECLARE_EVENT_CLASS(scsi_cmd_done_timeout_template,
> __field( unsigned int, prot_sglen )
> __field( unsigned char, prot_op )
> __dynamic_array(unsigned char, cmnd, cmd->cmd_len)
> + __field( u8, sense_key )
> + __field( u8, asc )
> + __field( u8, ascq )
> ),
>
> TP_fast_assign(
> + struct scsi_sense_hdr sshdr;
> +
> __entry->host_no = cmd->device->host->host_no;
> __entry->channel = cmd->device->channel;
> __entry->id = cmd->device->id;
> @@ -285,11 +290,22 @@ DECLARE_EVENT_CLASS(scsi_cmd_done_timeout_template,
> __entry->prot_sglen = scsi_prot_sg_count(cmd);
> __entry->prot_op = scsi_get_prot_op(cmd);
> memcpy(__get_dynamic_array(cmnd), cmd->cmnd, cmd->cmd_len);
> + if (cmd->sense_buffer && SCSI_SENSE_VALID(cmd) &&
> + scsi_command_normalize_sense(cmd, &sshdr)) {
> + __entry->sense_key = sshdr.sense_key;
> + __entry->asc = sshdr.asc;
> + __entry->ascq = sshdr.ascq;
> + } else {
> + __entry->sense_key = 0;
> + __entry->asc = 0;
> + __entry->ascq = 0;
> + }
> ),
>
> TP_printk("host_no=%u channel=%u id=%u lun=%u data_sgl=%u prot_sgl=%u " \
> "prot_op=%s driver_tag=%d scheduler_tag=%d cmnd=(%s %s raw=%s) " \
> - "result=(driver=%s host=%s message=%s status=%s)",
> + "result=(driver=%s host=%s message=%s status=%s "
> + "sense_key=%#x asc=%#x ascq=%#x)",
This way probably breaks userspace script or utility, maybe you can
just append "sense(sense_key=%#x asc=%#x ascq=%#x)" only.
thanks,
Ming
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-17 23:09 ` [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked Bart Van Assche
@ 2023-05-18 1:16 ` Ming Lei
2023-05-18 2:34 ` Bart Van Assche
0 siblings, 1 reply; 12+ messages in thread
From: Ming Lei @ 2023-05-18 1:16 UTC (permalink / raw)
To: Bart Van Assche
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi,
Hannes Reinecke, John Garry, Mike Christie, ming.lei
On Wed, May 17, 2023 at 04:09:27PM -0700, Bart Van Assche wrote:
> Tell the block layer to rerun the queue after a delay instead of
> immediately if the SCSI LLD decided to block the host.
>
> Note: neither scsi_run_host_queues() nor scsi_run_queue() are called
> from the fast path.
>
> Reviewed-by: Christoph Hellwig <hch@lst.de>
> Cc: Ming Lei <ming.lei@redhat.com>
> Cc: Hannes Reinecke <hare@suse.de>
> Cc: John Garry <john.g.garry@oracle.com>
> Cc: Mike Christie <michael.christie@oracle.com>
> Signed-off-by: Bart Van Assche <bvanassche@acm.org>
> ---
> drivers/scsi/scsi_lib.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
> index e4f34217b879..b37718ebbbfc 100644
> --- a/drivers/scsi/scsi_lib.c
> +++ b/drivers/scsi/scsi_lib.c
> @@ -1767,7 +1767,7 @@ static blk_status_t scsi_queue_rq(struct blk_mq_hw_ctx *hctx,
> break;
> case BLK_STS_RESOURCE:
> case BLK_STS_ZONE_RESOURCE:
> - if (scsi_device_blocked(sdev))
> + if (scsi_device_blocked(sdev) || shost->host_self_blocked)
> ret = BLK_STS_DEV_RESOURCE;
What if scsi_unblock_requests() is just called after the above check and
before returning to block layer core? Then this request is invisible to
scsi_run_host_queues()<-scsi_unblock_requests(), and io hang happens.
Thanks,
Ming
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-18 1:16 ` Ming Lei
@ 2023-05-18 2:34 ` Bart Van Assche
2023-05-18 2:39 ` Ming Lei
2023-05-18 11:36 ` Hannes Reinecke
0 siblings, 2 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-18 2:34 UTC (permalink / raw)
To: Ming Lei
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi,
Hannes Reinecke, John Garry, Mike Christie
On 5/17/23 18:16, Ming Lei wrote:
> On Wed, May 17, 2023 at 04:09:27PM -0700, Bart Van Assche wrote:
>> @@ -1767,7 +1767,7 @@ static blk_status_t scsi_queue_rq(struct blk_mq_hw_ctx *hctx,
>> break;
>> case BLK_STS_RESOURCE:
>> case BLK_STS_ZONE_RESOURCE:
>> - if (scsi_device_blocked(sdev))
>> + if (scsi_device_blocked(sdev) || shost->host_self_blocked)
>> ret = BLK_STS_DEV_RESOURCE;
>
> What if scsi_unblock_requests() is just called after the above check and
> before returning to block layer core? Then this request is invisible to
> scsi_run_host_queues()<-scsi_unblock_requests(), and io hang happens.
If returning BLK_STS_DEV_RESOURCE could cause an I/O hang, wouldn't that
be a bug in the block layer core? Isn't the block layer core expected to
rerun the queue after a delay if a block driver returns
BLK_STS_DEV_RESOURCE? See also blk_mq_dispatch_rq_list().
Thanks,
Bart.
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-18 2:34 ` Bart Van Assche
@ 2023-05-18 2:39 ` Ming Lei
2023-05-18 14:59 ` Bart Van Assche
2023-05-18 11:36 ` Hannes Reinecke
1 sibling, 1 reply; 12+ messages in thread
From: Ming Lei @ 2023-05-18 2:39 UTC (permalink / raw)
To: Bart Van Assche
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi,
Hannes Reinecke, John Garry, Mike Christie, ming.lei
On Wed, May 17, 2023 at 07:34:38PM -0700, Bart Van Assche wrote:
> On 5/17/23 18:16, Ming Lei wrote:
> > On Wed, May 17, 2023 at 04:09:27PM -0700, Bart Van Assche wrote:
> > > @@ -1767,7 +1767,7 @@ static blk_status_t scsi_queue_rq(struct blk_mq_hw_ctx *hctx,
> > > break;
> > > case BLK_STS_RESOURCE:
> > > case BLK_STS_ZONE_RESOURCE:
> > > - if (scsi_device_blocked(sdev))
> > > + if (scsi_device_blocked(sdev) || shost->host_self_blocked)
> > > ret = BLK_STS_DEV_RESOURCE;
> >
> > What if scsi_unblock_requests() is just called after the above check and
> > before returning to block layer core? Then this request is invisible to
> > scsi_run_host_queues()<-scsi_unblock_requests(), and io hang happens.
>
> If returning BLK_STS_DEV_RESOURCE could cause an I/O hang, wouldn't that be
> a bug in the block layer core? Isn't the block layer core expected to rerun
> the queue after a delay if a block driver returns BLK_STS_DEV_RESOURCE? See
> also blk_mq_dispatch_rq_list().
Please see comment for BLK_STS_DEV_RESOURCE:
/*
* BLK_STS_DEV_RESOURCE is returned from the driver to the block layer if
* device related resources are unavailable, but the driver can guarantee
* that the queue will be rerun in the future once resources become
* available again. This is typically the case for device specific
* resources that are consumed for IO. If the driver fails allocating these
* resources, we know that inflight (or pending) IO will free these
* resource upon completion.
Basically it requires driver to re-run queue.
In reality, it can be full of race, maybe we can just remove
BLK_STS_DEV_RESOURCE.
Thanks,
Ming
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-18 2:34 ` Bart Van Assche
2023-05-18 2:39 ` Ming Lei
@ 2023-05-18 11:36 ` Hannes Reinecke
1 sibling, 0 replies; 12+ messages in thread
From: Hannes Reinecke @ 2023-05-18 11:36 UTC (permalink / raw)
To: Bart Van Assche, Ming Lei
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi, John Garry,
Mike Christie
On 5/18/23 04:34, Bart Van Assche wrote:
> On 5/17/23 18:16, Ming Lei wrote:
>> On Wed, May 17, 2023 at 04:09:27PM -0700, Bart Van Assche wrote:
>>> @@ -1767,7 +1767,7 @@ static blk_status_t scsi_queue_rq(struct
>>> blk_mq_hw_ctx *hctx,
>>> break;
>>> case BLK_STS_RESOURCE:
>>> case BLK_STS_ZONE_RESOURCE:
>>> - if (scsi_device_blocked(sdev))
>>> + if (scsi_device_blocked(sdev) || shost->host_self_blocked)
>>> ret = BLK_STS_DEV_RESOURCE;
>>
>> What if scsi_unblock_requests() is just called after the above check and
>> before returning to block layer core? Then this request is invisible to
>> scsi_run_host_queues()<-scsi_unblock_requests(), and io hang happens.
>
> If returning BLK_STS_DEV_RESOURCE could cause an I/O hang, wouldn't that
> be a bug in the block layer core? Isn't the block layer core expected to
> rerun the queue after a delay if a block driver returns
> BLK_STS_DEV_RESOURCE? See also blk_mq_dispatch_rq_list().
>
DEV_RESOURCE is a tricky thing; it actually implies that the _device_ is
blocked, and no further I/O is possible.
And it's actually the responsibility of the device/driver to kick the
queue again once the contention/block/whatever is done.
So no, the block layer should not do anything here, relying on the
driver to do something.
Which is why returning DEV_RESOURCE is not recommended, seeing that it's
easy to get it wrong...
Cheers,
Hannes
--
Dr. Hannes Reinecke Kernel Storage Architect
hare@suse.de +49 911 74053 688
SUSE Software Solutions GmbH, Maxfeldstr. 5, 90409 Nürnberg
HRB 36809 (AG Nürnberg), Geschäftsführer: Ivo Totev, Andrew
Myers, Andrew McDonald, Martje Boudien Moerman
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked
2023-05-18 2:39 ` Ming Lei
@ 2023-05-18 14:59 ` Bart Van Assche
0 siblings, 0 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-18 14:59 UTC (permalink / raw)
To: Ming Lei
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi,
Hannes Reinecke, John Garry, Mike Christie
On 5/17/23 19:39, Ming Lei wrote:
> In reality, it can be full of race, maybe we can just remove
> BLK_STS_DEV_RESOURCE.
We need a better solution than changing BLK_STS_DEV_RESOURCE into
BLK_STS_RESOURCE. Otherwise power usage will go up in battery-powered
devices that use block drivers that return BLK_STS_DEV_RESOURCE today.
Thanks,
Bart.
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v3 2/4] scsi: core: Trace SCSI sense data
2023-05-18 1:00 ` Ming Lei
@ 2023-05-18 19:22 ` Bart Van Assche
0 siblings, 0 replies; 12+ messages in thread
From: Bart Van Assche @ 2023-05-18 19:22 UTC (permalink / raw)
To: Ming Lei
Cc: Martin K . Petersen, Christoph Hellwig, linux-scsi, Niklas Cassel,
Hannes Reinecke, John Garry, Mike Christie
On 5/17/23 18:00, Ming Lei wrote:
> On Wed, May 17, 2023 at 04:09:25PM -0700, Bart Van Assche wrote:
>> TP_printk("host_no=%u channel=%u id=%u lun=%u data_sgl=%u prot_sgl=%u " \
>> "prot_op=%s driver_tag=%d scheduler_tag=%d cmnd=(%s %s raw=%s) " \
>> - "result=(driver=%s host=%s message=%s status=%s)",
>> + "result=(driver=%s host=%s message=%s status=%s "
>> + "sense_key=%#x asc=%#x ascq=%#x)",
>
> This way probably breaks userspace script or utility, maybe you can
> just append "sense(sense_key=%#x asc=%#x ascq=%#x)" only.
I will make this change.
Thanks,
Bart.
^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2023-05-18 19:22 UTC | newest]
Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-05-17 23:09 [PATCH v3 0/4] SCSI core patches Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 1/4] scsi: core: Use min() instead of open-coding it Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 2/4] scsi: core: Trace SCSI sense data Bart Van Assche
2023-05-18 1:00 ` Ming Lei
2023-05-18 19:22 ` Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 3/4] scsi: core: Only kick the requeue list if necessary Bart Van Assche
2023-05-17 23:09 ` [PATCH v3 4/4] scsi: core: Delay running the queue if the host is blocked Bart Van Assche
2023-05-18 1:16 ` Ming Lei
2023-05-18 2:34 ` Bart Van Assche
2023-05-18 2:39 ` Ming Lei
2023-05-18 14:59 ` Bart Van Assche
2023-05-18 11:36 ` Hannes Reinecke
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).