[PATCH] Do not requeue requests if FAILFAST is set

public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed

* [PATCH] Do not requeue requests if FAILFAST is set
@ 2007-11-06  8:23 Hannes Reinecke
  2007-11-06 18:01 ` Mike Christie
  0 siblings, 1 reply; 2+ messages in thread
From: Hannes Reinecke @ 2007-11-06  8:23 UTC (permalink / raw)
  To: James Bottomley; +Cc: SCSI Mailing List, Mike Christie

[-- Attachment #1: Type: text/plain, Size: 663 bytes --]

Hi James,

the FAILFAST flag is currently not honored in scsi_request_fn().
So if scsi_dispatch_cmd() returns an error or the queue is not
ready, the request will always be requeued regardless of the
FAILFAST setting.
This can result in a delayed fallback for multipathing, as
these requests will only be catched once the timeout expires.

This patch avoids requeuing if the FAILFAST flag is set, and
terminates the request immediately.

Comments et al welcome.

Cheers,

Hannes
-- 
Dr. Hannes Reinecke		      zSeries & Storage
hare@suse.de			      +49 911 74053 688
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: Markus Rex, HRB 16746 (AG Nürnberg)

[-- Attachment #2: scsi-kill-failfast-requests --]
[-- Type: text/plain, Size: 2016 bytes --]

Do not requeue requests if REQ_FAILFAST is set

Any requests with the REQ_FAILFAST flag set should not be requeued
to the requeust queue, but rather terminated directly.
Otherwise the multipath failover will stall until the command
timeout triggers.

Signed-off-by: Hannes Reinecke <hare@suse.de>

diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
index 277f1b6..a12d2f1 100644
--- a/drivers/scsi/scsi_lib.c
+++ b/drivers/scsi/scsi_lib.c
@@ -1200,6 +1200,11 @@ static int scsi_prep_fn(struct request_queue *q, struct request *req)
 			 */
 			if (!(req->cmd_flags & REQ_PREEMPT))
 				ret = BLKPREP_DEFER;
+			/*
+			 * Return failfast requests immediately
+			 */
+			if (req->cmd_flags & REQ_FAILFAST)
+				ret = BLKPREP_KILL;
 			break;
 		default:
 			/*
@@ -1327,6 +1332,17 @@ static inline int scsi_host_queue_ready(struct request_queue *q,
 	return 1;
 }
 
+static void __scsi_kill_request(struct request *req)
+{
+	struct scsi_cmnd *cmd = req->special;
+	struct scsi_device *sdev = cmd->device;
+
+	cmd->result = DID_NO_CONNECT << 16;
+	atomic_inc(&cmd->device->iorequest_cnt);
+	sdev->device_busy--;
+	__scsi_done(cmd);
+}
+
 /*
  * Kill a request for a dead device
  */
@@ -1440,8 +1456,16 @@ static void scsi_request_fn(struct request_queue *q)
 		 * accept it.
 		 */
 		req = elv_next_request(q);
-		if (!req || !scsi_dev_queue_ready(q, sdev))
+		if (!req)
+			break;
+
+		if (!scsi_dev_queue_ready(q, sdev)) {
+			if (req->cmd_flags & REQ_FAILFAST) {
+				scsi_kill_request(req, q);
+				continue;
+			}
 			break;
+		}
 
 		if (unlikely(!scsi_device_online(sdev))) {
 			sdev_printk(KERN_ERR, sdev,
@@ -1522,8 +1546,12 @@ static void scsi_request_fn(struct request_queue *q)
 	 * later time.
 	 */
 	spin_lock_irq(q->queue_lock);
-	blk_requeue_request(q, req);
-	sdev->device_busy--;
+	if (unlikely(req->cmd_flags & REQ_FAILFAST))
+		__scsi_kill_request(req);
+	else {
+		blk_requeue_request(q, req);
+		sdev->device_busy--;
+	}
 	if(sdev->device_busy == 0)
 		blk_plug_device(q);
  out:

^ permalink raw reply related	[flat|nested] 2+ messages in thread

* Re: [PATCH] Do not requeue requests if FAILFAST is set
  2007-11-06  8:23 [PATCH] Do not requeue requests if FAILFAST is set Hannes Reinecke
@ 2007-11-06 18:01 ` Mike Christie
  0 siblings, 0 replies; 2+ messages in thread
From: Mike Christie @ 2007-11-06 18:01 UTC (permalink / raw)
  To: Hannes Reinecke; +Cc: James Bottomley, SCSI Mailing List

Hannes Reinecke wrote:
> Hi James,
> 
> the FAILFAST flag is currently not honored in scsi_request_fn().
> So if scsi_dispatch_cmd() returns an error or the queue is not
> ready, the request will always be requeued regardless of the
> FAILFAST setting.
> This can result in a delayed fallback for multipathing, as
> these requests will only be catched once the timeout expires.
> 
> This patch avoids requeuing if the FAILFAST flag is set, and
> terminates the request immediately.
> 
> Comments et al welcome.
> 

I think that might be better than my transport errors patch. If 
multipath is being used then we told people to set the recovery timeout, 
to a very low value. So that way commands would not sit in a blocked 
queue very long.

With your patch, if the queue was blocked because we are doing recovery, 
then the command is returned upwards as soon as the driver/class returns 
the command to the scsi layer. And that is nice since it fixes that 
issue where the fibre channel class/driver fails commands in the driver 
when fail fast tmo fires, but commands in the blocked queue sit around 
until dev loss tmo fires or the rport comes back.

Is there any case though where fail fast is set on the request, but the 
driver really does want to retry on the driver? Was there some other 
user of something like DID_IMM_RETRY that could be used with multipath 
or raid and when they return DID_IMM_RETRY they wanted to bypass the 
fail fast checks? If so that would be my only reason for splitting up 
the error values and relying on the transport class timers (fc fail fast 
tmo and iscsi recovery tmo) to decide when to fail commands (assuming 
that the fc class is also changed so when fail fast tmo fires blocked 
commands are failed).

Oh yeah since we are now doing dm-multipath with SAS, what is that class 
doing as far as failing commands when there is a transport issue? Does 
it have something like fc's fail fast tmo? Does it fail running commands 
right away so the scsi layer can requeue or does it try to do some lower 
level recovery and try to continue the command until some timeout fires?

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2007-11-06 18:01 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-11-06  8:23 [PATCH] Do not requeue requests if FAILFAST is set Hannes Reinecke
2007-11-06 18:01 ` Mike Christie

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox