[PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked

linux-scsi.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked
@ 2006-02-06 16:29 James Smart
  2006-02-06 16:33 ` Christoph Hellwig
  2006-02-06 19:11 ` Luben Tuikov
  0 siblings, 2 replies; 4+ messages in thread
From: James Smart @ 2006-02-06 16:29 UTC (permalink / raw)
  To: linux-scsi

The attached patch creates a FC transport service routine, which can
be specified in the LLDD's scsi_host_template. This allows the transport
to intercept io timeouts, check the state of rport, and if blocked,
request the midlayer to reschedule the timeout. This patch also patches
the lpfc driver to make use of this interface.

What we are trying to avoid is to have the abort handler kick in while the
rport is blocked. The rport is blocked because we have lost connectivity.
As we have no connectivity, the LLDD either fails it (most common) or busy
stalls (yech!) until rport state is resolved.  If the abort is failed, the
error handler escalates, and eventually attempts to send a TUR to the
device, which fails as we have no connectivity. As the TUR fails, the device
gets taken offline, requiring manual interaction to restore use.

The original suggestion was from James Bottomley:
http://marc.theaimsgroup.com/?l=linux-scsi&m=113682123403609&w=2
based on the thread started by Michael Reed:
http://marc.theaimsgroup.com/?l=linux-scsi&m=113477788309880&w=2

Please note:
   This does not resolve the cases where the error handlers are engaged when
   the rport is blocked.

   The need of the transport conflicted with the initial eh_timed_out design,
   which limited the rescheduling of the timeout to the number of retries
   allowed on the command plus 1.  In the case of the FC transport, we already
   have the dev_loss_tmo running to guarantee that this will not be an
   infinite stall, so penalizing the i/o (especially be incrementing the retry
   count) should be avoided.

   Also, as Christoph just recently removed the eh_timed_out use in the
   megaraid driver - he eliminated the only other (in-kernel) user of this
   interface. Perhaps we should consider changing the design and realign
   the EH_RESET_TIMER functionality to what EH_TRANSPORT_BLOCKED does.
   Thoughts ?

-- james s



diff -upNr a/drivers/scsi/scsi_error.c b/drivers/scsi/scsi_error.c
--- a/drivers/scsi/scsi_error.c	2006-02-06 06:01:12.000000000 -0500
+++ b/drivers/scsi/scsi_error.c	2006-02-06 09:15:24.000000000 -0500
@@ -153,6 +153,25 @@ int scsi_delete_timer(struct scsi_cmnd *
   * scsi_times_out - Timeout function for normal scsi commands.
   * @scmd:	Cmd that is timing out.
   *
+ * Allows transport or LLDD to intercept command timeout for additional
+ * processing. The handler returns a status for the timeout processing:
+ *  EH_HANDLED:
+ *    The transport/LLDD has completed the timeout processing. No further
+ *    error handling is necessary. Proceed with scsi command completion
+ *    processing.
+ *  EH_RESET_TIMER:
+ *    The LLDD is requesting further delay of the timeout handler
+ *    The midlayer reschedules the timeout, but it counts against the
+ *    retry limit for the i/o.
+ *  EH_TRANSPORT_BLOCKED;
+ *    The transport is reporting that the device is in a "blocked"
+ *    state wherein it cannot be contacted (typically due to a temporary
+ *    loss of connectivity). The transport ensures that this is not an
+ *    infinite condition. The timeout is rescheduled until the transport
+ *    state changes.
+ *  EH_NOT_HANDLED:
+ *    No processing. The midlayer performs its normal timeout functions.
+ *
   * Notes:
   *     We do not need to lock this.  There is the potential for a race
   *     only in that the normal completion handling might run, but if the
@@ -173,6 +192,8 @@ void scsi_times_out(struct scsi_cmnd *sc
  			 * with allowed == 0 */
  			if (scmd->retries++ > scmd->allowed)
  				break;
+			/* FALL THRU */
+		case EH_TRANSPORT_BLOCKED:
  			scsi_add_timer(scmd, scmd->timeout_per_command,
  				       scsi_times_out);
  			return;
diff -upNr a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c
--- a/drivers/scsi/scsi_transport_fc.c	2006-02-06 06:01:12.000000000 -0500
+++ b/drivers/scsi/scsi_transport_fc.c	2006-02-06 10:45:53.000000000 -0500
@@ -31,6 +31,7 @@
  #include <scsi/scsi_host.h>
  #include <scsi/scsi_transport.h>
  #include <scsi/scsi_transport_fc.h>
+#include <scsi/scsi_cmnd.h>
  #include "scsi_priv.h"

  /*
@@ -1879,6 +1880,45 @@ restart_search:
  }


+/**
+ * fc_sdev_io_timeout - FC Transport I/O timeout intercept handler
+ *
+ * @scmd:	The SCSI command which timed out
+ *
+ * This routine can be used by a FC LLDD as its eh_timed_out function in
+ * the scsi host template.
+ *
+ * This routine protects against error handlers getting invoked while a
+ * rport is in a blocked state, typically due to a temporarily loss of
+ * connectivity. If the error handlers are allowed to proceed, requests
+ * to abort i/o, reset the target, etc will likely fail as there is no way
+ * to communicate with the device to perform the requested function. These
+ * failures may result in the midlayer taking the device offline, requiring
+ * manual intervention to restore operation.
+ *
+ * This routine, called whenever an i/o times out, validates the state of
+ * the underlying rport. If the rport is blocked, it returns
+ * EH_TRANSPORT_BLOCKED, which will continue to reschedule the timeout.
+ * Eventually, either the device will return, or devloss_tmo will fire,
+ * and when the timeout then fires, it will be handled normally.
+ * If the rport is not blocked, normal error handling continues.
+ *
+ * Notes:
+ *	This routine assumes no locks are held on entry.
+ **/
+enum scsi_eh_timer_return
+fc_sdev_io_timeout(struct scsi_cmnd *scmd)
+{
+	struct fc_rport *rport = starget_to_rport(scsi_target(scmd->device));
+
+	if (rport->port_state == FC_PORTSTATE_BLOCKED)
+		return EH_TRANSPORT_BLOCKED;
+
+	return EH_NOT_HANDLED;
+}
+EXPORT_SYMBOL(fc_sdev_io_timeout);
+
+
  MODULE_AUTHOR("Martin Hicks");
  MODULE_DESCRIPTION("FC Transport Attributes");
  MODULE_LICENSE("GPL");
diff -upNr a/include/scsi/scsi_host.h b/include/scsi/scsi_host.h
--- a/include/scsi/scsi_host.h	2006-02-06 06:01:41.000000000 -0500
+++ b/include/scsi/scsi_host.h	2006-02-06 09:09:20.000000000 -0500
@@ -38,6 +38,7 @@ enum scsi_eh_timer_return {
  	EH_NOT_HANDLED,
  	EH_HANDLED,
  	EH_RESET_TIMER,
+	EH_TRANSPORT_BLOCKED,
  };


diff -upNr a/include/scsi/scsi_transport_fc.h b/include/scsi/scsi_transport_fc.h
--- a/include/scsi/scsi_transport_fc.h	2006-02-06 06:01:41.000000000 -0500
+++ b/include/scsi/scsi_transport_fc.h	2006-02-06 09:10:31.000000000 -0500
@@ -483,6 +483,7 @@ struct fc_rport *fc_remote_port_add(stru
  void fc_remote_port_delete(struct fc_rport  *rport);
  void fc_remote_port_rolechg(struct fc_rport  *rport, u32 roles);
  int scsi_is_fc_rport(const struct device *);
+enum scsi_eh_timer_return fc_sdev_io_timeout(struct scsi_cmnd *scmd);

  static inline u64 wwn_to_u64(u8 *wwn)
  {
diff -upNr a/drivers/scsi/lpfc/lpfc_scsi.c b/drivers/scsi/lpfc/lpfc_scsi.c
--- a/drivers/scsi/lpfc/lpfc_scsi.c	2006-02-06 06:01:10.000000000 -0500
+++ b/drivers/scsi/lpfc/lpfc_scsi.c	2006-02-06 09:12:31.000000000 -0500
@@ -1288,6 +1288,7 @@ struct scsi_host_template lpfc_template
  	.name			= LPFC_DRIVER_NAME,
  	.info			= lpfc_info,
  	.queuecommand		= lpfc_queuecommand,
+	.eh_timed_out		= fc_sdev_io_timeout,
  	.eh_abort_handler	= lpfc_abort_handler,
  	.eh_device_reset_handler= lpfc_reset_lun_handler,
  	.eh_bus_reset_handler	= lpfc_reset_bus_handler,


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked
  2006-02-06 16:29 [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked James Smart
@ 2006-02-06 16:33 ` Christoph Hellwig
  2006-02-06 19:11 ` Luben Tuikov
  1 sibling, 0 replies; 4+ messages in thread
From: Christoph Hellwig @ 2006-02-06 16:33 UTC (permalink / raw)
  To: James Smart; +Cc: linux-scsi

On Mon, Feb 06, 2006 at 11:29:36AM -0500, James Smart wrote:
>   Also, as Christoph just recently removed the eh_timed_out use in the
>   megaraid driver - he eliminated the only other (in-kernel) user of this
>   interface. Perhaps we should consider changing the design and realign
>   the EH_RESET_TIMER functionality to what EH_TRANSPORT_BLOCKED does.
>   Thoughts ?

I'd rather kill it completely and make eh_times out a transport callout,
instead of the host template.  Right now every FC driver would need an
update to call your routine, and this should really be handled transparently
by the transport class.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked
  2006-02-06 16:29 [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked James Smart
  2006-02-06 16:33 ` Christoph Hellwig
@ 2006-02-06 19:11 ` Luben Tuikov
  2006-02-07 14:41   ` James Smart
  1 sibling, 1 reply; 4+ messages in thread
From: Luben Tuikov @ 2006-02-06 19:11 UTC (permalink / raw)
  To: James.Smart, linux-scsi

--- James Smart <James.Smart@Emulex.Com> wrote:
> The attached patch creates a FC transport service routine, which can
> be specified in the LLDD's scsi_host_template. This allows the transport
> to intercept io timeouts, check the state of rport, and if blocked,
> request the midlayer to reschedule the timeout. This patch also patches
> the lpfc driver to make use of this interface.

Yes, this is the intended purpose of this interface.

> What we are trying to avoid is to have the abort handler kick in while the
> rport is blocked. The rport is blocked because we have lost connectivity.
> As we have no connectivity, the LLDD either fails it (most common) or busy
> stalls (yech!) until rport state is resolved.  If the abort is failed, the
> error handler escalates, and eventually attempts to send a TUR to the
> device, which fails as we have no connectivity. As the TUR fails, the device
> gets taken offline, requiring manual interaction to restore use.

Is it possible that you add dev_loss_tmo to the scsi_dev timeout?
Say scsi_dev->timeout = dev_loss_tmo + X seconds?

This would mean that SCSI Core's command timer would fire X seconds _after_
the transport event, and presumably you've dealt with this issue
X seconds before (when the transport event fired).

> The original suggestion was from James Bottomley:
> http://marc.theaimsgroup.com/?l=linux-scsi&m=113682123403609&w=2
> based on the thread started by Michael Reed:
> http://marc.theaimsgroup.com/?l=linux-scsi&m=113477788309880&w=2
> 
> Please note:
>    This does not resolve the cases where the error handlers are engaged when
>    the rport is blocked.

Indeed, it doesn't.

>    The need of the transport conflicted with the initial eh_timed_out design,
>    which limited the rescheduling of the timeout to the number of retries
>    allowed on the command plus 1.  In the case of the FC transport, we already

Initially, returning EH_RESET_TIMER, did just what you want: reset the timer,
no questions asked.

>    have the dev_loss_tmo running to guarantee that this will not be an
>    infinite stall, so penalizing the i/o (especially be incrementing the retry
>    count) should be avoided.

So you can prove that it is bounded in time.  But this is very good!

So if you defined eh_strategy_handler() (in conjunction with setting the device
timeout to X seconds + dev_loss_tmo) you'll end up in your own eh_strategy_handler()
IFF the rport timeout fired and nothing could be done, then the SCSI Command
timeout fired (X seconds after) and nothing could be done, and you ended up
in your eh_strategy_handler().

Note: Defining eh_timed_out() doesn't imply that you should define
eh_strategy_handler(), but the opposite is not true.

>    Also, as Christoph just recently removed the eh_timed_out use in the
>    megaraid driver - he eliminated the only other (in-kernel) user of this
>    interface. Perhaps we should consider changing the design and realign
>    the EH_RESET_TIMER functionality to what EH_TRANSPORT_BLOCKED does.
>    Thoughts ?

Well, it only makes sense to return to the original meaning of EH_RESET_HANDLER,
i.e. that SCSI Core should just reset the timer and no questions asked.

Since, of course, EH_TRANSPORT_BLOCKED,
  1. Is equivalent to EH_RESET_TIMER, but circumvents SCSI Core's command retry
     attempts (as the diff patch shows below).
  2. Has no meaning to a timer timeout handler, i.e. what you really want is
     EH_RESET_TIMER.

So, since you need it, I'd say removing command retry counting in SCSI Core
for EH_RESET_TIMER can be removed to suit your needs.

> + *  EH_TRANSPORT_BLOCKED;
> + *    The transport is reporting that the device is in a "blocked"
> + *    state wherein it cannot be contacted (typically due to a temporary
> + *    loss of connectivity). The transport ensures that this is not an
> + *    infinite condition. The timeout is rescheduled until the transport
> + *    state changes.

Indeed, and as shown in your patch below this is just EH_RESET_TIMER sans
the retry counting.

> @@ -173,6 +192,8 @@ void scsi_times_out(struct scsi_cmnd *sc
>   			 * with allowed == 0 */
>   			if (scmd->retries++ > scmd->allowed)
>   				break;
> +			/* FALL THRU */
> +		case EH_TRANSPORT_BLOCKED:
>   			scsi_add_timer(scmd, scmd->timeout_per_command,
>   				       scsi_times_out);
>   			return;

Submit a patch which removes the retries counting from EH_RESET_TIMER
and let's see what happens.

    Luben


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked
  2006-02-06 19:11 ` Luben Tuikov
@ 2006-02-07 14:41   ` James Smart
  0 siblings, 0 replies; 4+ messages in thread
From: James Smart @ 2006-02-07 14:41 UTC (permalink / raw)
  To: ltuikov; +Cc: linux-scsi

> Is it possible that you add dev_loss_tmo to the scsi_dev timeout?
> Say scsi_dev->timeout = dev_loss_tmo + X seconds?

No. This would imply setting it in slave_configure, and which means commands
artificially have a longer timeout at all times, not just when we're in a
loss of connectivity.

I had considered having eh_timed_out return a time adder, so we timeout once
rather than periodically based on the io timeout value. However, the
calculations to come up with how much time remains on the dev_loss_tmo value,
coupled with adding a return value to this function, didn't seem worth it.

-- james s

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2006-02-07 14:41 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-02-06 16:29 [PATCH] FC transport & lpfc : Avoid device offline cases by stalling aborts until device unblocked James Smart
2006-02-06 16:33 ` Christoph Hellwig
2006-02-06 19:11 ` Luben Tuikov
2006-02-07 14:41   ` James Smart

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).