From: Vu Pham <vuhuong-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
To: Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org>
Cc: Roland Dreier <roland-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
David Dillow <dillowda-1Heg1YXhbW8@public.gmane.org>,
Sebastian Riemer
<sebastian.riemer-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>,
linux-rdma <linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
linux-scsi <linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
James Bottomley
<jbottomley-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
Subject: Re: [PATCH 07/14] scsi_transport_srp: Add transport layer error handling
Date: Thu, 13 Jun 2013 12:43:41 -0700 [thread overview]
Message-ID: <51BA20ED.6040200@mellanox.com> (raw)
In-Reply-To: <51B8777B.5050201-HInyCGIudOg@public.gmane.org>
Hello Bart,
>
> +What: /sys/class/srp_remote_ports/port-<h>:<n>/dev_loss_tmo
> +Date: September 1, 2013
> +KernelVersion: 3.11
> +Contact: linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> +Description: Number of seconds the SCSI layer will wait after a transport
> + layer error has been observed before removing a target port.
> + Zero means immediate removal.
>
A negative value will disable the target port removal.
> +
> +What: /sys/class/srp_remote_ports/port-<h>:<n>/fast_io_fail_tmo
> +Date: September 1, 2013
> +KernelVersion: 3.11
> +Contact: linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> +Description: Number of seconds the SCSI layer will wait after a transport
> + layer error has been observed before failing I/O. Zero means
> + immediate removal. A negative value will disable this
> + behavior.
>
>
<snip>
> +
> +/**
> + * srp_tmo_valid() - check timeout combination validity
> + *
> + * If no fast I/O fail timeout has been configured then the device loss timeout
> + * must be below SCSI_DEVICE_BLOCK_MAX_TIMEOUT. If a fast I/O fail timeout has
> + * been configured then it must be below the device loss timeout.
> + */
> +int srp_tmo_valid(int fast_io_fail_tmo, int dev_loss_tmo)
> +{
> + return (fast_io_fail_tmo < 0 && 1 <= dev_loss_tmo &&
> + dev_loss_tmo <= SCSI_DEVICE_BLOCK_MAX_TIMEOUT)
> + || (0 <= fast_io_fail_tmo &&
> + (dev_loss_tmo < 0 ||
> + (fast_io_fail_tmo < dev_loss_tmo &&
> + dev_loss_tmo < LONG_MAX / HZ))) ? 0 : -EINVAL;
> +}
> +EXPORT_SYMBOL_GPL(srp_tmo_valid);
>
fast_io_fail_tmo is off, one cannot turn off dev_loss_tmo with negative
value
dev_loss_tmo is off, one cannot turn off fast_io_fail_tmo with negative
value
<snip>
>
> + * srp_reconnect_rport - reconnect by invoking srp_function_template.reconnect()
> + *
> + * Blocks SCSI command queueing before invoking reconnect() such that the
> + * scsi_host_template.queuecommand() won't be invoked concurrently with
> + * reconnect(). This is important since a reconnect() implementation may
> + * reallocate resources needed by queuecommand(). Please note that this
> + * function neither waits until outstanding requests have finished nor tries
> + * to abort these. It is the responsibility of the reconnect() function to
> + * finish outstanding commands before reconnecting to the target port.
> + */
> +int srp_reconnect_rport(struct srp_rport *rport)
> +{
> + struct Scsi_Host *shost = rport_to_shost(rport);
> + struct srp_internal *i = to_srp_internal(shost->transportt);
> + struct scsi_device *sdev;
> + int res;
> +
> + pr_debug("SCSI host %s\n", dev_name(&shost->shost_gendev));
> +
> + res = mutex_lock_interruptible(&rport->mutex);
> + if (res) {
> + pr_debug("%s: mutex_lock_interruptible() returned %d\n",
> + dev_name(&shost->shost_gendev), res);
> + goto out;
> + }
> +
> + spin_lock_irq(shost->host_lock);
> + scsi_block_requests(shost);
> + spin_unlock_irq(shost->host_lock);
> +
>
In scsi_block_requests() definition, no locks are assumed held.
If rport's state is already SRP_RPORT_BLOCKED, I don't think we need to
do extra block with scsi_block_requests()
Shouldn't we avoid using both scsi_block/unblock_requests() and
scsi_target_block/unblock()?
ie. in srp_start_tl_fail_timers() call scsi_block_requests(), in
__rport_fast_io_fail_timedout() and rport_dev_loss_timedout() call
scsi_unblock_requests()
and using scsi_block/unblock_requests in this fuction.
> + while (scsi_request_fn_active(shost))
> + msleep(20);
> +
> + res = i->f->reconnect(rport);
> + scsi_unblock_requests(shost);
> + pr_debug("%s (state %d): transport.reconnect() returned %d\n",
> + dev_name(&shost->shost_gendev), rport->state, res);
> + if (res == 0) {
> + cancel_delayed_work(&rport->fast_io_fail_work);
> + cancel_delayed_work(&rport->dev_loss_work);
> + rport->failed_reconnects = 0;
> + scsi_target_unblock(&shost->shost_gendev, SDEV_RUNNING);
> + srp_rport_set_state(rport, SRP_RPORT_RUNNING);
> + /*
> + * It can occur that after fast_io_fail_tmo expired and before
> + * dev_loss_tmo expired that the SCSI error handler has
> + * offlined one or more devices. scsi_target_unblock() doesn't
> + * change the state of these devices into running, so do that
> + * explicitly.
> + */
> + spin_lock_irq(shost->host_lock);
> + __shost_for_each_device(sdev, shost)
> + if (sdev->sdev_state == SDEV_OFFLINE)
> + sdev->sdev_state = SDEV_RUNNING;
> + spin_unlock_irq(shost->host_lock);
> + } else if (rport->state == SRP_RPORT_RUNNING) {
> + srp_rport_set_state(rport, SRP_RPORT_FAIL_FAST);
> + scsi_target_unblock(&shost->shost_gendev,
> + SDEV_TRANSPORT_OFFLINE);
>
Would this unblock override the fast_io_fail_tmo functionality?
> + }
> + mutex_unlock(&rport->mutex);
> +
> +out:
> + return res;
> +}
> +EXPORT_SYMBOL(srp_reconnect_rport);
> +
> +static void srp_reconnect_work(struct work_struct *work)
> +{
> + struct srp_rport *rport = container_of(to_delayed_work(work),
> + struct srp_rport, reconnect_work);
> + struct Scsi_Host *shost = rport_to_shost(rport);
> + int delay, res;
> +
> + res = srp_reconnect_rport(rport);
> + if (res != 0) {
> + shost_printk(KERN_ERR, shost,
> + "reconnect attempt %d failed (%d)\n",
> + ++rport->failed_reconnects, res);
> + delay = rport->reconnect_delay *
> + min(100, max(1, rport->failed_reconnects - 10));
> + if (delay > 0)
> + queue_delayed_work(system_long_wq,
> + &rport->reconnect_work, delay * HZ);
> + }
> +}
> +
> +static void __rport_fast_io_fail_timedout(struct srp_rport *rport)
> +{
> + struct Scsi_Host *shost = rport_to_shost(rport);
> + struct srp_internal *i;
> +
> + lockdep_assert_held(&rport->mutex);
> +
> + if (srp_rport_set_state(rport, SRP_RPORT_FAIL_FAST))
> + return;
> + scsi_target_unblock(rport->dev.parent, SDEV_TRANSPORT_OFFLINE);
> +
> + /* Involve the LLDD if possible to terminate all io on the rport. */
> + i = to_srp_internal(shost->transportt);
> + if (i->f->terminate_rport_io)
> + i->f->terminate_rport_io(rport);
> +}
> +
> +/**
> + * rport_fast_io_fail_timedout() - fast I/O failure timeout handler
> + *
> + * Unblocks the SCSI host.
> + */
> +static void rport_fast_io_fail_timedout(struct work_struct *work)
> +{
> + struct srp_rport *rport = container_of(to_delayed_work(work),
> + struct srp_rport, fast_io_fail_work);
> + struct Scsi_Host *shost = rport_to_shost(rport);
> +
> + pr_debug("fast_io_fail_tmo expired for %s.\n",
> + dev_name(&shost->shost_gendev));
> +
> + mutex_lock(&rport->mutex);
> + __rport_fast_io_fail_timedout(rport);
> + mutex_unlock(&rport->mutex);
> +}
> +
> +/**
> + * rport_dev_loss_timedout() - device loss timeout handler
> + *
> + * Note: rport->ft->rport_delete must either unblock the SCSI host or schedule
> + * SCSI host removal.
> + */
> +static void rport_dev_loss_timedout(struct work_struct *work)
> +{
> + struct srp_rport *rport = container_of(to_delayed_work(work),
> + struct srp_rport, dev_loss_work);
> + struct Scsi_Host *shost = rport_to_shost(rport);
> + struct srp_internal *i = to_srp_internal(shost->transportt);
> +
> + pr_err("dev_loss_tmo expired for %s.\n",
> + dev_name(&shost->shost_gendev));
> +
> + mutex_lock(&rport->mutex);
> + WARN_ON(srp_rport_set_state(rport, SRP_RPORT_LOST) != 0);
> + scsi_target_unblock(rport->dev.parent, SDEV_TRANSPORT_OFFLINE);
> + mutex_unlock(&rport->mutex);
> +
> + i->f->rport_delete(rport);
> +}
> +
> +/**
> + * srp_start_tl_fail_timers() - start the transport layer failure timers
> + *
> + * Start the transport layer fast I/O failure and device loss timers. Do not
> + * modify a timer that was already started.
> + */
> +void srp_start_tl_fail_timers(struct srp_rport *rport)
> +{
> + struct Scsi_Host *shost = rport_to_shost(rport);
> + int delay;
> +
> + mutex_lock(&rport->mutex);
> + pr_debug("%s current state: %d\n", dev_name(&shost->shost_gendev),
> + rport->state);
> + if (srp_rport_set_state(rport, SRP_RPORT_BLOCKED) != 0)
> + goto out_unlock;
> + pr_debug("%s new state: %d\n", dev_name(&shost->shost_gendev),
> + rport->state);
> + scsi_target_block(&shost->shost_gendev);
> +
> + if (rport->fast_io_fail_tmo >= 0)
> + queue_delayed_work(system_long_wq, &rport->fast_io_fail_work,
> + 1UL * rport->fast_io_fail_tmo * HZ);
> + if (rport->dev_loss_tmo >= 0)
> + queue_delayed_work(system_long_wq, &rport->dev_loss_work,
> + 1UL * rport->dev_loss_tmo * HZ);
> + delay = rport->reconnect_delay;
> + if (delay > 0)
> + queue_delayed_work(system_long_wq, &rport->reconnect_work,
> + 1UL * delay * HZ);
> +out_unlock:
> + mutex_unlock(&rport->mutex);
> +}
> +EXPORT_SYMBOL(srp_start_tl_fail_timers);
> +
> +/**
> + * srp_timed_out - SRP transport intercept of the SCSI timeout EH
> + *
> + * If a timeout occurs while an rport is in the blocked state, ask the SCSI
> + * EH to continue waiting (BLK_EH_RESET_TIMER). Otherwise let the SCSI core
> + * handle the timeout (BLK_EH_NOT_HANDLED).
> + *
> + * Note: This function is called from soft-IRQ context and with the request
> + * queue lock held.
> + */
> +static enum blk_eh_timer_return srp_timed_out(struct scsi_cmnd *scmd)
> +{
> + struct scsi_device *sdev = scmd->device;
> +
> + pr_debug("timeout for sdev %s\n", dev_name(&sdev->sdev_gendev));
> + return scsi_device_blocked(sdev) ? BLK_EH_RESET_TIMER :
> + BLK_EH_NOT_HANDLED;
>
if one turn off fast_io_fail_tmo, rport_fast_io_fail_timedout() won't be
called, and reconnect is failed, scsi_device_blocked remains true, error
recovery cannot happen
I think it should be
return (scsi_device_blocked(sdev) && rport->fast_io_fail_tmo > 0) ?
BLK_EH_RESET_TIMER : BLK_EH_NOT_HANDLED;
-vu
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2013-06-13 19:43 UTC|newest]
Thread overview: 69+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-12 13:17 [PATCH 0/14] IB SRP initiator patches for kernel 3.11 Bart Van Assche
[not found] ` <51B87501.4070005-HInyCGIudOg@public.gmane.org>
2013-06-12 13:20 ` [PATCH 01/14] IB/srp: Fix remove_one crash due to resource exhaustion Bart Van Assche
[not found] ` <51B875A4.7040903-HInyCGIudOg@public.gmane.org>
2013-06-12 13:38 ` Bart Van Assche
[not found] ` <51B879CF.1080802-HInyCGIudOg@public.gmane.org>
2013-06-12 14:24 ` Sebastian Riemer
2013-06-27 21:01 ` David Dillow
[not found] ` <1372366870.32164.30.camel-zHLflQxYYDO4Hhoo1DtQwJ9G+ZOsUmrO@public.gmane.org>
2013-06-27 23:45 ` Roland Dreier
[not found] ` <CAL1RGDWVgAKSL-GNZCkP1FEt9r_y5QWp+74NzDcga6+tcvWpXw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-06-28 7:41 ` Sebastian Riemer
2013-06-12 13:21 ` [PATCH 02/14] IB/srp: Fix race between srp_queuecommand() and srp_claim_req() Bart Van Assche
[not found] ` <51B875EE.3030702-HInyCGIudOg@public.gmane.org>
2013-06-12 14:58 ` Sebastian Riemer
[not found] ` <51B88C7C.4030209-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-12 15:14 ` Bart Van Assche
[not found] ` <51B8903E.3000609-HInyCGIudOg@public.gmane.org>
2013-06-27 21:02 ` David Dillow
[not found] ` <1372366945.32164.32.camel-zHLflQxYYDO4Hhoo1DtQwJ9G+ZOsUmrO@public.gmane.org>
2013-06-28 7:36 ` Bart Van Assche
2013-06-12 13:23 ` [PATCH 03/14] IB/srp: Avoid that srp_reset_host() is skipped after a TL error Bart Van Assche
[not found] ` <51B87638.50102-HInyCGIudOg@public.gmane.org>
2013-06-13 9:30 ` Sebastian Riemer
[not found] ` <51B99120.9000503-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-13 9:57 ` Bart Van Assche
2013-06-27 21:03 ` David Dillow
2013-06-12 13:24 ` [PATCH 04/14] IB/srp: Skip host settle delay Bart Van Assche
[not found] ` <51B87689.8030806-HInyCGIudOg@public.gmane.org>
2013-06-13 9:53 ` Sebastian Riemer
[not found] ` <51B996A1.6080604-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-13 13:06 ` Or Gerlitz
2013-06-27 21:04 ` David Dillow
2013-06-12 13:25 ` [PATCH 05/14] IB/srp: Maintain a single connection per I_T nexus Bart Van Assche
[not found] ` <51B876BF.4070400-HInyCGIudOg@public.gmane.org>
2013-06-13 13:57 ` Sebastian Riemer
[not found] ` <51B9CFC3.8080008-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-13 15:07 ` Bart Van Assche
[not found] ` <51B9E046.3030008-HInyCGIudOg@public.gmane.org>
2013-06-13 15:35 ` Sebastian Riemer
2013-06-13 17:50 ` Vu Pham
[not found] ` <51BA0655.6090707-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-06-13 18:25 ` Bart Van Assche
[not found] ` <51BA0E8F.3030104-HInyCGIudOg@public.gmane.org>
2013-06-13 23:27 ` Vu Pham
[not found] ` <51BA555F.9060807-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-06-14 9:38 ` Sebastian Riemer
[not found] ` <51BAE482.1050304-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-14 17:07 ` Vu Pham
[not found] ` <51BB4DBB.4070800-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-06-17 9:41 ` Sebastian Riemer
2013-06-27 21:10 ` David Dillow
[not found] ` <1372367432.32164.36.camel-zHLflQxYYDO4Hhoo1DtQwJ9G+ZOsUmrO@public.gmane.org>
2013-06-28 7:40 ` Bart Van Assche
2013-06-12 13:26 ` [PATCH 06/14] IB/srp: Keep rport as long as the IB transport layer Bart Van Assche
2013-06-12 13:29 ` [PATCH 08/14] IB/srp: Add srp_terminate_io() Bart Van Assche
2013-06-12 13:30 ` [PATCH 09/14] IB/srp: Use SRP transport layer error recovery Bart Van Assche
2013-06-12 13:31 ` [PATCH 10/14] IB/srp: Start timers if a transport layer error occurs Bart Van Assche
2013-06-12 13:33 ` [PATCH 11/14] IB/srp: Fail SCSI commands silently Bart Van Assche
2013-06-12 13:35 ` [PATCH 12/14] IB/srp: Make HCA completion vector configurable Bart Van Assche
[not found] ` <51B87904.1070803-HInyCGIudOg@public.gmane.org>
2013-06-27 21:24 ` David Dillow
[not found] ` <1372368256.32164.41.camel-zHLflQxYYDO4Hhoo1DtQwJ9G+ZOsUmrO@public.gmane.org>
2013-06-28 8:18 ` Bart Van Assche
[not found] ` <51CD46F0.60301-HInyCGIudOg@public.gmane.org>
2013-06-28 12:04 ` David Dillow
[not found] ` <1372421041.28740.14.camel-a7a0dvSY7KqLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
2013-06-28 12:29 ` Bart Van Assche
2013-06-12 13:36 ` [PATCH 13/14] IB/srp: Make transport layer retry count configurable Bart Van Assche
[not found] ` <51B8794F.6050003-HInyCGIudOg@public.gmane.org>
2013-06-27 21:22 ` David Dillow
[not found] ` <1372368138.32164.40.camel-zHLflQxYYDO4Hhoo1DtQwJ9G+ZOsUmrO@public.gmane.org>
2013-06-28 8:28 ` Bart Van Assche
[not found] ` <51CD4933.5080709-HInyCGIudOg@public.gmane.org>
2013-06-28 12:07 ` David Dillow
[not found] ` <1372421227.28740.17.camel-a7a0dvSY7KqLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
2013-06-28 12:30 ` Bart Van Assche
2013-06-12 13:37 ` [PATCH 14/14] IB/srp: Bump driver version and release date Bart Van Assche
2013-06-12 13:28 ` [PATCH 07/14] scsi_transport_srp: Add transport layer error handling Bart Van Assche
[not found] ` <51B8777B.5050201-HInyCGIudOg@public.gmane.org>
2013-06-13 19:43 ` Vu Pham [this message]
2013-06-14 13:19 ` Bart Van Assche
[not found] ` <51BB1857.7040802-HInyCGIudOg@public.gmane.org>
2013-06-14 17:59 ` Vu Pham
[not found] ` <51BB5A04.3080901-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-06-15 9:52 ` Bart Van Assche
[not found] ` <51BC3945.9030900-HInyCGIudOg@public.gmane.org>
2013-06-17 6:18 ` Hannes Reinecke
2013-06-17 7:04 ` Bart Van Assche
2013-06-17 7:14 ` Hannes Reinecke
2013-06-17 7:29 ` Bart Van Assche
[not found] ` <51BEBAEA.4080202-HInyCGIudOg@public.gmane.org>
2013-06-17 8:10 ` Hannes Reinecke
2013-06-17 10:13 ` Sebastian Riemer
2013-06-18 16:59 ` Vu Pham
[not found] ` <51C09202.2040503-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2013-06-19 13:00 ` Bart Van Assche
2013-06-23 21:13 ` Mike Christie
[not found] ` <51C764FB.6070207-hcNo3dDEHLuVc3sceRu5cw@public.gmane.org>
2013-06-24 7:37 ` Bart Van Assche
-- strict thread matches above, loose matches on Subject: below --
2013-06-19 13:44 Jack Wang
2013-06-19 15:27 ` Bart Van Assche
2013-06-21 12:17 ` Jack Wang
2013-06-24 13:48 Jack Wang
[not found] ` <51C84E39.80806-EIkl63zCoXaH+58JC4qpiA@public.gmane.org>
2013-06-24 15:50 ` Bart Van Assche
[not found] ` <51C86AB4.1000906-HInyCGIudOg@public.gmane.org>
2013-06-24 16:05 ` Jack Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51BA20ED.6040200@mellanox.com \
--to=vuhuong-vpraknaxozvwk0htik3j/w@public.gmane.org \
--cc=bvanassche-HInyCGIudOg@public.gmane.org \
--cc=dillowda-1Heg1YXhbW8@public.gmane.org \
--cc=jbottomley-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=linux-scsi-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=roland-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
--cc=sebastian.riemer-EIkl63zCoXaH+58JC4qpiA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox