From: Hannes Reinecke <hare@suse.de>
To: wenxiong@linux.vnet.ibm.com
Cc: James.Bottomley@HansenPartnership.com,
linux-scsi@vger.kernel.org, brking@linux.vnet.ibm.com
Subject: Re: [PATCH V2 1/1] scsi: Handle MLQUEUE busy response in scsi_send_eh_cmnd
Date: Tue, 23 Apr 2013 16:52:57 +0200 [thread overview]
Message-ID: <5176A049.3090407@suse.de> (raw)
In-Reply-To: <20130416203310.517593092@linux.vnet.ibm.com>
On 04/16/2013 10:26 PM, wenxiong@linux.vnet.ibm.com wrote:
> We discussed James's concern. We intergated James's patch and generated
> this updated patch.
>
> Fix scsi_send_eh_cmnd to check the return code of queuecommand when
> sending commands and retry for a bit if the LLDD returns a busy response.
> This fixes an issue seen with the ipr driver where an ipr initiated reset
> immediately following an eh_host_reset caused EH initiated commands to fail,
> resulting in devices being taken offline. This patch resolves the issue.
>
>
> Signed-off-by: Wen Xiong <wenxiong@linux.vnet.ibm.com>
> Signed-off-by: Brian King <brking@linux.vnet.ibm.com>
> ---
> drivers/scsi/scsi_error.c | 34 +++++++++++++++++++++++++---------
> 1 file changed, 25 insertions(+), 9 deletions(-)
>
> Index: b/drivers/scsi/scsi_error.c
> ===================================================================
> --- a/drivers/scsi/scsi_error.c 2013-04-16 12:56:16.617857960 -0500
> +++ b/drivers/scsi/scsi_error.c 2013-04-16 12:57:00.279108838 -0500
> @@ -791,18 +791,21 @@ static int scsi_send_eh_cmnd(struct scsi
> struct scsi_device *sdev = scmd->device;
> struct Scsi_Host *shost = sdev->host;
> DECLARE_COMPLETION_ONSTACK(done);
> - unsigned long timeleft;
> + unsigned long timeleft = timeout;
> struct scsi_eh_save ses;
> + const int stall_for = min(HZ/10, 1);
> int rtn;
>
> scsi_eh_prep_cmnd(scmd, &ses, cmnd, cmnd_size, sense_bytes);
> +retry:
> shost->eh_action = &done;
>
> scsi_log_send(scmd);
> scmd->scsi_done = scsi_eh_done;
> - shost->hostt->queuecommand(shost, scmd);
> + rtn = shost->hostt->queuecommand(shost, scmd);
>
> - timeleft = wait_for_completion_timeout(&done, timeout);
> + if (!rtn)
> + timeleft = wait_for_completion_timeout(&done, timeout);
>
> shost->eh_action = NULL;
>
Hmm. This seems to be a generic bug fix here; it is perfectly
ok for queuecommand() to return a non-zero value without calling
->scsi_done. At which point ->complete is pointless to try as no
completion ever would be invoked.
Mind separating that out as a separate patch?
> @@ -819,10 +822,19 @@ static int scsi_send_eh_cmnd(struct scsi
> * about this command.
> */
> if (timeleft) {
> - rtn = scsi_eh_completed_normally(scmd);
> - SCSI_LOG_ERROR_RECOVERY(3,
> - printk("%s: scsi_eh_completed_normally %x\n",
> - __func__, rtn));
> + switch (rtn) {
> + case 0:
> + rtn = scsi_eh_completed_normally(scmd);
> + SCSI_LOG_ERROR_RECOVERY(3,
> + printk("%s: scsi_eh_completed_normally %x\n",
> + __func__, rtn));
> + break;
> + case FAILED:
> + break;
> + default:
> + rtn = ADD_TO_MLQUEUE;
> + break;
> + }
>
> switch (rtn) {
> case SUCCESS:
Bzzt.
'FAILED' is a valid response for scsi_eh_completed_normally, but not
for ->queuecommand.
> @@ -831,8 +843,12 @@ static int scsi_send_eh_cmnd(struct scsi
> case TARGET_ERROR:
> break;
> case ADD_TO_MLQUEUE:
> - rtn = NEEDS_RETRY;
> - break;
> + if (timeleft > stall_for) {
> + timeout = timeleft - stall_for;
> + msleep(stall_for);
> + goto retry;
> + }
> + /* fall through */
> default:
> rtn = FAILED;
> break;
>
We're already calling 'wait_for_completion_timeout'.
So normally the 'msleep' wouldn't be necessary.
It's only required for the case when ->queuecommand
returns non-zero.
So I'd rather see to have the msleep into section
where the return command from queuecommand is evaluated;
here we'll actually decrease responsiveness as
we're always waiting for X seconds, even if the command
would've been completed during that time.
Cheers,
Hannes
--
Dr. Hannes Reinecke zSeries & Storage
hare@suse.de +49 911 74053 688
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
prev parent reply other threads:[~2013-04-23 14:52 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-04-16 20:26 [PATCH V2 0/1] scsi: Handle MLQUEUE busy response in scsi_send_eh_cmnd wenxiong
2013-04-16 20:26 ` [PATCH V2 1/1] " wenxiong
2013-04-23 14:52 ` Hannes Reinecke [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5176A049.3090407@suse.de \
--to=hare@suse.de \
--cc=James.Bottomley@HansenPartnership.com \
--cc=brking@linux.vnet.ibm.com \
--cc=linux-scsi@vger.kernel.org \
--cc=wenxiong@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.