From: Hannes Reinecke <hare@suse.de>
To: Brian King <brking@linux.vnet.ibm.com>,
James Bottomley <James.Bottomley@HansenPartnership.com>
Cc: linux-scsi <linux-scsi@vger.kernel.org>,
Bart Van Assche <bart.vanassche@sandisk.com>
Subject: Re: [PATCH] SCSI: Increase REPORT_LUNS timeout
Date: Sun, 13 Sep 2015 10:43:28 +0200 [thread overview]
Message-ID: <55F53730.6070609@suse.de> (raw)
In-Reply-To: <55E9F53A.9090108@linux.vnet.ibm.com>
On 09/04/2015 09:47 PM, Brian King wrote:
> On 09/04/2015 11:15 AM, James Bottomley wrote:
>> On Fri, 2015-09-04 at 10:47 -0500, Brian King wrote:
>>> On 09/04/2015 10:36 AM, James Bottomley wrote:
>>>> On Wed, 2015-09-02 at 09:31 -0500, Brian King wrote:
>>>>> This patch fixes an issue seen with an IBM 2145 (SVC) where, following an error
>>>>> injection test which results in paths going offline, when they came
>>>>> back online, the path would timeout the REPORT_LUNS issued during the
>>>>> scan. This timeout situation continued until retries were expired, resulting in
>>>>> falling back to a sequential LUN scan. Then, since the target responds
>>>>> with PQ=1, PDT=0 for all possible LUNs, due to the way the sequential
>>>>> LUN scan code works, we end up adding 512 LUNs for each target, when there
>>>>> is really only a small handful of LUNs that are actually present.
>>>>>
>>>>> This patch doubles the timeout used on the REPORT_LUNS for each retry
>>>>> after a timeout is seen on a REPORT_LUNS. This patch solves the issue
>>>>> of 512 non existent LUNs showing up after this event. Running the test
>>>>> with this patch still showed that we were regularly hitting two timeouts,
>>>>> but the third, and final, REPORT_LUNS was always successful.
>>>>>
>>>>> Signed-off-by: Brian King <brking@linux.vnet.ibm.com>
>>>>> ---
>>>>>
>>>>> drivers/scsi/scsi_scan.c | 5 ++++-
>>>>> 1 file changed, 4 insertions(+), 1 deletion(-)
>>>>>
>>>>> diff -puN drivers/scsi/scsi_scan.c~scsi_report_luns_timeout_escalate drivers/scsi/scsi_scan.c
>>>>> --- linux/drivers/scsi/scsi_scan.c~scsi_report_luns_timeout_escalate 2015-09-02 08:49:07.268243497 -0500
>>>>> +++ linux-bjking1/drivers/scsi/scsi_scan.c 2015-09-02 08:49:07.272243461 -0500
>>>>> @@ -1304,6 +1304,7 @@ static int scsi_report_lun_scan(struct s
>>>>> struct scsi_device *sdev;
>>>>> struct Scsi_Host *shost = dev_to_shost(&starget->dev);
>>>>> int ret = 0;
>>>>> + int timeout = SCSI_TIMEOUT + 4 * HZ;
>>>>>
>>>>> /*
>>>>> * Only support SCSI-3 and up devices if BLIST_NOREPORTLUN is not set.
>>>>> @@ -1383,7 +1384,7 @@ retry:
>>>>>
>>>>> result = scsi_execute_req(sdev, scsi_cmd, DMA_FROM_DEVICE,
>>>>> lun_data, length, &sshdr,
>>>>> - SCSI_TIMEOUT + 4 * HZ, 3, NULL);
>>>>> + timeout, 3, NULL);
>>>>>
>>>>> SCSI_LOG_SCAN_BUS(3, sdev_printk (KERN_INFO, sdev,
>>>>> "scsi scan: REPORT LUNS"
>>>>> @@ -1392,6 +1393,8 @@ retry:
>>>>> retries, result));
>>>>> if (result == 0)
>>>>> break;
>>>>> + else if (host_byte(result) == DID_TIME_OUT)
>>>>> + timeout = timeout * 2;
>>>>> else if (scsi_sense_valid(&sshdr)) {
>>>>> if (sshdr.sense_key != UNIT_ATTENTION)
>>>>
>>>> Actually, this is a bit pointless, isn't it; why retry, why not just set
>>>> the initial timeout? ... I could understand if retrying and printing a
>>>> message gave important or useful information, but it doesn't. How long
>>>> do you actually need? ... we can just up the initial timeout to that.
>>>> Currently we have a hacked 6s which looks arbitrary. Would 15s be
>>>> better? Nothing really times out anyway, so everything else will still
>>>> reply within the original 6s giving zero impact in the everyday case.
>>>
>>> 12 seconds definitely isn't long enough, but 24 seconds seems to work, at least
>>> after we go through both a 6 and 12 second timeout. Anyone opposed to using 30 seconds?
>>> 15 seconds is likely to be right on the edge in this scenario.
>>
>> 30s is fine by me. I think the initial 2s was from the sequential
>> inquiry scan so as not to wait too long. The extra 4s was added because
>> that was too short for report luns on some devices; I suspect some
>> larger arrays take a while just to gather all the data.
>>
>> 30s is also the traditional rq_timeout, so it may be possible to re-use
>> this parameter. Currently it's set up in the ULD, so it's zero unless
>> the slave_configure requested a special value. Traditionally, it's the
>> timeout for _READ and _WRITE, not special commands, but it feels like
>> REPORT_LUNS should follow this timeout as well and it would give you a
>> configurable way of updating it in your driver. If we do it this way,
>> you'd have to set it in slave_alloc, because slave_configure is too
>> late.
>
> I think we may just need to hard code it like the patch below. Here is the current flow for
> setting this today:
>
> slave_alloc
> scsi scan: inquiry / report LUNs
> slave_configure
> sd attach
>
> Some LLDDs set a default timeout in slave_configure today, so sd.c only sets a default timeout
> if its not already set. It uses 30 seconds for disks and 75 seconds for optical devices.
> If we start setting rq_timeout earlier, then the ULD will never know when it can set it.
>
> Additionally, in this particular scenario, its not so much a case of behavior tied to the LLDD, its more tied
> to the SCSI target. If there is concern about increasing the default to 30 seconds, we could
> use a blist attribute for this.
>
> -Brian
>
> 8<
>
> This patch fixes an issue seen with an IBM 2145 (SVC) where, following an error
> injection test which results in paths going offline, when they came
> back online, the path would timeout the REPORT_LUNS issued during the
> scan. This timeout situation continued until retries were expired, resulting in
> falling back to a sequential LUN scan. Then, since the target responds
> with PQ=1, PDT=0 for all possible LUNs, due to the way the sequential
> LUN scan code works, we end up adding 512 LUNs for each target, when there
> is really only a small handful of LUNs that are actually present.
>
> This patch increases the timeout used on the REPORT_LUNS to 30 seconds.
> This patch solves the issue of 512 non existent LUNs showing up after
> this event.
>
> Signed-off-by: Brian King <brking@linux.vnet.ibm.com>
> ---
>
> drivers/scsi/scsi_scan.c | 3 ++-
> 1 file changed, 2 insertions(+), 1 deletion(-)
>
> diff -puN drivers/scsi/scsi_scan.c~scsi_report_luns_30secs drivers/scsi/scsi_scan.c
> --- linux/drivers/scsi/scsi_scan.c~scsi_report_luns_30secs 2015-09-04 14:38:47.890757391 -0500
> +++ linux-bjking1/drivers/scsi/scsi_scan.c 2015-09-04 14:39:28.891459147 -0500
> @@ -55,6 +55,7 @@
> * Default timeout
> */
> #define SCSI_TIMEOUT (2*HZ)
> +#define SCSI_REPORT_LUNS_TIMEOUT (30*HZ)
>
> /*
> * Prefix values for the SCSI id's (stored in sysfs name field)
> @@ -1383,7 +1384,7 @@ retry:
>
> result = scsi_execute_req(sdev, scsi_cmd, DMA_FROM_DEVICE,
> lun_data, length, &sshdr,
> - SCSI_TIMEOUT + 4 * HZ, 3, NULL);
> + SCSI_REPORT_LUNS_TIMEOUT, 3, NULL);
>
> SCSI_LOG_SCAN_BUS(3, sdev_printk (KERN_INFO, sdev,
> "scsi scan: REPORT LUNS"
> _
>
>
That's far better.
Reviewed-by: Hannes Reinecke <hare@suse.de>
Cheers,
Hannes
--
Dr. Hannes Reinecke zSeries & Storage
hare@suse.de +49 911 74053 688
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2015-09-13 8:43 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-20 22:11 SCSI scanning behavior Brian King
2015-08-26 13:02 ` Hannes Reinecke
2015-08-26 13:52 ` Bart Van Assche
2015-09-02 14:31 ` [PATCH] SCSI: Scale up REPORT_LUNS timeout on failure Brian King
2015-09-04 15:00 ` Bart Van Assche
2015-09-04 15:28 ` Brian King
2015-09-04 15:36 ` James Bottomley
2015-09-04 15:47 ` Brian King
2015-09-04 16:15 ` James Bottomley
2015-09-04 19:47 ` [PATCH] SCSI: Increase REPORT_LUNS timeout Brian King
2015-09-13 8:43 ` Hannes Reinecke [this message]
2015-11-03 4:03 ` Martin K. Petersen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55F53730.6070609@suse.de \
--to=hare@suse.de \
--cc=James.Bottomley@HansenPartnership.com \
--cc=bart.vanassche@sandisk.com \
--cc=brking@linux.vnet.ibm.com \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).