From: David Jeffery <djeffery@redhat.com>
To: linux-scsi@vger.kernel.org
Subject: [PATCH RESEND] sd: disk offlined prematurely from media access timeout
Date: Tue, 24 Sep 2013 15:42:44 -0400 [thread overview]
Message-ID: <20130924194244.GA26428@fury.redhat.com> (raw)
There is an error with the medium access timeout feature of the sd driver. The
sdkp->medium_access_timed_out value is set to zero in sd_done() in the wrong
place. It is set to zero only if a command returns sense data. If an I/O
command times out, error handling succeeds, and the I/O commands complete, the
value won't be reset if nothing responds with a sense buffer. Then, another
timeout (no matter how far in the future) can increment it again, causing the
device to be errantly set offline.
The resetting of sdkp->medium_access_timed_out should occur before the check for
sense data.
Signed-off-by: David Jeffery <djeffery@redhat.com>
---
It can be reproduced using scsi_debug and using SCSI_DEBUG_OPT_MAC_TIMEOUT to
force some I/O to timeout once. This small script assumes /dev/sdb as
scsi_debug's disk, causes a timeout, completes 2MB of I/O successfully including
the timed out I/O command, then repeats. Without the patch, the device is
offlined on the second loop. All loops will successfully complete I/O
with the patch.
echo "-1" >/sys/bus/pseudo/drivers/scsi_debug/every_nth
for i in `seq 1 4`; do
echo starting loop $i
echo "128" >/sys/bus/pseudo/drivers/scsi_debug/opts
dd if=/dev/sdb of=/dev/null bs=1M iflag=direct count=1 &
sleep 5
echo "0" >/sys/bus/pseudo/drivers/scsi_debug/opts
wait
dd if=/dev/sdb of=/dev/null bs=1M iflag=direct count=1
echo ending loop $i
done
diff --git a/drivers/scsi/sd.c b/drivers/scsi/sd.c
index 86fcf2c..2779e6b 100644
--- a/drivers/scsi/sd.c
+++ b/drivers/scsi/sd.c
@@ -1669,12 +1669,12 @@ static int sd_done(struct scsi_cmnd *SCpnt)
sshdr.ascq));
}
#endif
+ sdkp->medium_access_timed_out = 0;
+
if (driver_byte(result) != DRIVER_SENSE &&
(!sense_valid || sense_deferred))
goto out;
- sdkp->medium_access_timed_out = 0;
-
switch (sshdr.sense_key) {
case HARDWARE_ERROR:
case MEDIUM_ERROR:
next reply other threads:[~2013-09-24 19:44 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-09-24 19:42 David Jeffery [this message]
2013-10-23 8:36 ` [PATCH RESEND] sd: disk offlined prematurely from media access timeout Martin K. Petersen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130924194244.GA26428@fury.redhat.com \
--to=djeffery@redhat.com \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).