From: James Bottomley <James.Bottomley@HansenPartnership.com>
To: "Rhine, Jay (Jay)" <jrhine@alcatel-lucent.com>
Cc: linux-scsi@vger.kernel.org
Subject: RE: Devices going offline on Adaptec 29320 using driver AIC79XXafter messages "Attempting to queue an ABORT message:CDB"
Date: Tue, 25 Nov 2008 16:14:54 -0600 [thread overview]
Message-ID: <1227651294.3415.36.camel@localhost.localdomain> (raw)
In-Reply-To: <CA39D769C8FE694788CE376026984BAB0117D26F@ILEXC1U01.ndc.lucent.com>
On Tue, 2008-11-25 at 16:08 -0600, Rhine, Jay (Jay) wrote:
> > The slight problem here is that no-one has a sequencer manual which
> tells us what all this means. However, it's
> > completely normal since the driver has a dump_card_state() call in the
> abort routine.
> >
> > Why the abort was called in the first place is anyone's guess, but it
> > probably came from a command timing out. The timeout could either be
> a
> > sequencer error or simply a normal problem because you're hammering
> the device hard and it took longer to get to the
> > command to process.
> >
> > You can test this latter quite easily by doubling the command
> timeouts:
> >
> > echo 60 > /sys/class/scsi_disk/*/device/timeout
> >
> > And seeing if the trouble occurs with the same frequency. If it does,
> there's likely some sequencer issue; if the
> > frequency decreases, it's device related and you can probably throttle
> the device by reducing the queue depth to avoid
> > the situation.
> >
> > James
>
> James,
>
> That sounds like a good idea. I will try to adjust the timeout.
> However, I have to ask about the "completely normal part". I can see
> the abort message occasionaly occurring normally if the drives always
> recovered after the abort. However, is it normal that the devices will
> go offline the second time this situation occurs? I'm afraid my
> knowledge of SCSI does not go to this level of detail. If it is normal,
> and I can substancially reduce the frequency by some tweaking I can live
> with that. However, if this there is a real bug I would like to get it
> fixed.
Completely normal as in some disk arrays can take 60-120s to process
commands under heavy load ... this depends on disk array though. The
classic one to do this is the EMC symmetrix: It has such a massive
cache that it can accept I/O at cable rates while spitting it out to the
platters at less than this. It's like a sink filling up until you reach
the overflow. By the time this happens, it can take minutes to get data
from the cable across the cache to the platters causing command timeouts
unless the O/S is tuned to accept far longer timeout intervals.
If it's a bug in the sequencer, it's going to be very hard to fix
without documentation, so I'd hope for the former.
James
next prev parent reply other threads:[~2008-11-25 22:14 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-11-25 20:54 Devices going offline on Adaptec 29320 using driver AIC79XX after messages "Attempting to queue an ABORT message:CDB" Rhine, Jay (Jay)
2008-11-25 21:53 ` James Bottomley
2008-11-25 22:08 ` Devices going offline on Adaptec 29320 using driver AIC79XXafter " Rhine, Jay (Jay)
2008-11-25 22:14 ` James Bottomley [this message]
2008-11-25 23:22 ` Devices going offline on Adaptec 29320 using driverAIC79XXafter " Rhine, Jay (Jay)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1227651294.3415.36.camel@localhost.localdomain \
--to=james.bottomley@hansenpartnership.com \
--cc=jrhine@alcatel-lucent.com \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox