public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
From: Andrew Vasquez <andrew.vasquez@qlogic.com>
To: gregsurbey@hotmail.com
Cc: Andrew Morton <akpm@osdl.org>,
	linux-scsi@vger.kernel.org, Mike Anderson <andmike@us.ibm.com>
Subject: Re: Fw: [Bugme-new] [Bug 4473] New: QLogic 2100: SCSI timeouts, device resets, and crashes kernel
Date: Mon, 11 Apr 2005 15:15:08 -0700	[thread overview]
Message-ID: <20050411221508.GH4449@plap.qlogic.org> (raw)
In-Reply-To: <20050411211829.GA822@us.ibm.com>

On Mon, 11 Apr 2005, Mike Anderson wrote:

> I will add the same comment to the bug.
> 
> Did this work on a previous version of the kernel? Just checking to
> understand if your connectivity to the storage unit or the unit itself
> could be an issue.
> 
> If appears we are receiving timeouts, but on abort the qla is indicating
> that the IO has already been completed. We could have IOs that are taking
> near max timeout and then the error handler races with the completion of
> the IO.
> 
> A debug step you could try is to raise the default timeout from 30 to
> something like 60 seconds to see if this effects the error. To do this
> just echo "60" > /sys/block/sd${N}/device/timeout. Also you can run iostat
> during your testing to see what you IO times / queue depths look like.
> 
> Andrew Vasquez may be able to add more info.
> 

Greg,

The logs seem to indicate some (additional) problems with the ISP
after the device-reset completes:

	qla2100 0000:00:06.0: scsi(0:0:1:0): DEVICE RESET ISSUED.
	qla2100 0000:00:06.0: scsi(0:0:1:0): DEVICE RESET SUCCEEDED.
	qla2100 0000:00:06.0: ISP System Error - mbx1=7737h mbx2=dc5h mbx3=0h.
	qla2100 0000:00:06.0: Firmware dump saved to temp buffer (0/dcec0000).

there is a small tool availble (qla_dmp.sh):

ftp://ftp.qlogic.com/outgoing/linux/beta/8.x/test/qla_dmp.sh

which I'd like you to use if the machine is still in a somewhat usable
state and you see a message similar to the following:

	qla2100 0000:00:06.0: ISP System Error - mbx1=7737h mbx2=dc5h mbx3=0h.
	qla2100 0000:00:06.0: Firmware dump saved to temp buffer (0/dcec0000).

Execute the following command:

	# ./qla_dmp.sh 0

The value passed to qla_dmp.sh should be the same as the first integer
in the 'saved to temp buffer' string (in this example, 0).  If the
operation was successful, a message like to following should be logged
in the messages file.

        Firmware dumped to file fw_dump_20041217_023222.txt

compress the file (in this example):

        # bzip2 fw_dump_20041217_023222.txt

and forward over the compressed file in addition to the
var/log/messages file.

--
av

      reply	other threads:[~2005-04-11 22:15 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-04-11 20:51 Fw: [Bugme-new] [Bug 4473] New: QLogic 2100: SCSI timeouts, device resets, and crashes kernel Andrew Morton
2005-04-11 21:18 ` Mike Anderson
2005-04-11 22:15   ` Andrew Vasquez [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20050411221508.GH4449@plap.qlogic.org \
    --to=andrew.vasquez@qlogic.com \
    --cc=akpm@osdl.org \
    --cc=andmike@us.ibm.com \
    --cc=gregsurbey@hotmail.com \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox