From: James Bottomley <James.Bottomley@HansenPartnership.com>
To: "Darrick J. Wong" <djwong@us.ibm.com>
Cc: Keith Hopkins <vger@hopnet.net>, Jan Sembera <jsembera@suse.cz>,
linux-scsi@vger.kernel.org, Alexis Bruemmer <alexisb@us.ibm.com>,
Peter Bogdanovic <pbog@us.ibm.com>,
Gilbert Wu <Gilbert_Wu@adaptec.com>
Subject: Re: [PATCH] aic94xx: Don't free ABORT_TASK SCBs that are timed out (Was: Re: aic94xx: failing on high load)
Date: Tue, 19 Feb 2008 12:52:26 -0600 [thread overview]
Message-ID: <1203447146.3103.32.camel@localhost.localdomain> (raw)
In-Reply-To: <20080219184359.GA5414@tree.beaverton.ibm.com>
On Tue, 2008-02-19 at 10:44 -0800, Darrick J. Wong wrote:
> If we send an ABORT_TASK ascb that doesn't return within the timeout period,
> we should not free that ascb because the sequencer is still holding onto it.
> Hopefully it will fix what James Bottomley describes below:
>
> On Tue, Feb 19, 2008 at 10:22:20AM -0600, James Bottomley wrote:
>
> > Unfortunately, there's a bug in TMF timeout handling in the driver, it
> > leaves the sequencer entry pending, but frees the ascb. If the
> > sequencer ever picks this up it will get very confused, as it does a
> > while down in the trace:
> >
> > > aic94xx: BUG:sequencer:dl:no ascb?!
> > > aic94xx: BUG:sequencer:dl:no ascb?!
> >
> > That's where the sequencer adds an ascb to the done list that we've
> > already freed. From this point on confusion reigns and the error
> > handler eventually offlines the device.
> >
> > I'll see if I can come up with patches to fix this ... or at least
> > mitigate the problems it causes.
>
> Signed-off-by: Darrick J. Wong <djwong@us.ibm.com>
Actually, unfortunately, this is only a tiny part of it. The message
that triggered all of this is
> sas: sas_scsi_find_task: aborting task 0xffff81033c3d3d80
> aic94xx: tmf timed out
> aic94xx: tmf came back
That's caused by a timeout at asd_enqueue_internal() further up in the
code base.
James
next prev parent reply other threads:[~2008-02-19 18:52 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <479FB3ED.3080401@hopnet.net>
[not found] ` <20080130091403.GA14887@alaris.suse.cz>
2008-01-30 10:59 ` aic94xx: failing on high load (another data point) Keith Hopkins
2008-01-30 19:29 ` Darrick J. Wong
2008-02-14 16:11 ` Keith Hopkins
2008-02-15 15:28 ` James Bottomley
2008-02-15 16:28 ` Keith Hopkins
2008-02-18 14:26 ` Keith Hopkins
2008-02-18 16:18 ` James Bottomley
2008-02-19 16:22 ` James Bottomley
2008-02-19 18:44 ` [PATCH] aic94xx: Don't free ABORT_TASK SCBs that are timed out (Was: Re: aic94xx: failing on high load) Darrick J. Wong
2008-02-19 18:52 ` James Bottomley [this message]
2008-02-28 14:56 ` Keith Hopkins
2008-02-28 16:10 ` James Bottomley
2008-02-20 3:48 ` aic94xx: failing on high load (another data point) James Bottomley
2008-02-20 9:54 ` Keith Hopkins
2008-02-20 16:22 ` James Bottomley
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1203447146.3103.32.camel@localhost.localdomain \
--to=james.bottomley@hansenpartnership.com \
--cc=Gilbert_Wu@adaptec.com \
--cc=alexisb@us.ibm.com \
--cc=djwong@us.ibm.com \
--cc=jsembera@suse.cz \
--cc=linux-scsi@vger.kernel.org \
--cc=pbog@us.ibm.com \
--cc=vger@hopnet.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).