public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Terry Barnaby <terry@beam.ltd.uk>
To: mmadore@aslab.com, gibbs@scsiguy.com, linux-kernel@vger.kernel.org
Subject: Re: Reproducible SCSI Error with Adaptec 7902
Date: Fri, 14 Mar 2003 10:59:53 +0000	[thread overview]
Message-ID: <3E71B629.60204@beam.ltd.uk> (raw)

Hi,

We may be experiencing the same problem.
In our case it results in the SEAGATE ST336607LW drive locking up solid 
with no hardware reset possible.

Our problem is that our 320MB/s SEAGATE ST336607LW drive will lockup 
after about 10mins to 2hours of serious activity (Copying disk partitions).
.
The primary error message we see is:
"Saw underflow (16384 of 20480 bytes). Treated as error"
followed by various SCSI error messages. The SCSI disks LED
remains on and it is impossible to access the SCSI disk. The system
will then hang. Reseting the system does not clear the SCSI disk LED and
the SCSI disk is not seen in the Adaptec BIOS on startup. A power off/on
cycle will clear the condition.

We have been trying to track down the problem for about two weeks now 
and we are still unsure where the problem lies: Disk, SCSI cable, SCSI 
controller or Linux driver.

Some info we do have though is:
1. Setting the SCSI bus speed from 320MB/s to 160MB/s does not affect 
the problem.
2. Switching off packetized mode fixes the problem (we think).
3. Using a non SMP kernel may fix the problem (we are testing at this 
moment).

Our system is:
System: Dual Xeon 2.4GHz system using SuperMicro X5DA8 Motherboard.
SCSI: Adaptec 7902 onboard dual channel SCSI controller
Disks: 2 off Quantum Atlas 10K2 18G (160LW), 1 of Quantum 9G (80LW)
Disks: 1 off Seagate ST336607LW 36G (320LW)
System: RedHat 7.3 with updates to 18/02/03
Kernel: 2.4.18-24.7.xsmp
Aic79xx Driver: versions 1.0.0 and 1.1.0

Our current view is that there are two problems:
1. There is a timing/SMP issue with the Linux AIC79XX SCSI driver in SMP 
systems that cause and incorect SCSI bus condition.
2. The SEAGATE ST336607LW responds to this condition by locking up and
cannot be reset. We have information from Seagate that it is possible 
for the ST336607LW to get in a condition where it cannot be reset !

We have had a lot of communications with Seagate on this so far to no
avail. We have quite a lot of information in terms of log files etc.

Is there a good contact for someone who knows about the Adaptec AIC79XX
driver that we could talk to ?

Any help would be appreciated.

Terry


> I am receiving the following messages in my system log when stress testing
> with Cerberus (http://sourceforge.net/projects/va-ctcs). This is with an
> onboard Adaptec 7902 Ultra 320 SCSI adapter. The messages are reproducible
> on two different systems. This is with the 1.1.0 aic79xx driver, on
> both the
> stock Redhat kernel, and with a kernel compiled from the 2.4.19 sources.
> The
> system does not seem to be harmed by the messages, but I would like to
> know if
> they point to a problem or not. Interestingly, if I put and Adaptec
> 29320 PCI
> card into the same machine, and use the same driver, the error is not
> reproducible.
> 
> Mike 


-- 
Dr Terry Barnaby                     BEAM Ltd
Phone: +44 1454 324512               Northavon Business Center, Dean Rd
Fax:   +44 1454 313172               Yate, Bristol, BS37 5NH, UK
Email: terry@beam.ltd.uk             Web: www.beam.ltd.uk
BEAM for: Visually Impaired X-Terminals, Parallel Processing, Software
                       "Tandems are twice the fun !"


             reply	other threads:[~2003-03-14 10:49 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-03-14 10:59 Terry Barnaby [this message]
2003-03-14 14:53 ` Reproducible SCSI Error with Adaptec 7902 Justin T. Gibbs
2003-03-14 15:48   ` Terry Barnaby
2003-03-14 17:34     ` Justin T. Gibbs
2003-03-18  9:50       ` Terry Barnaby
2003-03-19  2:15         ` Justin T. Gibbs
2003-03-20 10:07           ` Terry Barnaby
2003-03-14 16:18   ` Michael Madore
2003-03-14 16:17     ` Terry Barnaby
2003-03-14 17:35       ` Justin T. Gibbs
2003-03-15 13:11       ` Ingo Oeser
  -- strict thread matches above, loose matches on Subject: below --
2003-03-17 16:30 Cress, Andrew R
2003-03-18  9:37 ` Terry Barnaby
2003-01-07 16:49 Michael Madore
2003-01-07 19:32 ` Justin T. Gibbs

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3E71B629.60204@beam.ltd.uk \
    --to=terry@beam.ltd.uk \
    --cc=gibbs@scsiguy.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mmadore@aslab.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox