public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Thomas Fjellstrom <thomas@fjellstrom.ca>
To: Linux Kernel List <linux-kernel@vger.kernel.org>
Cc: linux-scsi@vger.kernel.org
Subject: Re: mvsas errors in 2.6.36
Date: Wed, 17 Nov 2010 00:53:31 -0700	[thread overview]
Message-ID: <201011170053.31174.thomas@fjellstrom.ca> (raw)
In-Reply-To: <201010310911.45738.thomas@fjellstrom.ca>

On October 31, 2010, Thomas Fjellstrom wrote:
> On October 29, 2010, Thomas Fjellstrom wrote:
> > Good news and bad news, the current mvsas driver in 2.6.36 seems to work
> > better than older kernels with my setup (2 port sas + 5 SATA disks). But
> > I gotten the following messages so far:
> > 
[snip]
> > I did not unplug a disk, the errors seem to be spurious.
> > 
> > Otherwise though things seem to be working. At least so far. The
> > mv_abort_task part is very familiar, the older version of this driver
> > would do it right after attempting to build/activate the md raid5 array
> > that lives on this controller. Except the controller would lock up, and
> > all drives would become inaccessible.
> > 
> > I'm going to attempt to grow this array today, so long as the xfs_fsr
> > that I started doesn't cause the array to fail.
> > 
> > If I keep getting mv_abort_task errors, I'll have to back down to the
> > copy of the driver I got from Andy Yan. I've managed to patch it up to
> > compile for 2.6.36 just now, I just hope it'll work at least as well as
> > it did with 2.6.34. At the very least I didn't get these errors.
> > 
> > Some background, the disks attached to the card are (5) Seagate 7200.12
> > 1TB disks, using SAS->SATA cables. Machine is a amd64 Phenom II X4 810
> > w/4G ram running debian sid and a vanila 2.6.36 kernel. The card is a
> > AOC-SASLP-MV8, according to lspci:
> > 
> > 04:00.0 SCSI storage controller: Marvell Technology Group Ltd.
> > MV64460/64461/64462 System Controller, Revision B (rev 01)
> > 
> > according to dmesg:
> > 
[snip]
> > I just hope the raid5 reshape I'm about to do doesn't crap its pants
> > because of the errors above.
> > 
> > I'd like to help test any fixes or changes if needed. Let me know.
> > 
> > Thanks again.
> 
> After a couple days of uptime, the messages are still happening:
> 
[snip]
> No fatal errors yet.

Still no fatal errors, but the problem is still happening regularly. It causes 
a pause in disk io of a couple seconds at least. Really quite annoying.

One thing thats got me wondering, is could this be a power issue? It almost 
seems like (from the messages) that a single drive (any drive) is freaking 
out, and returning an error that probably shouldn't happen (no CHS 0?), which 
could mean the drive is underpowered and the firmware is flipping out. I'm not 
entirely sure. The system has a 750w decent quality Antec power supply. The 
total power use of the system shouldn't come over half that (phenom II x4 810 
cpu, gigabyte ma790fxtud5p mb, low profile nvidia 9400GS gpu, 8 sata hdds, 3 
fans, etc). I'm /mostly/ sure the 12v rails are spread out evenly, but I have 
yet to make absolutely sure.

But then it doesn't seem as if the root drives are ever flipping out. Theres 
two 500GB Seagate 7200.12 drives md raid1'ed on the motherboard's (SB750) sata 
II controller. They work fine, no messages regarding them at all the entire 
time. However I get frequent and repeated messages from all drives on the 
mvsas based controller.

So color me stumped.

-- 
Thomas Fjellstrom
thomas@fjellstrom.ca

  parent reply	other threads:[~2010-11-17  7:53 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-10-29 12:50 mvsas errors in 2.6.36 Thomas Fjellstrom
2010-10-31 15:11 ` Thomas Fjellstrom
2010-11-02 17:02   ` Audio Haven
2010-11-17  7:53   ` Thomas Fjellstrom [this message]
2010-11-17  8:24     ` Andre Tomt
2010-12-02  6:29       ` Thomas Fjellstrom
2010-12-02  9:48         ` Thomas Fjellstrom
2010-12-03 16:39           ` Thomas Fjellstrom
2010-12-03 20:31             ` David Milburn
2010-12-04  6:57               ` Thomas Fjellstrom
     [not found]               ` <201012041550372348573@usish.com>
2010-12-04  8:37                 ` Thomas Fjellstrom
2010-12-04 11:52                 ` Thomas Fjellstrom
2010-12-04 12:33                 ` jack_wang
2010-12-04 12:54                   ` Thomas Fjellstrom
2010-12-04 15:44                     ` Thomas Fjellstrom
2010-12-04 18:22                       ` Thomas Fjellstrom
2010-12-05  2:08                       ` jack_wang
2010-12-05 20:01                         ` Thomas Fjellstrom

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201011170053.31174.thomas@fjellstrom.ca \
    --to=thomas@fjellstrom.ca \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox