All of lore.kernel.org
 help / color / mirror / Atom feed
From: Thomas Fjellstrom <tfjellstrom@shaw.ca>
To: linux-raid@vger.kernel.org
Cc: linux-kernel@vger.kernel.org, linux-scsi <linux-scsi@vger.kernel.org>
Subject: Re: mdraid causing mvsas to lockup?
Date: Mon, 21 Sep 2009 10:16:45 -0600	[thread overview]
Message-ID: <200909211016.45679.tfjellstrom@shaw.ca> (raw)
In-Reply-To: <200909181702.46109.tfjellstrom@shaw.ca>

On Fri September 18 2009, Thomas Fjellstrom wrote:
> On Fri September 18 2009, Thomas Fjellstrom wrote:
> > On Thu September 17 2009, Thomas Fjellstrom wrote:
> > > On Thu September 17 2009, Kristleifur Daðason wrote:
> > > > On Thu, Sep 17, 2009 at 11:02 PM, Thomas Fjellstrom
> > > > <tfjellstrom@shaw.ca>
> > >
> > > wrote:
> > > > > On Thu September 17 2009, John Bridges wrote:
> > > > >> I'm a fan of the SuperMicro AOC-SAT2-MV8, great card.
> > > > >> http://www.supermicro.com/products/accessories/addon/AOC-SAT2-MV8.
> > > > >>cf m
> > > > >>
> > > > >> It's an 8 port PCI-X card, works in both PCI and PCI-X slots.
> > > > >>
> > > > >> SATA2
> > > > >>
> > > > >> Drivers for Linux are stable, built in.
> > > > >
> > > > > Have you had any experience with the AOC-SASLP-MV8? I've got one
> > > > > and have been having no end of issues with it under linux.
> > > > >
> > > > > --
> > > > > Thomas Fjellstrom
> > > > > tfjellstrom@shaw.ca
> > > > > --
> > > >
> > > > I have,
> > > >
> > > > or rather, I've tried to get an AOC-SASLP-MV8 card going. I think I
> > > > can safely say that at least Linux kernel 2.6.31 is a requirement.
> > > > The card was basically useless with everything up to 2.6.30, then I
> > > > tried 2.6.31-rc5 on a whim and it kicked in. Built-in driver support,
> > > > that is. However it wasn't stable, it dropped disks when syncing a
> > > > large array. I've been meaning to test on 2.6.31 final, and am pretty
> > > > optimistic.
> > >
> > > Yeah, the driver didn't appear till .30. I have 2.6.31-git4 installed
> > > right now, and no matter what I do, the controller starts spewing
> > > errors:
> > >
> > > [ 1455.698186] drivers/scsi/mvsas/mv_sas.c 1669:mvs_abort_task:rc= 5
> > > [ 1455.698196] drivers/scsi/mvsas/mv_sas.c 1608:mvs_query_task:rc= 5
> > > ...
> > > [ 1424.708085] end_request: I/O error, dev sdh, sector 3072
> > > [ 1424.708106] sd 0:0:3:0: [sdh] Unhandled error code
> > > [ 1424.708111] sd 0:0:3:0: [sdh] Result: hostbyte=DID_OK
> > > driverbyte=DRIVER_TIMEOUT
> > > [ 1424.708118] sd 0:0:3:0: [sdh] CDB: Read(10): 28 00 00 00 08 00 00 04
> > > 00 00
> > >
> > > And thats with perfectly good disks, and with smartd/hddtemp disabled
> > > (they were causing one of my disks to barf).
> > >
> > > All I have to do is start a read from any disk, and after a few
> > > minutes, the card starts erroring out, and then dies.
> > >
> > > It actually seems like it got more unstable from .30 to .31.
> > >
> > > I've been trying to get some help with it on the lkml/ide/scsi lists
> > > for a while now, one person has tried to help, but thats about it.
> >
> > Very strange. I've found that reading from all 4 drives currently
> > connected to the controller at once, works. I have 4 dd commands, one
> > reading off each drive, and so far no errors, the dd commands aren't
> > locking up, and they are going full speed (120MB/s per drive).
> >
> > If however I attempt to bring up the md raid0 array ontop of these disks,
> >  the controller locks up, and all of the disks become inaccessible.
> >
> > Maybe it has something to do with it, but just as the system is booting,
> > I get the following, maybe related, maybe not:
> >
> > ata_id[5183]: HDIO_GET_IDENTITY failed for '/dev/block/8:96'
> > ata_id[5188]: HDIO_GET_IDENTITY failed for '/dev/block/8:112'
> > ata_id[5184]: HDIO_GET_IDENTITY failed for '/dev/block/8:80'
> >
> > (those map to sdg, sdh, and sdf in that order, no report for sde, the
> > first disk in the controller)
> 
> So I've let the controller and disks sit all day after finishing a full
>  read test (dd if=/dev/sd[efgh] of=/dev/null bs=8M) with all four 1TB
>  drives going at the same time, and I've had no errors at all. All four dd
>  commands finished without error, and went at full speed.
> 
> If I attempt to activate an md raid0 array ontop of any disks on this
> controller the controller starts having a fit, and all disks are
>  inaccessible till a hard reset (the machine won't fully reboot, or turn
>  off, as the "flushing scsi cache" or "shutting down LVM" steps will hang
>  waiting on drives on the wedged controller.
> 
> I would really like to get this fixed, if there's anything more I can do to
> help narrow down the problem further, I'll do my best.
> 

Does anyone have a clue what might be wrong? Something I could check into? I 
have a couple system migrations to do, and this is blocking that. (my old 
array has been making "click" noises for a year now, and I'm afraid it'll die 
at any time)

-- 
Thomas Fjellstrom
tfjellstrom@shaw.ca
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

WARNING: multiple messages have this Message-ID (diff)
From: Thomas Fjellstrom <tfjellstrom@shaw.ca>
To: linux-raid@vger.kernel.org
Cc: linux-kernel@vger.kernel.org, "linux-scsi" <linux-scsi@vger.kernel.org>
Subject: Re: mdraid causing mvsas to lockup?
Date: Mon, 21 Sep 2009 10:16:45 -0600	[thread overview]
Message-ID: <200909211016.45679.tfjellstrom@shaw.ca> (raw)
In-Reply-To: <200909181702.46109.tfjellstrom@shaw.ca>

On Fri September 18 2009, Thomas Fjellstrom wrote:
> On Fri September 18 2009, Thomas Fjellstrom wrote:
> > On Thu September 17 2009, Thomas Fjellstrom wrote:
> > > On Thu September 17 2009, Kristleifur Daðason wrote:
> > > > On Thu, Sep 17, 2009 at 11:02 PM, Thomas Fjellstrom
> > > > <tfjellstrom@shaw.ca>
> > >
> > > wrote:
> > > > > On Thu September 17 2009, John Bridges wrote:
> > > > >> I'm a fan of the SuperMicro AOC-SAT2-MV8, great card.
> > > > >> http://www.supermicro.com/products/accessories/addon/AOC-SAT2-MV8.
> > > > >>cf m
> > > > >>
> > > > >> It's an 8 port PCI-X card, works in both PCI and PCI-X slots.
> > > > >>
> > > > >> SATA2
> > > > >>
> > > > >> Drivers for Linux are stable, built in.
> > > > >
> > > > > Have you had any experience with the AOC-SASLP-MV8? I've got one
> > > > > and have been having no end of issues with it under linux.
> > > > >
> > > > > --
> > > > > Thomas Fjellstrom
> > > > > tfjellstrom@shaw.ca
> > > > > --
> > > >
> > > > I have,
> > > >
> > > > or rather, I've tried to get an AOC-SASLP-MV8 card going. I think I
> > > > can safely say that at least Linux kernel 2.6.31 is a requirement.
> > > > The card was basically useless with everything up to 2.6.30, then I
> > > > tried 2.6.31-rc5 on a whim and it kicked in. Built-in driver support,
> > > > that is. However it wasn't stable, it dropped disks when syncing a
> > > > large array. I've been meaning to test on 2.6.31 final, and am pretty
> > > > optimistic.
> > >
> > > Yeah, the driver didn't appear till .30. I have 2.6.31-git4 installed
> > > right now, and no matter what I do, the controller starts spewing
> > > errors:
> > >
> > > [ 1455.698186] drivers/scsi/mvsas/mv_sas.c 1669:mvs_abort_task:rc= 5
> > > [ 1455.698196] drivers/scsi/mvsas/mv_sas.c 1608:mvs_query_task:rc= 5
> > > ...
> > > [ 1424.708085] end_request: I/O error, dev sdh, sector 3072
> > > [ 1424.708106] sd 0:0:3:0: [sdh] Unhandled error code
> > > [ 1424.708111] sd 0:0:3:0: [sdh] Result: hostbyte=DID_OK
> > > driverbyte=DRIVER_TIMEOUT
> > > [ 1424.708118] sd 0:0:3:0: [sdh] CDB: Read(10): 28 00 00 00 08 00 00 04
> > > 00 00
> > >
> > > And thats with perfectly good disks, and with smartd/hddtemp disabled
> > > (they were causing one of my disks to barf).
> > >
> > > All I have to do is start a read from any disk, and after a few
> > > minutes, the card starts erroring out, and then dies.
> > >
> > > It actually seems like it got more unstable from .30 to .31.
> > >
> > > I've been trying to get some help with it on the lkml/ide/scsi lists
> > > for a while now, one person has tried to help, but thats about it.
> >
> > Very strange. I've found that reading from all 4 drives currently
> > connected to the controller at once, works. I have 4 dd commands, one
> > reading off each drive, and so far no errors, the dd commands aren't
> > locking up, and they are going full speed (120MB/s per drive).
> >
> > If however I attempt to bring up the md raid0 array ontop of these disks,
> >  the controller locks up, and all of the disks become inaccessible.
> >
> > Maybe it has something to do with it, but just as the system is booting,
> > I get the following, maybe related, maybe not:
> >
> > ata_id[5183]: HDIO_GET_IDENTITY failed for '/dev/block/8:96'
> > ata_id[5188]: HDIO_GET_IDENTITY failed for '/dev/block/8:112'
> > ata_id[5184]: HDIO_GET_IDENTITY failed for '/dev/block/8:80'
> >
> > (those map to sdg, sdh, and sdf in that order, no report for sde, the
> > first disk in the controller)
> 
> So I've let the controller and disks sit all day after finishing a full
>  read test (dd if=/dev/sd[efgh] of=/dev/null bs=8M) with all four 1TB
>  drives going at the same time, and I've had no errors at all. All four dd
>  commands finished without error, and went at full speed.
> 
> If I attempt to activate an md raid0 array ontop of any disks on this
> controller the controller starts having a fit, and all disks are
>  inaccessible till a hard reset (the machine won't fully reboot, or turn
>  off, as the "flushing scsi cache" or "shutting down LVM" steps will hang
>  waiting on drives on the wedged controller.
> 
> I would really like to get this fixed, if there's anything more I can do to
> help narrow down the problem further, I'll do my best.
> 

Does anyone have a clue what might be wrong? Something I could check into? I 
have a couple system migrations to do, and this is blocking that. (my old 
array has been making "click" noises for a year now, and I'm afraid it'll die 
at any time)

-- 
Thomas Fjellstrom
tfjellstrom@shaw.ca

  reply	other threads:[~2009-09-21 16:16 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-09-17 11:44 recommended 4port SATA controller ? Rainer Fuegenstein
2009-09-17 11:57 ` Majed B.
2009-09-17 12:49 ` Greg Freemyer
2009-09-17 13:04   ` Jon Lewis
2009-09-17 13:17     ` Max Waterman
2009-09-17 13:28   ` Rui Santos
2009-10-13 21:19     ` Bill Davidsen
2009-09-17 13:40   ` Tapani Tarvainen
2009-09-17 14:47   ` Rainer Fuegenstein
2009-09-17 21:52     ` Matt Garman
2009-09-17 22:41       ` Kristleifur Daðason
2009-09-17 23:48       ` Rainer Fuegenstein
2009-09-21 16:29         ` Matt Garman
2009-09-22 14:05       ` Matthias Urlichs
2009-09-17 13:43 ` Christian Pernegger
2009-09-17 14:57 ` Jon Hardcastle
2009-09-17 21:37 ` John Bridges
2009-09-17 23:02   ` Thomas Fjellstrom
2009-09-17 23:35     ` Kristleifur Daðason
2009-09-17 23:59       ` Thomas Fjellstrom
2009-09-18 10:58         ` mdraid causing mvsas to lockup? (was: Re: recommended 4port SATA controller ?) Thomas Fjellstrom
2009-09-18 23:02           ` Thomas Fjellstrom
2009-09-18 23:02             ` Thomas Fjellstrom
2009-09-21 16:16             ` Thomas Fjellstrom [this message]
2009-09-21 16:16               ` mdraid causing mvsas to lockup? Thomas Fjellstrom
2009-09-27  3:34               ` Thomas Fjellstrom
2009-09-27  3:34                 ` Thomas Fjellstrom
2009-09-18  0:23     ` recommended 4port SATA controller ? John Bridges
2009-09-18  0:52       ` John Bridges

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200909211016.45679.tfjellstrom@shaw.ca \
    --to=tfjellstrom@shaw.ca \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-raid@vger.kernel.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.