public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* aic94xx + ST3146855SS still failing under heavy load
@ 2008-04-14 17:03 Leonid Kalmankin
  2008-04-16 16:46 ` Raoul Bhatia [IPAX]
  2008-04-16 17:34 ` Petrakis, Peter
  0 siblings, 2 replies; 5+ messages in thread
From: Leonid Kalmankin @ 2008-04-14 17:03 UTC (permalink / raw)
  To: linux-scsi

Hello!

We have a system with:

vanilla 2.6.25-rc8 (2.6.23, 2.6.24 have the same behaviour)

Adaptec AIC-9410W SAS (Razor ASIC RAID) (rev 09)
aic94xx: Found sequencer Firmware version 1.1 (V30)
  (Firmware version 1.1 (V17/10c6) makes no difference)
scsi 2:0:0:0: Direct-Access  SEAGATE ST3146855SS 0002 PQ: 0 ANSI: 5


It reliably fails under heavy IO:

> sas: command 0xffff81022c5f5640, task 0xffff8101f6b0f000, timed out: EH_NOT_HANDLED
> sas: command 0xffff81022c5f5500, task 0xffff8101f6b0f1c0, timed out: EH_NOT_HANDLED
> ....
> sas: Enter sas_scsi_recover_host
> sas: trying to find task 0xffff8101f6b0f000
> sas: sas_scsi_find_task: aborting task 0xffff8101f6b0f000
> aic94xx: task 0xffff8101f6b0f000 done with opcode 0x1e resp 0x0 stat 0x8d but aborted by upper layer!
> aic94xx: tmf tasklet complete
> aic94xx: tmf came back
> aic94xx: asd_abort_task: task 0xffff8101f6b0f000 done
> aic94xx: task 0xffff8101f6b0f000 aborted, res: 0x0
> sas: sas_scsi_find_task: task 0xffff8101f6b0f000 is done
> sas: sas_eh_handle_sas_errors: task 0xffff8101f6b0f000 is done
> sas: --- Exit sas_scsi_recover_host

Sometimes it successfully recovers; sometimes the disk is lost until the reboot.

I've read http://archive.netbsd.se/?ml=linux-scsi&a=2008-01&t=6260524
Asked Seagate about firmware update; they told me they do not have any.

As I understood, the root of this problem is protocol errors in disk's firmware
(other disks, for example FUJITSU MBA3147RC work fine); however, that kind of errors
should be recoverable by sas/aic94xx drivers.

If that is true, I could test some patches/ideas, where should I start?


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2008-04-17 15:52 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-04-14 17:03 aic94xx + ST3146855SS still failing under heavy load Leonid Kalmankin
2008-04-16 16:46 ` Raoul Bhatia [IPAX]
2008-04-16 17:34 ` Petrakis, Peter
2008-04-17 15:08   ` Leonid Kalmankin
2008-04-17 15:51   ` Petrakis, Peter

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox