* Re: Bug in aic94xx driver in 2.6.25-rc3
[not found] ` <1203992592.26232.79.camel@alexis>
@ 2008-02-27 22:39 ` peter
2008-02-27 23:37 ` James Bottomley
0 siblings, 1 reply; 2+ messages in thread
From: peter @ 2008-02-27 22:39 UTC (permalink / raw)
To: linux-scsi
Cc: Darrick J. Wong, James Bottomley, Wu, Gilbert, Alexis Bruemmer,
tom_white
My original post was user error and not acutally a bug. I didn't
realize that there was another patch I need to apply to rc3 to get the
latest scsi drivers and error handler code. Alexis clued me in. Now
the error handler appears to be working properly. I included a sample
at the bottom of this email.
I am still seeing the disk go offline if I run i/o performance tests on
sas disks connected to the aic94xx (sequencer version 32). It doesn't
happen right away. The i/o tests will run for several hours before it
fails. Eventually you see the filesystem abort and then be remounted as
read only.
Peter Bogdanovic
IBM
sas: command 0xffff810142569200, task 0xffff81022ad27980, timed out:
EH_NOT_HANDLED^M
sas: Enter sas_scsi_recover_host^M
sas: trying to find task 0xffff81022ad27980^M
sas: sas_scsi_find_task: aborting task 0xffff81022ad27980^M
aic94xx: tmf tasklet complete^M
aic94xx: tmf resp tasklet^M
aic94xx: tmf came back^M
aic94xx: task not done, clearing nexus^M
aic94xx: asd_clear_nexus_tag: PRE^M
aic94xx: asd_clear_nexus_tag: POST^M
aic94xx: asd_clear_nexus_tag: clear nexus posted, waiting...^M
aic94xx: task 0xffff81022ad27980 done with opcode 0x23 resp 0x0 stat
0x8d but aborted by upper layer!^M
aic94xx: asd_clear_nexus_tasklet_complete: here^M
aic94xx: asd_clear_nexus_tasklet_complete: opcode: 0x0^M
aic94xx: came back from clear nexus^M
aic94xx: task 0xffff81022ad27980 aborted, res: 0x0^M
sas: sas_scsi_find_task: task 0xffff81022ad27980 is done^M
sas: sas_eh_handle_sas_errors: task 0xffff81022ad27980 is done^M
sd 0:0:3:0: [sdd] Result: hostbyte=DID_ABORT
driverbyte=DRIVER_OK,SUGGEST_OK^M
end_request: I/O error, dev sdd, sector 27106623^M
sas: --- Exit sas_scsi_recover_host^M
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: Bug in aic94xx driver in 2.6.25-rc3
2008-02-27 22:39 ` Bug in aic94xx driver in 2.6.25-rc3 peter
@ 2008-02-27 23:37 ` James Bottomley
0 siblings, 0 replies; 2+ messages in thread
From: James Bottomley @ 2008-02-27 23:37 UTC (permalink / raw)
To: pbog; +Cc: linux-scsi, Darrick J. Wong, Wu, Gilbert, Alexis Bruemmer,
tom_white
On Wed, 2008-02-27 at 14:39 -0800, peter wrote:
> My original post was user error and not acutally a bug. I didn't
> realize that there was another patch I need to apply to rc3 to get the
> latest scsi drivers and error handler code. Alexis clued me in. Now
> the error handler appears to be working properly. I included a sample
> at the bottom of this email.
>
> I am still seeing the disk go offline if I run i/o performance tests on
> sas disks connected to the aic94xx (sequencer version 32). It doesn't
> happen right away. The i/o tests will run for several hours before it
> fails. Eventually you see the filesystem abort and then be remounted as
> read only.
Yes, I've seen this one too. in my case it's caused by error handling
tripping a flutter on the disk phy, so the HBA actually sees an
unplug/replug event, but that causes the disk to go offline (even though
it actually reappears almost immediately). I have that on my list of
things to fix ... probably by stealing the devloss tmo from fc.
James
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2008-02-27 23:37 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1203987834.6909.17.camel@gnattop>
[not found] ` <1203992592.26232.79.camel@alexis>
2008-02-27 22:39 ` Bug in aic94xx driver in 2.6.25-rc3 peter
2008-02-27 23:37 ` James Bottomley
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox