linux-scsi.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-04-13 20:54 ` Sebastian Wasilewski
@ 2008-04-20 13:14   ` Sebastian Wasilewski
  0 siblings, 0 replies; 9+ messages in thread
From: Sebastian Wasilewski @ 2008-04-20 13:14 UTC (permalink / raw)
  To: linux-scsi

[-- Attachment #1: Type: text/plain, Size: 517 bytes --]

Hi All,

The problem still exists. With NCQ turned off as well as with NCQ turned on.

# dmesg  |grep 'Code={Abort}, SubCode(0x3000)'
mptbase: ioc0: LogInfo(0x31123000): Originator={PL}, Code={Abort}, 
SubCode(0x3000)
mptbase: ioc0: LogInfo(0x31123000): Originator={PL}, Code={Abort}, 
SubCode(0x3000)
mptbase: ioc0: LogInfo(0x31123000): Originator={PL}, Code={Abort}, 
SubCode(0x3000)
(...)

# dmesg  |grep 'Code={Abort}, SubCode(0x3000)' |wc -l
377

Does anyone know, what that messages mean?

Regards,
Sebastian



[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/x-pkcs7-signature, Size: 4968 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
@ 2008-04-20 19:14 Richard Scobie
  0 siblings, 0 replies; 9+ messages in thread
From: Richard Scobie @ 2008-04-20 19:14 UTC (permalink / raw)
  To: linux-scsi

I have a case open with LSI at the moment on this. I'll let you know 
what they tell me.

Regards,

Richard

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
@ 2008-04-22 19:28 Richard Scobie
  2008-04-22 20:02 ` Moore, Eric
  2008-04-22 23:21 ` Sebastian Wasilewski
  0 siblings, 2 replies; 9+ messages in thread
From: Richard Scobie @ 2008-04-22 19:28 UTC (permalink / raw)
  To: linux-scsi; +Cc: sebastian

I received a reply from LSI:

"It seems there is some bad cable or connectivity or bad device in the
topology. The loginfo 0x31123000 says the I/O is aborted due to a Stuck
Link. The loginfo 0x31120403 specifies the I/O is aborted due to frame
transfer errors."

So far doing large dd reads and writes, I have been unable to generate 
any of these errors, it is only during the initial md RAID5 sync up, so 
I guess this is going to be a little tricky to pin down.

Regards,

Richard

^ permalink raw reply	[flat|nested] 9+ messages in thread

* RE: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-04-22 19:28 mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT) Richard Scobie
@ 2008-04-22 20:02 ` Moore, Eric
  2008-04-22 20:16   ` Richard Scobie
  2008-04-22 23:21 ` Sebastian Wasilewski
  1 sibling, 1 reply; 9+ messages in thread
From: Moore, Eric @ 2008-04-22 20:02 UTC (permalink / raw)
  To: Richard Scobie, linux-scsi; +Cc: sebastian

On Tuesday, April 22, 2008 1:29 PM,  Richard Scobie wrote:
> I received a reply from LSI:
> 
> "It seems there is some bad cable or connectivity or bad device in the
> topology. The loginfo 0x31123000 says the I/O is aborted due 
> to a Stuck
> Link. The loginfo 0x31120403 specifies the I/O is aborted due to frame
> transfer errors."
> 
> So far doing large dd reads and writes, I have been unable to 
> generate 
> any of these errors, it is only during the initial md RAID5 
> sync up, so 
> I guess this is going to be a little tricky to pin down.
> 

The loginfo bits are defined in a header inside the linux kernel
sources.  

Location:  /usr/src/linux/drivers/message/fusion/lsi/mpi_log_sas.h

If you search for 3000, you will find
PL_LOGINFO_CODE_WRONG_REL_OFF_OR_FRAME_LENGTH.

There's no match for 403, so I guess this iz a new define not included
in my version of the headers.

Eric

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-04-22 20:02 ` Moore, Eric
@ 2008-04-22 20:16   ` Richard Scobie
  0 siblings, 0 replies; 9+ messages in thread
From: Richard Scobie @ 2008-04-22 20:16 UTC (permalink / raw)
  To: Moore, Eric; +Cc: linux-scsi, sebastian

Moore, Eric wrote:

> The loginfo bits are defined in a header inside the linux kernel
> sources.  
> 
> Location:  /usr/src/linux/drivers/message/fusion/lsi/mpi_log_sas.h
> 
> If you search for 3000, you will find
> PL_LOGINFO_CODE_WRONG_REL_OFF_OR_FRAME_LENGTH.
> 
> There's no match for 403, so I guess this iz a new define not included
> in my version of the headers.
> 
> Eric

Thanks Eric. I doubt I can source a new backplane to swap out, so I'll 
try to find out if there is a more recent firmware for it and try that.

Regards,

Richard

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-04-22 19:28 mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT) Richard Scobie
  2008-04-22 20:02 ` Moore, Eric
@ 2008-04-22 23:21 ` Sebastian Wasilewski
  1 sibling, 0 replies; 9+ messages in thread
From: Sebastian Wasilewski @ 2008-04-22 23:21 UTC (permalink / raw)
  To: Richard Scobie; +Cc: linux-scsi

[-- Attachment #1: Type: text/plain, Size: 1239 bytes --]

Richard Scobie wrote:
> I received a reply from LSI:
> 
> "It seems there is some bad cable or connectivity or bad device in the
> topology. The loginfo 0x31123000 says the I/O is aborted due to a Stuck
> Link. The loginfo 0x31120403 specifies the I/O is aborted due to frame
> transfer errors."
> 
> So far doing large dd reads and writes, I have been unable to generate 
> any of these errors, it is only during the initial md RAID5 sync up, so 
> I guess this is going to be a little tricky to pin down.
> 

Thanks for the info.

There is still one thing I do not understand. I have made a test -- I 
have connected three SATA hard drives directly to the controller using 
cable supplied with the controller. The error still occurred. Then I 
have changed the cable to the one supplied with other controller, as 
well as hard drives (but model was still the same) and the error message 
was still occurring.

I have changed controller (I have a few of them) but again, the problem 
still existed.

The only unusual thing I have noticed during the test, was that all 
ST31000340NS drives  worked at 1.5Gbps, not 3.0Gbps link speed.

I have made plenty of the dd tests and all were fine (both with NCQ and 
without).


Regards,
Sebastian

[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/x-pkcs7-signature, Size: 4968 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
@ 2008-05-02  4:26 Richard Scobie
  2008-05-02 22:37 ` Moore, Eric
  0 siblings, 1 reply; 9+ messages in thread
From: Richard Scobie @ 2008-05-02  4:26 UTC (permalink / raw)
  To: linux-scsi

Sebastian Wasilewski said:

There is still one thing I do not understand. I have made a test -- I
have connected three SATA hard drives directly to the controller using
cable supplied with the controller. The error still occurred. Then I
have changed the cable to the one supplied with other controller, as
well as hard drives (but model was still the same) and the error message
was still occurring.

I have changed controller (I have a few of them) but again, the problem
still existed.

------------------------------

If anyone else can replicate these errors using Sebastian's setup above 
I would interested in hearing.

I have a case open with LSI but apparently my HBA is a "prototype" board 
and this is complicating things. I have had no indication whether they 
can replicate this issue or not.

If anyone else from LSI is on this list and can wade through SR# 
2-209998499  I'd be interested in any suggestions. Time is running out 
for me and I will probably have to go with someone elses HBA.

Apologies for this mail not threading correctly, as I am not subscribed. 
The start of it can be found here:

http://marc.info/?l=linux-scsi&m=120805128416791&w=2

and carries on here:

http://marc.info/?l=linux-scsi&m=120869748422095&w=2

Regards,

Richard

^ permalink raw reply	[flat|nested] 9+ messages in thread

* RE: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-05-02  4:26 Richard Scobie
@ 2008-05-02 22:37 ` Moore, Eric
  2008-05-02 22:49   ` Richard Scobie
  0 siblings, 1 reply; 9+ messages in thread
From: Moore, Eric @ 2008-05-02 22:37 UTC (permalink / raw)
  To: Richard Scobie, linux-scsi


> If anyone else from LSI is on this list and can wade through SR# 
> 2-209998499  I'd be interested in any suggestions. Time is 
> running out 
> for me and I will probably have to go with someone elses HBA.
> 

Richard, those SR issues are not accessable by developers at LSI.
Those are created by the tech support group, used internally within the
support group.    The developers is what your going to find on the
linux-scsi@ forum, like me and Sathya.  We are using clear case, which
is a different database.    If you give me the contact name whom
assigned you that SR number, I will see what I can do.

Eric

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT)
  2008-05-02 22:37 ` Moore, Eric
@ 2008-05-02 22:49   ` Richard Scobie
  0 siblings, 0 replies; 9+ messages in thread
From: Richard Scobie @ 2008-05-02 22:49 UTC (permalink / raw)
  To: Moore, Eric; +Cc: linux-scsi

Moore, Eric wrote:

> Richard, those SR issues are not accessable by developers at LSI.
> Those are created by the tech support group, used internally within the
> support group.    The developers is what your going to find on the
> linux-scsi@ forum, like me and Sathya.  We are using clear case, which
> is a different database.    If you give me the contact name whom
> assigned you that SR number, I will see what I can do.
> 
> Eric

Hi Eric,

Many thanks. Ken Stewart is the man to talk to, otherwise I can forward 
you some long, back and forth emails.

Regards,

Richard

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2008-05-02 22:45 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-04-22 19:28 mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT) Richard Scobie
2008-04-22 20:02 ` Moore, Eric
2008-04-22 20:16   ` Richard Scobie
2008-04-22 23:21 ` Sebastian Wasilewski
  -- strict thread matches above, loose matches on Subject: below --
2008-05-02  4:26 Richard Scobie
2008-05-02 22:37 ` Moore, Eric
2008-05-02 22:49   ` Richard Scobie
2008-04-20 19:14 Richard Scobie
2008-04-13 20:47 mptbase Code={Abort} Richard Scobie
2008-04-13 20:54 ` Sebastian Wasilewski
2008-04-20 13:14   ` mptbase Code={Abort} (LSI SAS1068 PCI-X Fusion-MPT) Sebastian Wasilewski

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).