public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* Adaptec 29320LPE errors on boot
@ 2008-03-05 15:42 Rodrigo Severo
  2008-03-05 15:55 ` James Bottomley
       [not found] ` <532ABFBDAAC3A34EB12EBA6CEC2838F4831D6BEF@ADPE2K703.adaptec.com>
  0 siblings, 2 replies; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 15:42 UTC (permalink / raw)
  To: linux-scsi

Hi,


I have this Adaptec 29320LPE SCSI board (used the aic79xx driver)
which give several errors during boot time, just after it recognizes
the HDs connected to it.

If I remove the SCSI HD there are no SCSI errors.

The very first error messages have scrolled up and I don't know how to
get to them (is there a way?) but here is the transcript of what I can
see:

scsi0: hwerrint, Sequencer Parity Error
scsi0: hwerrint, Data-path Parity Error
scsi0: hwerrint, Scratch or SCB Memory Parity Error
scsi0: hwerrint, CIOBUS Parity Error
>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<
scsi0: Dumping Card State at program address 0xffff Mode 0x77
Card was paused
INTSTAT[0xff] SELOID[0xff] SELID[0xff] HS_MAILBOX[0xff]
INTCTL[0xff] SEQINTSTAT[0xff] SAVED_MODE[0xff]
DFFSTAT[0xff] SCSISIGI[0xff] SCSIPHASE[0xff] SCSIBUS[0xff]
LASTPHASE[0xff] SCSISEQ0[0xff] SCSISEQ1[0xff] SEQCTL0[0xff]
SEQINTCTL[0xff] SEQ_FLAGS[0xff] SEQ_FLAGS2[0xff]
QFREEZE_COUNT[0xffff] KERNEL_QFREEZE_COUNT[0xffff]
MK_MESSAGE_SCB[0xffff] MK_MESSAGE_SCSIID[0xff]
SSTAT0[0xff] SSTAT1[0xff] SSTAT2[0xff] SSTAT3[0xff]
PERRDIAG[0xff] SIMODE1[0xff] LQISTAT0[0xff] LQISTAT1[0xff]
LQSTAT2[0xff] LQOSTAT0[0xff] LQOSTAT1[0xff] LQOSTAT2[0xff]

SCB Count = 4 CMDS_PENDING = 65535 LASTSCB 0xffff CURRSCB 0xffff NEXTSCB 0xffff
CPU 0: Machine Check Exception: 0000000000000004
CPU 0: BAnk 4: 3200000000070f0f
Kernel panic - not syncing: CPU context corrupt

Some extra info on the machine:

# cat /proc/version
Linux version 2.6.25-rc4 (root@lab) (gcc version 4.1.2 (Gentoo 4.1.2
p1.0.2)) #3 Wed Mar 5 12:21:22 BRT 2008

# lspci
00:00.0 RAM memory: nVidia Corporation C51 Host Bridge (rev a2)
00:00.1 RAM memory: nVidia Corporation C51 Memory Controller 0 (rev a2)
00:00.2 RAM memory: nVidia Corporation C51 Memory Controller 1 (rev a2)
00:00.3 RAM memory: nVidia Corporation C51 Memory Controller 5 (rev a2)
00:00.4 RAM memory: nVidia Corporation C51 Memory Controller 4 (rev a2)
00:00.5 RAM memory: nVidia Corporation C51 Host Bridge (rev a2)
00:00.6 RAM memory: nVidia Corporation C51 Memory Controller 3 (rev a2)
00:00.7 RAM memory: nVidia Corporation C51 Memory Controller 2 (rev a2)
00:02.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:03.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:04.0 PCI bridge: nVidia Corporation C51 PCI Express Bridge (rev a1)
00:05.0 VGA compatible controller: nVidia Corporation C51G [GeForce
6100] (rev a2)
00:09.0 RAM memory: nVidia Corporation MCP51 Host Bridge (rev a2)
00:0a.0 ISA bridge: nVidia Corporation MCP51 LPC Bridge (rev a2)
00:0a.1 SMBus: nVidia Corporation MCP51 SMBus (rev a2)
00:0a.2 RAM memory: nVidia Corporation MCP51 Memory Controller 0 (rev a2)
00:0b.0 USB Controller: nVidia Corporation MCP51 USB Controller (rev a2)
00:0b.1 USB Controller: nVidia Corporation MCP51 USB Controller (rev a2)
00:0d.0 IDE interface: nVidia Corporation MCP51 IDE (rev a1)
00:10.0 PCI bridge: nVidia Corporation MCP51 PCI Bridge (rev a2)
00:10.2 Multimedia audio controller: nVidia Corporation MCP51 AC97
Audio Controller (rev a2)
00:14.0 Bridge: nVidia Corporation MCP51 Ethernet Controller (rev a1)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] Miscellaneous Control
02:00.0 PCI bridge: PLX Technology, Inc. PEX 8114 PCI
Express-to-PCI/PCI-X Bridge (rev bc)
03:04.0 SCSI storage controller: Adaptec ASC-29320ALP U320 (rev 10)

Please let me know if there is any extra info needed. I can make any
modifications necessary as this is a test machine.


TIA,

Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 15:42 Adaptec 29320LPE errors on boot Rodrigo Severo
@ 2008-03-05 15:55 ` James Bottomley
  2008-03-05 16:26   ` Rodrigo Severo
       [not found] ` <532ABFBDAAC3A34EB12EBA6CEC2838F4831D6BEF@ADPE2K703.adaptec.com>
  1 sibling, 1 reply; 10+ messages in thread
From: James Bottomley @ 2008-03-05 15:55 UTC (permalink / raw)
  To: Rodrigo Severo; +Cc: linux-scsi

On Wed, 2008-03-05 at 12:42 -0300, Rodrigo Severo wrote:
> Hi,
> 
> 
> I have this Adaptec 29320LPE SCSI board (used the aic79xx driver)
> which give several errors during boot time, just after it recognizes
> the HDs connected to it.
> 
> If I remove the SCSI HD there are no SCSI errors.
> 
> The very first error messages have scrolled up and I don't know how to
> get to them (is there a way?)

If you can get it to boot (say to a ramdisk with a shell) then dmesg
gives them ... otherwise you need a serial console.

>  but here is the transcript of what I can
> see:
> 
> scsi0: hwerrint, Sequencer Parity Error
> scsi0: hwerrint, Data-path Parity Error
> scsi0: hwerrint, Scratch or SCB Memory Parity Error
> scsi0: hwerrint, CIOBUS Parity Error

That's a pretty comprehensive sequence of parity errors.  Assuming
they're the root cause rather than something caused by the first error:
best case, you just need a better cable or termination; worst case, your
card is actually hosed in its internal data path.

James



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
       [not found] ` <532ABFBDAAC3A34EB12EBA6CEC2838F4831D6BEF@ADPE2K703.adaptec.com>
@ 2008-03-05 16:26   ` Rodrigo Severo
  2008-03-05 16:32     ` James Bottomley
  0 siblings, 1 reply; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 16:26 UTC (permalink / raw)
  To: linux-scsi

On 3/5/08, Salyzyn, Mark <Mark_Salyzyn@adaptec.com> wrote:
> Sounds like you have a bad SCSI cable or drive.
>
>  Do any of these drives work in any other OS environment?

This same drive with this same cable with this same terminator works
well with a 39160 (which usses another driver: the aic7xxx) on the
same OS environment so I don't think this is a cable or drive issue.

Besides I tested with four different cables (different models and
manufactures, all pretend to be U320 compatible). The problem is
unchanged when I use different cables and terminators.

I also tested with several different disks (different models and
manufactures). The problem is still unchanged.


Regards,

Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 15:55 ` James Bottomley
@ 2008-03-05 16:26   ` Rodrigo Severo
  0 siblings, 0 replies; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 16:26 UTC (permalink / raw)
  To: linux-scsi

On 3/5/08, James Bottomley <James.Bottomley@hansenpartnership.com> wrote:
> On Wed, 2008-03-05 at 12:42 -0300, Rodrigo Severo wrote:
>  > I have this Adaptec 29320LPE SCSI board (used the aic79xx driver)
>  > which give several errors during boot time, just after it recognizes
>  > the HDs connected to it.
>  >
>  > If I remove the SCSI HD there are no SCSI errors.
>  >
>  > The very first error messages have scrolled up and I don't know how to
>  > get to them (is there a way?)
>
>
> If you can get it to boot (say to a ramdisk with a shell) then dmesg
>  gives them ... otherwise you need a serial console.

I managed to get the first error messages:

Attempting to Queue an Abort message: CDB 0x3c 0xa 0x0 0x0 0x0 0x0 0x0 0xfc 0x0
scsi0: hwerrint, Discard Timer has timed out
scsi0: hwerrint, Illegal Opcode in sequencer program

then it gets to the messages I included in my previous message.

>  >  but here is the transcript of what I can
>  > see:
>  >
>  > scsi0: hwerrint, Sequencer Parity Error
>  > scsi0: hwerrint, Data-path Parity Error
>  > scsi0: hwerrint, Scratch or SCB Memory Parity Error
>  > scsi0: hwerrint, CIOBUS Parity Error
>
>
> That's a pretty comprehensive sequence of parity errors.  Assuming
>  they're the root cause rather than something caused by the first error:
>  best case, you just need a better cable or termination;

I don't think so (please see my answer to Mark Salyzyn's message).

> worst case, your
>  card is actually hosed in its internal data path.

I also don't think so because I get this same exact error if I try
this SCSI board on three different motherboard models BUT there is one
fourth model where I don't get these errors. Some info on the "working
motherboard":

# lspci
00:00.0 RAM memory: nVidia Corporation MCP61 Memory Controller (rev a1)
00:01.0 ISA bridge: nVidia Corporation MCP61 LPC Bridge (rev a2)
00:01.1 SMBus: nVidia Corporation MCP61 SMBus (rev a2)
00:01.2 RAM memory: nVidia Corporation MCP61 Memory Controller (rev a2)
00:02.0 USB Controller: nVidia Corporation MCP61 USB Controller (rev a2)
00:04.0 PCI bridge: nVidia Corporation MCP61 PCI bridge (rev a1)
00:06.0 IDE interface: nVidia Corporation MCP61 IDE (rev a2)
00:08.0 IDE interface: nVidia Corporation MCP61 SATA Controller (rev a2)
00:09.0 PCI bridge: nVidia Corporation MCP61 PCI Express bridge (rev a2)
00:0b.0 PCI bridge: nVidia Corporation MCP61 PCI Express bridge (rev a2)
00:0c.0 PCI bridge: nVidia Corporation MCP61 PCI Express bridge (rev a2)
00:0d.0 VGA compatible controller: nVidia Corporation GeForce 6100
nForce 430 (rev a2)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8
[Athlon64/Opteron] Miscellaneous Control
01:06.0 SCSI storage controller: Adaptec AHA-3960D / AIC-7899A U160/m (rev 01)
01:06.1 SCSI storage controller: Adaptec AHA-3960D / AIC-7899A U160/m (rev 01)
01:07.0 Ethernet controller: 3Com Corporation 3c905C-TX/TX-M [Tornado] (rev 78)
04:00.0 Ethernet controller: Attansic Technology Corp. L1 Gigabit
Ethernet Adapter (rev b0)

In face of this info on the varying behaviour on different
motherboards and of the info on my asnwer to Mark Salyzyn's message do
you still think this can be a cable/driver/terminator/adapter issue?

I ask because it seems unlikely to me but I don't have that much
experience on SCSI issues.


Regards,

Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 16:26   ` Rodrigo Severo
@ 2008-03-05 16:32     ` James Bottomley
  2008-03-05 16:52       ` Rodrigo Severo
  0 siblings, 1 reply; 10+ messages in thread
From: James Bottomley @ 2008-03-05 16:32 UTC (permalink / raw)
  To: Rodrigo Severo; +Cc: linux-scsi

On Wed, 2008-03-05 at 13:26 -0300, Rodrigo Severo wrote:
> On 3/5/08, Salyzyn, Mark <Mark_Salyzyn@adaptec.com> wrote:
> > Sounds like you have a bad SCSI cable or drive.
> >
> >  Do any of these drives work in any other OS environment?
> 
> This same drive with this same cable with this same terminator works
> well with a 39160 (which usses another driver: the aic7xxx) on the
> same OS environment so I don't think this is a cable or drive issue.

That's only ultra 160 capable.  What happens if you take the speeds for
the drives down in the bios of the 29320?

> Besides I tested with four different cables (different models and
> manufactures, all pretend to be U320 compatible). The problem is
> unchanged when I use different cables and terminators.
> 
> I also tested with several different disks (different models and
> manufactures). The problem is still unchanged.

It's sounding a bit more like a damaged board ... but try reducing the
speed in the bios.

James



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 16:32     ` James Bottomley
@ 2008-03-05 16:52       ` Rodrigo Severo
  2008-03-05 17:10         ` James Bottomley
  2008-03-05 17:11         ` Matthew Wilcox
  0 siblings, 2 replies; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 16:52 UTC (permalink / raw)
  To: linux-scsi

On 3/5/08, James Bottomley <James.Bottomley@hansenpartnership.com> wrote:
> On Wed, 2008-03-05 at 13:26 -0300, Rodrigo Severo wrote:
>  > On 3/5/08, Salyzyn, Mark <Mark_Salyzyn@adaptec.com> wrote:
>  > > Sounds like you have a bad SCSI cable or drive.
>  > >
>  > >  Do any of these drives work in any other OS environment?
>  >
>  > This same drive with this same cable with this same terminator works
>  > well with a 39160 (which usses another driver: the aic7xxx) on the
>  > same OS environment so I don't think this is a cable or drive issue.
>
> That's only ultra 160 capable.  What happens if you take the speeds for
>  the drives down in the bios of the 29320?

I changed the following values on  29320LPE from the previous default ones:

Sync Transfer Rate: 160
Packetized: No
QAS: No

The error messages are unchanged.

>  > Besides I tested with four different cables (different models and
>  > manufactures, all pretend to be U320 compatible). The problem is
>  > unchanged when I use different cables and terminators.
>  >
>  > I also tested with several different disks (different models and
>  > manufactures). The problem is still unchanged.
>
> It's sounding a bit more like a damaged board ... but try reducing the
>  speed in the bios.

As I said before, you have much more experience on SCSI issues but I
still can't see how a damaged SCSI board would work on one motherboard
model. Do you really think this is possible?


Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 16:52       ` Rodrigo Severo
@ 2008-03-05 17:10         ` James Bottomley
  2008-03-05 19:14           ` Rodrigo Severo
  2008-03-05 17:11         ` Matthew Wilcox
  1 sibling, 1 reply; 10+ messages in thread
From: James Bottomley @ 2008-03-05 17:10 UTC (permalink / raw)
  To: Rodrigo Severo; +Cc: linux-scsi

On Wed, 2008-03-05 at 13:52 -0300, Rodrigo Severo wrote:
> On 3/5/08, James Bottomley <James.Bottomley@hansenpartnership.com> wrote:
> > On Wed, 2008-03-05 at 13:26 -0300, Rodrigo Severo wrote:
> >  > On 3/5/08, Salyzyn, Mark <Mark_Salyzyn@adaptec.com> wrote:
> >  > > Sounds like you have a bad SCSI cable or drive.
> >  > >
> >  > >  Do any of these drives work in any other OS environment?
> >  >
> >  > This same drive with this same cable with this same terminator works
> >  > well with a 39160 (which usses another driver: the aic7xxx) on the
> >  > same OS environment so I don't think this is a cable or drive issue.
> >
> > That's only ultra 160 capable.  What happens if you take the speeds for
> >  the drives down in the bios of the 29320?
> 
> I changed the following values on  29320LPE from the previous default ones:
> 
> Sync Transfer Rate: 160
> Packetized: No
> QAS: No
> 
> The error messages are unchanged.

That does point further to card problems.

> >  > Besides I tested with four different cables (different models and
> >  > manufactures, all pretend to be U320 compatible). The problem is
> >  > unchanged when I use different cables and terminators.
> >  >
> >  > I also tested with several different disks (different models and
> >  > manufactures). The problem is still unchanged.
> >
> > It's sounding a bit more like a damaged board ... but try reducing the
> >  speed in the bios.
> 
> As I said before, you have much more experience on SCSI issues but I
> still can't see how a damaged SCSI board would work on one motherboard
> model. Do you really think this is possible?

Certainly ... depends on the problem ... for instance, you could have a
damaged trace in the high 32 bits and not see the problem when you put
it in a 32 bit slot.  Likewise, it could be a capacitance problem on the
traces and your motherboard that works has a lower PCI bus speed.

The fact that it works in one system tends to rule out software driver
problems.  If you've tried different cables and reducing the speed, that
tends to eliminate the interconnect, so all that's left is the card (and
the motherboards, I suppose, although three failing would tend to rule
them out).

James





^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 16:52       ` Rodrigo Severo
  2008-03-05 17:10         ` James Bottomley
@ 2008-03-05 17:11         ` Matthew Wilcox
  2008-03-05 19:14           ` Rodrigo Severo
  1 sibling, 1 reply; 10+ messages in thread
From: Matthew Wilcox @ 2008-03-05 17:11 UTC (permalink / raw)
  To: Rodrigo Severo; +Cc: linux-scsi

On Wed, Mar 05, 2008 at 01:52:50PM -0300, Rodrigo Severo wrote:
> I changed the following values on  29320LPE from the previous default ones:
> 
> Sync Transfer Rate: 160

Ah, you've been fooled by SCSI terminology.  U160 is 80 million
transfers per second, but each transfer is 16-bit wide, so they call it
U160.  Can you try turning that down to 80 (and just for fun, if that
fails, can you try 40?)

-- 
Intel are signing my paycheques ... these opinions are still mine
"Bill, look, we understand that you're interested in selling us this
operating system, but compare it to ours.  We can't possibly take such
a retrograde step."

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 17:10         ` James Bottomley
@ 2008-03-05 19:14           ` Rodrigo Severo
  0 siblings, 0 replies; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 19:14 UTC (permalink / raw)
  To: linux-scsi

On 3/5/08, James Bottomley <James.Bottomley@hansenpartnership.com> wrote:
> On Wed, 2008-03-05 at 13:52 -0300, Rodrigo Severo wrote:
>  > On 3/5/08, James Bottomley <James.Bottomley@hansenpartnership.com> wrote:
>  > > On Wed, 2008-03-05 at 13:26 -0300, Rodrigo Severo wrote:
>  > >  > On 3/5/08, Salyzyn, Mark <Mark_Salyzyn@adaptec.com> wrote:
>  > >  > > Sounds like you have a bad SCSI cable or drive.
>  > >  > >
>  > >  > >  Do any of these drives work in any other OS environment?
>  > >  >
>  > >  > This same drive with this same cable with this same terminator works
>  > >  > well with a 39160 (which usses another driver: the aic7xxx) on the
>  > >  > same OS environment so I don't think this is a cable or drive issue.
>  > >
>  > > That's only ultra 160 capable.  What happens if you take the speeds for
>  > >  the drives down in the bios of the 29320?
>  >
>  > I changed the following values on  29320LPE from the previous default ones:
>  >
>  > Sync Transfer Rate: 160
>  > Packetized: No
>  > QAS: No
>  >
>  > The error messages are unchanged.
>
> That does point further to card problems.
>
>  > >  > Besides I tested with four different cables (different models and
>  > >  > manufactures, all pretend to be U320 compatible). The problem is
>  > >  > unchanged when I use different cables and terminators.
>  > >  >
>  > >  > I also tested with several different disks (different models and
>  > >  > manufactures). The problem is still unchanged.
>  > >
>  > > It's sounding a bit more like a damaged board ... but try reducing the
>  > >  speed in the bios.
>  >
>  > As I said before, you have much more experience on SCSI issues but I
>  > still can't see how a damaged SCSI board would work on one motherboard
>  > model. Do you really think this is possible?
>
>
> Certainly ... depends on the problem ... for instance, you could have a
>  damaged trace in the high 32 bits and not see the problem when you put
>  it in a 32 bit slot.  Likewise, it could be a capacitance problem on the
>  traces and your motherboard that works has a lower PCI bus speed.

I have not mentioned that this board - the 29320LPE - is a 1x PCI
Express, not a PCI-X, please forgive me.

Back to the issue at hand, being a 1x PCI Express board there can be
no lines used when installed on one motherboard and not used when on
another motherboard AFAICT.

Also I am not aware of any different bus speeds for the PCI Express
bus I'm not a PCI Express specialist so I may be wrong here.

>  The fact that it works in one system tends to rule out software driver
>  problems.  If you've tried different cables and reducing the speed, that
>  tends to eliminate the interconnect, so all that's left is the card (and
>  the motherboards, I suppose, although three failing would tend to rule
>  them out).

I understand your line of reasoning.

But I think this is not the case as I gathered several reports on the
net from users with this same SCSI board having problems with several
chipsets and managing to make it work on some few specific
motherboards. This is what's happening to me.

As far as I can tell none managed to get a usefull statement from
Adaptec on the subject so I really can't tell if this is a SCSI board
problem, a chipset problem or even a driver problem. Several tried. I
did. Got absolutely no answer at all from Adaptec, only dead silence.

Anyway, I might have a broken SCSI board but the same exactly fact
that convinces you that the board is malfunctiong (work in one
motherboard and not in three others) is the kind of argument that I'm
sure Adaptec will use to argue the opposite: the SCSI board must be
working well if it works in one motherboard.

Here are some tests I already did:

On the motherboard that the PCI-e 29320LPE card worked I managed to
copy several (>70) giga of data copied onto disks connected to it and
then start the MySQL server that used this data. It worked for less
than a hour before I got a kernel panic. At this point I just changed
the SCSI board to a PCI-X 39160. I got another kernel panic in less
than a hour. I was using plain kernel 2.6.24.3. I got back to a Gentoo
specific kernel 2.6.22 and the same 39160 board is working perfect for
more than 48 hours. The PCI-e 29320LPE panics after 15 minutes or so
with this last kernel.

I didn't took note of the details of the kernel panics but they looked
quite the same to me: both the PCI-e 29320LPE and the PCI-X 39160.
This makes me believe there are other things wrong. I would like to
iron these out before I decide the new PCI-e 29320LPE is broken. I'm
just not sure how to do it.


Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: Adaptec 29320LPE errors on boot
  2008-03-05 17:11         ` Matthew Wilcox
@ 2008-03-05 19:14           ` Rodrigo Severo
  0 siblings, 0 replies; 10+ messages in thread
From: Rodrigo Severo @ 2008-03-05 19:14 UTC (permalink / raw)
  To: linux-scsi

On 3/5/08, Matthew Wilcox <matthew@wil.cx> wrote:
> On Wed, Mar 05, 2008 at 01:52:50PM -0300, Rodrigo Severo wrote:
>  > I changed the following values on  29320LPE from the previous default ones:
>  >
>  > Sync Transfer Rate: 160
>
>
> Ah, you've been fooled by SCSI terminology.

I don't think so.

>  U160 is 80 million
>  transfers per second,

Yes I know.

>  but each transfer is 16-bit wide,

Also know that.

>  so they call it
>  U160.

Yes I know that too.

I changed the Sync Transfer Rate from the 29320LPE default 320
MB/Second to it's second option: 160 MB/Second.

>  Can you try turning that down to 80 (and just for fun, if that
>  fails, can you try 40?)

Just for fun, I did. I tried 40 MB/Second. It took some more time
(around 30 extra seconds) but then I got the same error messages.


Rodrigo Severo

-- 
---------------------------------------------------------------------------------------
Rodrigo Severo

Fábrica de Idéias
SBS Quadra 2 - Bloco S - Ed. Empire Center - Sala 1.301
Brasília - DF - CEP 70070-904
Tel. (61) 3321-1357       Fax (61) 3223-1712
---------------------------------------------------------------------------------------
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2008-03-05 19:14 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-03-05 15:42 Adaptec 29320LPE errors on boot Rodrigo Severo
2008-03-05 15:55 ` James Bottomley
2008-03-05 16:26   ` Rodrigo Severo
     [not found] ` <532ABFBDAAC3A34EB12EBA6CEC2838F4831D6BEF@ADPE2K703.adaptec.com>
2008-03-05 16:26   ` Rodrigo Severo
2008-03-05 16:32     ` James Bottomley
2008-03-05 16:52       ` Rodrigo Severo
2008-03-05 17:10         ` James Bottomley
2008-03-05 19:14           ` Rodrigo Severo
2008-03-05 17:11         ` Matthew Wilcox
2008-03-05 19:14           ` Rodrigo Severo

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox