public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* kernel panics and errors on lsi 1064e
@ 2009-03-20 10:44 Dimitris Zilaskos
  2009-03-20 15:27 ` James Bottomley
  0 siblings, 1 reply; 12+ messages in thread
From: Dimitris Zilaskos @ 2009-03-20 10:44 UTC (permalink / raw)
  To: linux-scsi


Hi,

I was having problems with two nodes rhel4 x86_64 compatible nodes with
this:

08:00.0 SCSI storage controller: LSI Logic / Symbios Logic SAS1064E
PCI-Express Fusion-MPT SAS (rev 04)

the nodes would panic after doing some task (download a few gigabytes 
from net and run a few computations)

screenshots of two panics

http://img10.imageshack.us/img10/3184/camxgemspanic.jpg
http://img10.imageshack.us/img10/174/wn024.jpg


Prior to the panic the systems would be up for couple of hours to a couple
of days and log this when say a gzip was running:

Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: attempting task
abort! (sc=000001019199d4c0)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: scsi7 : destination target 11,
lun 0
Mar  5 16:19:30 wn023.grid.auth.gr kernel:         command = Write (10) 00
01 cd ab d3 00 01 40 00
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptbase: ioc0: IOCStatus=8000
LogInfo=31120403 Originator={PL}, Code={Abort}, SubCode(0x0403)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptbase: ioc0: IOCStatus=8048
LogInfo=31140000 Originator={PL}, Code={IO Executed}, SubCode(0x0000)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: task abort:
SUCCESS (sc=000001019199d4c0)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptbase: ioc0: IOCStatus=804b
LogInfo=31120403 Originator={PL}, Code={Abort}, SubCode(0x0403)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: attempting task
abort! (sc=0000010024283d00)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: scsi7 : destination target 11,
lun 0
Mar  5 16:19:30 wn023.grid.auth.gr kernel:         command = Write (10) 00
01 cd ad 13 00 01 40 00
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: attempting task
abort! (sc=0000010102db4ac0)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: scsi7 : destination target 11,
lun 0
Mar  5 16:19:30 wn023.grid.auth.gr kernel:         command = Write (10) 00
01 cd ae 53 00 01 40 00
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: attempting task
abort! (sc=0000010102db4cc0)
Mar  5 16:19:30 wn023.grid.auth.gr kernel: scsi7 : destination target 11,
lun 0
Mar  5 16:19:30 wn023.grid.auth.gr kernel:         command = Write (10) 00
01 cd af 93 00 01 40 00
Mar  5 16:19:30 wn023.grid.auth.gr kernel: mptscsi: ioc0: attempting task
abort! (sc=0000010102db40c0)


Memtest for days was running ok.

I found this: https://bugzilla.redhat.com/show_bug.cgi?id=208033

and I upgraded my firmware from
http://downloadcenter.intel.com/filter_results.aspx?strTypes=all&ProductID=2
487&OSFullName=OS+Independent&lang=eng&strOSs=38&submit=Go

After the upgrade the systems don't seem to panic. But they log this


mptbase: ioc0: IOCStatus=8000 LogInfo=31123000 Originator={PL},
Code={Abort}, SubCode(0x3000)
mptbase: ioc0: IOCStatus=804b LogInfo=31123000 Originator={PL},
Code={Abort}, SubCode(0x3000)
mptbase: ioc0: IOCStatus=804b LogInfo=31123000 Originator={PL},
Code={Abort}, SubCode(0x3000)
mptbase: ioc0: IOCStatus=804b LogInfo=31123000 Originator={PL},
Code={Abort}, SubCode(0x3000)
mptbase: ioc0: IOCStatus=804b LogInfo=31123000 Originator={PL},
Code={Abort}, SubCode(0x3000)

lspci
00:00.0 Host bridge: Intel Corporation 5000V Chipset Memory Controller Hub
(rev b1)
00:02.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x8
Port 2-3 (rev b1)
00:03.0 PCI bridge: Intel Corporation 5000 Series Chipset PCI Express x4
Port 3 (rev b1)
00:08.0 System peripheral: Intel Corporation 5000 Series Chipset DMA Engine
(rev b1)
00:10.0 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev b1)
00:10.1 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev b1)
00:10.2 Host bridge: Intel Corporation 5000 Series Chipset FSB Registers
(rev b1)
00:11.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev b1)
00:13.0 Host bridge: Intel Corporation 5000 Series Chipset Reserved
Registers (rev b1)
00:15.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev b1)
00:16.0 Host bridge: Intel Corporation 5000 Series Chipset FBD Registers
(rev b1)
00:1c.0 PCI bridge: Intel Corporation 631xESB/632xESB/3100 Chipset PCI
Express Root Port 1 (rev 09)
00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #1 (rev 09)
00:1d.1 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #2 (rev 09)
00:1d.2 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #3 (rev 09)
00:1d.3 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI
USB Controller #4 (rev 09)
00:1d.7 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset EHCI
USB2 Controller (rev 09)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9)
00:1f.0 ISA bridge: Intel Corporation 631xESB/632xESB/3100 Chipset LPC
Interface Controller (rev 09)
00:1f.1 IDE interface: Intel Corporation 631xESB/632xESB IDE Controller (rev
09)
00:1f.2 SATA controller: Intel Corporation 631xESB/632xESB SATA AHCI
Controller (rev 09)
00:1f.3 SMBus: Intel Corporation 631xESB/632xESB/3100 Chipset SMBus
Controller (rev 09)
01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Upstream
Port (rev 01)
01:00.3 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express to PCI-X
Bridge (rev 01)
02:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E1 (rev 01)
02:01.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E2 (rev 01)
02:02.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Downstream
Port E3 (rev 01)
05:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
05:00.1 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet
Controller (Copper) (rev 01)
08:00.0 SCSI storage controller: LSI Logic / Symbios Logic SAS1064E
PCI-Express Fusion-MPT SAS (rev 04)
09:0c.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02)

[root@wn024 log]# cat /proc/mpt/summary
ioc0: LSISAS1064E, FwRev=011b0000h, Ports=1, MaxQ=268, IRQ=169
[root@wn024 log]# cat /proc/mpt/version
mptlinux-3.12.19.00rh
   Fusion MPT base driver
   Fusion MPT SPI host driver
   Fusion MPT SAS host driver

uname -an
Linux wn024.grid.auth.gr 2.6.9-78.0.13.ELlargesmp #1 SMP Wed Jan 14 14:20:39
CST 2009 x86_64 x86_64 x86_64 GNU/Linux


Is something broken here? I am close to ask for the systems to be replaced.


Cheers,

-- 
=============================================================================
Dimitris Zilaskos
GridAUTH Operations Centre @ Aristotle University of Thessaloniki , Greece
Tel: +302310998988 Fax: +302310994309
http://www.grid.auth.gr
=============================================================================

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2009-04-29 12:15 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-03-20 10:44 kernel panics and errors on lsi 1064e Dimitris Zilaskos
2009-03-20 15:27 ` James Bottomley
2009-03-20 20:11   ` Dimitris Zilaskos
2009-03-23  8:55     ` Desai, Kashyap
2009-03-23 18:01       ` Dimitris Zilaskos
2009-03-23 19:33       ` Dimitris Zilaskos
2009-03-23 19:48         ` Desai, Kashyap
2009-03-23 20:07           ` Dimitris Zilaskos
2009-03-24  9:10           ` Dimitris Zilaskos
2009-03-24  9:34             ` Desai, Kashyap
2009-04-07 10:56               ` Dimitris Zilaskos
2009-04-29 12:13               ` Dimitris Zilaskos

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox