* PROBLEM: RedHat 7.2 Stock SMP Install
@ 2002-05-22 14:38 Rose, Billy
2002-05-22 15:31 ` Doug McNaught
0 siblings, 1 reply; 4+ messages in thread
From: Rose, Billy @ 2002-05-22 14:38 UTC (permalink / raw)
To: linux-kernel
I have an HP LPr Dual P-III 550 with 2 18G SCSI drives that locked up. No
response at the console, ssh failed, and ping never answered. Looking in
/var/log/messages shows no entries at or even near the time of lock up.
Below is included some info from ver_linux and the /proc fs. Production use:
Tux web server for static content, Apache/PHP for simple variable
substitution in .php pages. System runs Samba 2.2.1a for connection from
clients to store web content. How can I find more info on the lockup?
[root@warnock root]# ./ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.
Linux warnock 2.4.7-10smp #1 SMP Thu Sep 6 17:09:31 EDT 2001 i686 unknown
Gnu C 2.96
Gnu make 3.79.1
binutils 2.11.90.0.8
util-linux 2.11f
mount 2.11g
modutils 2.4.6
e2fsprogs 1.23
reiserfsprogs 3.x.0j
Linux C Library 2.2.4
Dynamic linker (ldd) 2.2.4
Procps 2.0.7
Net-tools 1.60
Console-tools 0.3.3
Sh-utils 2.0.11
Modules Loaded ide-scsi nfsd lockd sunrpc tux autofs eepro100
usb-uhci usbcore ext3 jbd sym53c8xx sd_mod scsi_mod
[root@warnock root]# cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 7
model name : Pentium III (Katmai)
stepping : 3
cpu MHz : 549.074
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca
cmov pat pse36 mmx fxsr sse
bogomips : 1094.45
processor : 1
vendor_id : GenuineIntel
cpu family : 6
model : 7
model name : Pentium III (Katmai)
stepping : 3
cpu MHz : 549.074
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca
cmov
pat pse36 mmx fxsr sse
bogomips : 1097.72
[root@warnock root]# cat /proc/modules
ide-scsi 8352 0
nfsd 71808 8 (autoclean)
lockd 53744 1 (autoclean) [nfsd]
sunrpc 70000 1 (autoclean) [nfsd lockd]
tux 80432 3
autofs 12096 0 (autoclean) (unused)
eepro100 18144 1
usb-uhci 22880 0 (unused)
usbcore 54528 1 [usb-uhci]
ext3 67728 3
jbd 44480 3 [ext3]
sym53c8xx 57920 4
sd_mod 11584 4
scsi_mod 98512 3 [ide-scsi sym53c8xx sd_mod]
[root@warnock root]# cat /proc/ioports
0000-001f : dma1
0020-003f : pic1
0040-005f : timer
0060-006f : keyboard
0070-007f : rtc
0080-008f : dma page reg
00a0-00bf : pic2
00c0-00df : dma2
00f0-00ff : fpu
01f0-01f7 : ide0
02f8-02ff : serial(auto)
03c0-03df : vga+
03f6-03f6 : ide0
03f8-03ff : serial(auto)
0cf8-0cff : PCI conf1
1000-101f : Intel Corporation 82371AB PIIX4 USB
1000-101f : usb-uhci
1020-102f : Intel Corporation 82371AB PIIX4 IDE
1020-1027 : ide0
1040-105f : Intel Corporation 82371AB PIIX4 ACPI
8000-803f : Intel Corporation 82371AB PIIX4 ACPI
9000-9fff : PCI Bus #01
9000-90ff : Symbios Logic Inc. (formerly NCR) 53c895
9000-907f : sym53c8xx
9400-941f : Intel Corporation 82557 [Ethernet Pro 100]
9400-941f : eepro100
[root@warnock root]# cat /proc/iomem
00000000-0009ebff : System RAM
0009ec00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000c8000-000cb7ff : Extension ROM
000f0000-000fffff : System ROM
00100000-1ffeffff : System RAM
00100000-0025d2c7 : Kernel code
0025d2c8-00276bff : Kernel data
1fff0000-1ffffbff : ACPI Tables
1ffffc00-1fffffff : ACPI Non-volatile Storage
fa000000-fa000fff : Cirrus Logic GD 5446
fa100000-fa2fffff : PCI Bus #01
fa100000-fa1fffff : Intel Corporation 82557 [Ethernet Pro 100]
fa200000-fa200fff : Symbios Logic Inc. (formerly NCR) 53c895
fa201000-fa2010ff : Symbios Logic Inc. (formerly NCR) 53c895
fa500000-fa5fffff : PCI Bus #01
fa500000-fa500fff : Intel Corporation 82557 [Ethernet Pro 100]
fa500000-fa500fff : eepro100
fc000000-fdffffff : Cirrus Logic GD 5446
fec00000-fec0ffff : reserved
fee00000-fee00fff : reserved
fffe9800-ffffffff : reserved
[root@warnock root]# lspci -vvv
00:00.0 Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX Host bridge
(AGP disabled) (rev 03)
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort+ >SERR- <PERR-
Latency: 64
Region 0: Memory at <unassigned> (32-bit, prefetchable) [size=256M]
00:04.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 0
00:04.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01) (prog-if
80 [Master])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 32
Region 4: I/O ports at 1020 [size=16]
00:04.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01)
(prog-if 00 [UHCI])
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 32
Interrupt: pin D routed to IRQ 19
Region 4: I/O ports at 1000 [size=32]
00:04.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Interrupt: pin ? routed to IRQ 9
00:07.0 PCI bridge: Digital Equipment Corporation DECchip 21152 (rev 03)
(prog-if 00 [Normal decode])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 57, cache line size 08
Bus: primary=00, secondary=01, subordinate=01, sec-latency=249
I/O behind bridge: 00009000-00009fff
Memory behind bridge: fa100000-fa2fffff
Prefetchable memory behind bridge: 00000000fa500000-00000000fa500000
BridgeCtl: Parity+ SERR+ NoISA+ VGA- MAbort- >Reset- FastB2B-
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI- D1- D2- AuxCurrent=220mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Bridge: PM- B3+
00:0d.0 VGA compatible controller: Cirrus Logic GD 5446 (rev 45) (prog-if 00
[VGA])
Subsystem: Hewlett-Packard Company: Unknown device 0001
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Region 0: Memory at fc000000 (32-bit, prefetchable) [size=32M]
Region 1: Memory at fa000000 (32-bit, non-prefetchable) [size=4K]
Expansion ROM at <unassigned> [disabled] [size=32K]
01:03.0 Ethernet controller: Intel Corporation 82557 [Ethernet Pro 100] (rev
05) Subsystem: Hewlett-Packard Company Ethernet Pro 10/100TX
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+
Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 66 (2000ns min, 14000ns max), cache line size 08
Interrupt: pin A routed to IRQ 19
Region 0: Memory at fa500000 (32-bit, prefetchable) [size=4K]
Region 1: I/O ports at 9400 [size=32]
Region 2: Memory at fa100000 (32-bit, non-prefetchable) [size=1M]
Expansion ROM at <unassigned> [disabled] [size=1M]
Capabilities: [dc] Power Management version 1
Flags: PMEClk- DSI+ D1+ D2+ AuxCurrent=0mA
PME(D0+,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable+ DSel=0 DScale=0 PME-
01:04.0 SCSI storage controller: Symbios Logic Inc. (formerly NCR) 53c895
(rev 01)
Subsystem: Hewlett-Packard Company: Unknown device 1000
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+
Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR-
Latency: 247 (7500ns min, 16000ns max), cache line size 08
Interrupt: pin A routed to IRQ 18
Region 0: I/O ports at 9000 [size=256]
Region 1: Memory at fa201000 (32-bit, non-prefetchable) [size=256]
Region 2: Memory at fa200000 (32-bit, non-prefetchable) [size=4K]
[root@warnock root]# cat /proc/scsi/scsi
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: HP Model: 18.2GB C 80-8C42 Rev:
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: HP Model: 18.2GB C 80-8C42 Rev:
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi1 Channel: 00 Id: 00 Lun: 00
Vendor: TEAC Model: CD-224E Rev: 1.5A
Type: CD-ROM ANSI SCSI revision: 02
[root@warnock root]# cat /proc/mounts
/dev/root / ext3 rw 0 0
/proc /proc proc rw 0 0
usbdevfs /proc/bus/usb usbdevfs rw 0 0
/dev/sda1 /boot ext3 rw 0 0
none /dev/pts devpts rw 0 0
none /dev/shm tmpfs rw 0 0
/dev/sdb1 /var ext3 rw 0 0
[root@warnock root]# cat /proc/slabinfo
slabinfo - version: 1.1 (SMP)
kmem_cache 85 85 232 5 5 1 : 252 126
ip_fib_hash 226 226 32 2 2 1 : 252 126
urb_priv 0 0 64 0 0 1 : 252 126
journal_head 276 780 48 8 10 1 : 252 126
revoke_table 252 253 12 1 1 1 : 252 126
revoke_record 113 113 32 1 1 1 : 252 126
clip_arp_cache 0 0 128 0 0 1 : 252 126
ip_mrt_cache 0 0 96 0 0 1 : 252 126
tcp_tw_bucket 594 720 128 24 24 1 : 252 126
tcp_bind_bucket 226 226 32 2 2 1 : 252 126
tcp_open_request 154 280 96 6 7 1 : 252 126
inet_peer_cache 118 118 64 2 2 1 : 252 126
ip_dst_cache 834 960 192 48 48 1 : 252 126
arp_cache 60 60 128 2 2 1 : 252 126
blkdev_requests 5438 6320 96 141 158 1 : 252 126
dnotify cache 126 169 20 1 1 1 : 252 126
file lock cache 126 126 92 3 3 1 : 252 126
fasync cache 202 202 16 1 1 1 : 252 126
uid_cache 226 226 32 2 2 1 : 252 126
skbuff_head_cache 504 504 160 21 21 1 : 252 126
sock 144 174 1280 58 58 1 : 60 30
sigqueue 261 261 132 9 9 1 : 252 126
cdev_cache 236 236 64 4 4 1 : 252 126
bdev_cache 177 177 64 3 3 1 : 252 126
mnt_cache 80 80 96 2 2 1 : 252 126
inode_cache 11493 11493 416 1277 1277 1 : 124 62
dentry_cache 12780 12780 128 426 426 1 : 252 126
dquot 0 0 128 0 0 1 : 252 126
filp 440 440 96 11 11 1 : 252 126
names_cache 23 23 4096 23 23 1 : 60 30
buffer_head 38680 38680 96 967 967 1 : 252 126
mm_struct 72 72 160 3 3 1 : 252 126
vm_area_struct 1054 1180 64 20 20 1 : 252 126
fs_cache 118 118 64 2 2 1 : 252 126
files_cache 81 81 416 9 9 1 : 124 62
signal_act 93 93 1312 31 31 1 : 60 30
size-131072(DMA) 0 0 131072 0 0 32 : 0 0
size-131072 0 0 131072 0 0 32 : 0 0
size-65536(DMA) 0 0 65536 0 0 16 : 0 0
size-65536 1 1 65536 1 1 16 : 0 0
size-32768(DMA) 0 0 32768 0 0 8 : 0 0
size-32768 0 1 32768 0 1 8 : 0 0
size-16384(DMA) 0 0 16384 0 0 4 : 0 0
size-16384 8 9 16384 8 9 4 : 0 0
size-8192(DMA) 0 0 8192 0 0 2 : 0 0
size-8192 2 3 8192 2 3 2 : 0 0
size-4096(DMA) 0 0 4096 0 0 1 : 60 30
size-4096 93 93 4096 93 93 1 : 60 30
size-2048(DMA) 0 0 2048 0 0 1 : 60 30
size-2048 236 236 2048 118 118 1 : 60 30
size-1024(DMA) 0 0 1024 0 0 1 : 124 62
size-1024 280 280 1024 70 70 1 : 124 62
size-512(DMA) 0 0 512 0 0 1 : 124 62
size-512 216 216 512 27 27 1 : 124 62
size-256(DMA) 0 0 256 0 0 1 : 252 126
size-256 360 360 256 24 24 1 : 252 126
size-128(DMA) 30 30 128 1 1 1 : 252 126
size-128 1200 1200 128 40 40 1 : 252 126
size-64(DMA) 0 0 64 0 0 1 : 252 126
size-64 295 295 64 5 5 1 : 252 126
size-32(DMA) 113 113 32 1 1 1 : 252 126
size-32 4520 4520 32 40 40 1 : 252 126
Billy Rose
wrose@loislaw.com
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: PROBLEM: RedHat 7.2 Stock SMP Install
2002-05-22 14:38 PROBLEM: RedHat 7.2 Stock SMP Install Rose, Billy
@ 2002-05-22 15:31 ` Doug McNaught
0 siblings, 0 replies; 4+ messages in thread
From: Doug McNaught @ 2002-05-22 15:31 UTC (permalink / raw)
To: Rose, Billy; +Cc: linux-kernel
"Rose, Billy" <wrose@loislaw.com> writes:
> I have an HP LPr Dual P-III 550 with 2 18G SCSI drives that locked
up.
> Linux warnock 2.4.7-10smp #1 SMP Thu Sep 6 17:09:31 EDT 2001 i686 unknown
^^^^^^^^^^^
Try the latest errata kernel for 7.2 and see if it still happens.
There's a reason they release kernel update packages...
-Doug
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: PROBLEM: RedHat 7.2 Stock SMP Install
@ 2002-05-22 15:41 Ron Niles
0 siblings, 0 replies; 4+ messages in thread
From: Ron Niles @ 2002-05-22 15:41 UTC (permalink / raw)
To: linux-kernel
"Rose, Billy" <wrose@loislaw.com> wrote:
>I have an HP LPr Dual P-III 550 with 2 18G SCSI drives that locked up. No
>response at the console, ssh failed, and ping never answered.
>How can I find more info on the lockup?
When getting solid lockups like this, nmi_watchdog.txt can sometimes break
you out and give an Oops to work off of.
>Attached devices:
>Host: scsi0 Channel: 00 Id: 00 Lun: 00
> Vendor: HP Model: 18.2GB C 80-8C42 Rev:
> Type: Direct-Access ANSI SCSI revision: 02
>Host: scsi0 Channel: 00 Id: 01 Lun: 00
> Vendor: HP Model: 18.2GB C 80-8C42 Rev:
> Type: Direct-Access ANSI SCSI revision: 02
Multiple SCSI devices on the same bus often have termination problems which
can result in scsi bus lock-up or even data integrity problems. Make sure
you understand the finer points of SCSI device termination. You might want
to try running just one drive off the card and see if the system stabilizes.
>Modules Loaded ide-scsi nfsd lockd sunrpc tux autofs eepro100
usb-uhci usbcore
> ext3 jbd sym53c8xx sd_mod scsi_mod
Although it's probably not related to your lockup, I have problems with the
eepro100 drivers, most notably memory leaks. It is worth using the updated
e100 linux driver from the Intel website instead. Note that the driver name
is changed from eepro100 to e100.
^ permalink raw reply [flat|nested] 4+ messages in thread
* RE: PROBLEM: RedHat 7.2 Stock SMP Install
@ 2002-05-22 16:10 Rose, Billy
0 siblings, 0 replies; 4+ messages in thread
From: Rose, Billy @ 2002-05-22 16:10 UTC (permalink / raw)
To: 'Doug McNaught'; +Cc: linux-kernel
> -----Original Message-----
> From: Doug McNaught [mailto:doug@wireboard.com]
> Sent: Wednesday, May 22, 2002 10:32 AM
> To: Rose, Billy
> Cc: linux-kernel@vger.kernel.org
> Subject: Re: PROBLEM: RedHat 7.2 Stock SMP Install
>
>
> "Rose, Billy" <wrose@loislaw.com> writes:
>
> > I have an HP LPr Dual P-III 550 with 2 18G SCSI drives that locked
> up.
>
> > Linux warnock 2.4.7-10smp #1 SMP Thu Sep 6 17:09:31 EDT
> 2001 i686 unknown
> ^^^^^^^^^^^
> Try the latest errata kernel for 7.2 and see if it still happens.
> There's a reason they release kernel update packages...
I looked on RH before sending the original post. There's a reason you don't
upgrade a live RedHat machine running Tux for (what is listed to be)
security fixes that do not pertain to a particular machine, without more
knowledge of the crash. I have taken your advice (and Mr. Benjamin
LaHaise's) and upgraded a _sandbox_ machine to see how Tux will behave
during and after the upgrade. With Tux, simply upgrading/installing a new
kernel is not easy as there are incompatibilities between different versions
of the userspace and kernel space portions when version numbers are in a
mixed state (i.e. the upgrade hoses everything and you fall back to the
known kernel). Installing/upgrading to the latest errata kernel requires
Tux's userspace stuff to be upgraded first... If the new kernel dies, my old
kernel will still boot, but Tux may not work (the primary job of said
server) because the userspace stuff is now incompatible. FYI, the machine in
question typically handles 4X the requests as it's brethren NT machines next
to it -- hats off to all you guys!
Billy Rose
wrose@loislaw.com>
> -Doug
>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2002-05-22 16:11 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-05-22 14:38 PROBLEM: RedHat 7.2 Stock SMP Install Rose, Billy
2002-05-22 15:31 ` Doug McNaught
-- strict thread matches above, loose matches on Subject: below --
2002-05-22 15:41 Ron Niles
2002-05-22 16:10 Rose, Billy
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.