* megaraid problem on hp netserver lh6000 - ignorebiostables o.k.
@ 2004-02-27 14:46 Manfred.Herrmann
2004-02-27 16:11 ` Ian Pratt
` (2 more replies)
0 siblings, 3 replies; 18+ messages in thread
From: Manfred.Herrmann @ 2004-02-27 14:46 UTC (permalink / raw)
To: xen-devel
[-- Attachment #1: Type: text/plain, Size: 836 bytes --]
testresults from XEN1.0 and XEN1.2 democd shows following:
XEN1.0 democd default boot
o.k. ... but no smp
XEN1.2 democd default boot 2.4.25
stop at ... megaraid scanning raidchannel with error:
scsi0: scanning virtual channel0 for scsi devices.
scsi_wait_req: still waiting...!
scsi_wait_req: still waiting...!
scsi_wait_req: still waiting...!
scsi_wait_req: still waiting...!
(... endless)
XEN1.2 democd safemode
o.k. ... but no smp
XEN1.2 democd default boot 2.4.25
o.k. with xen-kernel parameter ignorebiostables ... but no smp
XEN1.2 democd ... linux 2.4.22
o.k. (with smp)
XEN1.2 democd ... linux 2.4.24
XEN1.2 democd ... linux 2.4.25
stop at ... VFS: Cannot open root device "ram0" or 01:00
i prepared two LH6000/4CPU/4GB for XEN-production :o(
so what next?
Manni
[-- Attachment #2: Type: text/html, Size: 1802 bytes --]
^ permalink raw reply [flat|nested] 18+ messages in thread* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 14:46 megaraid problem on hp netserver lh6000 - ignorebiostables o.k Manfred.Herrmann @ 2004-02-27 16:11 ` Ian Pratt 2004-02-27 16:46 ` Manfred.Herrmann 2004-02-27 16:57 ` Keir Fraser 2004-02-28 3:13 ` Stephen Evanchik 2 siblings, 1 reply; 18+ messages in thread From: Ian Pratt @ 2004-02-27 16:11 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: xen-devel, Ian.Pratt > i prepared two LH6000/4CPU/4GB for XEN-production :o( > so what next? > XEN1.2 democd ... linux 2.4.24 Please boot linux and send us the output of dmesg, "lspci -v" and "cat /proc/cpuinfo" It sounds like the BIOS is passing in some weird stuff in the tables. dmesg output may help us pin this down. It might be worth doing a BIOS upgrade if one is available. We run Xen on HP DL360's but have never seen a netserver lh6000. Ian ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 16:11 ` Ian Pratt @ 2004-02-27 16:46 ` Manfred.Herrmann 2004-02-27 17:46 ` Ian Pratt 0 siblings, 1 reply; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-27 16:46 UTC (permalink / raw) Cc: Ian.Pratt, xen-devel [-- Attachment #1.1: Type: text/plain, Size: 671 bytes --] > > > i prepared two LH6000/4CPU/4GB for XEN-production :o( > > so what next? > > > XEN1.2 democd ... linux 2.4.24 > > Please boot linux and send us the output of dmesg, "lspci -v" and > "cat /proc/cpuinfo" > > It sounds like the BIOS is passing in some weird stuff in the > tables. dmesg output may help us pin this down. It might be worth > doing a BIOS upgrade if one is available. the latest BIOS/Firmware-upgrades are done BIOS 4.06.45S MngmtCtrl. E.10.44 Netraid B.02.04 E.01.10 > > We run Xen on HP DL360's but have never seen a netserver lh6000. > > > Ian one more debug info: same error for Adaptec7880 and megaraid [-- Attachment #1.2: Type: text/html, Size: 1200 bytes --] [-- Attachment #2: proc-cpuinfo.txt --] [-- Type: text/plain, Size: 1988 bytes --] processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 7 model name : Pentium III (Katmai) stepping : 3 cpu MHz : 550.084 cache size : 1024 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1097.72 processor : 1 vendor_id : GenuineIntel cpu family : 6 model : 7 model name : Pentium III (Katmai) stepping : 3 cpu MHz : 550.084 cache size : 1024 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1097.72 processor : 2 vendor_id : GenuineIntel cpu family : 6 model : 7 model name : Pentium III (Katmai) stepping : 3 cpu MHz : 550.084 cache size : 1024 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1097.72 processor : 3 vendor_id : GenuineIntel cpu family : 6 model : 7 model name : Pentium III (Katmai) stepping : 3 cpu MHz : 550.084 cache size : 1024 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1097.72 processor : 4 vendor_id : GenuineIntel cpu family : 6 model : 7 model name : Pentium III (Katmai) stepping : 3 cpu MHz : 550.084 cache size : 1024 KB fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 2 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse bogomips : 1097.72 [-- Attachment #3: lspci-v.txt --] [-- Type: text/plain, Size: 4533 bytes --] 00:00.0 Host bridge: ServerWorks CNB20HE Host Bridge (rev 21) Flags: fast devsel 00:00.1 Host bridge: ServerWorks CNB20HE Host Bridge (rev 01) Flags: bus master, medium devsel, latency 64 00:00.2 Host bridge: ServerWorks: Unknown device 0006 Flags: medium devsel 00:00.3 Host bridge: ServerWorks: Unknown device 0006 Flags: medium devsel 00:06.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100] (rev 08) Subsystem: Hewlett-Packard Company NetServer 10/100TX Flags: bus master, medium devsel, latency 64, IRQ 18 Memory at f0801000 (32-bit, non-prefetchable) [size=4K] I/O ports at 1800 [size=64] Memory at f0900000 (32-bit, non-prefetchable) [size=1M] Capabilities: [dc] Power Management version 2 00:07.0 VGA compatible controller: ATI Technologies Inc 3D Rage IIC (rev 7a) (prog-if 00 [VGA]) Subsystem: Hewlett-Packard Company: Unknown device 10c4 Flags: bus master, stepping, medium devsel, latency 66 Memory at f1000000 (32-bit, non-prefetchable) [size=16M] I/O ports at 1400 [size=256] Memory at f0802000 (32-bit, non-prefetchable) [size=4K] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [5c] Power Management version 1 00:0f.0 ISA bridge: ServerWorks OSB4 South Bridge (rev 4f) Subsystem: ServerWorks OSB4 South Bridge Flags: bus master, medium devsel, latency 0 00:0f.1 IDE interface: ServerWorks OSB4 IDE Controller (prog-if 8a [Master SecP PriP]) Flags: bus master, medium devsel, latency 32 I/O ports at 1840 [size=16] 04:02.0 System peripheral: Hewlett-Packard Company NetServer PCI Hot-Plug Controller (rev 0d) Subsystem: Hewlett-Packard Company: Unknown device 0001 Flags: slow devsel, IRQ 16 I/O ports at 4000 [size=256] 04:02.1 InfiniBand: Hewlett-Packard Company NetServer SMIC Controller (rev 09) Subsystem: Hewlett-Packard Company: Unknown device 0001 Flags: slow devsel, IRQ 17 I/O ports at 4400 [size=256] 04:03.0 PCI bridge: Intel Corp. 80960RP [i960 RP Microprocessor/Bridge] (rev 01) (prog-if 00 [Normal decode]) Flags: bus master, medium devsel, latency 64 Bus: primary=04, secondary=05, subordinate=05, sec-latency=64 I/O behind bridge: 00005000-00005fff Memory behind bridge: f5100000-f51fffff Capabilities: [68] Power Management version 2 04:03.1 I2O: Intel Corp. 80960RP [i960RP Microprocessor] (rev 01) (prog-if 01) Subsystem: Hewlett-Packard Company MegaRAID, Integrated HP NetRAID Flags: bus master, fast Back2Back, medium devsel, latency 64, IRQ 20 Memory at f8000000 (32-bit, prefetchable) [size=32M] Expansion ROM at <unassigned> [disabled] [size=32K] Capabilities: [80] Power Management version 2 04:06.0 Ethernet controller: 3Com Corporation 3c905C-TX/TX-M [Tornado] (rev 74) Subsystem: 3Com Corporation 3C905C-TX Fast Etherlink for PC Management NIC Flags: bus master, medium devsel, latency 64, IRQ 21 I/O ports at 4800 [size=128] Memory at f5000000 (32-bit, non-prefetchable) [size=128] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [dc] Power Management version 2 05:02.0 Ethernet controller: 3Com Corporation 3c905C-TX/TX-M [Tornado] (rev 74) Subsystem: 3Com Corporation 3C905C-TX Fast Etherlink for PC Management NIC Flags: bus master, medium devsel, latency 64, IRQ 22 I/O ports at 5000 [size=128] Memory at f5108000 (32-bit, non-prefetchable) [size=128] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [dc] Power Management version 2 05:03.0 Ethernet controller: 3Com Corporation 3c905C-TX/TX-M [Tornado] (rev 74) Subsystem: 3Com Corporation 3C905C-TX Fast Etherlink for PC Management NIC Flags: bus master, medium devsel, latency 64, IRQ 24 I/O ports at 5080 [size=128] Memory at f5108400 (32-bit, non-prefetchable) [size=128] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [dc] Power Management version 2 05:04.0 Ethernet controller: 3Com Corporation 3c905C-TX/TX-M [Tornado] (rev 74) Subsystem: 3Com Corporation 3C905C-TX Fast Etherlink for PC Management NIC Flags: bus master, medium devsel, latency 64, IRQ 23 I/O ports at 5400 [size=128] Memory at f5108800 (32-bit, non-prefetchable) [size=128] Expansion ROM at <unassigned> [disabled] [size=128K] Capabilities: [dc] Power Management version 2 05:08.0 System peripheral: Hewlett-Packard Company NetServer Smart IRQ Router (rev a0) Subsystem: Hewlett-Packard Company: Unknown device 0001 Flags: slow devsel Memory at f5100000 (32-bit, non-prefetchable) [size=32K] [-- Attachment #4: dmesg.txt --] [-- Type: text/plain, Size: 15645 bytes --] /O APIC #6 Version 17 at 0xFEC01000. Enabling APIC mode: Flat. Using 2 I/O APICs Processors: 5 Kernel command line: root=/dev/ram0 rw init=/linuxrc mem=3940352K Initializing CPU#0 Detected 550.084 MHz processor. Console: colour VGA+ 80x25 Calibrating delay loop... 1097.72 BogoMIPS Memory: 901288k/917504k available (2832k kernel code, 15824k reserved, 947k data, 392k init, 0k highmem) Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes) Inode cache hash table entries: 65536 (order: 7, 524288 bytes) Mount cache hash table entries: 512 (order: 0, 4096 bytes) Buffer cache hash table entries: 65536 (order: 6, 262144 bytes) Page-cache hash table entries: 262144 (order: 8, 1048576 bytes) CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K CPU serial number disabled. Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Checking 'hlt' instruction... OK. POSIX conformance testing by UNIFIX CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU0: Intel Pentium III (Katmai) stepping 03 per-CPU timeslice cutoff: 2925.11 usecs. enabled ExtINT on CPU#0 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Booting processor 1/0 eip 2000 Initializing CPU#1 masked ExtINT on CPU#1 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 1097.72 BogoMIPS CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K CPU serial number disabled. Intel machine check reporting enabled on CPU#1. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU1: Intel Pentium III (Katmai) stepping 03 Booting processor 2/1 eip 2000 Initializing CPU#2 masked ExtINT on CPU#2 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 1097.72 BogoMIPS CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K CPU serial number disabled. Intel machine check reporting enabled on CPU#2. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU2: Intel Pentium III (Katmai) stepping 03 Booting processor 3/2 eip 2000 Initializing CPU#3 masked ExtINT on CPU#3 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 1097.72 BogoMIPS CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K CPU serial number disabled. Intel machine check reporting enabled on CPU#3. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU3: Intel Pentium III (Katmai) stepping 03 Booting processor 4/4 eip 2000 Initializing CPU#4 masked ExtINT on CPU#4 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 1097.72 BogoMIPS CPU: L1 I cache: 16K, L1 D cache: 16K CPU: L2 cache: 1024K CPU serial number disabled. Intel machine check reporting enabled on CPU#4. CPU: After generic, caps: 0383fbff 00000000 00000000 00000000 CPU: Common caps: 0383fbff 00000000 00000000 00000000 CPU4: Intel Pentium III (Katmai) stepping 03 Total of 5 processors activated (5488.64 BogoMIPS). ENABLING IO-APIC IRQs Setting 3 in the phys_id_present_map ...changing IO-APIC physical APIC ID to 3 ... ok. Setting 6 in the phys_id_present_map ...changing IO-APIC physical APIC ID to 6 ... ok. init IO_APIC IRQs IO-APIC (apicid-pin) 3-0, 6-3, 6-13, 6-14, 6-15 not connected. ..TIMER: vector=0x31 pin1=-1 pin2=0 ...trying to set up timer (IRQ0) through the 8259A ... ..... (found pin 0) ...works. number of MP IRQ sources: 40. number of IO-APIC #3 registers: 16. number of IO-APIC #6 registers: 16. testing the IO APIC....................... IO APIC #3...... .... register #00: 03000000 ....... : physical APIC id: 03 ....... : Delivery Type: 0 ....... : LTS : 0 .... register #01: 000F0011 ....... : max redirection entries: 000F ....... : PRQ implemented: 0 ....... : IO APIC version: 0011 .... register #02: 00000000 ....... : arbitration: 00 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 01F 0F 0 0 0 0 0 1 1 31 01 01F 0F 0 0 0 0 0 1 1 39 02 000 00 1 0 0 0 0 0 0 00 03 01F 0F 0 0 0 0 0 1 1 41 04 01F 0F 0 0 0 0 0 1 1 49 05 01F 0F 1 1 0 1 0 1 1 51 06 01F 0F 0 0 0 0 0 1 1 59 07 01F 0F 0 0 0 0 0 1 1 61 08 01F 0F 0 0 0 0 0 1 1 69 09 01F 0F 1 1 0 1 0 1 1 71 0a 01F 0F 1 1 0 1 0 1 1 79 0b 01F 0F 1 1 0 1 0 1 1 81 0c 01F 0F 0 0 0 0 0 1 1 89 0d 01F 0F 0 0 0 0 0 1 1 91 0e 01F 0F 0 0 0 0 0 1 1 99 0f 01F 0F 0 0 0 0 0 1 1 A1 IO APIC #6...... .... register #00: 06000000 ....... : physical APIC id: 06 ....... : Delivery Type: 0 ....... : LTS : 0 .... register #01: 000F0011 ....... : max redirection entries: 000F ....... : PRQ implemented: 0 ....... : IO APIC version: 0011 .... register #02: 01000000 ....... : arbitration: 01 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 01F 0F 1 1 0 1 0 1 1 A9 01 01F 0F 1 1 0 1 0 1 1 B1 02 01F 0F 1 1 0 1 0 1 1 B9 03 000 00 1 0 0 0 0 0 0 00 04 01F 0F 1 1 0 1 0 1 1 C1 05 01F 0F 1 1 0 1 0 1 1 C9 06 01F 0F 1 1 0 1 0 1 1 D1 07 01F 0F 1 1 0 1 0 1 1 D9 08 01F 0F 1 1 0 1 0 1 1 E1 09 01F 0F 1 1 0 1 0 1 1 E9 0a 01F 0F 1 1 0 1 0 1 1 32 0b 01F 0F 1 1 0 1 0 1 1 3A 0c 01F 0F 1 1 0 1 0 1 1 42 0d 000 00 1 0 0 0 0 0 0 00 0e 000 00 1 0 0 0 0 0 0 00 0f 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 0:0 IRQ1 -> 0:1 IRQ2 -> 0:2 IRQ3 -> 0:3 IRQ4 -> 0:4 IRQ5 -> 0:5 IRQ6 -> 0:6 IRQ7 -> 0:7 IRQ8 -> 0:8 IRQ9 -> 0:9 IRQ10 -> 0:10 IRQ11 -> 0:11 IRQ12 -> 0:12 IRQ13 -> 0:13 IRQ14 -> 0:14 IRQ15 -> 0:15 IRQ16 -> 1:0 IRQ17 -> 1:1 IRQ18 -> 1:2 IRQ20 -> 1:4 IRQ21 -> 1:5 IRQ22 -> 1:6 IRQ23 -> 1:7 IRQ24 -> 1:8 IRQ25 -> 1:9 IRQ26 -> 1:10 IRQ27 -> 1:11 IRQ28 -> 1:12 .................................... done. Using local APIC timer interrupts. calibrating APIC timer ... ..... CPU clock speed is 550.0540 MHz. ..... host bus clock speed is 100.0097 MHz. cpu: 0, clocks: 1000097, slice: 166682 CPU0<T0:1000096,T1:833408,D:6,S:166682,C:1000097> cpu: 1, clocks: 1000097, slice: 166682 cpu: 2, clocks: 1000097, slice: 166682 cpu: 3, clocks: 1000097, slice: 166682 cpu: 4, clocks: 1000097, slice: 166682 CPU2<T0:1000096,T1:500048,D:2,S:166682,C:1000097> CPU1<T0:1000096,T1:666720,D:12,S:166682,C:1000097> CPU3<T0:1000096,T1:333360,D:8,S:166682,C:1000097> CPU4<T0:1000096,T1:166672,D:14,S:166682,C:1000097> checking TSC synchronization across CPUs: passed. Waiting on wait_init_idle (map = 0x1e) All processors have done init_idle PCI: PCI BIOS revision 2.10 entry at 0xfd989, last bus=7 PCI: Using configuration type 1 PCI: Probing PCI hardware PCI: Probing PCI hardware (bus 00) PCI: Discovered primary peer bus 01 [IRQ] PCI: Discovered primary peer bus 04 [IRQ] PCI: Using IRQ router ServerWorks [1166/0200] at 00:0f.0 PCI->APIC IRQ transform: (B0,I6,P0) -> 18 PCI->APIC IRQ transform: (B4,I2,P0) -> 16 PCI->APIC IRQ transform: (B4,I2,P1) -> 17 PCI->APIC IRQ transform: (B4,I3,P0) -> 20 PCI->APIC IRQ transform: (B4,I6,P0) -> 21 PCI->APIC IRQ transform: (B5,I2,P0) -> 22 PCI->APIC IRQ transform: (B5,I3,P0) -> 24 PCI->APIC IRQ transform: (B5,I4,P0) -> 23 isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found Linux NET4.0 for Linux 2.4 Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket Starting kswapd Journalled Block Device driver loaded Installing knfsd (copyright (C) 1996 okir@monad.swb.de). pty: 256 Unix98 ptys configured Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI ISAPNP enabled ttyS00 at 0x03f8 (irq = 4) is a 16550A ttyS01 at 0x02f8 (irq = 3) is a 16550A Floppy drive(s): fd0 is 1.44M FDC 0 is a National Semiconductor PC87306 RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize loop: loaded (max 8 devices) Intel(R) PRO/1000 Network Driver - version 5.1.13-k1 Copyright (c) 1999-2003 Intel Corporation. 3c59x: Donald Becker and others. www.scyld.com/network/vortex.html See Documentation/networking/vortex.txt 04:06.0: 3Com PCI 3c905C Tornado at 0x4800. Vers LK1.1.18-ac 00:50:da:07:d7:41, IRQ 21 product code 5957 rev 00.13 date 07-17-99 Internal config register is 1800000, transceivers 0xa. 8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface. MII transceiver found at address 24, status 782d. Enabling bus-master transmits and whole-frame receives. 04:06.0: scatter/gather enabled. h/w checksums enabled See Documentation/networking/vortex.txt 05:02.0: 3Com PCI 3c905C Tornado at 0x5000. Vers LK1.1.18-ac 00:50:da:4f:e5:01, IRQ 22 product code 5957 rev 00.13 date 12-23-99 Internal config register is 1800000, transceivers 0xa. 8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface. MII transceiver found at address 24, status 7809. Enabling bus-master transmits and whole-frame receives. 05:02.0: scatter/gather enabled. h/w checksums enabled See Documentation/networking/vortex.txt 05:03.0: 3Com PCI 3c905C Tornado at 0x5080. Vers LK1.1.18-ac 00:50:da:07:d8:29, IRQ 24 product code 5957 rev 00.13 date 07-17-99 Internal config register is 1800000, transceivers 0xa. 8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface. MII transceiver found at address 24, status 7809. Enabling bus-master transmits and whole-frame receives. 05:03.0: scatter/gather enabled. h/w checksums enabled See Documentation/networking/vortex.txt 05:04.0: 3Com PCI 3c905C Tornado at 0x5400. Vers LK1.1.18-ac 00:50:da:df:86:05, IRQ 23 product code 5957 rev 00.13 date 01-14-00 Internal config register is 1800000, transceivers 0xa. 8K byte-wide RAM 5:3 Rx:Tx split, autoselect/Autonegotiate interface. MII transceiver found at address 24, status 782d. Enabling bus-master transmits and whole-frame receives. 05:04.0: scatter/gather enabled. h/w checksums enabled pcnet32.c:v1.27a 10.02.2002 tsbogend@alpha.franken.de Intel(R) PRO/100 Network Driver - version 2.3.18-k1 Copyright (c) 2003 Intel Corporation e100: selftest OK. e100: eth4: Intel(R) PRO/100 Network Connection Hardware receive checksums enabled cpu cycle saver enabled No adapter found. Universal TUN/TAP device driver 1.5 (C)1999-2002 Maxim Krasnyansky Linux agpgart interface v0.99 (c) Jeff Hartmann agpgart: Maximum main memory to use for agp memory: 816M agpgart: no supported devices found. [drm] Initialized tdfx 1.0.0 20010216 on minor 0 [drm] Initialized radeon 1.1.1 20010405 on minor 1 [drm:drm_init] *ERROR* Cannot initialize the agpgart module. Uniform Multi-Platform E-IDE driver Revision: 7.00beta4-2.4 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx hda: CD-532E-B, ATAPI CD/DVD-ROM drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 hda: attached ide-cdrom driver. hda: ATAPI 32X CD-ROM drive, 128kB Cache Uniform CD-ROM driver Revision: 3.12 SCSI subsystem driver Revision: 1.00 Red Hat/Adaptec aacraid driver (1.1.2 Sep 25 2003 16:31:15) megaraid: v1.18f (Release Date: Tue Dec 10 09:54:39 EST 2002) megaraid: found 0x8086:0x1960:idx 0:bus 4:slot 3:func 1 scsi0 : Found a MegaRAID controller at 0xf880f000, IRQ: 20 megaraid: [\x10\x01E :\x04\x02B ] detected 2 logical drives megaraid: channel[1] is raid. megaraid: channel[2] is raid. scsi0 : LSI Logic MegaRAID \x10\x01E 254 commands 15 targs 5 chans 7 luns scsi0: scanning virtual channel 0 for logical drives. Vendor: MegaRAID Model: LD0 RAID1 34730R Rev: E Type: Direct-Access ANSI SCSI revision: 02 Vendor: MegaRAID Model: LD1 RAID1 70086R Rev: E Type: Direct-Access ANSI SCSI revision: 02 scsi0: scanning physical channel 0 for devices. Vendor: HP Model: SAFTE; U160/M BP Rev: 1023 Type: Processor ANSI SCSI revision: 02 scsi0: scanning physical channel 1 for devices. Vendor: HP Model: SAFTE; U160/M BP Rev: 1023 Type: Processor ANSI SCSI revision: 02 3ware Storage Controller device driver for Linux v1.02.00.036. 3w-xxxx: No cards found. Attached scsi disk sda at scsi0, channel 0, id 0, lun 0 Attached scsi disk sdb at scsi0, channel 0, id 1, lun 0 SCSI device sda: 71127040 512-byte hdwr sectors (36417 MB) Partition check: sda: sda1 sda2 sda3 sda4 < sda5 > SCSI device sdb: 143536128 512-byte hdwr sectors (73490 MB) sdb: sdb1 sdb2 sdb3 sdb4 < sdb5 > Linux Kernel Card Services 3.1.22 options: [pci] [cardbus] [pm] usb.c: registered new driver hub host/uhci.c: USB Universal Host Controller Interface driver v1.1 usb.c: registered new driver hid hid-core.c: v1.8.1 Andreas Gal, Vojtech Pavlik <vojtech@suse.cz> hid-core.c: USB HID support drivers Initializing USB Mass Storage driver... usb.c: registered new driver usb-storage USB Mass Storage support registered. md: linear personality registered as nr 1 md: raid0 personality registered as nr 2 md: raid1 personality registered as nr 3 md: raid5 personality registered as nr 4 raid5: measuring checksumming speed 8regs : 1016.000 MB/sec 32regs : 518.000 MB/sec pIII_sse : 1100.000 MB/sec pII_mmx : 1220.400 MB/sec p5_mmx : 1280.800 MB/sec raid5: using function: pIII_sse (1100.000 MB/sec) md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: Autodetecting RAID arrays. md: autorun ... md: ... autorun DONE. LVM version 1.0.5+(22/07/2002) NET4: Linux TCP/IP 1.0 for NET4.0 IP Protocols: ICMP, UDP, TCP, IGMP IP: routing cache hash table of 8192 buckets, 64Kbytes TCP: Hash tables configured (established 262144 bind 65536) ip_conntrack version 2.1 (7168 buckets, 57344 max) - 292 bytes per conntrack ip_tables: (C) 2000-2002 Netfilter core team NET4: Unix domain sockets 1.0/SMP for Linux NET4.0. ds: no socket drivers loaded! RAMDISK: Compressed image found at block 0 Freeing initrd memory: 1274k freed VFS: Mounted root (ext2 filesystem). Freeing unused kernel memory: 392k freed ISO 9660 Extensions: Microsoft Joliet Level 3 ISO 9660 Extensions: RRIP_1991A ISO 9660 Extensions: Microsoft Joliet Level 3 ISO 9660 Extensions: RRIP_1991A ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 16:46 ` Manfred.Herrmann @ 2004-02-27 17:46 ` Ian Pratt 2004-02-27 18:14 ` Manfred.Herrmann 0 siblings, 1 reply; 18+ messages in thread From: Ian Pratt @ 2004-02-27 17:46 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Ian Pratt, xen-devel > > > i prepared two LH6000/4CPU/4GB for XEN-production :o( > > > so what next? Judging from the Linux boot output, this looks like a 5 (!) CPU system. Is this correct? You're definitely outside the tested envelope with this system... A few things to try: 1) Remove half the RAM (2GB). We have some 4GB systems that work fine, but its possible there's a clash with IO memory space. 2) Remove all but one of the Ethernet cards. Multiple Ethernet cards shouldn't cause problems, but we've never used multiple of these particular cards. (You can't currently use more than one card anyhow, but this will be fixed relatively soon) 3) Boot a 2.4.16 version of Linux (e.g. from the 1.0 XenDemoCD). If that fails, step forward versions until you find the one where it starts working. (It's probably a BIOS table or weird APIC issue) 4) Boot Xen with a serial line connected so that you can capture the serial output to send to us. Ian ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 17:46 ` Ian Pratt @ 2004-02-27 18:14 ` Manfred.Herrmann 2004-02-27 18:42 ` Ian Pratt 0 siblings, 1 reply; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-27 18:14 UTC (permalink / raw) Cc: Ian Pratt, xen-devel [-- Attachment #1: Type: text/plain, Size: 1494 bytes --] > > > > > i prepared two LH6000/4CPU/4GB for XEN-production :o( > > > > so what next? > > Judging from the Linux boot output, this looks like a 5 (!) CPU > system. Is this correct? > yes ... LH6000 is up to 6 CPU, I thought it´s good for QOS resource sharing if I can pin domains to a "free" cpu. > You're definitely outside the tested envelope with this system... > not so good for production use :o) > > A few things to try: > > 1) Remove half the RAM (2GB). We have some 4GB systems that work > fine, but its possible there's a clash with IO memory space. > ack > > 2) Remove all but one of the Ethernet cards. Multiple Ethernet > cards shouldn't cause problems, but we've never used multiple of > these particular cards. (You can't currently use more than one > card anyhow, but this will be fixed relatively soon) > ack > 3) Boot a 2.4.16 version of Linux (e.g. from the 1.0 > XenDemoCD). If that fails, step forward versions until you find > the one where it starts working. (It's probably a BIOS table or > weird APIC issue) > ? sorry ... what to do in detail ? On my XenDemoCD 1.0 download/22.01.2004 is only 2.4.22 as a boot config. This XenoLinux config booting is o.k.! In dmesg only one cpu. With mii-tools no network interface is accessible. > 4) Boot Xen with a serial line connected so that you can capture > the serial output to send to us. > ack ... need equipment for it ... next week > > Ian Manni [-- Attachment #2: Type: text/html, Size: 2350 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 18:14 ` Manfred.Herrmann @ 2004-02-27 18:42 ` Ian Pratt 2004-02-28 17:18 ` Manfred.Herrmann 0 siblings, 1 reply; 18+ messages in thread From: Ian Pratt @ 2004-02-27 18:42 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Ian Pratt, xen-devel > > 3) Boot a 2.4.16 version of Linux (e.g. from the 1.0 > > XenDemoCD). If that fails, step forward versions until you find > > the one where it starts working. (It's probably a BIOS table or > > weird APIC issue) > > > ? sorry ... what to do in detail ? > On my XenDemoCD 1.0 download/22.01.2004 is only 2.4.22 as a boot > config. > This XenoLinux config booting is o.k.! > In dmesg only one cpu. > With mii-tools no network interface is accessible. Hang-on, you mean the Linux 2.4.22 kernel is only finding one CPU? I though an early message said it was OK SMP i.e. 'top' shows 5 CPUs etc. If 2.4.22 does work, please can you try compiling up a 2.4.20 to see if that breaks. (there's a 2.4.16 on the *1.0* version of the demoCD.) If we can find the most recent version that fails to work, that will give us a clue as to what we're missing. Having the Xen boot output will helpful too. Has this machine got builtin-in remote console facilities that you could cut and paste from instead of setting up a serial line? Cheers, Ian ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 18:42 ` Ian Pratt @ 2004-02-28 17:18 ` Manfred.Herrmann 2004-02-28 18:27 ` Ian Pratt 0 siblings, 1 reply; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-28 17:18 UTC (permalink / raw) Cc: Ian Pratt, xen-devel [-- Attachment #1: Type: text/plain, Size: 2811 bytes --] XenDemoCD 1.2 - The problem exist with more than 1 CPU. Following configurations are o.k. with 1 CPU: 4GByte RAM o.k. Adaptec AIC7880 o.k. Netraid o.k. 5 NICs (4 3C905 + 1 EEPRO100) o.k. Multiple domains are o.k. (ssh login possible) XenDemoCD 1.0 - No boot problem, dom0 o.k. 4 CPUs [root@xendemo0 root]# xenctl domain list id: 0 (Domain-0) processor: 0 has cpu: true state: 0 running mcu advance: 10 total pages: 25000 id: 1 (XenoLinux) processor: 1 has cpu: false <--- what is "false" ? state: 8 suspended mcu advance: 10 total pages: 24576 id: 2 (XenoLinux) processor: 2 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 3 (XenoLinux) processor: 3 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 4 (XenoLinux) processor: 0 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 5 (XenoLinux) processor: 1 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 6 (XenoLinux) processor: 2 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 7 (XenoLinux) processor: 3 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 id: 8 (XenoLinux) processor: 0 has cpu: false state: 8 suspended mcu advance: 10 total pages: 24576 [root@xendemo0 root]# > > > > 3) Boot a 2.4.16 version of Linux (e.g. from the 1.0 > > > XenDemoCD). If that fails, step forward versions until you find > > > the one where it starts working. (It's probably a BIOS table or > > > weird APIC issue) > > > > > ? sorry ... what to do in detail ? > > On my XenDemoCD 1.0 download/22.01.2004 is only 2.4.22 as a boot > > config. > > This XenoLinux config booting is o.k.! > > In dmesg only one cpu. > > With mii-tools no network interface is accessible. > > Hang-on, you mean the Linux 2.4.22 kernel is only finding one > CPU? I though an early message said it was OK SMP i.e. 'top' > shows 5 CPUs etc. > > If 2.4.22 does work, please can you try compiling up a 2.4.20 to > see if that breaks. (there's a 2.4.16 on the *1.0* version of the > demoCD.) > Native Linux 2.4.22 from XenDemoCD 1.2 is o.k.. Do you mean a standard Linux as debian 2.4.18? Give me a detailed instruction what and how ... and I will try it. > If we can find the most recent version that fails to work, that > will give us a clue as to what we're missing. > > Having the Xen boot output will helpful too. Has this machine got > builtin-in remote console facilities that you could cut and paste > from instead of setting up a serial line? > sorry, I don´t know about the remote console features > Cheers, > Ian [-- Attachment #2: Type: text/html, Size: 5736 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-28 17:18 ` Manfred.Herrmann @ 2004-02-28 18:27 ` Ian Pratt 2004-02-29 10:06 ` Manfred.Herrmann 0 siblings, 1 reply; 18+ messages in thread From: Ian Pratt @ 2004-02-28 18:27 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Ian Pratt, xen-devel > XenDemoCD 1.2 - The problem exist with more than 1 CPU. > > Following configurations are o.k. with 1 CPU: > 4GByte RAM o.k. > Adaptec AIC7880 o.k. > Netraid o.k. > 5 NICs (4 3C905 + 1 EEPRO100) o.k. > Multiple domains are o.k. (ssh login possible) > > XenDemoCD 1.0 - No boot problem, dom0 o.k. > > 4 CPUs So, just to make sure I understand: XenDemoCD 1.0 works fine with 4 CPUs installed. What about 5? XenDemoCD 2.0 works OK with one CPU, but fails with 4 and 5. What's the output when it fails? What about 2 CPUs? > [root@xendemo0 root]# xenctl domain list > id: 1 (XenoLinux) > processor: 1 > has cpu: false <--- what is "false" ? It just means that the domain happens not to be using the CPU at this instant (i.e. it's idle). Totally normal. Ian ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-28 18:27 ` Ian Pratt @ 2004-02-29 10:06 ` Manfred.Herrmann 2004-03-01 23:15 ` Ian Pratt 2004-03-04 14:58 ` Steven Hand 0 siblings, 2 replies; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-29 10:06 UTC (permalink / raw) Cc: Ian Pratt, xen-devel [-- Attachment #1: Type: text/plain, Size: 1807 bytes --] > > XenDemoCD 1.2 - The problem exist with more than 1 CPU. > > > > Following configurations are o.k. with 1 CPU: > > 4GByte RAM o.k. > > Adaptec AIC7880 o.k. > > Netraid o.k. > > 5 NICs (4 3C905 + 1 EEPRO100) o.k. > > Multiple domains are o.k. (ssh login possible) > I will refer to this TEST1 > > > > XenDemoCD 1.0 - No boot problem, dom0 o.k. > > > > 4 CPUs I will refer to this TEST2 > > So, just to make sure I understand: > > XenDemoCD 1.0 works fine with 4 CPUs installed. What about 5? > Yes XenDemoCD 1.0 (TEST2) boot is o.k. no more tests done. If it make sense I will test domain starting, ssh ... More Info about 5 CPUs: 1. Only "server1" has 5 CPUs (TEST1 1 CPU o.k., 2CPUs fail, 5CPUs fail) 2. TEST2 was on "server2" (4 CPUs) > XenDemoCD 2.0 works OK with one CPU, but fails with 4 and > 5. What's the output when it fails? What about 2 CPUs? > With 2 CPUs fail. Output when it fails: megaraid scanning raidchannel with error: scsi0: scanning virtual channel0 for scsi devices. scsi_wait_req: still waiting...! scsi_wait_req: still waiting...! scsi_wait_req: still waiting...! (... endless) Helpful info from a previous TESTx: BIOS AIC-7880 enabled + Netraid (megaraid) enabled Bootsequence 1. AIC-7880, 2. Netraid Output when it fails: ... "adaptec scanning" ... (this text is not copy/paste) scsi_wait_req: still waiting...! scsi_wait_req: still waiting...! scsi_wait_req: still waiting...! (... endless) > > [root@xendemo0 root]# xenctl domain list > > id: 1 (XenoLinux) > > processor: 1 > > has cpu: false <--- what is "false" ? > > It just means that the domain happens not to be using the CPU at > this instant (i.e. it's idle). Totally normal. > very good :-) [-- Attachment #2: Type: text/html, Size: 3273 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-29 10:06 ` Manfred.Herrmann @ 2004-03-01 23:15 ` Ian Pratt 2004-03-02 8:26 ` Manfred.Herrmann 2004-03-04 14:58 ` Steven Hand 1 sibling, 1 reply; 18+ messages in thread From: Ian Pratt @ 2004-03-01 23:15 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Ian Pratt, xen-devel > More Info about 5 CPUs: > 1. Only "server1" has 5 CPUs (TEST1 1 CPU o.k., 2CPUs fail, 5CPUs fail) OK, so whenever you get a fail its a problem with the megaraid driver looping? Can you bring the machine up using either NFS root or a spare IDE driver? I'd like to know whether interrupt routeing is totally screwed with >1 CPU, or whether its just a problem with the megaraid driver and the network drivers are OK. Judging from Stephen's comment, it sounds like there's a known problem with the megaraid driver. A sensible first move would be to upgrade the driver from the current version 1.18d to the latest 1.18k from the 2.4.25 tree. Ian ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-03-01 23:15 ` Ian Pratt @ 2004-03-02 8:26 ` Manfred.Herrmann 0 siblings, 0 replies; 18+ messages in thread From: Manfred.Herrmann @ 2004-03-02 8:26 UTC (permalink / raw) Cc: Ian Pratt, xen-devel [-- Attachment #1: Type: text/plain, Size: 983 bytes --] > > More Info about 5 CPUs: > > 1. Only "server1" has 5 CPUs (TEST1 1 CPU o.k., 2CPUs fail, 5CPUs fail) > > OK, so whenever you get a fail its a problem with the megaraid > driver looping? > Not only megaraid, with builtin aic7880 enabled the same behaviour. More testresults with XenDemoCD 1.0: All "pass" - CPU o.k. (dom0=CPU0, dom1=CPU1, dom2=CPU2) - networking o.k. (scp from dom2 to dom0 /dev/sdb1) - megaraid drive access o.k. > Can you bring the machine up using either NFS root or a spare IDE > driver? I'd like to know whether interrupt routeing is totally > screwed with >1 CPU, or whether its just a problem with the > megaraid driver and the network drivers are OK. > I will try it. > > Judging from Stephen's comment, it sounds like there's a known > problem with the megaraid driver. > > A sensible first move would be to upgrade the driver from the > current version 1.18d to the latest 1.18k from the 2.4.25 tree. > I will try it. Thanks a lot! Manni [-- Attachment #2: Type: text/html, Size: 1561 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-29 10:06 ` Manfred.Herrmann 2004-03-01 23:15 ` Ian Pratt @ 2004-03-04 14:58 ` Steven Hand 2004-03-04 15:04 ` Steven Hand 1 sibling, 1 reply; 18+ messages in thread From: Steven Hand @ 2004-03-04 14:58 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Ian Pratt, xen-devel > > XenDemoCD 2.0 works OK with one CPU, but fails with 4 and > > 5. What's the output when it fails? What about 2 CPUs? > > > With 2 CPUs fail. > > Output when it fails: > megaraid scanning raidchannel with error: > scsi0: scanning virtual channel0 for scsi devices. > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > (... endless) > > Helpful info from a previous TESTx: > BIOS AIC-7880 enabled + Netraid (megaraid) enabled > Bootsequence 1. AIC-7880, 2. Netraid > > Output when it fails: > ... "adaptec scanning" ... (this text is not copy/paste) > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > (... endless) Can you try appling the following minor patch (against 1.2) and retrying? It should essentially just terminate the endless printks and allow us to see when (if?) it falls over later on. cheers, S. ------------------------------------------------------------------------- # This is a BitKeeper generated patch for the following project: # Project Name: Xen Virtual Machine Monitor source code # This patch format is intended for GNU patch command version 2.5 or higher. # This patch includes the following deltas: # ChangeSet 1.742 -> 1.743 # xen/drivers/scsi/scsi.c 1.10 -> 1.11 # # The following is the BitKeeper ChangeSet Log # -------------------------------------------- # 04/03/04 smh22@tempest.cl.cam.ac.uk 1.743 # debugging tweaks # -------------------------------------------- # diff -Nru a/xen/drivers/scsi/scsi.c b/xen/drivers/scsi/scsi.c --- a/xen/drivers/scsi/scsi.c Thu Mar 4 14:57:22 2004 +++ b/xen/drivers/scsi/scsi.c Thu Mar 4 14:57:22 2004 @@ -244,8 +244,8 @@ } #else /* XXX SMH: just use a flag to signal completion; caller spins */ - if (*(int *)(req->waiting) != 0) { -// printk("scsi_wait_done: flipping wait status on req %p\n", req); + if (*(volatile int *)(req->waiting) != 0) { + printk("scsi_wait_done: flipping wait status on req %p\n", req); *(int *)(req->waiting) = 0; } #endif @@ -812,6 +812,7 @@ #else int wait = 1; int usecs = 0; + int nsecs = 0; #endif @@ -845,8 +846,16 @@ usecs += 500; if(usecs > 1000000) { printk("scsi_wait_req: still waiting...!\n"); + nsecs = nsecs++; usecs = 0; } + + if(nsecs > 10) { + printk("scsi_wait_req: bored of waiting (wait is %d, &wait %p)\n", + wait, &wait); + wait = 0; + } + } #endif ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-03-04 14:58 ` Steven Hand @ 2004-03-04 15:04 ` Steven Hand 0 siblings, 0 replies; 18+ messages in thread From: Steven Hand @ 2004-03-04 15:04 UTC (permalink / raw) To: Steven Hand; +Cc: Manfred.Herrmann, Ian Pratt, xen-devel > Can you try appling the following minor patch (against 1.2) and > retrying? It should essentially just terminate the endless printks > and allow us to see when (if?) it falls over later on. Oh, and please fix the typo ("nsecs = nsecs++;" should of course be "nsecs = nsecs + 1;" :-) S. ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 14:46 megaraid problem on hp netserver lh6000 - ignorebiostables o.k Manfred.Herrmann 2004-02-27 16:11 ` Ian Pratt @ 2004-02-27 16:57 ` Keir Fraser 2004-02-27 17:44 ` Manfred.Herrmann 2004-02-28 3:13 ` Stephen Evanchik 2 siblings, 1 reply; 18+ messages in thread From: Keir Fraser @ 2004-02-27 16:57 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: xen-devel > i prepared two LH6000/4CPU/4GB for XEN-production :o( > so what next? Well, we have never booted that chipset, we have never booted with that many I/O devices configured, we have never booted that many CPUs, and we have never booted with that amount of memory. So it's fair to say that you're outside the tested operational envelope. :-) I'd also guess that this may be non-trivial to debug. -- Keir ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 16:57 ` Keir Fraser @ 2004-02-27 17:44 ` Manfred.Herrmann 2004-02-27 17:47 ` Keir Fraser 0 siblings, 1 reply; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-27 17:44 UTC (permalink / raw) To: Keir Fraser; +Cc: xen-devel [-- Attachment #1: Type: text/plain, Size: 916 bytes --] > > i prepared two LH6000/4CPU/4GB for XEN-production :o( > > so what next? > > Well, we have never booted that chipset, we have never booted with > that many I/O devices configured, we have never booted that many CPUs, > and we have never booted with that amount of memory. So it's fair to > say that you're outside the tested operational envelope. :-) > uuupppsss :-) > I'd also guess that this may be non-trivial to debug. > > -- Keir So what would you suggest? 1. Sell this servers and get smaller an newer systems. or 2. This systems are very good for quality assurance. You can test many cpu´s, nic´s and other i/o. If "2." then: I own 3 LH6000 systems. Server1 with 10HDs, Server2 with 6HDs and Server3 with 2HDs. Server1 is for production. Server2 for a while we can run reliability and load tests. Server3 is free for testing new configs/changes. best regards Manni [-- Attachment #2: Type: text/html, Size: 1520 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 17:44 ` Manfred.Herrmann @ 2004-02-27 17:47 ` Keir Fraser 2004-02-27 18:22 ` Manfred.Herrmann 0 siblings, 1 reply; 18+ messages in thread From: Keir Fraser @ 2004-02-27 17:47 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: Keir Fraser, xen-devel [-- Warning: decoded text below may be mangled, UTF-8 assumed --] [-- Attachment #1: Type: text/plain, Size: 1048 bytes --] > So what would you suggest? > > 1. Sell this servers and get smaller an newer systems. > or > 2. This systems are very good for quality assurance. You can test > many cpu´s, nic´s and other i/o. > > If "2." then: > I own 3 LH6000 systems. Server1 with 10HDs, Server2 with 6HDs and > Server3 with 2HDs. > Server1 is for production. > Server2 for a while we can run reliability and load tests. > Server3 is free for testing new configs/changes. I think Ian is going to mail you some suggestions of his own. I would suggest pulling some of the hardware out of one of teh boxes and see if a more 'minimal' config gets you anywhere: e.g., one cpu, one network card, one disc, 1GB memory. If we can get you to a working configuration it makes debugging much easier! -- Keir ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 17:47 ` Keir Fraser @ 2004-02-27 18:22 ` Manfred.Herrmann 0 siblings, 0 replies; 18+ messages in thread From: Manfred.Herrmann @ 2004-02-27 18:22 UTC (permalink / raw) Cc: Keir Fraser, xen-devel [-- Attachment #1: Type: text/plain, Size: 357 bytes --] > I think Ian is going to mail you some suggestions of his own. > > I would suggest pulling some of the hardware out of one of teh boxes > and see if a more 'minimal' config gets you anywhere: > e.g., one cpu, one network card, one disc, 1GB memory. > > If we can get you to a working configuration it makes debugging much > easier! > ack ... Manni [-- Attachment #2: Type: text/html, Size: 506 bytes --] ^ permalink raw reply [flat|nested] 18+ messages in thread
* Re: megaraid problem on hp netserver lh6000 - ignorebiostables o.k. 2004-02-27 14:46 megaraid problem on hp netserver lh6000 - ignorebiostables o.k Manfred.Herrmann 2004-02-27 16:11 ` Ian Pratt 2004-02-27 16:57 ` Keir Fraser @ 2004-02-28 3:13 ` Stephen Evanchik 2 siblings, 0 replies; 18+ messages in thread From: Stephen Evanchik @ 2004-02-28 3:13 UTC (permalink / raw) To: Manfred.Herrmann; +Cc: xen-devel On Fri, 2004-02-27 at 09:46, Manfred.Herrmann@zipptec.de wrote: > XEN1.2 democd default boot 2.4.25 > stop at ... megaraid scanning raidchannel with error: > > scsi0: scanning virtual channel0 for scsi devices. > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > scsi_wait_req: still waiting...! > (... endless) I can also confirm this behavior with a Dell PowerEdge, dual Xeon, 2GB system with the MPT Fusion (LSI1030) SCSI controller. This happens from 'time to time' with the driver I ported way back in November. I brushed it off as a driver problem as its awful code. Of course, we do have reports of the thing working quite well with a domain configured sufficiently. My machine is well within the specs too ;) This machine has 3 SCSI controllers (aic79xx, sym83cxx_2 and the mptfusion) and I think I only see the above when there is a drive plugged in to the mptfusion controller. I do have a spark of a memory creating a patch that timed out in scsi.c:scsi_wait_req because this appeared with _NO_ drives attached and the driver enabled. Actually, the more I think about it the more I remeber that when this happened something strange was going on with the DMA for the request/reply frames.. and a quick check of some debug output confirms that. When this happens with the MPT Fusion controller you see this in the initialization and subsequent commands: http://www.clarkson.edu/~evanchsa/xen/debug/dma.txt and then later that day I made changes to get this: http://www.clarkson.edu/~evanchsa/xen/debug/dma.111003.txt I now recall that I got rid of some of their PCI handling code as it was no longer needed in the newer kernels. I think that was the change I mentioned above. I could be completely wrong of course ;) I'm not a pro at this by any means. The machine is currently being used so I'll have to schedule some time to check in to this more. Stephen ------------------------------------------------------- SF.Net is sponsored by: Speed Start Your Linux Apps Now. Build and deploy apps & Web services for Linux with a free DVD software kit from IBM. Click Now! http://ads.osdn.com/?ad_id=1356&alloc_id=3438&op=click ^ permalink raw reply [flat|nested] 18+ messages in thread
end of thread, other threads:[~2004-03-04 15:04 UTC | newest] Thread overview: 18+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2004-02-27 14:46 megaraid problem on hp netserver lh6000 - ignorebiostables o.k Manfred.Herrmann 2004-02-27 16:11 ` Ian Pratt 2004-02-27 16:46 ` Manfred.Herrmann 2004-02-27 17:46 ` Ian Pratt 2004-02-27 18:14 ` Manfred.Herrmann 2004-02-27 18:42 ` Ian Pratt 2004-02-28 17:18 ` Manfred.Herrmann 2004-02-28 18:27 ` Ian Pratt 2004-02-29 10:06 ` Manfred.Herrmann 2004-03-01 23:15 ` Ian Pratt 2004-03-02 8:26 ` Manfred.Herrmann 2004-03-04 14:58 ` Steven Hand 2004-03-04 15:04 ` Steven Hand 2004-02-27 16:57 ` Keir Fraser 2004-02-27 17:44 ` Manfred.Herrmann 2004-02-27 17:47 ` Keir Fraser 2004-02-27 18:22 ` Manfred.Herrmann 2004-02-28 3:13 ` Stephen Evanchik
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.