From mboxrd@z Thu Jan 1 00:00:00 1970 From: Lyle Worthington Subject: Unexpected IO-APIC Date: Tue, 24 Feb 2004 09:07:21 -0600 Sender: linux-smp-owner@vger.kernel.org Message-ID: Mime-Version: 1.0 Content-Transfer-Encoding: 7BIT Return-path: List-Id: Content-Type: text/plain; charset="us-ascii"; format="flowed" To: linux-smp@vger.kernel.org I am running RedHat 7.3 on a dual P4 3.2G Xeons with 6G RAM and linux kernel 2.4.18-19.7.xbigmem. This sytem is the only one report this IO-APIC issue and has crashed 4 times in the past 6 days. I have searched all over the web trying to find replies to messages people have sent in regarding this issue but have found none so I am emailing hoping you can help. Here is the output from dmesg, let me know if you need any more information: 00 00000000 Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: bfebfbff 00000000 00000000 00000000 CPU: Common caps: bfebfbff 00000000 00000000 00000000 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Checking 'hlt' instruction... OK. POSIX conformance testing by UNIFIX mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au) mtrr: detected mtrr type: Intel CPU: Before vendor init, caps: bfebfbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 0K, L1 D cache: 8K CPU: L2 cache: 512K CPU: L3 cache: 1024K CPU: Physical Processor ID: 0 CPU: After vendor init, caps: bfebfbff 00000000 00000000 00000000 Intel machine check reporting enabled on CPU#0. CPU: After generic, caps: bfebfbff 00000000 00000000 00000000 CPU: Common caps: bfebfbff 00000000 00000000 00000000 CPU0: Intel(R) Xeon(TM) CPU 3.20GHz stepping 05 per-CPU timeslice cutoff: 1462.73 usecs. task migration cache decay timeout: 1 msecs. enabled ExtINT on CPU#0 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Booting processor 1/1 eip 2000 Initializing CPU#1 masked ExtINT on CPU#1 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 6379.68 BogoMIPS CPU: Before vendor init, caps: bfebfbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 0K, L1 D cache: 8K CPU: L2 cache: 512K CPU: L3 cache: 1024K CPU: Physical Processor ID: 0 CPU: After vendor init, caps: bfebfbff 00000000 00000000 00000000 Intel machine check reporting enabled on CPU#1. CPU: After generic, caps: bfebfbff 00000000 00000000 00000000 CPU: Common caps: bfebfbff 00000000 00000000 00000000 CPU1: Intel(R) Xeon(TM) CPU 3.20GHz stepping 05 Booting processor 2/6 eip 2000 Initializing CPU#2 masked ExtINT on CPU#2 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 6379.68 BogoMIPS CPU: Before vendor init, caps: bfebfbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 0K, L1 D cache: 8K CPU: L2 cache: 512K CPU: L3 cache: 1024K CPU: Physical Processor ID: 3 CPU: After vendor init, caps: bfebfbff 00000000 00000000 00000000 Intel machine check reporting enabled on CPU#2. CPU: After generic, caps: bfebfbff 00000000 00000000 00000000 CPU: Common caps: bfebfbff 00000000 00000000 00000000 CPU2: Intel(R) Xeon(TM) CPU 3.20GHz stepping 05 Booting processor 3/7 eip 2000 Initializing CPU#3 masked ExtINT on CPU#3 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Calibrating delay loop... 6379.68 BogoMIPS CPU: Before vendor init, caps: bfebfbff 00000000 00000000, vendor = 0 CPU: L1 I cache: 0K, L1 D cache: 8K CPU: L2 cache: 512K CPU: L3 cache: 1024K CPU: Physical Processor ID: 3 CPU: After vendor init, caps: bfebfbff 00000000 00000000 00000000 Intel machine check reporting enabled on CPU#3. CPU: After generic, caps: bfebfbff 00000000 00000000 00000000 CPU: Common caps: bfebfbff 00000000 00000000 00000000 CPU3: Intel(R) Xeon(TM) CPU 3.20GHz stepping 05 Total of 4 processors activated (25465.14 BogoMIPS). cpu_sibling_map[0] = 1 cpu_sibling_map[1] = 0 cpu_sibling_map[2] = 3 cpu_sibling_map[3] = 2 ENABLING IO-APIC IRQs Setting 2 in the phys_id_present_map ...changing IO-APIC physical APIC ID to 2 ... ok. Setting 3 in the phys_id_present_map ...changing IO-APIC physical APIC ID to 3 ... ok. Setting 4 in the phys_id_present_map ...changing IO-APIC physical APIC ID to 4 ... ok. init IO_APIC IRQs IO-APIC (apicid-pin) 2-0, 2-5, 2-10, 2-11, 2-17, 2-20, 2-23, 3-0, 3-1, 3-2, 3-3, 3-5, 3-6, 3-7, 3-8, 3-9, 3-10, 3-11, 3-12, 3-13, 3-14, 3-15, 3-16, 3-17, 3-18, 3-19, 3-20, 3-21, 3-22, 3-23, 4-0, 4-1, 4-2, 4-3, 4-4, 4-5, 4-7, 4-8, 4-9, 4-10, 4-11, 4-12, 4-13, 4-14, 4-15, 4-16, 4-17, 4-18, 4-19, 4-20, 4-21, 4-22, 4-23 not connected. ..TIMER: vector=0x31 pin1=2 pin2=0 number of MP IRQ sources: 20. number of IO-APIC #2 registers: 24. number of IO-APIC #3 registers: 24. number of IO-APIC #4 registers: 24. testing the IO APIC....................... IO APIC #2...... .... register #00: 02008000 ....... : physical APIC id: 02 WARNING: unexpected IO-APIC, please mail to linux-smp@vger.kernel.org .... register #01: 00178020 ....... : max redirection entries: 0017 ....... : PRQ implemented: 1 ....... : IO APIC version: 0020 .... register #02: 00000000 ....... : arbitration: 00 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 00F 0F 0 0 0 0 0 1 1 39 02 008 08 0 0 0 0 0 1 1 31 03 00F 0F 0 0 0 0 0 1 1 41 04 00F 0F 0 0 0 0 0 1 1 49 05 000 00 1 0 0 0 0 0 0 00 06 00F 0F 0 0 0 0 0 1 1 51 07 00F 0F 0 0 0 0 0 1 1 59 08 00F 0F 0 0 0 0 0 1 1 61 09 00F 0F 0 0 0 0 0 1 1 69 0a 000 00 1 0 0 0 0 0 0 00 0b 000 00 1 0 0 0 0 0 0 00 0c 00F 0F 0 0 0 0 0 1 1 71 0d 00F 0F 0 0 0 0 0 1 1 79 0e 00F 0F 0 0 0 0 0 1 1 81 0f 00F 0F 0 0 0 0 0 1 1 89 10 00F 0F 1 1 0 1 0 1 1 91 11 000 00 1 0 0 0 0 0 0 00 12 00F 0F 1 1 0 1 0 1 1 99 13 00F 0F 1 1 0 1 0 1 1 A1 14 000 00 1 0 0 0 0 0 0 00 15 00F 0F 1 1 0 1 0 1 1 A9 16 00F 0F 1 1 0 1 0 1 1 B1 17 000 00 1 0 0 0 0 0 0 00 IO APIC #3...... .... register #00: 03000000 ....... : physical APIC id: 03 .... register #01: 00178020 ....... : max redirection entries: 0017 ....... : PRQ implemented: 1 ....... : IO APIC version: 0020 .... register #02: 03000000 ....... : arbitration: 03 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 000 00 1 0 0 0 0 0 0 00 02 000 00 1 0 0 0 0 0 0 00 03 000 00 1 0 0 0 0 0 0 00 04 00F 0F 1 1 0 1 0 1 1 B9 05 000 00 1 0 0 0 0 0 0 00 06 000 00 1 0 0 0 0 0 0 00 07 000 00 1 0 0 0 0 0 0 00 08 000 00 1 0 0 0 0 0 0 00 09 000 00 1 0 0 0 0 0 0 00 0a 000 00 1 0 0 0 0 0 0 00 0b 000 00 1 0 0 0 0 0 0 00 0c 000 00 1 0 0 0 0 0 0 00 0d 000 00 1 0 0 0 0 0 0 00 0e 000 00 1 0 0 0 0 0 0 00 0f 000 00 1 0 0 0 0 0 0 00 10 000 00 1 0 0 0 0 0 0 00 11 000 00 1 0 0 0 0 0 0 00 12 000 00 1 0 0 0 0 0 0 00 13 000 00 1 0 0 0 0 0 0 00 14 000 00 1 0 0 0 0 0 0 00 15 000 00 1 0 0 0 0 0 0 00 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IO APIC #4...... .... register #00: 04000000 ....... : physical APIC id: 04 .... register #01: 00178020 ....... : max redirection entries: 0017 ....... : PRQ implemented: 1 ....... : IO APIC version: 0020 .... register #02: 04000000 ....... : arbitration: 04 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 000 00 1 0 0 0 0 0 0 00 01 000 00 1 0 0 0 0 0 0 00 02 000 00 1 0 0 0 0 0 0 00 03 000 00 1 0 0 0 0 0 0 00 04 000 00 1 0 0 0 0 0 0 00 05 000 00 1 0 0 0 0 0 0 00 06 00F 0F 1 1 0 1 0 1 1 C1 07 000 00 1 0 0 0 0 0 0 00 08 000 00 1 0 0 0 0 0 0 00 09 000 00 1 0 0 0 0 0 0 00 0a 000 00 1 0 0 0 0 0 0 00 0b 000 00 1 0 0 0 0 0 0 00 0c 000 00 1 0 0 0 0 0 0 00 0d 000 00 1 0 0 0 0 0 0 00 0e 000 00 1 0 0 0 0 0 0 00 0f 000 00 1 0 0 0 0 0 0 00 10 000 00 1 0 0 0 0 0 0 00 11 000 00 1 0 0 0 0 0 0 00 12 000 00 1 0 0 0 0 0 0 00 13 000 00 1 0 0 0 0 0 0 00 14 000 00 1 0 0 0 0 0 0 00 15 000 00 1 0 0 0 0 0 0 00 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 0:2 IRQ1 -> 0:1 IRQ3 -> 0:3 IRQ4 -> 0:4 IRQ6 -> 0:6 IRQ7 -> 0:7 IRQ8 -> 0:8 IRQ9 -> 0:9 IRQ12 -> 0:12 IRQ13 -> 0:13 IRQ14 -> 0:14 IRQ15 -> 0:15 IRQ16 -> 0:16 IRQ18 -> 0:18 IRQ19 -> 0:19 IRQ21 -> 0:21 IRQ22 -> 0:22 IRQ28 -> 1:4 IRQ54 -> 2:6 .................................... done. Using local APIC timer interrupts. calibrating APIC timer ... ..... CPU clock speed is 3189.0347 MHz. ..... host bus clock speed is 132.1723 MHz. cpu: 0, clocks: 259519, slice: 51903 CPU0 cpu: 1, clocks: 259519, slice: 51903 cpu: 3, clocks: 259519, slice: 51903 cpu: 2, clocks: 259519, slice: 51903 CPU3 CPU1 CPU2 checking TSC synchronization across CPUs: passed. migration_task 0 on cpu=0 migration_task 1 on cpu=1 migration_task 2 on cpu=2 migration_task 3 on cpu=3 PCI: PCI BIOS revision 2.10 entry at 0xfd8b5, last bus=4 PCI: Using configuration type 1 PCI: Probing PCI hardware Transparent bridge - Intel Corp. 82801BA/CA/DB PCI Bridge PCI: Discovered primary peer bus 10 [IRQ] PCI: Discovered primary peer bus 11 [IRQ] PCI: Discovered primary peer bus 12 [IRQ] PCI: Using IRQ router PIIX [8086/2480] at 00:1f.0 PCI->APIC IRQ transform: (B0,I29,P0) -> 16 PCI->APIC IRQ transform: (B0,I29,P1) -> 19 PCI->APIC IRQ transform: (B0,I29,P2) -> 18 PCI->APIC IRQ transform: (B2,I3,P0) -> 54 PCI->APIC IRQ transform: (B3,I3,P0) -> 28 PCI->APIC IRQ transform: (B4,I4,P0) -> 21 PCI->APIC IRQ transform: (B4,I5,P0) -> 22 isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found speakup: initialized device: /dev/synth, node (MAJOR 10, MINOR 25) Linux NET4.0 for Linux 2.4 Based upon Swansea University Computer Society NET3.039 Initializing RT netlink socket apm: BIOS version 1.2 Flags 0x03 (Driver version 1.16) apm: disabled - APM is not SMP safe. Starting kswapd allocated 256 pages and 256 bhs reserved for the highmem bounces VFS: Diskquotas version dquot_6.5.0 initialized pty: 2048 Unix98 ptys configured Serial driver version 5.05c (2001-07-08) with MANY_PORTS MULTIPORT SHARE_IRQ SERIAL_PCI ISAPNP enabled pc_keyb: controller jammed (0x1D). pc_keyb: controller jammed (0x1D). ttyS0 at 0x03f8 (irq = 4) is a 16550A ttyS1 at 0x02f8 (irq = 3) is a 16550A Real Time Clock Driver v1.10e block: 1024 slots per queue, batch=256 Uniform Multi-Platform E-IDE driver Revision: 6.31 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx PIIX4: IDE controller on PCI bus 00 dev f9 PCI: No IRQ known for interrupt pin A of device 00:1f.1. Probably buggy MP table. PIIX4: chipset revision 2 PIIX4: not 100% native mode: will probe irqs later ide0: BM-DMA at 0x2060-0x2067, BIOS settings: hda:pio, hdb:pio ide1: BM-DMA at 0x2068-0x206f, BIOS settings: hdc:pio, hdd:pio hdc: FX54++M, ATAPI CD/DVD-ROM drive ide1 at 0x170-0x177,0x376 on irq 15 ide-floppy driver 0.99.newide Floppy drive(s): fd0 is 1.44M FDC 0 is a post-1991 82077 NET4: Frame Diverter 0.46 RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize ide-floppy driver 0.99.newide md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: Autodetecting RAID arrays. md: autorun ... md: ... autorun DONE. pci_hotplug: PCI Hot Plug PCI Core version: 0.4 NET4: Linux TCP/IP 1.0 for NET4.0 IP Protocols: ICMP, UDP, TCP, IGMP IP: routing cache hash table of 65536 buckets, 512Kbytes TCP: Hash tables configured (established 262144 bind 65536) Linux IP multicast router 0.06 plus PIM-SM NET4: Unix domain sockets 1.0/SMP for Linux NET4.0. RAMDISK: Compressed image found at block 0 Freeing initrd memory: 203k freed VFS: Mounted root (ext2 filesystem). SCSI subsystem driver Revision: 1.00 kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2 Red Hat/Adaptec aacraid driver, Dec 12 2002 AAC0: kernel 4.0.4 build 6008 AAC0: monitor 4.0.4 build 6008 AAC0: bios 4.0.0 build 6008 AAC0: serial 0b9aef1 scsi0 : aacraid Vendor: ADAPTEC Model: Adaptec RAID10 Rev: V1.0 Type: Direct-Access ANSI SCSI revision: 02 Attached scsi removable disk sda at scsi0, channel 0, id 0, lun 0 SCSI device sda: 433001088 512-byte hdwr sectors (221697 MB) sda: Write Protect is off Partition check: sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 > Journalled Block Device driver loaded kjournald starting. Commit interval 5 seconds EXT3-fs: mounted filesystem with ordered data mode. Freeing unused kernel memory: 188k freed Adding Swap: 2048276k swap-space (priority -1) Adding Swap: 2048248k swap-space (priority -2) Adding Swap: 2048248k swap-space (priority -3) usb.c: registered new driver usbdevfs usb.c: registered new driver hub usb-uhci.c: $Revision: 1.275 $ time 07:47:30 Dec 12 2002 usb-uhci.c: High bandwidth mode enabled PCI: Setting latency timer of device 00:1d.0 to 64 usb-uhci.c: USB UHCI at I/O 0x2000, IRQ 16 usb-uhci.c: Detected 2 ports usb.c: new USB bus registered, assigned bus number 1 hub.c: USB hub found hub.c: 2 ports detected PCI: Setting latency timer of device 00:1d.1 to 64 usb-uhci.c: USB UHCI at I/O 0x2020, IRQ 19 usb-uhci.c: Detected 2 ports usb.c: new USB bus registered, assigned bus number 2 hub.c: USB hub found hub.c: 2 ports detected PCI: Setting latency timer of device 00:1d.2 to 64 usb-uhci.c: USB UHCI at I/O 0x2040, IRQ 18 usb-uhci.c: Detected 2 ports usb.c: new USB bus registered, assigned bus number 3 hub.c: USB hub found hub.c: 2 ports detected usb-uhci.c: v1.275:USB Universal Host Controller Interface driver EXT3 FS 2.4-0.9.18, 14 May 2002 on sd(8,2), internal journal kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.18, 14 May 2002 on sd(8,1), internal journal EXT3-fs: mounted filesystem with ordered data mode. kjournald starting. Commit interval 5 seconds EXT3 FS 2.4-0.9.18, 14 May 2002 on sd(8,7), internal journal EXT3-fs: mounted filesystem with ordered data mode. eepro100.c:v1.09j-t 9/29/99 Donald Becker http://www.scyld.com/network/eepro100.html eepro100.c: $Revision: 1.36 $ 2000/11/17 Modified by Andrey V. Savochkin and others divert: allocating divert_blk for eth0 eth0: OEM i82557/i82558 10/100 Ethernet, 00:30:48:29:8F:B3, IRQ 22. Board assembly 000000-000, Physical connectors present: RJ45 Primary interface chip i82555 PHY #1. General self-test: passed. Serial sub-system self-test: passed. Internal registers self-test: passed. ROM checksum self-test: passed (0xd0a6c714). -- / Lyle Worthington | Operations Manager | SKYLIST, Inc. | lyle@skylist.net \ (512) 857-7322 <\------------------------------|>