From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeremy Fitzhardinge Subject: Re: Current xen-unstable xen crashing in boot Date: Fri, 14 Aug 2009 13:48:32 -0700 Message-ID: <4A85CDA0.2010905@goop.org> References: <4A85B8D0.7090206@goop.org> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <4A85B8D0.7090206@goop.org> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Keir Fraser Cc: Xen-devel , Wei Gang List-Id: xen-devel@lists.xenproject.org On 08/14/09 12:19, Jeremy Fitzhardinge wrote: > I updated from hg today, and xen is crashing fairly early. I haven't t= ried bisecting yet, but the last > working version was a week or two ago. > > _____ ____ _ _ _ =20 > \ \/ /___ _ __ |___ / | ___| _ _ _ __ ___| |_ __ _| |__ | | __= _=20 > \ // _ \ '_ \ |_ \ |___ \ __| | | | '_ \/ __| __/ _` | '_ \| |/ _= \ > / \ __/ | | | ___) | ___) |__| |_| | | | \__ \ || (_| | |_) | | _= _/ > /_/\_\___|_| |_| |____(_)____/ \__,_|_| |_|___/\__\__,_|_.__/|_|\__= _| > = =20 > (XEN) Xen version 3.5-unstable (jeremy@eng.hq.xensource.com) (gcc versi= on 4.4.0 20090506 (Red Hat 4.4.0-4) (GCC) ) Fri Aug 14 10:48:57 PDT 2009 > (XEN) Latest ChangeSet: Fri Aug 14 17:26:23 2009 +0100 20067:5619bed51e= c4 > (XEN) Command line: loglvl=3Dall guest_loglvl=3Dall com1=3D115200,8n1 c= onsole=3Dcom1 noreboot dom0_mem=3D768m > (XEN) Video information: > (XEN) VGA is text mode 80x25, font 8x16 > (XEN) VBE/DDC methods: V2; EDID transfer time: 2 seconds > (XEN) Disc information: > (XEN) Found 1 MBR signatures > (XEN) Found 1 EDD information structures > (XEN) Xen-e820 RAM map: > (XEN) 0000000000000000 - 00000000000a0000 (usable) > (XEN) 0000000000100000 - 000000007ffc0000 (usable) > (XEN) 000000007ffc0000 - 000000007ffcfc00 (ACPI data) > (XEN) 000000007ffcfc00 - 000000007ffff000 (reserved) > (XEN) 00000000e0000000 - 00000000fec90000 (reserved) > (XEN) 00000000fed00000 - 00000000fed00400 (reserved) > (XEN) 00000000fee00000 - 00000000fee10000 (reserved) > (XEN) 00000000ffb00000 - 0000000100000000 (reserved) > (XEN) System RAM: 2047MB (2096512kB) > (XEN) ACPI: RSDP 000FD650, 0014 (r0 DELL ) > (XEN) ACPI: RSDT 000FD664, 0038 (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) ACPI: FACP 000FD6B0, 0074 (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) ACPI: DSDT 7FFC0000, 388D (r1 DELL PE BKC 1 MSFT 1000= 00E) > (XEN) ACPI: FACS 7FFCFC00, 0040 > (XEN) ACPI: APIC 000FD724, 009C (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) ACPI: SPCR 000FD7CC, 0050 (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) ACPI: HPET 000FD81C, 0038 (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) ACPI: MCFG 000FD854, 003C (r1 DELL PE BKC 1 MSFT 1000= 00A) > (XEN) NUMA turned off > (XEN) Faking a node at 0000000000000000-000000007ffc0000 > (XEN) Domain heap initialised > (XEN) found SMP MP-table at 000fe710 > (XEN) DMI 2.3 present. > (XEN) Using APIC driver default > (XEN) ACPI: PM-Timer IO Port: 0x808 > (XEN) ACPI: ACPI SLEEP INFO: pm1x_cnt[804,0], pm1x_evt[800,0] > (XEN) ACPI: wakeup_vec[7ffcfc0c], vec_size[20] > (XEN) ACPI: Local APIC address 0xfee00000 > (XEN) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled) > (XEN) Processor #0 15:4 APIC version 20 > (XEN) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled) > (XEN) Processor #1 15:4 APIC version 20 > (XEN) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x06] disabled) > (XEN) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x07] disabled) > (XEN) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) > (XEN) ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1]) > (XEN) ACPI: LAPIC_NMI (acpi_id[0x03] high edge lint[0x1]) > (XEN) ACPI: LAPIC_NMI (acpi_id[0x04] high edge lint[0x1]) > (XEN) ACPI: IOAPIC (id[0x02] address[0xfec00000] gsi_base[0]) > (XEN) IOAPIC[0]: apic_id 2, version 32, address 0xfec00000, GSI 0-23 > (XEN) ACPI: IOAPIC (id[0x03] address[0xfec80000] gsi_base[32]) > (XEN) IOAPIC[1]: apic_id 3, version 32, address 0xfec80000, GSI 32-55 > (XEN) ACPI: IOAPIC (id[0x04] address[0xfec83000] gsi_base[64]) > (XEN) IOAPIC[2]: apic_id 4, version 32, address 0xfec83000, GSI 64-87 > (XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) > (XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) > (XEN) ACPI: IRQ0 used by override. > (XEN) ACPI: IRQ2 used by override. > (XEN) ACPI: IRQ9 used by override. > (XEN) Enabling APIC mode: Flat. Using 3 I/O APICs > (XEN) ACPI: HPET id: 0xffffffff base: 0xfed00000 > (XEN) PCI: Found Intel Corporation E7520 Memory Controller Hub with MMC= ONFIG support. > (XEN) Using ACPI (MADT) for SMP configuration information > (XEN) Early fatal page fault at e008:ffff828c8015415c (cr2=3Dffff83007f= c75008, ec=3D0000) > (XEN) Stack dump: 000000000007fc76 000000000000000c ffff83007fc77ff0 ff= ff83007fc76000 ffff828c8026fe58 0000000000000262 0000000000000020 0180000= 000000000 000000000000010c ffff828c802a6380 ffff83007fc75008 000000000000= 0001 ffff830000000000 ffff828c80219b0c ffff83007fc76000 0000000000000000 = ffff828c8015415c 000000000000e008 0000000000010082 ffff828c8026fdd8 00000= 00000000000 ffff828c80154158 ffff828c8026fde8 0000000000000262 0000000000= 0002e2 0000000000000086 0000000000000262 0000000000000000 000000008026fe1= 8 ffff83007fc75008 ffff828c00000027 ffff83007fc76000 ffff828c8021dd60 000= 0000000000002 000000007fc42000 ffff83000008bfc0 ffff83000008bf40 00000000= 00002000 ffff828c8026fe68 ffff828c80154f85 ffff828c8026fe78 ffff828c80154= fa5 ffff828c8026ff18 ffff828c802343ef 0000000000000000 0000000000000000 0= 000000000a1e000 0000000000aca3e0 ffff83000008bf40 0000000000000000 000000= 00ffffffff ffff83000000000c 0000000800000000 000000010000006e 00000000000= 00003 00000000000002f8 0000000000000000 0000000000000000 0000000000000000= 0000000000000000 0000000000000000 0000000000000000 0000000000067e8c ffff= 828c801000b5 0000000000000000 0000000000000000 0000000000000000 000000000= 0000000 0000000000000000 0000000000000000 0000000000000000 00000000000000= 00 0000000000000000 0000000000000000 0000000000000000 0000000000000000 00= 00000000000000 0000000000000000 0000000000000000 0000000000000000 0000000= 000000000 0000000000000000 0000000000000000 0000000000000000 000000000000= 0000 0000000000000000 0000000000000000 0000000000000000 0000000000000000 = 0000000000000000 00000000fffff000=20 > > This appears to be: > (gdb) x/i 0xffff828c8015415c=20 > 0xffff828c8015415c : mov (%rax),%rax > (gdb) x/i 0xffff828c80154f85=20 > 0xffff828c80154f85 <__memguard_change_range+237>: leaveq=20 > > but I can't find a good enclosing frame from there (is there a tool to = pretty-print these stack traces?). > =20 I bisected it down to: The first bad revision is: changeset: 20038:1197585e32b7 user: Keir Fraser date: Fri Aug 07 17:29:50 2009 +0100 summary: x86: Increase default max CPUs to 64. J