From mboxrd@z Thu Jan 1 00:00:00 1970 From: Anthony Wright Subject: Re: Kernel bug from 3.0 (was phy disks and vifs timing out in DomU) Date: Thu, 25 Aug 2011 22:11:44 +0100 Message-ID: <4E56BA90.3050907@overnetdata.com> References: <29902981.10.1311837224851.JavaMail.root@zimbra.overnetdata.com> <24093349.14.1311837878822.JavaMail.root@zimbra.overnetdata.com> <4E31820C.5030200@overnetdata.com> <1311870512.24408.153.camel@cthulhu.hellion.org.uk> <4E3266DE.9000606@overnetdata.com> <20110803152841.GA2860@dumpdata.com> <4E4E3957.1040007@overnetdata.com> <20110819125615.GA26558@dumpdata.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="------------030500070604010105050306" Return-path: In-Reply-To: <20110819125615.GA26558@dumpdata.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Konrad Rzeszutek Wilk Cc: Ian Campbell , Todd Deshane , "xen-devel@lists.xensource.com" List-Id: xen-devel@lists.xenproject.org This is a multi-part message in MIME format. --------------030500070604010105050306 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit On 19/08/2011 13:56, Konrad Rzeszutek Wilk wrote: > On Fri, Aug 19, 2011 at 11:22:15AM +0100, Anthony Wright wrote: >> On 03/08/2011 16:28, Konrad Rzeszutek Wilk wrote: >>> On Fri, Jul 29, 2011 at 08:53:02AM +0100, Anthony Wright wrote: >>>> I've just upgraded to xen 4.1.1 with a stock 3.0 kernel on dom0 (with >>>> the vga-support patch backported). I can't get my DomU's to work due to >>>> the phy disks and vifs timing out in DomU and looking through my logs >>>> this morning I'm getting a consistent kernel bug report with xen >>>> mentioned at the top of the stack trace and vifdisconnect mentioned on >>> Yikes! Ian any ideas what to try? >>> >>> Anthony, can you compile the kernel with debug=y and when this happens >>> see what 'xl dmesg' gives? Also there is also the 'xl debug-keys g' which >>> should dump the grants in use.. that might help a bit. >> I've compiled a 3.0.1 kernel with CONFIG_DEBUG=Y (a number of other >> config values appeared at this point, and I took defaults for them). >> >> The output from /var/log/messages & 'xl dmesg' is attached. There was no >> output from 'xl debug-keys g'. > Ok, so I am hitting this too - I was hoping that the patch from Stefano > would have fixed the issue, but sadly it did not. > > Let me (I am traveling right now) see if I can come up with an internim > solution until Ian comes with the right fix. > On different hardware with the same software I'm also getting problems starting DomUs, but this time the error is different. I've attached a copy of the xl console output, but basically the server hang at "Mount-cache hash table entries: 512". Again the VM is paravirtualised, and again I get a qemu-dm process for it. The references to this message are normally related to memory issues, but the server has only 1000M of ram, so can't see it causing too much of a problem. Is this related to the other problems I'm seeing or completely separate? thanks, Anthony --------------030500070604010105050306 Content-Type: text/plain; name="domU.log" Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename="domU.log" [ 3.226308] Reserving virtual address space above 0xf5800000 [ 3.226308] Linux version 2.6.30.1 (root@deb-builder) (gcc version 4.3.2 (GCC) ) #2 SMP Mon Jul 18 12:06:12 GMT 2011 [ 3.226308] KERNEL supported cpus: [ 3.226308] Intel GenuineIntel [ 3.226308] AMD AuthenticAMD [ 3.226308] NSC Geode by NSC [ 3.226308] Cyrix CyrixInstead [ 3.226308] Centaur CentaurHauls [ 3.226308] Transmeta GenuineTMx86 [ 3.226308] Transmeta TransmetaCPU [ 3.226308] UMC UMC UMC UMC [ 3.226308] BIOS-provided physical RAM map: [ 3.226308] Xen: 0000000000000000 - 00000000000a0000 (usable) [ 3.226308] Xen: 00000000000a0000 - 0000000000100000 (reserved) [ 3.226308] Xen: 0000000000100000 - 000000000065c000 (usable) [ 3.226308] Xen: 000000000065c000 - 0000000000853000 (reserved) [ 3.226308] Xen: 0000000000853000 - 000000007d000000 (usable) [ 3.226308] DMI not present or invalid. [ 3.226308] last_pfn = 0x7d000 max_arch_pfn = 0x1000000 [ 3.226308] init_memory_mapping: 0000000000000000-00000000229fe000 [ 3.226308] NX (Execute Disable) protection: active [ 3.226308] 1446MB HIGHMEM available. [ 3.226308] 553MB LOWMEM available. [ 3.226308] mapped low ram: 0 - 229fe000 [ 3.226308] low ram: 0 - 229fe000 [ 3.226308] node 0 low ram: 00000000 - 229fe000 [ 3.226308] node 0 bootmap 00007000 - 0000b540 [ 3.226308] (7 early reservations) ==> bootmem [0000000000 - 00229fe000] [ 3.226308] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000] [ 3.226308] #1 [0000853000 - 000085b000] XEN PAGETABLES ==> [0000853000 - 000085b000] [ 3.226308] #2 [0000001000 - 0000002000] EX TRAMPOLINE ==> [0000001000 - 0000002000] [ 3.226308] #3 [0000006000 - 0000007000] TRAMPOLINE ==> [0000006000 - 0000007000] [ 3.226308] #4 [0000100000 - 00005366f4] TEXT DATA BSS ==> [0000100000 - 00005366f4] [ 3.226308] #5 [0000537000 - 0000643000] PGTABLE ==> [0000537000 - 0000643000] [ 3.226308] #6 [0000007000 - 000000c000] BOOTMAP ==> [0000007000 - 000000c000] [ 3.226308] Zone PFN ranges: [ 3.226308] DMA 0x00000000 -> 0x00001000 [ 3.226308] Normal 0x00001000 -> 0x000229fe [ 3.226308] HighMem 0x000229fe -> 0x0007d000 [ 3.226308] Movable zone start PFN for each node [ 3.226308] early_node_map[3] active PFN ranges [ 3.226308] 0: 0x00000000 -> 0x000000a0 [ 3.226308] 0: 0x00000100 -> 0x0000065c [ 3.226308] 0: 0x00000853 -> 0x0007d000 [ 3.226308] Using APIC driver default [ 3.226308] SMP: Allowing 8 CPUs, 0 hotplug CPUs [ 3.226308] Local APIC disabled by BIOS -- you can enable it with "lapic" [ 3.226308] Allocating PCI resources starting at 80000000 (gap: 7d000000:83000000) [ 3.226308] NR_CPUS:8 nr_cpumask_bits:8 nr_cpu_ids:8 nr_node_ids:1 [ 3.226308] PERCPU: Allocated 6 4k pages, static data 22940 bytes [ 3.810521] Xen: using vcpu_info placement [ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 507400 [ 0.000000] Kernel command line: root=/dev/xvda1 [ 0.000000] Enabling fast FPU save and restore... done. [ 0.000000] Enabling unmasked SIMD FPU exception support... done. [ 0.000000] Initializing CPU#0 [ 0.000000] NR_IRQS:512 [ 0.000000] PID hash table entries: 4096 (order: 12, 16384 bytes) [ 0.000000] Detected 2533.462 MHz processor. [ 0.010000] Console: colour dummy device 80x25 [ 0.010000] console [tty0] enabled [ 0.010000] console [hvc0] enabled [ 0.010000] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) [ 0.010000] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) [ 0.010000] Initializing HighMem for node 0 (000229fe:0007d000) [ 0.010000] Memory: 2022248k/2048000k available (2539k kernel code, 22480k reserved, 1067k data, 244k init, 1480712k highmem) [ 0.010000] virtual kernel memory layout: [ 0.010000] fixmap : 0xf574f000 - 0xf57ff000 ( 704 kB) [ 0.010000] pkmap : 0xf5200000 - 0xf5400000 (2048 kB) [ 0.010000] vmalloc : 0xe31fe000 - 0xf51fe000 ( 288 MB) [ 0.010000] lowmem : 0xc0000000 - 0xe29fe000 ( 553 MB) [ 0.010000] .init : 0xc0490000 - 0xc04cd000 ( 244 kB) [ 0.010000] .data : 0xc037ae1d - 0xc0485e18 (1067 kB) [ 0.010000] .text : 0xc0100000 - 0xc037ae1d (2539 kB) [ 0.010000] Checking if this processor honours the WP bit even in supervisor mode...Ok. [ 0.010000] installing Xen timer for CPU 0 [ 0.010000] Calibrating delay loop (skipped), value calculated using timer frequency.. 5066.92 BogoMIPS (lpj=25334620) [ 0.010000] Mount-cache hash table entries: 512 --------------030500070604010105050306 Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --------------030500070604010105050306--