From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from ozlabs.org (ozlabs.org [103.22.144.67]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 3rVXjh65KNzDqR3 for ; Thu, 16 Jun 2016 15:58:00 +1000 (AEST) Received: from aserp1040.oracle.com (aserp1040.oracle.com [141.146.126.69]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.org (Postfix) with ESMTPS id 3rVXjg23BXz9t1c for ; Thu, 16 Jun 2016 15:57:59 +1000 (AEST) Date: Wed, 15 Jun 2016 22:57:46 -0700 From: "Darrick J. Wong" To: Michael Ellerman Cc: "Aneesh Kumar K.V" , linuxppc-dev@ozlabs.org Subject: Re: kernel bug in "Drop WIMG in favour of new constants"? Message-ID: <20160616055746.GC22590@birch.djwong.org> References: <20160616043340.GB22590@birch.djwong.org> <1466054627.5400.5.camel@ellerman.id.au> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="8t9RHnE3ZwKMSgU+" In-Reply-To: <1466054627.5400.5.camel@ellerman.id.au> List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , --8t9RHnE3ZwKMSgU+ Content-Type: text/plain; charset=us-ascii Content-Disposition: inline On Thu, Jun 16, 2016 at 03:23:47PM +1000, Michael Ellerman wrote: > On Wed, 2016-06-15 at 21:33 -0700, Darrick J. Wong wrote: > > > Hi Aneesh, > > > > I noticed when trying out 4.7-rc3 on qemu-2.5 that the kernel no longer > > boots. 4.6 booted just fine, so I bisected the kernel to the commit > > 30bda41aba4efb2370c97e2cbe7385de93ccc372, which is "powerpc/mm: Drop WIMG in > > favour of new constants". The changelog suggests that the KVM changes need > > closer review, and here's an actual crash: > > > > (I can send libvirt's machine xml, .config, and full dmesg if that helps.) > > Yes please. > > I'm successfully booting 4.7-rc's on qemu (2.5.0 (Debian 1:2.5+dfsg-5ubuntu10.1)). Ok, see attached. I also sent along the dpkg --status output for qemu and qemu-slof; looks like we're running the same Ubuntu packages... ...my host kernel is 4.6.0 on x64. --D --8t9RHnE3ZwKMSgU+ Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename=dmesg Connected to domain mtrp0 Escape character is ^] Populating /vdevice methods Populating /vdevice/vty@30000000 Populating /vdevice/nvram@71000000 Populating /pci@800000020000000 00 4000 (D) : 1af4 1005 unknown-legacy-device* 00 3800 (D) : 1af4 1000 virtio [ net ] 00 3000 (D) : 1af4 1000 virtio [ net ] 00 2800 (D) : 1af4 1002 unknown-legacy-device* 00 2000 (D) : 1af4 1003 communication-controller* 00 1800 (D) : 1af4 1004 virtio [ scsi ] Populating /pci@800000020000000/scsi@3 SCSI: Looking for devices 100000100000000 DISK : "QEMU QEMU HARDDISK 2.5+" 100000000000000 DISK : "QEMU QEMU HARDDISK 2.5+" 00 1000 (D) : 1033 0194 serial bus [ usb-xhci ] 00 0800 (D) : 1234 1111 qemu vga Installing QEMU fb Scanning USB XHCI: Initializing USB Keyboard USB mouse USB HUB No console specified using screen & keyboard Detected RAM kernel at 400000 (103dad8 bytes) Welcome to Open Firmware Copyright (c) 2004, 2011 IBM Corporation All rights reserved. This program and the accompanying materials are made available under the terms of the BSD License available at http://www.opensource.org/licenses/bsd-license.php Booting from memory... [ 0.000000] bootconsole [udbg0] enabled -> early_setup(), dt_ptr: 0x1942000 [ 0.000000] Allocated 4096 bytes for 1 pacas at c00000000ffff000 [ 0.000000] pSeries detected, looking for LPAR capability... [ 0.000000] -> fw_vec5_feature_init() [ 0.000000] <- fw_vec5_feature_init() [ 0.000000] -> fw_hypertas_feature_init() [ 0.000000] <- fw_hypertas_feature_init() [ 0.000000] Machine is LPAR ! [ 0.000000] Using pSeries machine description Found, Initializing memory management... [ 0.000000] Page sizes from device-tree: [ 0.000000] base_shift=12: shift=12, sllp=0x0000, avpnm=0x00000000, tlbiel=1, penc=0 [ 0.000000] base_shift=24: shift=24, sllp=0x0100, avpnm=0x00000001, tlbiel=0, penc=0 [ 0.000000] Page orders: linear mapping = 24, virtual = 12, io = 12, vmemmap = 24 [ 0.000000] Using 1TB segments <- early_setup() -> setup_system() -> initialize_cache_info() <- initialize_cache_info() [ 0.000000] Found initrd at 0xc000000001450000:0xc0000000018d8cd3 [ 0.000000] -> pSeries_init_early() [ 0.000000] -> fw_cmo_feature_init() [ 0.000000] CMO not available [ 0.000000] <- fw_cmo_feature_init() [ 0.000000] <- pSeries_init_early() [ 0.000000] Starting Linux ppc64 #11 PREEMPT Wed Jun 15 18:03:32 PDT 2016 [ 0.000000] ----------------------------------------------------- [ 0.000000] ppc64_pft_size = 0x18 [ 0.000000] phys_mem_size = 0x80000000 [ 0.000000] cpu_features = 0x0b7e7a6418500049 [ 0.000000] possible = 0x3f7fffff18500649 [ 0.000000] always = 0x0000000018100040 [ 0.000000] cpu_user_features = 0xdc0065c2 0x20000000 [ 0.000000] mmu_features = 0x7c000001 [ 0.000000] firmware_features = 0x00000001405a440b [ 0.000000] htab_hash_mask = 0x1ffff [ 0.000000] ----------------------------------------------------- <- setup_system() [ 0.000000] Linux version 4.7.0-rc3-pcsum (djwong@alder.djwong.org) (gcc version 5.3.1 20160413 (Ubuntu 5.3.1-14ubuntu2) ) #11 PREEMPT Wed Jun 15 18:03:32 PDT 2016 [ 0.000000] PCI host bridge /pci@800000020000000 ranges: [ 0.000000] IO 0x0000010080000000..0x000001008000ffff -> 0x0000000000000000 [ 0.000000] MEM 0x00000100a0000000..0x000001101fffffff -> 0x0000000080000000 [ 0.000000] PPC64 nvram contains 131072 bytes [ 0.000000] Relocation on exceptions not supported [ 0.000000] Top of RAM: 0x80000000, Total RAM: 0x80000000 [ 0.000000] Memory hole size: 0MB [ 0.000000] Zone ranges: [ 0.000000] DMA [mem 0x0000000000000000-0x000000007fffffff] [ 0.000000] DMA32 empty [ 0.000000] Normal empty [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x0000000000000000-0x000000007fffffff] [ 0.000000] Initmem setup node 0 [mem 0x0000000000000000-0x000000007fffffff] [ 0.000000] On node 0 totalpages: 524288 [ 0.000000] DMA zone: 8192 pages used for memmap [ 0.000000] DMA zone: 0 pages reserved [ 0.000000] DMA zone: 524288 pages, LIFO batch:31 [ 0.000000] pcpu-alloc: s0 r0 d32768 u32768 alloc=1*32768 [ 0.000000] pcpu-alloc: [0] 0 [ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 516096 [ 0.000000] Kernel command line: ro rootdelay=60 panic=0 noresume root=/dev/nfs nfsroot=192.168.122.1:/storage/mtr/ppc64/ ip=dhcp hostname=alder-mtrp0 BOOTIF=01-52-54-00-26-f3-ee console=hvc0 debug systemd.show_status=false [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes) [ 0.000000] Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes) [ 0.000000] Sorting __ex_table... [ 0.000000] Memory: 1973740K/2097152K available (7212K kernel code, 1432K rwdata, 2308K rodata, 3588K init, 2062K bss, 123412K reserved, 0K cma-reserved) [ 0.000000] SLUB: HWalign=128, Order=0-3, MinObjects=0, CPUs=1, Nodes=1 [ 0.000000] Preemptible hierarchical RCU implementation. [ 0.000000] Build-time adjustment of leaf fanout to 64. [ 0.000000] NR_IRQS:512 nr_irqs:512 16 [ 0.000000] pic: no ISA interrupt controller [ 0.000000] time_init: decrementer frequency = 512.000000 MHz [ 0.000000] time_init: processor frequency = 1000.000000 MHz [ 0.000210] clocksource: timebase: mask: 0xffffffffffffffff max_cycles: 0x761537d007, max_idle_ns: 440795202126 ns [ 0.000587] clocksource: timebase mult[1f40000] shift[24] registered [ 0.000923] clockevent: decrementer mult[83126e98] shift[32] cpu[0] [ 0.004297] Console: colour dummy device 80x25 [ 0.004943] console [hvc0] enabled [ 0.004943] console [hvc0] enabled [ 0.005364] bootconsole [udbg0] disabled [ 0.005364] bootconsole [udbg0] disabled [ 0.008176] pid_max: default: 32768 minimum: 301 [ 0.010985] Mount-cache hash table entries: 4096 (order: 3, 32768 bytes) [ 0.011182] Mountpoint-cache hash table entries: 4096 (order: 3, 32768 bytes) [ 0.023245] ftrace: allocating 18922 entries in 112 pages [ 0.104384] EEH: pSeries platform initialized [ 0.106314] POWER7 performance monitor hardware support registered [ 0.129177] devtmpfs: initialized [ 0.143863] EEH: devices created [ 0.144887] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645041785100000 ns [ 0.149669] NET: Registered protocol family 16 [ 0.154090] EEH: No capable adapters found [ 0.168883] cpuidle: using governor ladder [ 0.180984] cpuidle: using governor menu Linux ppc64 #11 PREEMPT Wed [ 0.184645] rtas_msi: Registering RTAS MSI callbacks. [ 0.197451] PCI: Probing PCI hardware [ 0.199166] no ibm,pcie-link-speed-stats property [ 0.200470] PCI host bridge to bus 0000:00 [ 0.201075] pci_bus 0000:00: root bus resource [io 0x10000-0x1ffff] (bus address [0x0000-0xffff]) [ 0.201548] pci_bus 0000:00: root bus resource [mem 0x100a0000000-0x1101fffffff] (bus address [0x80000000-0xfffffffff]) [ 0.201994] pci_bus 0000:00: root bus resource [bus 00-ff] [ 0.212257] pci_dma_bus_setup_pSeriesLP: setting up bus /pci@800000020000000 [ 0.212571] parent is /pci@800000020000000, iommu_table: 0x (null) [ 0.216849] IOMMU table initialized, virtual merging enabled [ 0.217357] created table: c00000007f9ba3c0 [ 0.217581] pci_dma_dev_setup_pSeriesLP: 0000:00:08.0 [ 0.217723] node is /pci@800000020000000/unknown-legacy-device@8 [ 0.217920] parent is /pci@800000020000000 [ 0.218034] found DMA window, table: c00000007f9ba3c0 [ 0.218381] iommu: Adding device 0000:00:08.0 to group 0 [ 0.221509] pci 0000:00:08.0: rtas_msi: disabling existing MSI. [ 0.221833] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.222044] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.222208] rtas_msi: Setting MSIs to 0 failed! [ 0.222346] pci_dma_dev_setup_pSeriesLP: 0000:00:07.0 [ 0.222474] node is /pci@800000020000000/ethernet@7 [ 0.222600] parent is /pci@800000020000000 [ 0.222707] found DMA window, table: c00000007f9ba3c0 [ 0.222885] iommu: Adding device 0000:00:07.0 to group 0 [ 0.223361] pci 0000:00:07.0: rtas_msi: disabling existing MSI. [ 0.223531] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.223698] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.223855] rtas_msi: Setting MSIs to 0 failed! [ 0.223978] pci_dma_dev_setup_pSeriesLP: 0000:00:06.0 [ 0.224094] node is /pci@800000020000000/ethernet@6 [ 0.224213] parent is /pci@800000020000000 [ 0.224317] found DMA window, table: c00000007f9ba3c0 [ 0.224495] iommu: Adding device 0000:00:06.0 to group 0 [ 0.225012] pci 0000:00:06.0: rtas_msi: disabling existing MSI. [ 0.225178] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.225338] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.225504] rtas_msi: Setting MSIs to 0 failed! [ 0.225627] pci_dma_dev_setup_pSeriesLP: 0000:00:05.0 [ 0.225749] node is /pci@800000020000000/unknown-legacy-device@5 [ 0.225890] parent is /pci@800000020000000 [ 0.225991] found DMA window, table: c00000007f9ba3c0 [ 0.226174] iommu: Adding device 0000:00:05.0 to group 0 [ 0.226565] pci 0000:00:05.0: rtas_msi: disabling existing MSI. [ 0.226731] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.226894] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.227042] rtas_msi: Setting MSIs to 0 failed! [ 0.227161] pci_dma_dev_setup_pSeriesLP: 0000:00:04.0 [ 0.227278] node is /pci@800000020000000/communication-controller@4 [ 0.227429] parent is /pci@800000020000000 [ 0.227537] found DMA window, table: c00000007f9ba3c0 [ 0.227706] iommu: Adding device 0000:00:04.0 to group 0 [ 0.228047] pci 0000:00:04.0: rtas_msi: disabling existing MSI. [ 0.228221] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.228385] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.228599] rtas_msi: Setting MSIs to 0 failed! [ 0.228734] pci_dma_dev_setup_pSeriesLP: 0000:00:03.0 [ 0.228867] node is /pci@800000020000000/scsi@3 [ 0.229137] parent is /pci@800000020000000 [ 0.229263] found DMA window, table: c00000007f9ba3c0 [ 0.229466] iommu: Adding device 0000:00:03.0 to group 0 [ 0.229768] pci 0000:00:03.0: rtas_msi: disabling existing MSI. [ 0.229944] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.230126] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.230284] rtas_msi: Setting MSIs to 0 failed! [ 0.230413] pci_dma_dev_setup_pSeriesLP: 0000:00:02.0 [ 0.230546] node is /pci@800000020000000/usb@2 [ 0.230669] parent is /pci@800000020000000 [ 0.230783] found DMA window, table: c00000007f9ba3c0 [ 0.230966] iommu: Adding device 0000:00:02.0 to group 0 [ 0.231235] pci 0000:00:02.0: rtas_msi: disabling existing MSI. [ 0.231418] rtas_msi: ibm,change_msi(func=3,num=0), got 0 rc = -1 [ 0.231598] rtas_msi: ibm,change_msi(func=1,num=0), got 0 rc = -1 [ 0.231752] rtas_msi: Setting MSIs to 0 failed! [ 0.231874] pci_dma_dev_setup_pSeriesLP: 0000:00:01.0 [ 0.231996] node is /pci@800000020000000/vga@1 [ 0.232122] parent is /pci@800000020000000 [ 0.232231] found DMA window, table: c00000007f9ba3c0 [ 0.232416] iommu: Adding device 0000:00:01.0 to group 0 [ 0.232694] pci 0000:00:01.0: rtas_msi: no LSI, nothing to do. [ 0.235387] PCI: Probing PCI hardware done [ 0.277880] HugeTLB registered 16 MB page size, pre-allocated 0 pages [ 0.289910] SCSI subsystem initialized [ 0.291318] usbcore: registered new interface driver usbfs [ 0.292508] usbcore: registered new interface driver hub [ 0.293151] usbcore: registered new device driver usb [ 0.305298] clocksource: Switched to clocksource timebase [ 0.443753] VFS: Disk quotas dquot_6.6.0 [ 0.444154] VFS: Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.478299] NET: Registered protocol family 2 [ 0.484695] TCP established hash table entries: 16384 (order: 5, 131072 bytes) [ 0.488221] TCP bind hash table entries: 16384 (order: 5, 131072 bytes) [ 0.488806] TCP: Hash tables configured (established 16384 bind 16384) [ 0.489946] UDP hash table entries: 1024 (order: 3, 32768 bytes) [ 0.490270] UDP-Lite hash table entries: 1024 (order: 3, 32768 bytes) [ 0.491928] NET: Registered protocol family 1 [ 0.494500] RPC: Registered named UNIX socket transport module. [ 0.494717] RPC: Registered udp transport module. [ 0.494879] RPC: Registered tcp transport module. [ 0.495017] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 0.497440] mm: Hashing failure ! EA=0xd000080080000010 access=0x800000000000000c current=swapper [ 0.498039] trap=0x300 vsid=0x13d349c ssize=1 base psize=0 psize 0 pte=0xc0000101200001ae [ 0.498398] mm: Hashing failure ! EA=0xd000080080000010 access=0x800000000000000c current=swapper [ 0.498785] trap=0x300 vsid=0x13d349c ssize=1 base psize=0 psize 0 pte=0xc0000101200001ae [ 0.499282] Unable to handle kernel paging request for data at address 0xd000080080000010 [ 0.499575] Faulting instruction address: 0xc00000000053b4bc [ 0.499997] Oops: Kernel access of bad area, sig: 7 [#1] [ 0.500211] PREEMPT pSeries [ 0.500565] Modules linked in: [ 0.500879] CPU: 0 PID: 1 Comm: swapper Not tainted 4.7.0-rc3-pcsum #11 [ 0.501250] task: c00000007f950000 ti: c00000007f94c000 task.ti: c00000007f94c000 [ 0.501526] NIP: c00000000053b4bc LR: c00000000053b4a8 CTR: 0000000000000000 [ 0.501783] REGS: c00000007f94f740 TRAP: 0300 Not tainted (4.7.0-rc3-pcsum) [ 0.502031] MSR: 8000000002009032 CR: 84000242 XER: 00000000 [ 0.502526] CFAR: c0000000000083b4 DAR: d000080080000010 DSISR: 40000000 SOFTE: 1 GPR00: c00000000053b4a8 c00000007f94f9c0 c000000000e24000 d000080080000010 GPR04: d000080080003000 c00000007f8d4018 4000000000000000 000000007f9f0000 GPR08: 000000007f8d4000 c0000101200031ae 00000000000001ff 0000000000000300 GPR12: 0000000084000242 c00000000ffff000 c00000000000b2a0 0000000000000000 GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR20: 0000000000000000 0000000000000000 0000000000000000 c00000000094a3b8 GPR24: c000000000d952a8 c00000000099a098 c00000000102c250 c000000000e51088 GPR28: 000000000000ffff d000080080000000 0000000000004000 c00000007f81b000 [ 0.505368] NIP [c00000000053b4bc] .quirk_usb_early_handoff+0x48c/0xd00 [ 0.505626] LR [c00000000053b4a8] .quirk_usb_early_handoff+0x478/0xd00 [ 0.505896] Call Trace: [ 0.506077] [c00000007f94f9c0] [c00000000053b4a8] .quirk_usb_early_handoff+0x478/0xd00 (unreliable) [ 0.506477] [c00000007f94fab0] [c0000000004312ac] .pci_do_fixups+0xdc/0x140 [ 0.506751] [c00000007f94fb60] [c00000000098427c] .pci_apply_final_quirks+0xb0/0x194 [ 0.507043] [c00000007f94fc10] [c00000000000aa48] .do_one_initcall+0x68/0x1e0 [ 0.507311] [c00000007f94fcf0] [c000000000957278] .kernel_init_freeable+0x238/0x320 [ 0.507597] [c00000007f94fdb0] [c00000000000b2c4] .kernel_init+0x24/0x160 [ 0.507872] [c00000007f94fe30] [c000000000009230] .ret_from_kernel_thread+0x58/0xa8 [ 0.508172] Instruction dump: [ 0.508385] 7d3e07b4 4bfff9e1 2fa30000 41befc8c e87f0320 7fc4f378 4bb093fd 60000000 [ 0.508794] 7c7d1b79 4182fc74 387d0010 7c0004ac <7d201c2c> 0c090000 4c00012c 2f89ffff [ 0.510503] ---[ end trace b7e54f4dcb78b6ca ]--- [ 0.537662] [ 1.540840] Kernel panic - not syncing: Attempted to kill init! exitcode=0x00000007 [ 1.540840] --8t9RHnE3ZwKMSgU+ Content-Type: application/xml Content-Disposition: attachment; filename="libvirt.xml" Content-Transfer-Encoding: quoted-printable =0A=0A=0A mtrp0=0A cf559133-7988-42cd-bef2-= 26f3ee8e385b=0A =0A 2097152=0A 2097152= =0A 4=0A =0A hvm=0A /storage/mtr/ppc64/boot/= vmlinux-4.7.0-rc3-pcsum=0A /storage/mtr/ppc64/boot/init= rd.img-4.7.0-rc3-pcsum=0A ro rootdelay=3D60 panic=3D0 = noresume root=3D/dev/nfs nfsroot=3D192.168.122.1:/storage/mtr/ppc64/ ip=3Dd= hcp hostname=3Dalder-mtrp0 BOOTIF=3D01-52-54-00-26-f3-ee console=3Dhvc0 deb= ug systemd.show_status=3Dfalse=0A =0A = =0A =0A destroy=0A= restart=0A restart=0A =0A /usr/bin/qemu-system-ppc64=0A =0A =0A = =0A =0A
=0A =0A = =0A =0A =0A =0A
=0A =0A =0A = =0A
= =0A =0A =0A
=0A =0A =0A
=0A =0A = =0A = =0A =0A =0A =
=0A =0A =0A =0A =0A= =0A
=0A =0A = =0A =0A
=0A =0A =0A =0A
=0A =0A =0A =0A =
=0A= =0A =0A =0A =0A