* free_irq_vector on ia64
@ 2007-08-30 10:48 Herbert Xu
2007-08-30 14:34 ` Alex Williamson
0 siblings, 1 reply; 5+ messages in thread
From: Herbert Xu @ 2007-08-30 10:48 UTC (permalink / raw)
To: Alex Williamson; +Cc: Xen Development Mailing List
Hi Alex:
I was looking at an ia64 bug report and noticed that we don't
actually free IRQs in the free_irq_vector hypercall. This
would eventually lead to alloc_irq_vector failing. Unless I'm
mistaken something like calling pci_disable_device and
pci_enable_device can lead to this situation.
So I'm wondering what the original problem was and how could
we resolve it without leaking the IRQ. Any ideas?
Thanks,
--
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: free_irq_vector on ia64 2007-08-30 10:48 free_irq_vector on ia64 Herbert Xu @ 2007-08-30 14:34 ` Alex Williamson 2007-09-03 7:27 ` [Xen-devel] " Duan, Ronghui 0 siblings, 1 reply; 5+ messages in thread From: Alex Williamson @ 2007-08-30 14:34 UTC (permalink / raw) To: Herbert Xu; +Cc: Xen Development Mailing List, xen-ia64-devel On Thu, 2007-08-30 at 18:48 +0800, Herbert Xu wrote: > Hi Alex: > > I was looking at an ia64 bug report and noticed that we don't > actually free IRQs in the free_irq_vector hypercall. This > would eventually lead to alloc_irq_vector failing. Unless I'm > mistaken something like calling pci_disable_device and > pci_enable_device can lead to this situation. > > So I'm wondering what the original problem was and how could > we resolve it without leaking the IRQ. Any ideas? Hi Herbert, I don't think we ever investigated this any further, though there's obviously something wrong there. I believe you're referring to this cset: http://xenbits.xensource.com/xen-unstable.hg?rev/968caf47b548 Unfortunately, the comment is still true. This is fairly simply to reproduce, hide a PCI device from dom0 using something like pciback.hide=(0000:01:02.1) on the dom0 append line (this is hiding function 1 or a 2 port/function e1000 card). Simply boot dom0, reboot, badness... Unable to handle kernel paging request at virtual address 0000007366627375 reboot[2703]: Oops 8813272891392 [1] Modules linked in: Pid: 2703, CPU 1, comm: reboot psr : 00001010085a6010 ifs : 800000000000038a ip : [<a0000001000acab0>] Not tainted ip is at notifier_call_chain+0x30/0xc0 unat: 0000000000000000 pfs : 400000000000038a rsc : 0000000000000007 rnat: 0000000000000000 bsps: 0000000000000000 pr : 000000000055a959 ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c0270033f csd : 0000000000000000 ssd : 0000000000000000 b0 : a0000001000aed00 b6 : a000000100018610 b7 : a000000100018570 f6 : 000000000000000000000 f7 : 000000000000000000000 f8 : 000000000000000000000 f9 : 000000000000000000000 f10 : 000000000000000000000 f11 : 000000000000000000000 r1 : a0000001011225a0 r2 : 4000000000000792 r3 : a000000100f3e368 r8 : 0000007366627375 r9 : 60000fffff4bfc90 r10 : 0000000000000000 r11 : 0000000000000008 r12 : e0000001b7d57d30 r13 : e0000001b7d50000 r14 : e0000001be05a880 r15 : 0000000100000000 r16 : 0000000000000000 r17 : 0000000001234567 r18 : 0000000000200000 r19 : 0000000000000008 r20 : 2000000000244200 r21 : 0009804c8a70033f r22 : e0000001b7d50f70 r23 : 60000fff7fffc0c8 r24 : 0000000000000000 r25 : 0000000000000000 r26 : c00000000000010a r27 : 0000000000000000 r28 : fffffffffff00031 r29 : 00001213085a6010 r30 : 0000000000000000 r31 : a000000100d0b080 Call Trace: [<a00000010001d520>] show_stack+0x40/0xa0 sp=e0000001b7d578e0 bsp=e0000001b7d511a8 [<a00000010001e180>] show_regs+0x840/0x880 sp=e0000001b7d57ab0 bsp=e0000001b7d51150 [<a000000100042900>] die+0x1c0/0x380 sp=e0000001b7d57ab0 bsp=e0000001b7d51108 [<a000000100066970>] ia64_do_page_fault+0x870/0x9a0 sp=e0000001b7d57ad0 bsp=e0000001b7d510b8 [<a000000100069140>] xen_leave_kernel+0x0/0x3e0 sp=e0000001b7d57b60 bsp=e0000001b7d510b8 [<a0000001000acab0>] notifier_call_chain+0x30/0xc0 sp=e0000001b7d57d30 bsp=e0000001b7d51068 [<a0000001000aed00>] blocking_notifier_call_chain+0x40/0x80 sp=e0000001b7d57d30 bsp=e0000001b7d51030 [<a0000001000afb10>] kernel_restart+0x30/0x120 sp=e0000001b7d57d30 bsp=e0000001b7d51010 [<a0000001000afff0>] sys_reboot+0x3b0/0x480 sp=e0000001b7d57d30 bsp=e0000001b7d50f90 [<a000000100014560>] ia64_ret_from_syscall+0x0/0x40 sp=e0000001b7d57e30 bsp=e0000001b7d50f90 [<a0000000000108e0>] __kernel_syscall_via_break+0x0/0x20 sp=e0000001b7d58000 bsp=e0000001b7d50f90 /etc/rc6.d/S90reboot: line 17: 2703 Segmentation fault reboot -d -f -i I'd guess this is some kind of double free that we need to track down. Thanks, Alex -- Alex Williamson HP Open Source & Linux Org. ^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: [Xen-devel] Re: free_irq_vector on ia64 2007-08-30 14:34 ` Alex Williamson @ 2007-09-03 7:27 ` Duan, Ronghui 2007-09-04 16:08 ` Alex Williamson 2007-09-04 19:44 ` Alex Williamson 0 siblings, 2 replies; 5+ messages in thread From: Duan, Ronghui @ 2007-09-03 7:27 UTC (permalink / raw) To: Alex Williamson, Herbert Xu; +Cc: Xen Development Mailing List, xen-ia64-devel Hi Alex: I follow your steps to hide e1000 from domain0, adding "pciback.hide== (0000:01:00.0)" in append line, then reboot .It seem that all right. The NAT consumption fault may have fixed by this patch: http://xenbits.xensource.com/xen-unstable.hg?rev/7158623a1b3d Could you please have a check? BTW: It seems that we don't free irq_handler of e1000, so when rebooting, there may be some warning message printing in the serial port! Thank you! Best regards Duan Ronghui -----Original Message----- From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel-bounces@lists.xensource.com] On Behalf Of Alex Williamson Sent: Thursday, August 30, 2007 10:35 PM To: Herbert Xu Cc: Xen Development Mailing List; xen-ia64-devel Subject: [Xen-devel] Re: free_irq_vector on ia64 On Thu, 2007-08-30 at 18:48 +0800, Herbert Xu wrote: > Hi Alex: > > I was looking at an ia64 bug report and noticed that we don't > actually free IRQs in the free_irq_vector hypercall. This > would eventually lead to alloc_irq_vector failing. Unless I'm > mistaken something like calling pci_disable_device and > pci_enable_device can lead to this situation. > > So I'm wondering what the original problem was and how could > we resolve it without leaking the IRQ. Any ideas? Hi Herbert, I don't think we ever investigated this any further, though there's obviously something wrong there. I believe you're referring to this cset: http://xenbits.xensource.com/xen-unstable.hg?rev/968caf47b548 Unfortunately, the comment is still true. This is fairly simply to reproduce, hide a PCI device from dom0 using something like pciback.hide=(0000:01:02.1) on the dom0 append line (this is hiding function 1 or a 2 port/function e1000 card). Simply boot dom0, reboot, badness... Unable to handle kernel paging request at virtual address 0000007366627375 reboot[2703]: Oops 8813272891392 [1] Modules linked in: Pid: 2703, CPU 1, comm: reboot psr : 00001010085a6010 ifs : 800000000000038a ip : [<a0000001000acab0>] Not tainted ip is at notifier_call_chain+0x30/0xc0 unat: 0000000000000000 pfs : 400000000000038a rsc : 0000000000000007 rnat: 0000000000000000 bsps: 0000000000000000 pr : 000000000055a959 ldrs: 0000000000000000 ccv : 0000000000000000 fpsr: 0009804c0270033f csd : 0000000000000000 ssd : 0000000000000000 b0 : a0000001000aed00 b6 : a000000100018610 b7 : a000000100018570 f6 : 000000000000000000000 f7 : 000000000000000000000 f8 : 000000000000000000000 f9 : 000000000000000000000 f10 : 000000000000000000000 f11 : 000000000000000000000 r1 : a0000001011225a0 r2 : 4000000000000792 r3 : a000000100f3e368 r8 : 0000007366627375 r9 : 60000fffff4bfc90 r10 : 0000000000000000 r11 : 0000000000000008 r12 : e0000001b7d57d30 r13 : e0000001b7d50000 r14 : e0000001be05a880 r15 : 0000000100000000 r16 : 0000000000000000 r17 : 0000000001234567 r18 : 0000000000200000 r19 : 0000000000000008 r20 : 2000000000244200 r21 : 0009804c8a70033f r22 : e0000001b7d50f70 r23 : 60000fff7fffc0c8 r24 : 0000000000000000 r25 : 0000000000000000 r26 : c00000000000010a r27 : 0000000000000000 r28 : fffffffffff00031 r29 : 00001213085a6010 r30 : 0000000000000000 r31 : a000000100d0b080 Call Trace: [<a00000010001d520>] show_stack+0x40/0xa0 sp=e0000001b7d578e0 bsp=e0000001b7d511a8 [<a00000010001e180>] show_regs+0x840/0x880 sp=e0000001b7d57ab0 bsp=e0000001b7d51150 [<a000000100042900>] die+0x1c0/0x380 sp=e0000001b7d57ab0 bsp=e0000001b7d51108 [<a000000100066970>] ia64_do_page_fault+0x870/0x9a0 sp=e0000001b7d57ad0 bsp=e0000001b7d510b8 [<a000000100069140>] xen_leave_kernel+0x0/0x3e0 sp=e0000001b7d57b60 bsp=e0000001b7d510b8 [<a0000001000acab0>] notifier_call_chain+0x30/0xc0 sp=e0000001b7d57d30 bsp=e0000001b7d51068 [<a0000001000aed00>] blocking_notifier_call_chain+0x40/0x80 sp=e0000001b7d57d30 bsp=e0000001b7d51030 [<a0000001000afb10>] kernel_restart+0x30/0x120 sp=e0000001b7d57d30 bsp=e0000001b7d51010 [<a0000001000afff0>] sys_reboot+0x3b0/0x480 sp=e0000001b7d57d30 bsp=e0000001b7d50f90 [<a000000100014560>] ia64_ret_from_syscall+0x0/0x40 sp=e0000001b7d57e30 bsp=e0000001b7d50f90 [<a0000000000108e0>] __kernel_syscall_via_break+0x0/0x20 sp=e0000001b7d58000 bsp=e0000001b7d50f90 /etc/rc6.d/S90reboot: line 17: 2703 Segmentation fault reboot -d -f -i I'd guess this is some kind of double free that we need to track down. Thanks, Alex -- Alex Williamson HP Open Source & Linux Org. _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel ^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: Re: free_irq_vector on ia64 2007-09-03 7:27 ` [Xen-devel] " Duan, Ronghui @ 2007-09-04 16:08 ` Alex Williamson 2007-09-04 19:44 ` Alex Williamson 1 sibling, 0 replies; 5+ messages in thread From: Alex Williamson @ 2007-09-04 16:08 UTC (permalink / raw) To: Duan, Ronghui Cc: Xen Development Mailing List, Herbert Xu, Zhang, Xing Z, Xu, Anthony, Zhang, Xiantao, xen-ia64-devel On Mon, 2007-09-03 at 15:27 +0800, Duan, Ronghui wrote: > Hi Alex: > > I follow your steps to hide e1000 from domain0, adding > "pciback.hide== > (0000:01:00.0)" in append line, then reboot .It seem that all right. The > NAT consumption fault may have fixed by this patch: > http://xenbits.xensource.com/xen-unstable.hg?rev/7158623a1b3d > > Could you please have a check? That could be why I don't get a NaT consumption like the comment indicated, but I still get the Oops. It doesn't seem like there's anything platform specific here, so I'm not sure why you don't see it. Note that I'm hiding function 1 of a two port, two function e1000, while you're hiding function 0. I don't know if that matters, but perhaps a variable to play with. > BTW: It seems that we don't free irq_handler of e1000, so when > rebooting, there may be some warning message printing in the serial > port! > Thank you! Yup, that's a bug in our base 2.6.18 kernel which should go away next time we re-base. However, that's a separate issue. Thanks, Alex -- Alex Williamson HP Open Source & Linux Org. ^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: Re: free_irq_vector on ia64 2007-09-03 7:27 ` [Xen-devel] " Duan, Ronghui 2007-09-04 16:08 ` Alex Williamson @ 2007-09-04 19:44 ` Alex Williamson 1 sibling, 0 replies; 5+ messages in thread From: Alex Williamson @ 2007-09-04 19:44 UTC (permalink / raw) To: Duan, Ronghui Cc: Xen Development Mailing List, Herbert Xu, Zhang, Xing Z, Xu, Anthony, Zhang, Xiantao, xen-ia64-devel Here's what I see on dom0 bootup if I revert xen-unstable.hg cset 12884 (plus a little debug code): ... pciback 0000:01:02.1: seizing device [port 1 of e1000] pciback 0000:02:05.0: seizing device [a tulip nic] pciback 0000:16:05.0: seizing device [another tulip nic] ... (XEN) assign_irq_vector(-1)... (XEN) assign_irq_vector(-1) = 59 GSI 76 (level, low) -> CPU 2 (0x0200) vector 59 ACPI: PCI Interrupt 0000:16:05.0[A] -> GSI 76 (level, low) -> IRQ 59 ACPI: PCI interrupt for device 0000:16:05.0 disabled GSI 76 (level, low) -> CPU 2 (0x0200) vector 59 unregistered (XEN) free_irq_vector(59) ... (XEN) assign_irq_vector(-1)... (XEN) assign_irq_vector(-1) = 59 GSI 28 (level, low) -> CPU 3 (0x0300) vector 59 ACPI: PCI Interrupt 0000:02:05.0[A] -> GSI 28 (level, low) -> IRQ 59 ACPI: PCI interrupt for device 0000:02:05.0 disabled GSI 28 (level, low) -> CPU 3 (0x0300) vector 59 unregistered (XEN) free_irq_vector(59) ... (XEN) assign_irq_vector(-1)... (XEN) assign_irq_vector(-1) = 59 GSI 32 (level, low) -> CPU 0 (0x0000) vector 59 ACPI: PCI Interrupt 0000:01:02.1[B] -> GSI 32 (level, low) -> IRQ 59 ACPI: PCI interrupt for device 0000:01:02.1 disabled GSI 32 (level, low) -> CPU 0 (0x0000) vector 59 unregistered (XEN) free_irq_vector(59) ... (XEN) assign_irq_vector(-1)... (XEN) assign_irq_vector(-1) = 59 GSI 19 (level, low) -> CPU 1 (0x0100) vector 59 ACPI: PCI Interrupt 0000:00:02.2[C] -> GSI 19 (level, low) -> IRQ 59 ehci_hcd 0000:00:02.2: EHCI Host Controller ehci_hcd 0000:00:02.2: new USB bus registered, assigned bus number 1 ehci_hcd 0000:00:02.2: request interrupt 59 failed ehci_hcd 0000:00:02.2: USB bus 1 deregistered ACPI: PCI interrupt for device 0000:00:02.2 disabled GSI 19 (level, low) -> CPU 1 (0x0100) vector 59 unregistered (XEN) free_irq_vector(59) So, I think each of the assigns of the hidden devices is from the pci_enable_device() call in pcistub_init_device(). This then immediately calls pciback_reset_device() which frees the irq. Note how vector 59 gets tossed around the hidden devices, then ends up being re-used for the USB device and it doesn't work (at least request_irq() failed). The order of device startup might have more to do with the Oops on shutdown than anything else (maybe a bad error path in the usb shutdown notifier chain). There's a slightly scary comment in pci_stub.c that we likely run afoul of if we reuse the interrupt vector: /* HACK: Force device (& ACPI) to determine what IRQ it's on - we * must do this here because pcibios_enable_device may specify * the pci device's true irq (and possibly its other resources) * if they differ from what's in the configuration space. * This makes the assumption that the device's resources won't * change after this point (otherwise this code may break!) */ When I run lspci on these devices, they all show up on IRQ 59. So, not calling free_irq_vector() is a bad hack, but it makes sure the interrupt vector assigned to the hidden doesn't get recycled somewhere else. Perhaps we need make sure pciback_reset_device() doesn't release the vector, but it's not obvious how to do that. Thanks, Alex -- Alex Williamson HP Open Source & Linux Org. ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2007-09-04 19:44 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-08-30 10:48 free_irq_vector on ia64 Herbert Xu 2007-08-30 14:34 ` Alex Williamson 2007-09-03 7:27 ` [Xen-devel] " Duan, Ronghui 2007-09-04 16:08 ` Alex Williamson 2007-09-04 19:44 ` Alex Williamson
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.