From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeremy Fitzhardinge Subject: Re: pv_ops dom0 kernel failure with ata_piix Date: Thu, 01 Jan 2009 08:36:33 +1100 Message-ID: <495BE5E1.40602@goop.org> References: <20081230143440.GC15052@edu.joroinen.fi> <20081230144422.GD15052@edu.joroinen.fi> <1e16a9ed0812300803i325f3f02qa8d501fe13456e96@mail.gmail.com> <20081230161939.GG15052@edu.joroinen.fi> <20081230162835.GH15052@edu.joroinen.fi> <1e16a9ed0812300847s5dd1c2afj4fabcd4169258e40@mail.gmail.com> <20081230165211.GI15052@edu.joroinen.fi> <20081230181845.GK15052@edu.joroinen.fi> <20081230190356.GL15052@edu.joroinen.fi> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <20081230190356.GL15052@edu.joroinen.fi> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: =?ISO-8859-1?Q?Pasi_K=E4rkk=E4inen?= Cc: Todd Deshane , xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Pasi K=E4rkk=E4inen wrote: > On Tue, Dec 30, 2008 at 08:18:45PM +0200, Pasi K=E4rkk=E4inen wrote: > =20 >> On Tue, Dec 30, 2008 at 06:52:11PM +0200, Pasi K=E4rkk=E4inen wrote: >> =20 >>> On Tue, Dec 30, 2008 at 11:47:40AM -0500, Todd Deshane wrote: >>> =20 >>>> On Tue, Dec 30, 2008 at 11:28 AM, Pasi K=E4rkk=E4inen = wrote: >>>> =20 >>>>>>> Try changing the SATA mode of the disk in the BIOS to either lega= cy or >>>>>>> ATA, depending >>>>>>> on what you bios supports. >>>>>>> >>>>>>> There is still a known issue with some of the AHCI stuff. >>>>>>> >>>>>>> =20 >>>>>> I'm not using AHCI at all.. AHCI (SATA) controller does not have a= ny disks >>>>>> plugged in. >>>>>> >>>>>> My (only) disk is attached to IDE/PATA ICH6 controller.. >>>>>> >>>>>> =20 >>>>> I tried disabling AHCI anyway (even when I'm not using the AHCI SAT= A >>>>> controller), but it didn't help. >>>>> >>>>> Booting the pv_ops dom0 kernel still fails.. because the IDE/PATA c= ontroller >>>>> still gets disabled (just like in the logs above) for some unknown = reason. >>>>> >>>>> -- Pasi >>>>> =20 >>>> So from the log it looks like you are getting a hard crash? >>>> >>>> =20 >>> No, it's not a hard crash.. I'm able to reboot the box with ctrl-alt-= del.=20 >>> >>> =20 >>>> Did you try adding noreboot to the xen command line and then trying >>>> to get any more information with sysreq stuff? >>>> >>>> Another thing that I noticed in the log is: >>>> >>>> irq 12: nobody cared (try booting with the "irqpoll" option) >>>> >>>> Is it an smp kernel? I had some weird problems when passing nosmp at >>>> one point. >>>> >>>> =20 >>> Yep, it is SMP kernel..=20 >>> =20 >>> =20 >>>> I don't recognize this crash, those module traces, at least for me w= ere >>>> non-fatal and didn't cause a crash. I was normally just dropped to a >>>> initramfs prompt in the case when the disk was not detected properly= . >>>> >>>> I was then able to at least look around and check that modules were >>>> loaded correctly etc. >>>> >>>> =20 >>> The problem seems to be the IDE controller/disk is not detected at al= l.. >>> because the device gets disabled for some reason during ata_piix >>> loading/probing..=20 >>> >>> That is the only disk I have, so the root partition is on it. >>> >>> Same kernel works just fine on baremetal without Xen. >>> >>> =20 >> So I guess the subject should be 'failure with ata_piix' instead of 'c= rash'..=20 >> >> Anyway, there seems to be problems with ata_piix too.. in addition to = AHCI. >> >> Jeremy: Can you think of anything to try?=20 >> >> =20 > > I tried booting with "nosmp" dom0 kernel parameter: > =20 Yes, nosmp is known to make things worse. It has the side-effect of=20 disabling io_apic support, so all the interrupt stuff fails in the way=20 you've noticed. J