From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Yinghai Lu" Subject: Re: [Bug #10872] x86_64 boot hang when CONFIG_NUMA=n Date: Tue, 8 Jul 2008 00:07:03 -0700 Message-ID: <86802c440807080007n19cd3025mde6a3b082bb9baef@mail.gmail.com> References: <20080706111750.a6d77e81.randy.dunlap@oracle.com> <20080707063218.GC23583@elte.hu> <20080707113917.6fda3f7d.randy.dunlap@oracle.com> <20080707154042.04bb492e.randy.dunlap@oracle.com> <86802c440807071724i1a186879h479460ada28e1b26@mail.gmail.com> <20080707222829.d79b6e2f.randy.dunlap@oracle.com> Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Return-path: DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:cc:in-reply-to:mime-version:content-type :content-transfer-encoding:content-disposition:references; bh=yWfgvxj7BW2bjdkp3KcGmYbZGTL1AC22NCQnlhbVOes=; b=XHwAEb6NDV53ngospKnL2IKsMYrBP/ILs808MkiRIGgV8cU2iUJHb1jk2Z08t7WMzN VgnA7wAYmxF2rR34fL4pB4ITrDyqUITQjhlOmkv4MLZidcLr7CHxz/XWSPHxUp8TCqYt JVMGaU3sVi7xml46uOCIRAfP4GTx0Jejv6EDg= In-Reply-To: <20080707222829.d79b6e2f.randy.dunlap-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org> Content-Disposition: inline Sender: kernel-testers-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org List-ID: Content-Type: text/plain; charset="us-ascii" To: Randy Dunlap Cc: lkml , Ingo Molnar , Linus Torvalds , "Rafael J. Wysocki" , Kernel Testers List On Mon, Jul 7, 2008 at 10:28 PM, Randy Dunlap wrote: > On Mon, 7 Jul 2008 17:24:16 -0700 Yinghai Lu wrote: > >> On Mon, Jul 7, 2008 at 3:40 PM, Randy Dunlap wrote: >> > On Mon, 7 Jul 2008 11:39:17 -0700 Randy Dunlap wrote: >> > >> >> On Mon, 7 Jul 2008 08:32:18 +0200 Ingo Molnar wrote: >> >> >> >> > >> >> > * Linus Torvalds wrote: >> >> > >> >> > > > This still happens with 2.6.26-rc9. Using CONFIG_NUMA=y boots OK. >> >> > > >> >> > > Ok, then it wasn't the nr_zones thing. >> >> > > >> >> > > Since it seems to be repeatable for you, can you bisect it? >> >> > >> >> > one guess would be: >> >> > >> >> > | commit e8ee6f0ae5cd860e8e6c02807edfa3c1fa01bcb5 >> >> > | Author: Yinghai Lu >> >> > | Date: Sun Apr 13 01:41:58 2008 -0700 >> >> > | >> >> > | x86: work around io allocation overlap of HT links >> >> > >> >> > but ... since CONFIG_NUMA makes it work, i'm not sure about that. >> >> > >> >> > Randy, could you post the full CONFIG_NUMA bootlog as well, does it show >> >> > any difference in resource allocations? >> >> >> >> Good and bad boot logs are attached. There are several differences, but I don't >> >> see any that are significant. >> >> >> >> I've started bisecting with: >> >> >> >> $ git bisect start >> >> $ git bisect bad v2.6.26-rc1 >> >> $ git bisect good v2.6.25 >> >> >> >> That's only about 1.29M lines of changes. >> > >> > git bisect and normal rebooting did not find a problem. >> > >> > I'll repeat this using kexec to boot the new kernel and see if that >> > locates any issues... since I normally use kexec to load/test new kernels >> > and that was how the failure occurred (occurs). >> > >> >> same NON-NUMA kernel kexec NON-NUMA kernel? >> >> or other kernel kexex it? > > Ah. Good question. I hadn't noticed that. > NUMA kernel kexec-ing a non-NUMA kernel now fails, but it worked in 2.6.25. > can you resend out that two config? YH