From: Jack Steiner <steiner@sgi.com>
To: Yinghai Lu <yhlu.kernel@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>,
tglx@linutronix.de, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, "H. Peter Anvin" <hpa@zytor.com>,
mike travis <travis@sgi.com>
Subject: Re: [PATCH] - Increase MAX_APICS for large configs
Date: Fri, 18 Apr 2008 19:01:44 -0500 [thread overview]
Message-ID: <20080419000144.GA24486@sgi.com> (raw)
In-Reply-To: <86802c440804181501w4e9563f2oe154c0744076e91e@mail.gmail.com>
On Fri, Apr 18, 2008 at 03:01:43PM -0700, Yinghai Lu wrote:
> On Fri, Apr 18, 2008 at 2:14 PM, Jack Steiner <steiner@sgi.com> wrote:
> > On Thu, Apr 17, 2008 at 01:07:27PM +0200, Ingo Molnar wrote:
> > >
> > > * Jack Steiner <steiner@sgi.com> wrote:
> > >
> > > > Increase the maximum number of apics when running very large
> > > > configurations. This patch has no affect on most systems.
> > >
> > > x86.git overnight random-qa testing found a boot crash and i bisected it
> > > down to this patch. The config is:
> > >
> > > http://redhat.com/~mingo/misc/config-Thu_Apr_17_10_17_14_CEST_2008.bad
> > >
> > > the failure is attached below. (I needed the exact boot parameters
> > > listed in that bootup log to see this failure.)
> > >
> > > it seems to be CONFIG_MAXSMP=y triggers the new more-apic-ids code and
> > > that causes some breakage elsewhere. [btw., this again shows how useful
> > > the CONFIG_MAXSMP debug feature is!]
> > >
> > > Ingo
> > >
> > > [ 0.000000] Linux version 2.6.25-rc9-sched-devel.git-x86-latest.git (mingo@dione) (gcc version 4.2.3) #260 SMP Thu Apr 17 10:58:11 CEST 2008
> > > [ 0.000000] Command line: root=/dev/sda6 console=ttyS0,115200 earlyprintk=serial,ttyS0,115200 debug initcall_debug apic=verbose sysrq_always_enabled ignore_loglevel selinux=0 nmi_watchdog=2 profile=0 nosmp highres=0 nolapic_timer hpet=disable idle=poll highmem=512m nopat acpi=off
> > > [ 0.000000] BIOS-provided physical RAM map:
> >
> > Has anyone seen this failure?? (Using git://git.kernel.org/pub/scm/linux/kernel/git/x86/linux-2.6-x86.git
> > from 4/18 AM).
> >
> > I tried to reproduce the above failure on a small system & was not successful.
> >
> > Switched to a larger system (XE310 Intel-based 8p, 6GB). All attempts to boot fail
> > with the following. I backed out the MAX_APIC change, & changed NR_CPUS=8. Still fails.
> >
> > ...
> > [ 32.010000] ehci_hcd 0000:00:1d.7: port 6 high speed
> > [ 32.010000] ehci_hcd 0000:00:1d.7: GetStatus port 6 status 001005 POWER sig=se0 PE CONNECT
> > [ 32.054003] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
> > [ 32.058003] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
> > [ 32.062003] usb usb2: Product: UHCI Host Controller
> > [ 32.066004] usb usb2: Manufacturer: Linux 2.6.25-x86-latest.git uhci_hcd
> > [ 32.070004] usb usb2: SerialNumber: 0000:00:1d.0
> > [ 32.074004] PCI: Found IRQ 10 for device 0000:00:1d.1
> > [ 32.078004] PCI: Sharing IRQ 10 with 0000:00:1f.2
> > [ 32.082005] PCI: Sharing IRQ 10 with 0000:00:1f.3
> > [ 32.086005] PCI: Sharing IRQ 10 with 0000:04:00.1
> > [ 32.090005] PCI: Setting latency timer of device 0000:00:1d.1 to 64
> > [ 32.094005] uhci_hcd 0000:00:1d.1: UHCI Host Controller
> > [ 32.098006] usb 1-6: new high speed USB device using ehci_hcd and address 2
> > [ 32.102006] nommu_map_single: overflow 1af757720+8
> >
> > Full log:
> >
> >
> > [ 0.000000] Linux version 2.6.25-x86-latest.git (root@cleopatra1) (gcc version 4.1.1 20070105 (Red Hat 4.1.1-52)) #2 SMP Fri Apr 18 09:36:33 CDT 2008
> > [ 0.000000] Command line: root=/dev/sda2 console=ttyS1,38400n8 debug initcall_debug apic=verbose sysrq_always_enabled ignore_loglevel selinux=0 nmi_watchdog=2 profile=0 nosmp highres=0 nolapic_timer hpet=disable idle=poll highmem=512m nopat acpi=off
>
> how about without acpi=off?
>
> can you make sure acpi=off works with previous kernel in that box?
Old kernels work with & without "acpi=off".
New kernels fail with & without "acpi=off".
I'll start to narrow down the exact config/boot options that are causing the
failure.
--- jack
WARNING: multiple messages have this Message-ID (diff)
From: Jack Steiner <steiner@sgi.com>
To: Yinghai Lu <yhlu.kernel@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>,
tglx@linutronix.de, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, "H. Peter Anvin" <hpa@zytor.com>,
mike travis <travis@sgi.com>
Subject: Re: [PATCH] - Increase MAX_APICS for large configs
Date: Fri, 18 Apr 2008 19:01:44 -0500 [thread overview]
Message-ID: <20080419000144.GA24486@sgi.com> (raw)
In-Reply-To: <86802c440804181501w4e9563f2oe154c0744076e91e@mail.gmail.com>
On Fri, Apr 18, 2008 at 03:01:43PM -0700, Yinghai Lu wrote:
> On Fri, Apr 18, 2008 at 2:14 PM, Jack Steiner <steiner@sgi.com> wrote:
> > On Thu, Apr 17, 2008 at 01:07:27PM +0200, Ingo Molnar wrote:
> > >
> > > * Jack Steiner <steiner@sgi.com> wrote:
> > >
> > > > Increase the maximum number of apics when running very large
> > > > configurations. This patch has no affect on most systems.
> > >
> > > x86.git overnight random-qa testing found a boot crash and i bisected it
> > > down to this patch. The config is:
> > >
> > > http://redhat.com/~mingo/misc/config-Thu_Apr_17_10_17_14_CEST_2008.bad
> > >
> > > the failure is attached below. (I needed the exact boot parameters
> > > listed in that bootup log to see this failure.)
> > >
> > > it seems to be CONFIG_MAXSMP=y triggers the new more-apic-ids code and
> > > that causes some breakage elsewhere. [btw., this again shows how useful
> > > the CONFIG_MAXSMP debug feature is!]
> > >
> > > Ingo
> > >
> > > [ 0.000000] Linux version 2.6.25-rc9-sched-devel.git-x86-latest.git (mingo@dione) (gcc version 4.2.3) #260 SMP Thu Apr 17 10:58:11 CEST 2008
> > > [ 0.000000] Command line: root=/dev/sda6 console=ttyS0,115200 earlyprintk=serial,ttyS0,115200 debug initcall_debug apic=verbose sysrq_always_enabled ignore_loglevel selinux=0 nmi_watchdog=2 profile=0 nosmp highres=0 nolapic_timer hpet=disable idle=poll highmem=512m nopat acpi=off
> > > [ 0.000000] BIOS-provided physical RAM map:
> >
> > Has anyone seen this failure?? (Using git://git.kernel.org/pub/scm/linux/kernel/git/x86/linux-2.6-x86.git
> > from 4/18 AM).
> >
> > I tried to reproduce the above failure on a small system & was not successful.
> >
> > Switched to a larger system (XE310 Intel-based 8p, 6GB). All attempts to boot fail
> > with the following. I backed out the MAX_APIC change, & changed NR_CPUS=8. Still fails.
> >
> > ...
> > [ 32.010000] ehci_hcd 0000:00:1d.7: port 6 high speed
> > [ 32.010000] ehci_hcd 0000:00:1d.7: GetStatus port 6 status 001005 POWER sig=se0 PE CONNECT
> > [ 32.054003] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
> > [ 32.058003] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
> > [ 32.062003] usb usb2: Product: UHCI Host Controller
> > [ 32.066004] usb usb2: Manufacturer: Linux 2.6.25-x86-latest.git uhci_hcd
> > [ 32.070004] usb usb2: SerialNumber: 0000:00:1d.0
> > [ 32.074004] PCI: Found IRQ 10 for device 0000:00:1d.1
> > [ 32.078004] PCI: Sharing IRQ 10 with 0000:00:1f.2
> > [ 32.082005] PCI: Sharing IRQ 10 with 0000:00:1f.3
> > [ 32.086005] PCI: Sharing IRQ 10 with 0000:04:00.1
> > [ 32.090005] PCI: Setting latency timer of device 0000:00:1d.1 to 64
> > [ 32.094005] uhci_hcd 0000:00:1d.1: UHCI Host Controller
> > [ 32.098006] usb 1-6: new high speed USB device using ehci_hcd and address 2
> > [ 32.102006] nommu_map_single: overflow 1af757720+8
> >
> > Full log:
> >
> >
> > [ 0.000000] Linux version 2.6.25-x86-latest.git (root@cleopatra1) (gcc version 4.1.1 20070105 (Red Hat 4.1.1-52)) #2 SMP Fri Apr 18 09:36:33 CDT 2008
> > [ 0.000000] Command line: root=/dev/sda2 console=ttyS1,38400n8 debug initcall_debug apic=verbose sysrq_always_enabled ignore_loglevel selinux=0 nmi_watchdog=2 profile=0 nosmp highres=0 nolapic_timer hpet=disable idle=poll highmem=512m nopat acpi=off
>
> how about without acpi=off?
>
> can you make sure acpi=off works with previous kernel in that box?
Old kernels work with & without "acpi=off".
New kernels fail with & without "acpi=off".
I'll start to narrow down the exact config/boot options that are causing the
failure.
--- jack
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2008-04-19 0:01 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-04-16 16:39 [PATCH] - Increase MAX_APICS for large configs Jack Steiner
2008-04-16 16:39 ` Jack Steiner
2008-04-16 18:45 ` Ingo Molnar
2008-04-16 18:45 ` Ingo Molnar
2008-04-16 19:22 ` Jack Steiner
2008-04-16 19:22 ` Jack Steiner
2008-04-17 11:07 ` Ingo Molnar
2008-04-17 11:07 ` Ingo Molnar
2008-04-17 18:02 ` Yinghai Lu
2008-04-17 18:02 ` Yinghai Lu
2008-04-18 21:14 ` Jack Steiner
2008-04-18 21:14 ` Jack Steiner
2008-04-18 22:01 ` Yinghai Lu
2008-04-18 22:01 ` Yinghai Lu
2008-04-19 0:01 ` Jack Steiner [this message]
2008-04-19 0:01 ` Jack Steiner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080419000144.GA24486@sgi.com \
--to=steiner@sgi.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mingo@elte.hu \
--cc=tglx@linutronix.de \
--cc=travis@sgi.com \
--cc=yhlu.kernel@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.