From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757475AbYDHWDe (ORCPT ); Tue, 8 Apr 2008 18:03:34 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752602AbYDHWDZ (ORCPT ); Tue, 8 Apr 2008 18:03:25 -0400 Received: from relay1.sgi.com ([192.48.171.29]:53058 "EHLO relay.sgi.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1752469AbYDHWDY (ORCPT ); Tue, 8 Apr 2008 18:03:24 -0400 Message-ID: <47FBEBA9.30005@sgi.com> Date: Tue, 08 Apr 2008 15:03:21 -0700 From: Mike Travis User-Agent: Thunderbird 2.0.0.6 (X11/20070801) MIME-Version: 1.0 To: Yinghai Lu CC: Ingo Molnar , Thomas Gleixner , "H. Peter Anvin" , Andrew Morton , linux-kernel@vger.kernel.org Subject: Re: [PATCH 0/2] NR_CPUS: increase maximum NR_CPUS to 4096 References: <20080405013014.478571000@polaris-admin.engr.sgi.com> <86802c440804081453y1a0a8e1ds8aac6074b993c476@mail.gmail.com> In-Reply-To: <86802c440804081453y1a0a8e1ds8aac6074b993c476@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Yinghai Lu wrote: > On Fri, Apr 4, 2008 at 6:30 PM, Mike Travis wrote: >> * Increases the limit of NR_CPUS to 4096 and introduces a >> boolean called "MAXSMP" which when set (e.g. "allyesconfig") >> will set NR_CPUS = 4096 and NODES_SHIFT = 9 (512). >> >> I've been running this config (4k NR_CPUS, 512 Max Nodes) >> on an AMD box with 2 dual-cores and 4gb memory as well as an >> Intel box with 4 single-core cpus and 8Mb. I've also >> successfully booted it in a simulated 2cpus/1Gb environment. >> >> Based on: >> git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git >> + x86/latest .../x86/linux-2.6-x86.git >> + sched-devel/latest .../mingo/linux-2.6-sched-devel.git >> >> Signed-off-by: Mike Travis > > got Hi Yinghai, Thanks for the feedback! Would you send me your config file and other details (like cpu type/mem size/etc.) and I'll attempt to reproduce the failure. (My problem is that only the AMD box is a real "workstation", the Intel box is a dual quad-cpu server so it's really deficient in I/O.) Thanks, Mike > > ------------[ cut here ]------------ > WARNING: at kernel/sched_fair.c:815 hrtick_start_fair+0x69/0x156() > Modules linked in: > Pid: 1, comm: swapper Not tainted > 2.6.25-rc8-x86-latest.git-smp-01033-ga39ae31-dirty #77 > > Call Trace: > [] warn_on_slowpath+0x67/0x8e > [] hrtick_start_fair+0x69/0x156 > [] ? dequeue_entity+0x2a/0xf8 > [] dequeue_task_fair+0x5f/0x7e > [] dequeue_task+0x22/0x44 > [] deactivate_task+0x39/0x69 > [] schedule+0x1b9/0x5c5 > [] ? autoremove_wake_function+0x20/0x5e > [] schedule_timeout+0x31/0xd7 > [] ? __wake_up+0x52/0x75 > [] wait_for_common+0x103/0x189 > [] ? default_wake_function+0x0/0x36 > [] wait_for_completion+0x2b/0x41 > [] call_usermodehelper_exec+0x87/0xe5 > [] kobject_uevent_env+0x3d0/0x424 > [] kobject_uevent+0x1e/0x34 > [] device_add+0x2f9/0x494 > [] device_register+0x28/0x43 > [] pcie_port_device_register+0x3f1/0x43e > [] ? pcibios_set_master+0x8d/0xa8 > [] pcie_portdrv_probe+0x79/0xbb > [] pci_call_probe+0xe5/0x146 > [] pci_device_probe+0x64/0xa2 > [] driver_probe_device+0xcf/0x16d > [] ? sysfs_addrm_finish+0x2f/0x22b > [] ? __driver_attach+0x0/0xbe > [] __driver_attach+0x6e/0xbe > [] bus_for_each_dev+0x5e/0xa2 > [] driver_attach+0x2f/0x45 > [] bus_add_driver+0xc6/0x226 > [] ? bus_put+0x29/0x3f > [] driver_register+0x6d/0xfc > [] __pci_register_driver+0x62/0xb0 > [] pcie_portdrv_init+0x4a/0x72 > [] kernel_init+0x1b4/0x340 > [] child_rip+0xa/0x12 > [] ? kernel_init+0x0/0x340 > [] ? child_rip+0x0/0x12 > > ---[ end trace e26645195698f5cf ]--- > BUG: unable to handle kernel NULL pointer dereference at 0000000000000148 > IP: [] pick_next_task_fair+0x7c/0xbb > PGD 0 > Oops: 0000 [1] SMP > CPU 28 > Modules linked in: > Pid: 1, comm: swapper Not tainted > 2.6.25-rc8-x86-latest.git-smp-01033-ga39ae31-dirty #77 > RIP: 0010:[] [] > pick_next_task_fair+0x7c/0xbb > RSP: 0018:ffff81081cc5cd70 EFLAGS: 00010046 > RAX: 0000000000000000 RBX: ffff81383c21a280 RCX: 0000000000000000 > RDX: ffff81383c224080 RSI: ffff81383c224080 RDI: 0000000063e15417 > RBP: ffff81081cc5cda0 R08: 0000000000000000 R09: ffff81383c224108 > R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 > R13: ffff81383c224080 R14: ffff81383c224080 R15: 000000000000001c > FS: 0000000000000000(0000) GS:ffff81401cc3c600(0000) knlGS:0000000000000000 > CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b > CR2: 0000000000000148 CR3: 0000000000201000 CR4: 00000000000006e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process swapper (pid: 1, threadinfo ffff81081cc5c000, task ffff81401cc52000) > Stack: ffff81081cc5cda0 0000000063e15417 ffffffff80a81840 0000000000000000 > 00000000fffeecfd ffff81383c224080 ffff81081cc5ce70 ffffffff80a57d28 > ffff81081cc5ce00 ffff81081cc5ce20 ffffffff81963080 ffffffff81963080 > Call Trace: > [] schedule+0x2b0/0x5c5 > [] ? autoremove_wake_function+0x20/0x5e > [] schedule_timeout+0x31/0xd7 > [] ? __wake_up+0x52/0x75 > [] wait_for_common+0x103/0x189 > [] ? default_wake_function+0x0/0x36 > [] wait_for_completion+0x2b/0x41 > [] call_usermodehelper_exec+0x87/0xe5 > [] kobject_uevent_env+0x3d0/0x424 > [] kobject_uevent+0x1e/0x34 > [] device_add+0x2f9/0x494 > [] device_register+0x28/0x43 > [] pcie_port_device_register+0x3f1/0x43e > [] ? pcibios_set_master+0x8d/0xa8 > [] pcie_portdrv_probe+0x79/0xbb > [] pci_call_probe+0xe5/0x146 > [] pci_device_probe+0x64/0xa2 > [] driver_probe_device+0xcf/0x16d > [] ? sysfs_addrm_finish+0x2f/0x22b > [] ? __driver_attach+0x0/0xbe > [] __driver_attach+0x6e/0xbe > [] bus_for_each_dev+0x5e/0xa2 > [] driver_attach+0x2f/0x45 > [] bus_add_driver+0xc6/0x226 > [] ? bus_put+0x29/0x3f > [] driver_register+0x6d/0xfc > [] __pci_register_driver+0x62/0xb0 > [] pcie_portdrv_init+0x4a/0x72 > [] kernel_init+0x1b4/0x340 > [] child_rip+0xa/0x12 > [] ? kernel_init+0x0/0x340 > [] ? child_rip+0x0/0x12 > > > Code: 24 40 78 1c 8b 3d 36 05 b3 00 48 89 da be 00 04 00 00 e8 7a eb > ff ff 49 39 c6 7f 04 4c 8b 63 48 4c 89 e6 4 > 8 89 df e8 29 f1 ff ff <49> 8b 9c 24 48 01 00 00 48 85 db 75 a5 49 8d > 5c 24 c8 4c 89 ef > RIP [] pick_next_task_fair+0x7c/0xbb > RSP > CR2: 0000000000000148 > ---[ end trace e26645195698f5cf ]---