From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S265383AbUHWPci (ORCPT ); Mon, 23 Aug 2004 11:32:38 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S265354AbUHWPcB (ORCPT ); Mon, 23 Aug 2004 11:32:01 -0400 Received: from mx1.redhat.com ([66.187.233.31]:1165 "EHLO mx1.redhat.com") by vger.kernel.org with ESMTP id S265144AbUHWP3l (ORCPT ); Mon, 23 Aug 2004 11:29:41 -0400 From: David Howells In-Reply-To: <200408221620.00692.rjw@sisk.pl> References: <200408221620.00692.rjw@sisk.pl> <20040822013402.5917b991.akpm@osdl.org> To: "R. J. Wysocki" Cc: Andrew Morton , linux-kernel@vger.kernel.org Subject: Re: 2.6.8.1-mm4 (strange behavior on dual Opteron w/ NUMA) User-Agent: EMH/1.14.1 SEMI/1.14.5 (Awara-Onsen) FLIM/1.14.5 (Demachiyanagi) APEL/10.6 Emacs/21.3 (i386-redhat-linux-gnu) MULE/5.0 (SAKAKI) MIME-Version: 1.0 (generated by SEMI 1.14.5 - "Awara-Onsen") Content-Type: text/plain; charset=US-ASCII Date: Mon, 23 Aug 2004 16:29:33 +0100 Message-ID: <798.1093274973@redhat.com> Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Rafael J. Wysocki wrote: > It has the same problems that I've reported for 2.6.8.1-mm3: > > 1) ALT-SysRq- does not work, although "echo > > /proc/sysrq-trigger" does (may be specific to x86-64). I'm seeing this on my Dual PPro testbox too. I'm running 2.6.8.1-mm4 using the i386 arch. I get an oops (which I've attached) from SysRq+B and from normal reboot. > 2) After issuing: > > # rmmod snd_seq_oss > > the kernel goes into a strange state: And that too; except that I'm seeing it with NFS and modules of my own devising. I've attached an excerpt of a SysRq trace of this. Note that it doesn't start executing the module exit function as far as I can tell. Both these problems are 100% reproducible. David =========== REBOOT OOPS =========== nfsd: last server has exited nfsd: unexporting all filesystems Restarting system. Unable to handle kernel paging request at virtual address c041a3e0 printing eip: c041a3e0 *pde = 00463027 *pte = 0041a000 Oops: 0000 [#1] SMP DEBUG_PAGEALLOC Modules linked in: nfs cachefs CPU: 0 EIP: 0060:[] Not tainted VLI EFLAGS: 00010282 (2.6.8.1-mm4) EIP is at find_isa_irq_pin+0x0/0x70 eax: 00000003 ebx: 00000000 ecx: 000100fe edx: 00000000 esi: 01234567 edi: c4d6c000 ebp: c4d6c000 esp: c4d6de60 ds: 007b es: 007b ss: 0068 Process reboot (pid: 2751, threadinfo=c4d6c000 task=c5527a40) Stack: c011227f 00000000 00000003 00000000 00000001 00000000 c010ebc8 00000000 c4d6c000 c010ebe9 01234567 c4d6c000 c011c167 c0351071 00000000 c012ba1d 00000000 00000001 00000000 c10bb7c0 00000001 00000000 c01452ef c10bb7c0 Call Trace: [] disable_IO_APIC+0x1f/0x140 [] machine_shutdown+0x48/0x60 [] machine_restart+0x9/0x90 [] printk+0x17/0x20 [] sys_reboot+0x19d/0x400 [] cache_free_debugcheck+0x17f/0x2b0 [] sock_destroy_inode+0x1b/0x20 [] sock_destroy_inode+0x1b/0x20 [] destroy_inode+0x35/0x60 [] generic_forget_inode+0x14e/0x1c0 [] cache_free_debugcheck+0x17f/0x2b0 [] iput+0x63/0x90 [] dput+0x33/0x340 [] __fput+0xa8/0x110 [] __fput+0xb0/0x110 [] filp_close+0x59/0x90 [] sys_close+0x7f/0x100 [] do_page_fault+0x0/0x5c0 [] syscall_call+0x7/0xb Code: Bad EIP value. Badness in send_IPI_mask_bitmask at arch/i386/kernel/smp.c:167 [] send_IPI_mask_bitmask+0x74/0x80 [] smp_send_reschedule+0x1f/0x30 [] try_to_wake_up+0x2b0/0x2e0 [] __wake_up_common+0x41/0x70 [] __wake_up+0x4b/0xb0 [] __queue_work+0x5f/0xc0 [] queue_work+0x55/0x70 [] call_usermodehelper+0xc9/0xd6 [] __call_usermodehelper+0x0/0x70 [] sprintf+0x1f/0x30 [] kset_hotplug+0x1f6/0x290 [] kobject_hotplug+0x67/0x70 [] kobject_del+0x1b/0x40 [] class_device_del+0x99/0xc0 [] class_device_unregister+0x13/0x30 [] class_simple_device_remove+0xa3/0x127 [] vcs_remove_devfs+0x1c/0x39 [] con_close+0x85/0x90 [] release_dev+0x6ad/0x6d0 [] cache_free_debugcheck+0x131/0x2b0 [] remove_vm_struct+0x8a/0xd0 [] tty_release+0x45/0xc0 [] flush_tlb_mm+0x49/0xa0 [] __fput+0xfe/0x110 [] filp_close+0x59/0x90 [] put_files_struct+0x64/0xd0 [] do_exit+0x1f0/0x590 [] sys_read+0x51/0x80 [] sys_exit+0x13/0x20 [] syscall_call+0x7/0xb ===================== RMMOD SYSRQ+T EXCERPT ===================== rmmod D C0341C03 0 2010 1932 (NOTLB) c4737e80 00000082 00000002 c0341c03 c4737ec4 c012e505 c4737e64 0dfec5b1 00000007 3e5a438a 00015b43 c41fddb0 15b433e8 00000000 c1104f60 c3c72bec c01150a5 c3d95a40 c11054ac c4737f04 c4737f08 c4737ec0 c4737ee8 c0341c03 Call Trace: [] wait_for_completion+0xb3/0x1a0 [] queue_work+0x55/0x70 [] activate_task+0xf5/0x120 [] wait_for_completion+0xb3/0x1a0 [] default_wake_function+0x0/0x20 [] default_wake_function+0x0/0x20 [] wake_up_process+0x1d/0x30 [] __stop_machine_run+0x9c/0xc0 [] __try_stop_module+0x0/0x41 [] stop_machine_run+0x1f/0x38 [] __try_stop_module+0x0/0x41 [] try_stop_module+0x38/0x40 [] __try_stop_module+0x0/0x41 [] sys_delete_module+0x122/0x180 [] __do_softirq+0xba/0xd0 [] smp_apic_timer_interrupt+0x8d/0x100 [] system_call+0x0/0x2c [] syscall_call+0x7/0xb kstopmachine R running 0 2011 6 2012 36 (L-TLB) kstopmachine R running 0 2012 2011 (L-TLB)