From mboxrd@z Thu Jan 1 00:00:00 1970 From: Gertjan Oude Lohuis Subject: Re: Kernel (2.6.24) crash on nfsd (BUG: soft lockup) Date: Thu, 28 Feb 2008 12:08:52 +0100 Message-ID: <47C69644.7010606@byte.nl> References: <47C434D2.80601@byte.nl> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="------------090806010409000803040207" Cc: rune.nilssen-FJFKJQU35qU@public.gmane.org To: linux-nfs@vger.kernel.org Return-path: Received: from gw.c1.byte.nl ([82.94.214.64]:47003 "EHLO smtp.byte.nl" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1759629AbYB1LIz (ORCPT ); Thu, 28 Feb 2008 06:08:55 -0500 In-Reply-To: <47C434D2.80601-DW70C6hi67U@public.gmane.org> Sender: linux-nfs-owner@vger.kernel.org List-ID: This is a multi-part message in MIME format. --------------090806010409000803040207 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Over the past few days, our fileserver has crashed with this bug a couple of times. We downgraded the kernel to 2.6.23.17 last night, but about an hour ago the machine crashed again, this time with a sligthly different stacktrace (attached). This is driving us nuts: kernel 2.6.17 to 2.6.22 are not possible, because of the lockd issue, and 2.6.23 and 2.6.24 are not possible because it crashes even more often. I noticed that Rune Nilssen reported the same issue a few days ago to this list, but he too hasn't received any response yet (http://article.gmane.org/gmane.linux.nfs/19105). Any more people here that suffer from this issue? Can I get/give more information to make debugging easier? -- Met vriendelijke groet, Gertjan Oude Lohuis Byte Internet W www.byte.nl E support-DW70C6hi67U@public.gmane.org F 020 6255 922 --------------090806010409000803040207 Content-Type: text/plain; name="stacktrace3.txt" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="stacktrace3.txt" BUG: soft lockup - CPU#3 stuck for 11s! [nfsd:2643] Pid: 2643, comm: nfsd EIP: 0060:[] CPU: 3 EIP is at __generic_file_splice_read+0x12c/0x418 EFLAGS: 00000206 Not tainted (2.6.23.17-fwsh-byte #3) EAX: f6e9dddc EBX: 00001000 ECX: 00000001 EDX: 00000000 ESI: 00000000 EDI: f6e9dcd0 EBP: 00000095 DS: 007b ES: 007b FS: 00d8 CR0: 8005003b CR2: b7e72cc0 CR3: 00622000 CR4: 000006f0 DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 DR6: ffff0ff0 DR7: 00000400 [] __check_preempt_curr_fair+0x4b/0x7d [] entity_tick+0x47/0x54 [] getnstimeofday+0x37/0x111 [] clockevents_program_event+0xac/0xcc [] run_timer_softirq+0x30/0x184 [] hrtimer_interrupt+0x132/0x1c4 [] __do_softirq+0xba/0xcf [] smp_apic_timer_interrupt+0x2c/0x35 [] apic_timer_interrupt+0x28/0x30 [] generic_file_splice_read+0x81/0xd5 [] do_splice_to+0x75/0x97 [] splice_direct_to_actor+0x9f/0x166 [] nfsd_acceptable+0x0/0xd1 [nfsd] [] nfsd_direct_splice_actor+0x0/0xa [nfsd] [] nfsd_vfs_read+0x399/0x3bd [nfsd] [] dentry_open+0x34/0x64 [] nfsd_read+0xee/0xfb [nfsd] [] nfsd3_proc_read+0xfe/0x186 [nfsd] [] nfs3svc_decode_readargs+0x0/0xeb [nfsd] [] nfsd_dispatch+0xc5/0x1ca [nfsd] [] svcauth_unix_set_client+0x116/0x165 [] svc_process+0x4fb/0x6d4 [] default_wake_function+0x0/0xc [] nfsd+0x16a/0x282 [nfsd] [] nfsd+0x0/0x282 [nfsd] [] kernel_thread_helper+0x7/0x10 ======================= Pid: 2643, comm: nfsd EIP: 0060:[] CPU: 3 EIP is at __generic_file_splice_read+0xdf/0x418 EFLAGS: 00000206 Not tainted (2.6.23.17-fwsh-byte #3) EAX: 00000095 EBX: f6e9de50 ECX: 00000001 EDX: 00000000 ESI: 00000001 EDI: f6e9dcd0 EBP: 00000096 DS: 007b ES: 007b FS: 00d8 CR0: 8005003b CR2: b7e72cc0 CR3: 00622000 CR4: 000006f0 DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 DR6: ffff0ff0 DR7: 00000400 [] __check_preempt_curr_fair+0x4b/0x7d [] entity_tick+0x47/0x54 [] getnstimeofday+0x37/0x111 [] clockevents_program_event+0xac/0xcc [] run_timer_softirq+0x30/0x184 [] hrtimer_interrupt+0x132/0x1c4 [] __do_softirq+0xba/0xcf [] smp_apic_timer_interrupt+0x2c/0x35 [] apic_timer_interrupt+0x28/0x30 [] find_inode_fast+0x26/0x46 [] generic_file_splice_read+0x81/0xd5 [] do_splice_to+0x75/0x97 [] splice_direct_to_actor+0x9f/0x166 [] nfsd_acceptable+0x0/0xd1 [nfsd] [] nfsd_direct_splice_actor+0x0/0xa [nfsd] [] nfsd_vfs_read+0x399/0x3bd [nfsd] [] dentry_open+0x34/0x64 [] nfsd_read+0xee/0xfb [nfsd] [] nfsd3_proc_read+0xfe/0x186 [nfsd] [] nfs3svc_decode_readargs+0x0/0xeb [nfsd] [] nfsd_dispatch+0xc5/0x1ca [nfsd] [] svcauth_unix_set_client+0x116/0x165 [] svc_process+0x4fb/0x6d4 [] default_wake_function+0x0/0xc [] nfsd+0x16a/0x282 [nfsd] [] nfsd+0x0/0x282 [nfsd] [] kernel_thread_helper+0x7/0x10 ======================= --------------090806010409000803040207--