From mboxrd@z Thu Jan 1 00:00:00 1970 From: Chuck Lever Subject: Re: nfsd restart attempt ended in nfsd hang Date: Thu, 22 Apr 2010 11:51:04 -0400 Message-ID: <4BD07068.2000700@oracle.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Cc: linux-nfs@vger.kernel.org To: Jan Engelhardt Return-path: Received: from rcsinet10.oracle.com ([148.87.113.121]:31230 "EHLO rcsinet10.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755215Ab0DVPvQ (ORCPT ); Thu, 22 Apr 2010 11:51:16 -0400 In-Reply-To: Sender: linux-nfs-owner@vger.kernel.org List-ID: On 04/22/2010 05:06 AM, Jan Engelhardt wrote: > Hi, > > > I am observing this hanging nfsd kernel thread: > > Linux 2.6.33.2 > 0404] nfsd D ffff88033b29d100 0 26090 2 0x00000000 > 0410] ffff880191b45c60 0000000000000046 0000000000000010 ffff880191b45fd8 > 0415] ffff880101e943c0 ffff880191b45fd8 0000000000012300 0000000000012300 > 0420] 0000000000012300 0000000000012300 0000000000012300 ffff880191b45fd8 > 0426] Call Trace: > 0442] [] rpc_wait_bit_killable+0x30/0x34 [sunrpc] > 0459] [] __wait_on_bit+0x3e/0x6f > 0465] [] out_of_line_wait_on_bit+0x6e/0x77 > 0479] [] __rpc_execute+0xf9/0x1b1 [sunrpc] > 0501] [] rpc_run_task+0x4f/0x57 [sunrpc] > 0515] [] rpc_call_sync+0x3d/0x5a [sunrpc] > 0535] [] rpcb_register_call+0x18/0x4b [sunrpc] > 0574] [] rpcb_v4_register+0xb4/0x16c [sunrpc] > 0612] [] svc_unregister+0x53/0xe1 [sunrpc] > 0643] [] svc_destroy+0x10b/0x123 [sunrpc] > 0667] [] nfsd+0x117/0x131 [nfsd] > 0674] [] kthread+0x75/0x7d > 0680] [] kernel_thread_helper+0x4/0x10 > > Is it going to timeout, if so when? I saw this sort of hang at Connectathon, but didn't have a chance to track it down. To verify, rpcbind is still running when the hang occurs? Or are you running a distribution that uses portmap instead? Is there any other loopback activity on this system, like a mount, or a misconfigured NFSv4 callback? -- chuck[dot]lever[at]oracle[dot]com