linux-nfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Colin Hudler <chudler@cs.uchicago.edu>
To: "J. Bruce Fields" <bfields@fieldses.org>, linux-nfs@vger.kernel.org
Subject: Re: kernel not recovering from statd port change
Date: Thu, 04 Sep 2014 15:42:30 -0500	[thread overview]
Message-ID: <5408CEB6.7040000@cs.uchicago.edu> (raw)
In-Reply-To: <20140821213421.GA5474@fieldses.org>

I've been debugging the same thing on an Ubuntu 12.04 server running 
3.8.0-44, and ended up in the same place you are.  Did you find out 
anything more? I have carefully inserted rpc_force_rebind() near 
nlm_client_get, but I don't think it is a good fix for others.

In production servers, I am starting rpc.statd with "--port #####", 
which does seem to solve the problem. NLM (vs NSM) apparently doesn't 
suffer from it.

One thing that puzzles me is that of several hundred NFS clients only a 
handful have a problem getting a lock. The problem clients are running 
3.2 and 2.6.26. The not-problem clients are 3.8 mostly. NFSv3.

On 08/21/2014 04:34 PM, J. Bruce Fields wrote:
> While testing server restart somebody noticed that knfsd can't recover
> from statd restarting with a new port.
>
>  From only a very quick skim of the code it looked like creating the nsm
> client with RPC_CLNT_CREATE_AUTOBIND should cause us to call rpcbind
> again on connection failures, but that doesn't seem to be working.
>
> Any ideas?  I'll keep looking....
>
> --b.
>
> commit 2c9fb5570fe2
> Author: J. Bruce Fields <bfields@redhat.com>
> Date:   Wed Aug 20 17:21:32 2014 -0400
>
>      lockd: allow rebinding to statd
>
>      During normal operation statd isn't restarted, but it may be if, for
>      example, the server is shut down and restarted to simulate a shutdown or
>      perform some kind of failover.  In that case the kernel may need to
>      query rpcbind again to get statd's new port number.
>
>      Symptoms were locking failures after a manual server restart (without
>      rebooting the machine), and loopback network traces showing the new
>      kernel nfsd attempting to contact statd at its old port number.
>
>      This was probably introduced by cb7323fffa85, which first allowed
>      reusing the statd rpc client, but it looks like a reference count may
>      typically have prevented any symptoms until e498daa81295 "LOCKD: Clear
>      ln->nsm_clnt only when ln->nsm_users is zero".
>
>      Fixes: cb7323fffa85 "lockd: create and use per-net NSM RPC clients on MON/UNMON requests"
>      Signed-off-by: J. Bruce Fields <bfields@redhat.com>
>
> diff --git a/fs/lockd/mon.c b/fs/lockd/mon.c
> index 1812f026960c..3bce1d318435 100644
> --- a/fs/lockd/mon.c
> +++ b/fs/lockd/mon.c
> @@ -80,7 +80,8 @@ static struct rpc_clnt *nsm_create(struct net *net)
>   		.program		= &nsm_program,
>   		.version		= NSM_VERSION,
>   		.authflavor		= RPC_AUTH_NULL,
> -		.flags			= RPC_CLNT_CREATE_NOPING,
> +		.flags			= RPC_CLNT_CREATE_NOPING|
> +			                  RPC_CLNT_CREATE_AUTOBIND,
>   	};
>
>   	return rpc_create(&args);
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

  reply	other threads:[~2014-09-04 20:52 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-08-21 21:34 kernel not recovering from statd port change J. Bruce Fields
2014-09-04 20:42 ` Colin Hudler [this message]
2014-09-04 21:01 ` Trond Myklebust

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5408CEB6.7040000@cs.uchicago.edu \
    --to=chudler@cs.uchicago.edu \
    --cc=bfields@fieldses.org \
    --cc=linux-nfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).