All of lore.kernel.org
 help / color / mirror / Atom feed
From: David Flynn <davidf@rd.bbc.co.uk>
To: "Myklebust, Trond" <Trond.Myklebust@netapp.com>
Cc: Chuck Lever <chuck.lever@oracle.com>,
	"J. Bruce Fields" <bfields@fieldses.org>,
	David Flynn <davidf@rd.bbc.co.uk>,
	linux-nfs@vger.kernel.org
Subject: Re: NFS4ERR_STALE_CLIENTID loop
Date: Sat, 29 Oct 2011 19:52:49 +0000	[thread overview]
Message-ID: <20111029195249.GE2011@rd.bbc.co.uk> (raw)
In-Reply-To: <2E1EB2CF9ED1CB4AA966F0EB76EAB4430BDE7479@SACMVEXC2-PRD.hq.netapp.com>

* Myklebust, Trond (Trond.Myklebust@netapp.com) wrote:
> > -----Original Message-----
> > From: Chuck Lever [mailto:chuck.lever@oracle.com]
> > On Oct 29, 2011, at 2:47 PM, J. Bruce Fields wrote:
> > > Yes, and it's not something I care that strongly about, really, my
> > > only observation is that this sort of failure (an implementation
> > > bug on one side or another resulting in a loop) seems to have been
> > > common (based on no hard data, just my vague memories of list
> > > threads), and the results fairly obnoxious (possibly even for
> > > unrelated hosts on the network).
> > > So if there's some simple way to fail more gracefully it might be
> > > helpful.
> >
> > For what it's worth, I agree that client implementations should
> > attempt to behave more gracefully in the face of server problems, be
> > they the result of bugs or the result of other issues specific to
> > that server.  Problems like this make NFSv4 as a protocol look bad.
> 
> I can't see what a client can do in this situation except possibly just
> give up after a while and throw a SERVER_BROKEN error (which means data
> loss). That still won't make NFSv4 look good...

Indeed, it is a quite the dilemma.

I agree that giving and guaranteeing unattended data loss is bad (data
loss at the behest of an operator is ok, afterall they can always fence
a broken machine).

Looking at some of the logs again, even going back to the very original
case, it appears to be about 600us between retries (RTT=400us).  Is
there any way to make that less aggressive?, eg 1s? -- that'd reduce the
impact by three orders of magnitude.  What would be the down-side?  How
often do you expect to get a BAD_STATEID error?

..david

  reply	other threads:[~2011-10-29 19:53 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-10-24 10:40 NFS4 BAD_STATEID loop (kernel 3.0) David Flynn
2011-10-24 11:22 ` Trond Myklebust
2011-10-24 13:17   ` David Flynn
2011-10-24 13:32     ` Trond Myklebust
2011-10-24 14:50       ` NFS4 BAD_STATEID loop (kernel 3.0.4) David Flynn
2011-10-24 15:31         ` Trond Myklebust
2011-10-24 15:55           ` David Flynn
2011-10-27 22:17           ` David Flynn
2011-10-29  0:25             ` NFS4ERR_STALE_CLIENTID loop David Flynn
2011-10-29 17:29               ` Trond Myklebust
2011-10-29 18:02                 ` David Flynn
2011-10-29 18:22                   ` Myklebust, Trond
2011-10-29 18:23                     ` Chuck Lever
2011-10-29 18:26                       ` Myklebust, Trond
2011-10-29 18:29                         ` David Flynn
2011-10-29 18:15                 ` J. Bruce Fields
2011-10-29 18:21                   ` Myklebust, Trond
2011-10-29 18:47                     ` J. Bruce Fields
2011-10-29 18:50                       ` Chuck Lever
2011-10-29 19:19                         ` Myklebust, Trond
2011-10-29 19:52                           ` David Flynn [this message]
2011-10-29 20:42                             ` Myklebust, Trond
2011-10-29 21:07                               ` David Flynn
2011-10-29 21:12                                 ` Myklebust, Trond
2011-10-31 13:07                             ` Chuck Lever
2011-10-31 13:21                               ` David Flynn
2011-10-31 13:39                                 ` Chuck Lever
2011-10-24 13:43 ` NFS4 BAD_STATEID loop (kernel 3.0) Chuck Lever

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20111029195249.GE2011@rd.bbc.co.uk \
    --to=davidf@rd.bbc.co.uk \
    --cc=Trond.Myklebust@netapp.com \
    --cc=bfields@fieldses.org \
    --cc=chuck.lever@oracle.com \
    --cc=linux-nfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.