Linux NFS development
 help / color / mirror / Atom feed
From: "J. Bruce Fields" <bfields@fieldses.org>
To: Andrew W Elble <aweits@rit.edu>
Cc: linux-nfs@vger.kernel.org
Subject: Re: [PATCH RFC v2] nfsd: don't revoke delegations that a client has stated it doesn't have
Date: Tue, 20 Oct 2015 17:10:17 -0400	[thread overview]
Message-ID: <20151020211017.GA22828@fieldses.org> (raw)
In-Reply-To: <m2twplxuc8.fsf@discipline.rit.edu>

On Tue, Oct 20, 2015 at 02:34:15PM -0400, Andrew W Elble wrote:
> 
> > Are clients really allowed to just lose a delegation?  (Have you seen
> > such a case, other than the duplicate-delegation case which you already
> > fixed?)
> 
> In short, yes, we're still seeing it. We have also been seeing increasing
> stability from the work that has been done (which also increases the
> time between replication).
> 
> The reason for v2 was I got coverage on the destruction path in
> testing and discovered my mistake in v1. This one is extremely
> frustrating to chase down (I've been 14+ hours deep in
> packet captures to try and find the allocation to the recall - I
> keep running out of disk space).

Ugh.  Maybe patching in some well-chosen printk's could help.

Or doing some filtering as you capture.   You could capture just
operations that might have a delegation stateid--OPEN, DELEGRETURN,
CB_RECALL.  And probably SEQUENCE replies with any flags set.

> > Another thing we could do here is finally implement the server-side
> > support for referring triples (I think the client's done?):
> >
> > 	http://wiki.linux-nfs.org/wiki/index.php/Server_4.0_and_4.1_issues#Referring_triples
> > 	https://tools.ietf.org/html/rfc5661#section-2.10.6.3
> >
> > That would eliminate the need for the recall retries.
> >
> > Though that would still leave open the question of how to handle those
> > errors on a recall.  We still not be able to conclude that it's safe for
> > the server to destroy the delegation.
> 
> Would this be more appropriately "fixed" by supporting DELEGPURGE in a
> limited fashion to clear out cl_revoked? (I'm not quite sure
> that's a valid interpretation of RFC5561) to clear out any "lost"
> delegations?

I haven't thought it through, but that's clearly not what's intended
for, so I'm pessimistic.  Note also we'd need to implement it first.

There may be other ways the client could better recover, but we'd still
rather avoid such situations in the first place.  The recovery logic is
already complicated enough, it would be worse if it also needed to
handle a lot of cases that could only occur due to outright bugs.

--b.

      reply	other threads:[~2015-10-20 21:10 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-10-20 14:21 [PATCH RFC v2] nfsd: don't revoke delegations that a client has stated it doesn't have Andrew Elble
2015-10-20 17:29 ` J. Bruce Fields
2015-10-20 18:34   ` Andrew W Elble
2015-10-20 21:10     ` J. Bruce Fields [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20151020211017.GA22828@fieldses.org \
    --to=bfields@fieldses.org \
    --cc=aweits@rit.edu \
    --cc=linux-nfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox