From: Jeff Layton <jlayton@redhat.com>
To: Benjamin Coddington <bcodding@redhat.com>
Cc: List Linux NFS Mailing <linux-nfs@vger.kernel.org>
Subject: Re: CLOSE/OPEN race
Date: Sat, 12 Nov 2016 16:16:00 -0500 [thread overview]
Message-ID: <1478985360.2442.29.camel@redhat.com> (raw)
In-Reply-To: <98C04570-5E22-4F6D-80AF-FA6EE48ED489@redhat.com>
On Sat, 2016-11-12 at 13:03 -0500, Benjamin Coddington wrote:
>
> On 12 Nov 2016, at 11:52, Jeff Layton wrote:
>
> >
> > On Sat, 2016-11-12 at 10:31 -0500, Benjamin Coddington wrote:
> > >
> > > On 12 Nov 2016, at 7:54, Jeff Layton wrote:
> > >
> > > >
> > > >
> > > > On Sat, 2016-11-12 at 06:08 -0500, Benjamin Coddington wrote:
> > > > >
> > > > >
> > > > > I've been seeing the following on a modified version of generic/089
> > > > > that gets the client stuck sending LOCK with NFS4ERR_OLD_STATEID.
> > > > >
> > > > > 1. Client has open stateid A, sends a CLOSE
> > > > > 2. Client sends OPEN with same owner
> > > > > 3. Client sends another OPEN with same owner
> > > > > 4. Client gets a reply to OPEN in 3, stateid is B.2 (stateid B
> > > > > sequence 2)
> > > > > 5. Client does LOCK,LOCKU,FREE_STATEID from B.2
> > > > > 6. Client gets a reply to CLOSE in 1
> > > > > 7. Client gets reply to OPEN in 2, stateid is B.1
> > > > > 8. Client sends LOCK with B.1 - OLD_STATEID, now stuck in a loop
> > > > >
> > > > > The CLOSE response in 6 causes us to clear NFS_OPEN_STATE, so that
> > > > > the OPEN
> > > > > response in 7 is able to update the open_stateid even though it has a
> > > > > lower
> > > > > sequence number.
> > > > >
> > > > > I think this case could be handled by never updating the open_stateid
> > > > > if the
> > > > > stateids match but the sequence number of the new state is less than
> > > > > the
> > > > > current open_state.
> > > > >
> > > >
> > > > What kernel is this on?
> > >
> > > On v4.9-rc2 with a couple fixups. Without them, I can't test long
> > > enough to
> > > reproduce this race. I don't think any of those are involved in this
> > > problem, though.
> > >
> > > >
> > > >
> > > > Yes, that seems wrong. The client should be picking B.2 for the open
> > > > stateid to use. I think that decision of whether to take a seqid is
> > > > made
> > > > in nfs_need_update_open_stateid. The logic in there looks correct to
> > > > me
> > > > at first glance though.
> > >
> > > nfs_need_update_open_stateid() will return true if NFS_OPEN_STATE is
> > > unset.
> > > That's the precondition set up by steps 1-6. Perhaps it should not
> > > update
> > > the stateid if they match but the sequence number is less, and still set
> > > NFS_OPEN_STATE once more. That will fix _this_ case. Are there other
> > > cases
> > > where that would be a problem?
> > >
> > > Ben
> >
> > That seems wrong.
>
> I'm not sure what you mean: what seems wrong?
>
Sorry, it seems wrong that the client would issue the LOCK with B.1
there.
> >
> > The only close was sent in step 1, and that was for a
> > completely different stateid (A rather than B). It seems likely that
> > that is where the bug is.
>
> I'm still not sure what point you're trying to make..
>
> Even though the close was sent in step 1, the response wasn't processed
> until step 6..
Not really a point per-se, I was just saying where I think the bug might
be...
When you issue a CLOSE, you issue it vs. a particular stateid (stateid
"A" in this case). Once the open stateid has been superseded by "B", the
closing of "A" should have no effect.
Perhaps nfs_clear_open_stateid needs to check and see whether the open
stateid has been superseded before doing its thing?
--
Jeff Layton <jlayton@redhat.com>
next prev parent reply other threads:[~2016-11-12 21:16 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-11-12 11:08 CLOSE/OPEN race Benjamin Coddington
2016-11-12 12:54 ` Jeff Layton
2016-11-12 15:31 ` Benjamin Coddington
2016-11-12 16:52 ` Jeff Layton
2016-11-12 18:03 ` Benjamin Coddington
2016-11-12 21:16 ` Jeff Layton [this message]
2016-11-13 2:56 ` Jeff Layton
2016-11-13 13:34 ` Benjamin Coddington
2016-11-13 14:22 ` Benjamin Coddington
2016-11-13 14:33 ` Jeff Layton
2016-11-13 14:47 ` Trond Myklebust
2016-11-14 14:53 ` Benjamin Coddington
2016-11-14 16:29 ` Trond Myklebust
2016-11-14 18:40 ` Benjamin Coddington
2016-11-12 18:16 ` Trond Myklebust
2016-11-12 18:46 ` Benjamin Coddington
2016-11-13 3:09 ` Jeff Layton
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1478985360.2442.29.camel@redhat.com \
--to=jlayton@redhat.com \
--cc=bcodding@redhat.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).