Re: Temporary hangs when using locking with apache+nfsv4

All of lore.kernel.org
 help / color / mirror / Atom feed

From: "J. Bruce Fields" <bfields@fieldses.org>
To: Jeff Layton <jlayton@redhat.com>
Cc: Trond Myklebust <trond.myklebust@primarydata.com>,
	Dennis Jacobfeuerborn <dennisml@conversis.de>,
	linux-nfs@vger.kernel.org
Subject: Re: Temporary hangs when using locking with apache+nfsv4
Date: Mon, 3 Mar 2014 17:35:37 -0500	[thread overview]
Message-ID: <20140303223537.GA12805@fieldses.org> (raw)
In-Reply-To: <20140303172921.19aecbab@tlielax.poochiereds.net>

On Mon, Mar 03, 2014 at 05:29:21PM -0500, Jeff Layton wrote:
> On Mon, 3 Mar 2014 15:41:54 -0500
> "J. Bruce Fields" <bfields@fieldses.org> wrote:
> 
> > On Mon, Mar 03, 2014 at 11:41:19AM -0500, Jeff Layton wrote:
> > > On Mon, 3 Mar 2014 10:46:37 -0500
> > > Trond Myklebust <trond.myklebust@primarydata.com> wrote:
> > > 
> > > > 
> > > > On Mar 3, 2014, at 10:43, Jeff Layton <jlayton@redhat.com> wrote:
> > > > 
> > > > > On Mon, 03 Mar 2014 06:47:52 +0100
> > > > > Dennis Jacobfeuerborn <dennisml@conversis.de> wrote:
> > > > > 
> > > > >> Hi,
> > > > >> I'm experimenting with using NFSv4 as storage for web servers and while 
> > > > >> regular file access seems to work fine as soon as I bring flock() into 
> > > > >> play things become more problematic.
> > > > >> I've create a tiny test php script that basically opens a file, locks it 
> > > > >> using flock(), writes that fact into a log file (on a local filesystem), 
> > > > >> performs a usleep(1000), writes into the log that it is about to unlock 
> > > > >> the file and finally unlocks it.
> > > > >> I invoke that script using ab with a concurrency of 20 for a few 
> > > > >> thousand requests.
> > > > >> 
> > > > > 
> > > > > Is all the activity from a single client, or are multiple clients
> > > > > contending for the lock?
> > > > > 
> > > > >> The result is that while 99% of the request respond quickly a few 
> > > > >> request seem to hang for up to 30 seconds. According to the log file 
> > > > >> they must eventually succeed since I see all expected entries and the 
> > > > >> locking seems to work as well since all entries are in the expected order.
> > > > >> 
> > > > >> Is it expected that these long delays happen? When I comment the locking 
> > > > >> function out these hangs disappear.
> > > > >> Are there some knobs to tune NFS and make it behave better in these 
> > > > >> situations?
> > > > >> 
> > > > > 
> > > > > NFSv4 locking is inherently unfair. If you're doing a blocking lock,
> > > > > then the client is expected to poll for it. So, long delays are
> > > > > possible if you just happen to be unlucky and keep missing the lock.
> > > > > 
> > > > > There's no knob to tune, but there probably is room for improvement in
> > > > > this code. In principle we could try to be more aggressive about
> > > > > getting the lock by trying to wake up one or more blocked tasks whenever
> > > > > a lock is released. You might still end up with delays, but it could
> > > > > help improve responsiveness.
> > > > 
> > > > …or you could implement the NFSv4.1 lock callback functionality. That would scale better than more aggressive polling.
> > > 
> > > I had forgotten about those. I wonder what servers actually implement
> > > them? I don't think Linux' knfsd does yet.
> > 
> > No.  How I'd imagined it would work:
> > 
> > 	- on a failed blocking lock request, insert a waiter.
> > 	- when the lock the waiter is blocking on is released or
> > 	  downgraded, apply the waiting lock as a "provisional" lock:
> > 	  add it to the i_flock list, but *don't* allow it to downgrade
> > 	  or merge with any existing locks.  Then send the callback.
> > 	- when the client resends the lock request, finish applying the
> > 	  lock.  This is when we downgrade, merge, or split as
> > 	  necessary.
> > 	- Alternatively, if some timeout passes without the client
> > 	  requesting the lock again, give up and remove the
> > 	  "provisional" lock.
> > 
> 
> Do we really need to do that?
> 
> RFC5667 seems to indicate that the server isn't required to hold the
> lock for the client when it sends the callback.
> 
> As a first step, we could just add the callbacks and not try to hold
> the lock for the client. That wouldn't be too hard to do -- maybe just
> add a blocking FL_ACCESS request to the i_flock list and then issue
> a CB_NOTIFY_LOCK when that returns.

Yes, you're right, something like that is probably a better first step.

--b.

next prev parent reply	other threads:[~2014-03-03 22:35 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-03-03  5:47 Temporary hangs when using locking with apache+nfsv4 Dennis Jacobfeuerborn
2014-03-03 15:43 ` Jeff Layton
2014-03-03 15:46   ` Trond Myklebust
2014-03-03 16:41     ` Jeff Layton
2014-03-03 18:22       ` Trond Myklebust
2014-03-03 18:34         ` Jeff Layton
2014-03-03 19:02           ` Trond Myklebust
2014-03-03 22:41             ` Jeff Layton
2014-03-03 20:41       ` J. Bruce Fields
2014-03-03 22:29         ` Jeff Layton
2014-03-03 22:35           ` J. Bruce Fields [this message]
2014-03-03 23:03   ` Dennis Jacobfeuerborn

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140303223537.GA12805@fieldses.org \
    --to=bfields@fieldses.org \
    --cc=dennisml@conversis.de \
    --cc=jlayton@redhat.com \
    --cc=linux-nfs@vger.kernel.org \
    --cc=trond.myklebust@primarydata.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.