From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from mta-102a.earthlink-vadesecure.net (mta-102b.earthlink-vadesecure.net [51.81.61.67])
	(using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits))
	(No client certificate requested)
	by smtp.subspace.kernel.org (Postfix) with ESMTPS id E66401EB5B
	for <linux-nfs@vger.kernel.org>; Thu, 21 Aug 2025 17:45:55 +0000 (UTC)
Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=51.81.61.67
ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116;
	t=1755798358; cv=none; b=LbADZuccAQ3eKZax0h6zha7R1YhCrxtJfX//KRs373PtEZ/yPLb1oDNBmXzZgKeGXJkkPFc5xFPvJjCTfxGox5Opi14ybPZo4Z5dK83uvoFoQaICBNMb3o0hB8ovScu5MFErM5OWmoLz+rb0+olWdajLTLiUvNhpAHwQiGO9q5w=
ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org;
	s=arc-20240116; t=1755798358; c=relaxed/simple;
	bh=LL7AzRdk9J0j1N9ND72k1/yxnZC5tA5+beiLE/kHY6g=;
	h=From:To:Cc:References:In-Reply-To:Subject:Date:Message-ID:
	 MIME-Version:Content-Type; b=SWVa7nKlEvbNKrw69ccZXYOv7A7at9j90f19xKVv9CQKoRCmKkBaEC/XXoKU/H0EsuhPhmshFkKnt1u+2T4gK+LjiuEpXh4ELn1B7tjt07be6mB38UX70obzGfE21eOUqdcSp4bLwBVpAECB6rAx48zu4dFZ3EuiyuVJ2XDvBgM=
ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=mindspring.com; spf=pass smtp.mailfrom=mindspring.com; dkim=pass (2048-bit key) header.d=earthlink.net header.i=@earthlink.net header.b=h+5zVhSe; arc=none smtp.client-ip=51.81.61.67
Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=mindspring.com
Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=mindspring.com
Authentication-Results: smtp.subspace.kernel.org;
	dkim=pass (2048-bit key) header.d=earthlink.net header.i=@earthlink.net header.b="h+5zVhSe"
Authentication-Results: earthlink-vadesecure.net;
 auth=pass smtp.auth=ffilzlnx@mindspring.com smtp.mailfrom=ffilzlnx@mindspring.com;
DKIM-Signature: v=1; a=rsa-sha256; bh=GGc6gNltF8egXcOi3vQd56Jm+0IThFClEVdMuk
 JLz5I=; c=relaxed/relaxed; d=earthlink.net; h=from:reply-to:subject:
 date:to:cc:resent-date:resent-from:resent-to:resent-cc:in-reply-to:
 references:list-id:list-help:list-unsubscribe:list-unsubscribe-post:
 list-subscribe:list-post:list-owner:list-archive; q=dns/txt;
 s=dk12062016; t=1755797438; x=1756402238; b=h+5zVhSe50Y3sDTcTFep0g/2OHw
 gut2kBhZHMIU7PLAV5FPNb+/xsXcot3rN7TZPbB9Jg83a6Iq/FfRIv5W3MZZ3j76jOaf8oM
 VIQELMe74HeMpuy5tgSqocOa9utbMkzFEkQlCzRk6CWpILGFQ70avHaHS5CVQvsiov4+tPL
 ZfJQpD/G5HijzfFdZ02+vkHH8+VysU/kv5UijdYJ31mdteX8H0RAcZBXpg+vebQoamPv8OZ
 uEX5yam4iPe4ys6bwXHm86Bx9DZgmI7t8pIWhbq31cZs7P4vxtSZMG3nhDRTiNoL6EwwA3N
 tTU7nJSs9KPf8aYkAxbM5z6up3K6HrA==
Received: from FRANKSTHINKPAD ([71.237.148.155])
 by vsel1nmtao02p.internal.vadesecure.com with ngmta
 id cc13e569-185dd8779355c456; Thu, 21 Aug 2025 17:30:38 +0000
From: "Frank Filz" <ffilzlnx@mindspring.com>
To: "'Calum Mackay'" <calum.mackay@oracle.com>,
	<linux-nfs@vger.kernel.org>
Cc: "'Ofir Vainshtein'" <ofirvins@google.com>,
	"'Chuck Lever'" <chuck.lever@oracle.com>
References: <01d001dc0ba9$e4cb0080$ae610180$@mindspring.com> <44d19311-7644-4f6e-8509-ff7312ba3ad9@oracle.com>
In-Reply-To: <44d19311-7644-4f6e-8509-ff7312ba3ad9@oracle.com>
Subject: RE: PYNFS LOCK20 Blocking Lock Test Case
Date: Thu, 21 Aug 2025 10:30:36 -0700
Message-ID: <009301dc12c1$4f9cb390$eed61ab0$@mindspring.com>
Precedence: bulk
X-Mailing-List: linux-nfs@vger.kernel.org
List-Id: <linux-nfs.vger.kernel.org>
List-Subscribe: <mailto:linux-nfs+subscribe@vger.kernel.org>
List-Unsubscribe: <mailto:linux-nfs+unsubscribe@vger.kernel.org>
MIME-Version: 1.0
Content-Type: text/plain;
	charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Outlook 15.0
Thread-Index: AQLX0cafepDmnK5On/trYyHnSs4j1gGuqD59smhofNA=
Content-Language: en-us

Ah, I see the logic the test case is expecting.

For Ganesha, we maintain the blocking lock so long as the clientid is =
being renewed, so we don't start the timer for claiming the lock until =
the lock becomes available which seems to be allowed per the RFC. Maybe =
we just need to not run that test case.

But it would be nice to have a similar test case that just takes too =
long after the lock is available to retry.

Part of the challenge is we share a lot of logic between 4.0 and 4.1 and =
with the actual callback in 4.1, there is no expectation of the client =
polling for the lock.

Let me mull this one over more.

Thanks

Frank

> -----Original Message-----
> From: Calum Mackay [mailto:calum.mackay@oracle.com]
> Sent: Wednesday, August 13, 2025 6:30 PM
> To: Frank Filz <ffilzlnx@mindspring.com>; linux-nfs@vger.kernel.org
> Cc: Calum Mackay <calum.mackay@oracle.com>; 'Ofir Vainshtein'
> <ofirvins@google.com>; Chuck Lever <chuck.lever@oracle.com>
> Subject: Re: PYNFS LOCK20 Blocking Lock Test Case
>=20
> On 12/08/2025 5:55 pm, Frank Filz wrote:
> > I believe this test case is wrong, relevant text from RFC:
> >
> > Some clients require the support of blocking locks. The NFSv4 =
protocol
> > must not rely on a callback mechanism and therefore is unable to
> > notify a client when a previously denied lock has been granted.
> > Clients have no choice but to continually poll for the lock. This
> > presents a fairness problem. Two new lock types are added, READW and
> > WRITEW, and are used to indicate to the server that the client is
> > requesting a blocking lock. The server should maintain an ordered =
list
> > of pending blocking locks. When the conflicting lock is released, =
the
> > server may wait the lease period for the first waiting client to
> > re-request the lock. After the lease period expires, the next =
waiting
> > client request is allowed the lock.
> >
> > Test case:
> >
> >      # Standard owner opens and locks a file
> >      fh1, stateid1 =3D c.create_confirm(t.word(),
> deny=3DOPEN4_SHARE_DENY_NONE)
> >      res1 =3D c.lock_file(t.word(), fh1, stateid1, type=3DWRITE_LT)
> >      check(res1, msg=3D"Locking file %s" % t.word())
> >      # Second owner is denied a blocking lock
> >      file =3D c.homedir + [t.word()]
> >      fh2, stateid2 =3D c.open_confirm(b"owner2", file,
> >                                     =
access=3DOPEN4_SHARE_ACCESS_BOTH,
> >                                     deny=3DOPEN4_SHARE_DENY_NONE)
> >      res2 =3D c.lock_file(b"owner2", fh2, stateid2,
> >                         type=3DWRITEW_LT, =
lockowner=3Db"lockowner2_LOCK20")
> >      check(res2, NFS4ERR_DENIED, msg=3D"Conflicting lock on %s" % =
t.word())
> >      sleeptime =3D c.getLeaseTime() // 2
> >      # Wait for queued lock to timeout
> >      for i in range(3):
> >          env.sleep(sleeptime, "Waiting for queued blocking lock to =
timeout")
> >          res =3D c.compound([op.renew(c.clientid)])
> >          check(res, [NFS4_OK, NFS4ERR_CB_PATH_DOWN])
> >      # Standard owner releases lock
> >      res1 =3D c.unlock_file(1, fh1, res1.lockid)
> >      check(res1)
> >      # Third owner tries to butt in and steal lock second owner is
> > waiting for
> >      # Should work, since second owner let his turn expire
> >      file =3D c.homedir + [t.word()]
> >      fh3, stateid3 =3D c.open_confirm(b"owner3", file,
> >                                     =
access=3DOPEN4_SHARE_ACCESS_BOTH,
> >                                     deny=3DOPEN4_SHARE_DENY_NONE)
> >      res3 =3D c.lock_file(b"owner3", fh3, stateid3,
> >                         type=3DWRITEW_LT, =
lockowner=3Db"lockowner3_LOCK20")
> >      check(res3, msg=3D"Grabbing lock after another owner let his =
'turn'
> > expire")
> >
> > Note that the RFC indicated the client has one lease period AFTER =
the
> > conflicting lock is released to retry while the test case waits 1.5
> > lease period after requesting the blocking lock before it releases =
the
> > conflicting lock...
> >
> > Am I reading things right?
>=20
> I see what you mean.
>=20
> But since a waiting blocking lock client obviously doesn't know when =
the lock-
> holding client is going to release its lock, the waiting client has to =
start polling
> regularly as soon as its initial blocking lock request is denied. It =
has to poll at
> least once per lease period.
>=20
> If the server notices that a waiting client hasn't polled once in a =
lease period,
> after its initial blocking lock request was denied, then it seems =
reasonable for
> the server to forget that waiting client's interest in the pending =
lock there and
> then. There's no need for the server to wait a further lease period =
after the lock
> is released.
>=20
>=20
> Looking at the current Linux nfsd code, that does seem to be what it =
does. I see
> that when the server adds the blocking lock request to its pending =
list, it adds the
> current timestamp to it, i.e. the time that the blocking lock was =
requested.
>=20
> The nfsd background clean-up thread (which runs at least once per =
lease
> period) removes any pending blocking lock requests if a lease period =
has passed
> since they were placed on the list (i.e. when the blocking lock was =
requested).
> There's a corresponding comment:
>=20
> 	/*
> 	 * It's possible for a client to try and acquire an already held lock
> 	 * that is being held for a long time, and then lose interest in it.
> 	 * So, we clean out any un-revisited request after a lease period
> 	 * under the assumption that the client is no longer interested.
>=20
> =
https://elixir.bootlin.com/linux/v6.16/source/fs/nfsd/nfs4state.c#L6824
>=20
>=20
> There's no pending locks action taken on lock release. The timing is =
based solely
> on when the blocking READW/WRITEW request occurred, i.e.
> the res2 WRITEW in the pynfs test, which is before the sleep.
>=20
> So, whilst the RFC may seem to suggest the timer should start at lock =
release, it
> doesn't seem unreasonable for the NFS server to start the timer =
earlier, at the
> blocking lock request, to avoid an unnecessary delay upon lock release =
if the
> client has lost interest in the lock, i.e. it isn't polling.
>=20
>=20
> Presumably, the pynfs test was originally written to match NFS server =
behaviour,
> rather than RFC wording. I'm not sure what other NFS servers do in =
this case.
> Waiting longer wouldn't change the test result in this case, I think.
>=20
>=20
> Does that seem reasonable to you?
>=20
> thanks,
> calum.