* [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts
@ 2016-08-11 17:36 Jeff Layton
2016-08-11 17:38 ` Christoph Hellwig
0 siblings, 1 reply; 4+ messages in thread
From: Jeff Layton @ 2016-08-11 17:36 UTC (permalink / raw)
To: bfields; +Cc: linux-nfs, Trond Myklebust, hch
We currently can hit a deadlock (of sorts) when trying to use flexfiles
layouts with XFS. XFS will call break_layout when something wants to
write to the file. In the case of the (super-simple) flexfiles layout
driver in knfsd, the MDS and DS are the same machine.
The client can get a layout and then issue a v3 write to do its I/O. XFS
will then call xfs_break_layouts, which will cause a CB_LAYOUTRECALL to
be issued to the client. The client however can't return the layout
until the v3 WRITE completes, but XFS won't allow the write to proceed
until the layout is returned.
Christoph says:
XFS only cares about block-like layouts where the client has direct
access to the file blocks. I'd need to look how to propagate the
flag into break_layout, but in principle we don't need to do any
recalls on truncate ever for file and flexfile layouts.
If we're never going to recall the layout, then we don't even need to
set the lease at all. Just skip doing so on flexfiles layouts by
adding a new flag to struct nfsd4_layout_ops and skipping the lease
setting and removal when that flag is true.
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: Jeff Layton <jlayton@redhat.com>
---
fs/nfsd/flexfilelayout.c | 1 +
fs/nfsd/nfs4layouts.c | 6 +++++-
fs/nfsd/pnfs.h | 1 +
3 files changed, 7 insertions(+), 1 deletion(-)
diff --git a/fs/nfsd/flexfilelayout.c b/fs/nfsd/flexfilelayout.c
index df880e9fa71f..b67287383010 100644
--- a/fs/nfsd/flexfilelayout.c
+++ b/fs/nfsd/flexfilelayout.c
@@ -126,6 +126,7 @@ nfsd4_ff_proc_getdeviceinfo(struct super_block *sb, struct svc_rqst *rqstp,
const struct nfsd4_layout_ops ff_layout_ops = {
.notify_types =
NOTIFY_DEVICEID4_DELETE | NOTIFY_DEVICEID4_CHANGE,
+ .disable_recalls = true,
.proc_getdeviceinfo = nfsd4_ff_proc_getdeviceinfo,
.encode_getdeviceinfo = nfsd4_ff_encode_getdeviceinfo,
.proc_layoutget = nfsd4_ff_proc_layoutget,
diff --git a/fs/nfsd/nfs4layouts.c b/fs/nfsd/nfs4layouts.c
index 2be9602b0221..42aace4fc4c8 100644
--- a/fs/nfsd/nfs4layouts.c
+++ b/fs/nfsd/nfs4layouts.c
@@ -174,7 +174,8 @@ nfsd4_free_layout_stateid(struct nfs4_stid *stid)
list_del_init(&ls->ls_perfile);
spin_unlock(&fp->fi_lock);
- vfs_setlease(ls->ls_file, F_UNLCK, NULL, (void **)&ls);
+ if (!nfsd4_layout_ops[ls->ls_layout_type]->disable_recalls)
+ vfs_setlease(ls->ls_file, F_UNLCK, NULL, (void **)&ls);
fput(ls->ls_file);
if (ls->ls_recalled)
@@ -189,6 +190,9 @@ nfsd4_layout_setlease(struct nfs4_layout_stateid *ls)
struct file_lock *fl;
int status;
+ if (nfsd4_layout_ops[ls->ls_layout_type]->disable_recalls)
+ return 0;
+
fl = locks_alloc_lock();
if (!fl)
return -ENOMEM;
diff --git a/fs/nfsd/pnfs.h b/fs/nfsd/pnfs.h
index 0c2a716e8741..d27a5aa60022 100644
--- a/fs/nfsd/pnfs.h
+++ b/fs/nfsd/pnfs.h
@@ -19,6 +19,7 @@ struct nfsd4_deviceid_map {
struct nfsd4_layout_ops {
u32 notify_types;
+ bool disable_recalls;
__be32 (*proc_getdeviceinfo)(struct super_block *sb,
struct svc_rqst *rqstp,
--
2.7.4
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts
2016-08-11 17:36 [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts Jeff Layton
@ 2016-08-11 17:38 ` Christoph Hellwig
2016-08-11 17:41 ` Jeff Layton
0 siblings, 1 reply; 4+ messages in thread
From: Christoph Hellwig @ 2016-08-11 17:38 UTC (permalink / raw)
To: Jeff Layton; +Cc: bfields, linux-nfs, Trond Myklebust, hch
This looks fine to me, although it might be a good idea to just
have a flags field in the layout_ops structure for additional
quirks in the future instead of the boolean.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts
2016-08-11 17:38 ` Christoph Hellwig
@ 2016-08-11 17:41 ` Jeff Layton
2016-08-11 18:14 ` J. Bruce Fields
0 siblings, 1 reply; 4+ messages in thread
From: Jeff Layton @ 2016-08-11 17:41 UTC (permalink / raw)
To: Christoph Hellwig; +Cc: bfields, linux-nfs, Trond Myklebust
On Thu, 2016-08-11 at 19:38 +0200, Christoph Hellwig wrote:
> This looks fine to me, although it might be a good idea to just
> have a flags field in the layout_ops structure for additional
> quirks in the future instead of the boolean.
Sure...that's reasonable. If and when the need arises for more flags we
can always do the conversion to a flags field then as well.
I'll Bruce make the call on which he'd rather see...
--
Jeff Layton <jlayton@redhat.com>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts
2016-08-11 17:41 ` Jeff Layton
@ 2016-08-11 18:14 ` J. Bruce Fields
0 siblings, 0 replies; 4+ messages in thread
From: J. Bruce Fields @ 2016-08-11 18:14 UTC (permalink / raw)
To: Jeff Layton; +Cc: Christoph Hellwig, linux-nfs, Trond Myklebust
On Thu, Aug 11, 2016 at 01:41:58PM -0400, Jeff Layton wrote:
> On Thu, 2016-08-11 at 19:38 +0200, Christoph Hellwig wrote:
> > This looks fine to me, although it might be a good idea to just
> > have a flags field in the layout_ops structure for additional
> > quirks in the future instead of the boolean.
>
>
> Sure...that's reasonable. If and when the need arises for more flags we
> can always do the conversion to a flags field then as well.
>
> I'll Bruce make the call on which he'd rather see...
I don't really care, so... I'll take your version as is barring other
objections.
--b.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2016-08-11 18:14 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-08-11 17:36 [PATCH] nfsd: don't set a FL_LAYOUT lease for flexfiles layouts Jeff Layton
2016-08-11 17:38 ` Christoph Hellwig
2016-08-11 17:41 ` Jeff Layton
2016-08-11 18:14 ` J. Bruce Fields
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.