From: "J. Bruce Fields" <bfields@fieldses.org>
To: Jeff Layton <jlayton@poochiereds.net>
Cc: linux-nfs@vger.kernel.org, hch@lst.de, kinglongmee@gmail.com
Subject: Re: [PATCH v3 14/20] nfsd: close cached files prior to a REMOVE or RENAME that would replace target
Date: Thu, 27 Aug 2015 09:38:06 -0400 [thread overview]
Message-ID: <20150827133806.GA10468@fieldses.org> (raw)
In-Reply-To: <20150826185331.74977119@synchrony.poochiereds.net>
On Wed, Aug 26, 2015 at 06:53:31PM -0400, Jeff Layton wrote:
> On Wed, 26 Aug 2015 16:00:32 -0400
> "J. Bruce Fields" <bfields@fieldses.org> wrote:
>
> > On Thu, Aug 20, 2015 at 07:17:14AM -0400, Jeff Layton wrote:
> > > It's not uncommon for some workloads to do a bunch of I/O to a file and
> > > delete it just afterward. If knfsd has a cached open file however, then
> > > the file may still be open when the dentry is unlinked. If the
> > > underlying filesystem is nfs, then that could trigger it to do a
> > > sillyrename.
> >
> > Possibly worth noting that situation doesn't currently occur upstream.
> >
> > (And, another justification worth noting: space used by a file should be
> > deallocated on last unlink or close. People do sometimes notice if it's
> > not, especially if the file is large.)
> >
>
> Good points.
>
> > > On a REMOVE or RENAME scan the nfsd_file cache for open files that
> > > correspond to the inode, and proactively unhash and put their
> > > references. This should prevent any delete-on-last-close activity from
> > > occurring, solely due to knfsd's open file cache.
> >
> > Is there anything here to prevent a new cache entry being added after
> > nfsd_file_close_inode and before the file is actually removed?
> >
>
> No, nothing -- it's strictly best effort.
Unfortunately I think this is something we really want to guarantee.
--b.
> What might make sense is to consider looking at the dentry associated
> with the struct file when putting the last reference to the nfsd_file.
> If it's unhashed, then we could unhash the nfsd_file and put the hash
> reference for it.
>
> That won't prevent silly renames in the case of NFS being reexported,
> of course, but it should ensure that we don't leave the thing open
> indefinitely in the case of such a race.
>
> I'll have to think about that one as well...
>
> > --b.
> >
> > >
> > > Signed-off-by: Jeff Layton <jeff.layton@primarydata.com>
> > > ---
> > > fs/nfsd/filecache.c | 25 +++++++++++++++++++++++++
> > > fs/nfsd/filecache.h | 1 +
> > > fs/nfsd/trace.h | 17 +++++++++++++++++
> > > fs/nfsd/vfs.c | 17 +++++++++++++++--
> > > 4 files changed, 58 insertions(+), 2 deletions(-)
> > >
> > > diff --git a/fs/nfsd/filecache.c b/fs/nfsd/filecache.c
> > > index e48b536762aa..4bd683f03b6e 100644
> > > --- a/fs/nfsd/filecache.c
> > > +++ b/fs/nfsd/filecache.c
> > > @@ -283,6 +283,31 @@ nfsd_file_find_locked(struct inode *inode, unsigned int may_flags,
> > > return NULL;
> > > }
> > >
> > > +/**
> > > + * nfsd_file_close_inode - attempt to forcibly close a nfsd_file
> > > + * @inode: inode of the file to attempt to remove
> > > + *
> > > + * Walk the whole hash bucket, looking for any files that correspond to "inode".
> > > + * If any do, then unhash them and put the hashtable reference to them.
> > > + */
> > > +void
> > > +nfsd_file_close_inode(struct inode *inode)
> > > +{
> > > + struct nfsd_file *nf;
> > > + struct hlist_node *tmp;
> > > + unsigned int hashval = (unsigned int)hash_ptr(inode, NFSD_FILE_HASH_BITS);
> > > + LIST_HEAD(dispose);
> > > +
> > > + spin_lock(&nfsd_file_hashtbl[hashval].nfb_lock);
> > > + hlist_for_each_entry_safe(nf, tmp, &nfsd_file_hashtbl[hashval].nfb_head, nf_node) {
> > > + if (inode == nf->nf_inode)
> > > + nfsd_file_unhash_and_release_locked(nf, &dispose);
> > > + }
> > > + spin_unlock(&nfsd_file_hashtbl[hashval].nfb_lock);
> > > + trace_nfsd_file_close_inode(hashval, inode, !list_empty(&dispose));
> > > + nfsd_file_dispose_list(&dispose);
> > > +}
> > > +
> > > __be32
> > > nfsd_file_acquire(struct svc_rqst *rqstp, struct svc_fh *fhp,
> > > unsigned int may_flags, struct nfsd_file **pnf)
> > > diff --git a/fs/nfsd/filecache.h b/fs/nfsd/filecache.h
> > > index debd558ef786..191cdb25aa66 100644
> > > --- a/fs/nfsd/filecache.h
> > > +++ b/fs/nfsd/filecache.h
> > > @@ -26,6 +26,7 @@ int nfsd_file_cache_init(void);
> > > void nfsd_file_cache_shutdown(void);
> > > void nfsd_file_put(struct nfsd_file *nf);
> > > struct nfsd_file *nfsd_file_get(struct nfsd_file *nf);
> > > +void nfsd_file_close_inode(struct inode *inode);
> > > __be32 nfsd_file_acquire(struct svc_rqst *rqstp, struct svc_fh *fhp,
> > > unsigned int may_flags, struct nfsd_file **nfp);
> > > #endif /* _FS_NFSD_FILECACHE_H */
> > > diff --git a/fs/nfsd/trace.h b/fs/nfsd/trace.h
> > > index 2dac872d31e8..95af3b9c7b66 100644
> > > --- a/fs/nfsd/trace.h
> > > +++ b/fs/nfsd/trace.h
> > > @@ -139,6 +139,23 @@ TRACE_EVENT(nfsd_file_acquire,
> > > show_nf_may(__entry->nf_may), __entry->nf_file,
> > > be32_to_cpu(__entry->status))
> > > );
> > > +
> > > +TRACE_EVENT(nfsd_file_close_inode,
> > > + TP_PROTO(unsigned int hash, struct inode *inode, int found),
> > > + TP_ARGS(hash, inode, found),
> > > + TP_STRUCT__entry(
> > > + __field(unsigned int, hash)
> > > + __field(struct inode *, inode)
> > > + __field(int, found)
> > > + ),
> > > + TP_fast_assign(
> > > + __entry->hash = hash;
> > > + __entry->inode = inode;
> > > + __entry->found = found;
> > > + ),
> > > + TP_printk("hash=0x%x inode=0x%p found=%d", __entry->hash,
> > > + __entry->inode, __entry->found)
> > > +);
> > > #endif /* _NFSD_TRACE_H */
> > >
> > > #undef TRACE_INCLUDE_PATH
> > > diff --git a/fs/nfsd/vfs.c b/fs/nfsd/vfs.c
> > > index 6cfd96adcc71..98d3b9d96480 100644
> > > --- a/fs/nfsd/vfs.c
> > > +++ b/fs/nfsd/vfs.c
> > > @@ -1583,6 +1583,15 @@ out_nfserr:
> > > goto out_unlock;
> > > }
> > >
> > > +static void
> > > +nfsd_close_cached_files(struct dentry *dentry)
> > > +{
> > > + struct inode *inode = d_inode(dentry);
> > > +
> > > + if (inode && S_ISREG(inode->i_mode))
> > > + nfsd_file_close_inode(inode);
> > > +}
> > > +
> > > /*
> > > * Rename a file
> > > * N.B. After this call _both_ ffhp and tfhp need an fh_put
> > > @@ -1652,6 +1661,7 @@ nfsd_rename(struct svc_rqst *rqstp, struct svc_fh *ffhp, char *fname, int flen,
> > > if (ffhp->fh_export->ex_path.dentry != tfhp->fh_export->ex_path.dentry)
> > > goto out_dput_new;
> > >
> > > + nfsd_close_cached_files(ndentry);
> > > host_err = vfs_rename(fdir, odentry, tdir, ndentry, NULL, 0);
> > > if (!host_err) {
> > > host_err = commit_metadata(tfhp);
> > > @@ -1721,10 +1731,13 @@ nfsd_unlink(struct svc_rqst *rqstp, struct svc_fh *fhp, int type,
> > > if (!type)
> > > type = d_inode(rdentry)->i_mode & S_IFMT;
> > >
> > > - if (type != S_IFDIR)
> > > + if (type != S_IFDIR) {
> > > + nfsd_close_cached_files(rdentry);
> > > host_err = vfs_unlink(dirp, rdentry, NULL);
> > > - else
> > > + } else {
> > > host_err = vfs_rmdir(dirp, rdentry);
> > > + }
> > > +
> > > if (!host_err)
> > > host_err = commit_metadata(fhp);
> > > dput(rdentry);
> > > --
> > > 2.4.3
>
>
> --
> Jeff Layton <jlayton@poochiereds.net>
next prev parent reply other threads:[~2015-08-27 13:38 UTC|newest]
Thread overview: 41+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-20 11:17 [PATCH v3 00/20] nfsd: open file caching Jeff Layton
2015-08-20 11:17 ` [PATCH v3 01/20] nfsd: allow more than one laundry job to run at a time Jeff Layton
2015-08-20 11:17 ` [PATCH v3 02/20] nfsd: add a new struct file caching facility to nfsd Jeff Layton
2015-08-20 23:11 ` Peng Tao
2015-08-20 23:43 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 03/20] list_lru: add list_lru_rotate Jeff Layton
2015-08-20 11:17 ` Jeff Layton
2015-08-21 9:36 ` Vladimir Davydov
2015-08-21 9:36 ` Vladimir Davydov
2015-08-20 11:17 ` [PATCH v3 04/20] nfsd: add a LRU list for nfsd_files Jeff Layton
2015-08-20 11:17 ` [PATCH v3 05/20] nfsd: add a shrinker to the nfsd_file cache Jeff Layton
2015-08-20 11:17 ` [PATCH v3 06/20] locks/nfsd: create a new notifier chain for lease attempts Jeff Layton
2015-08-26 19:49 ` J. Bruce Fields
2015-08-26 22:39 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 07/20] nfsd: hook up nfsd_write to the new nfsd_file cache Jeff Layton
2015-08-26 19:53 ` J. Bruce Fields
2015-08-26 22:40 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 08/20] nfsd: hook up nfsd_read to the " Jeff Layton
2015-08-20 11:17 ` [PATCH v3 09/20] sunrpc: add a new cache_detail operation for when a cache is flushed Jeff Layton
2015-08-20 11:17 ` [PATCH v3 10/20] nfsd: handle NFSD_MAY_NOT_BREAK_LEASE in open file cache Jeff Layton
2015-08-20 11:17 ` [PATCH v3 11/20] nfsd: hook nfsd_commit up to the nfsd_file cache Jeff Layton
2015-08-20 11:17 ` [PATCH v3 12/20] nfsd: move include of state.h from trace.c to trace.h Jeff Layton
2015-08-20 11:17 ` [PATCH v3 13/20] nfsd: add new tracepoints for nfsd_file cache Jeff Layton
2015-08-20 11:17 ` [PATCH v3 14/20] nfsd: close cached files prior to a REMOVE or RENAME that would replace target Jeff Layton
2015-08-26 20:00 ` J. Bruce Fields
2015-08-26 22:53 ` Jeff Layton
2015-08-27 13:38 ` J. Bruce Fields [this message]
2015-08-28 12:19 ` Jeff Layton
2015-08-28 17:58 ` J. Bruce Fields
2015-08-31 16:50 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 15/20] nfsd: call flush_delayed_fput from nfsd_file_close_fh Jeff Layton
2015-08-21 1:01 ` Peng Tao
2015-08-21 2:18 ` Peng Tao
2015-08-21 11:21 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 16/20] nfsd: convert nfs4_file->fi_fds array to use nfsd_files Jeff Layton
2015-08-20 11:17 ` [PATCH v3 17/20] nfsd: have nfsd_test_lock use the nfsd_file cache Jeff Layton
2015-08-20 11:17 ` [PATCH v3 18/20] nfsd: convert fi_deleg_file and ls_file fields to nfsd_file Jeff Layton
2015-08-20 11:17 ` [PATCH v3 19/20] nfsd: hook up nfs4_preprocess_stateid_op to the nfsd_file cache Jeff Layton
2015-08-21 1:28 ` Peng Tao
2015-08-21 11:23 ` Jeff Layton
2015-08-20 11:17 ` [PATCH v3 20/20] nfsd: rip out the raparms cache Jeff Layton
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150827133806.GA10468@fieldses.org \
--to=bfields@fieldses.org \
--cc=hch@lst.de \
--cc=jlayton@poochiereds.net \
--cc=kinglongmee@gmail.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.