From: Trond Myklebust <trond.myklebust@fys.uio.no>
To: Neil Brown <neilb@suse.de>
Cc: Linux NFS Mailing List <nfs@lists.sourceforge.net>
Subject: Re: Can we flush directory data when the ctime changes?
Date: Tue, 08 Aug 2006 10:05:17 -0400 [thread overview]
Message-ID: <1155045918.5673.26.camel@localhost> (raw)
In-Reply-To: <17624.15554.514216.623190@cse.unsw.edu.au>
On Tue, 2006-08-08 at 17:26 +1000, Neil Brown wrote:
> Hi,
> We have a scenario where readdir in the linux client gets confused by
> responses from a Netapp fileserver.
>
> What happens is that a readdir collects and caches the directory.
> Then a subsequent readdir finds the first page is still in the cache
> but the second page is missing.
> The nfs client uses a cookie from the first page to request subsequent
> entries and received a list of entries starting from the beginning of
> the directory rather than from the point that it was up to. It seems
> that the cookies have changed.
>
> The GETATTR call that the client makes to validate the cache reports
> that the mtime hasn't change, *but the ctime has*.
Growl! If the directory contents change, then the mtime should too. Page
22 of RFC1813:
Mtime is the time when the file data was last modified. Ctime
is the time when the attributes of the file were last changed.
Directory cookies are not file or directory attributes!
In addition, NFSv3 has the cookie verifier mechanism for the server to
specifically inform the client that the cookies are invalid.
IOW: this really needs to be fixed on the _server_. I'm not happy taking
new patches that may cause the client GETATTR calls to skyrocket again.
Cheers,
Trond
> I patched the (SLES9) kernel to invalidate the cache for directories
> when the ctime changes and this fixed the problem (which was quite
> repeatable).
>
> While it seems that it should be necessary to flush the dir cache on a
> ctime change, it doesn't seem harmful either.
>
> So: would it be acceptable to do this. The following patch should
> achieve this in current -mm (which is quite different from the 2.6.5
> kernel where I tested this).
>
> In case it is interesting, I have this commentary from one of our
> support personnel. I'm not sure if the content comes from someone at
> Netapp or elsewhere, but it sounds plausible.
>
> Thanks,
> NeilBrown
>
> ......
>
> For example, NetApp filer support NFS but their file system is not
> simply a true unix/linux style file system. They emulate that for
> NFS purposes but they also handle the file system in special ways
> for account for CIFS and other access.
>
> Things go on like files name conversions to unicode, for other name
> spaces. These events can take place at much later times than the
> original file name creation through NFS. This results in changes in
> the ctime stamp, without changes to mtime, and it also results in
> new cookie indexes being generated for the NFS file system. If an
> NFS client does not invalidate it's cache upon such a condition, if
> can result in inconsistent cache information, as it has in the
> example for which this bug report was filed.
>
>
> Signed-off-by: Neil Brown <neilb@suse.de>
>
> ### Diffstat output
> ./fs/nfs/inode.c | 5 +++++
> 1 file changed, 5 insertions(+)
>
> diff .prev/fs/nfs/inode.c ./fs/nfs/inode.c
> --- .prev/fs/nfs/inode.c 2006-08-08 17:03:15.000000000 +1000
> +++ ./fs/nfs/inode.c 2006-08-08 17:04:48.000000000 +1000
> @@ -943,6 +943,11 @@ static int nfs_update_inode(struct inode
> /* If ctime has changed we should definitely clear access+acl caches */
> if (!timespec_equal(&inode->i_ctime, &fattr->ctime)) {
> invalid |= NFS_INO_INVALID_ACCESS|NFS_INO_INVALID_ACL;
> + if (S_ISDIR(inode->i_mode))
> + /* Some servers (netapp) can change cookies when the
> + * ctime changes, without changing mtime...
> + */
> + invalid |= NFS_INO_INVALID_DATA;
> memcpy(&inode->i_ctime, &fattr->ctime, sizeof(inode->i_ctime));
> nfsi->cache_change_attribute = jiffies;
> }
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
NFS maillist - NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs
next prev parent reply other threads:[~2006-08-08 14:05 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-08-08 7:26 Can we flush directory data when the ctime changes? Neil Brown
2006-08-08 13:28 ` Chuck Lever
2006-08-08 14:05 ` Trond Myklebust [this message]
2006-08-08 14:27 ` Peter Staubach
2006-08-08 14:35 ` Trond Myklebust
2006-08-15 3:49 ` Neil Brown
2006-08-15 15:26 ` Peter Staubach
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1155045918.5673.26.camel@localhost \
--to=trond.myklebust@fys.uio.no \
--cc=neilb@suse.de \
--cc=nfs@lists.sourceforge.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.