From: Steve Dickson <SteveD@redhat.com>
To: nfs@lists.sourceforge.net
Subject: Re: [PATCH] NFS - Fix for Infinite loop during syncing
Date: Mon, 31 Jan 2005 12:44:38 -0500 [thread overview]
Message-ID: <41FE6E86.8060303@RedHat.com> (raw)
In-Reply-To: <41BDFA46.7070403@RedHat.com>
[-- Attachment #1: Type: text/plain, Size: 2474 bytes --]
Steve Dickson wrote:
>
> It was brought to my attention that following series of events
> would cause an infinite loop in the 2.4 nfs kernels.
>
> 1) Mount the fileystem with acregmin=1,acregmax=1 from two clients.
> 2) On client 1, create a process that continuously writes to a file.
> 3) On client 2, remove that file that is being written
> 4) On client 1, interrupted out of the writing process (which is failing
> with ESTALEs) and type sync
>
Here is an update patch to this problem. My original patch does
avoid the infinite loop but didn't address the actual cause of the loop.
The attached patch does... and here is what is happening:
A process is continuity writing to a broken (i.e ESTALE) fd which
is queuing up pages to be sent out.
A getattr happens (due a cache time out) which fails with ESTALE
so _nfs_revalidate_inode() removes the inode from the hash list:
if (status == -ESTALE) {
NFS_FLAGS(inode) |= NFS_INO_STALE;
if (inode != inode->i_sb->s_root->d_inode)
remove_inode_hash(inode);
}
Now when __sync_one() comes along and see the dirty pages, the inode
is added to the locked inode list, data is sync-ed out and then
__refile_inode() is called:
<>
list_add(&inode->i_list, &inode->i_sb->s_locked_inodes);
<> inode->i_state |= I_LOCK;
/* write out data */
inode->i_state &= ~I_LOCK;
if (!(inode->i_state & I_FREEING))
__refile_inode(inode);
Now here is the problem! Since the inode is has already been removed
from the i_hash list, the inode is never refiled
__refile_inode(inode):
if (inode->i_state & I_FREEING)
return;
if (list_empty(&inode->i_hash))
return;
which causes the infinite loop because the node is never removed from
the locked
inode list. Now my original patch avoid this loop because
__nfs_revalidate_inode()
saw the inode was stale before it removed the inode from the hash list.
The attached
patch still "breaks" the inode earlier (since is stop a bunch of
unnecessary i/o) but
it also it removes the call to remove_inode_hash() in
_nfs_revalidate_inode() which
is the real cause of the problem....
So code in question is:
if (inode != inode->i_sb->s_root->d_inode)
remove_inode_hash(inode);
and I hoping someone can shed some light on as to why the
inode is being removed from the i_hash list with an ESTALE failure.
Does it make sense to remove an inode from the i_hash when there
are dirty pages?
steved.
[-- Attachment #2: linux-2.4.29-nfs-syncloop.patch --]
[-- Type: text/x-patch, Size: 1366 bytes --]
--- linux-2.4.29/fs/nfs/write.c.orig 2004-04-14 09:05:40.000000000 -0400
+++ linux-2.4.29/fs/nfs/write.c 2005-01-31 10:51:45.979056000 -0500
@@ -1073,6 +1073,9 @@ nfs_writeback_done(struct rpc_task *task
SetPageError(page);
if (req->wb_file)
req->wb_file->f_error = task->tk_status;
+ if (task->tk_status == -ESTALE)
+ NFS_FLAGS(inode) |= NFS_INO_STALE;
+
nfs_inode_remove_request(req);
dprintk(", error = %d\n", task->tk_status);
goto next;
@@ -1223,6 +1226,9 @@ nfs_commit_done(struct rpc_task *task)
if (task->tk_status < 0) {
if (req->wb_file)
req->wb_file->f_error = task->tk_status;
+ if (task->tk_status == -ESTALE)
+ NFS_FLAGS(inode) |= NFS_INO_STALE;
+
nfs_inode_remove_request(req);
dprintk(", error = %d\n", task->tk_status);
goto next;
--- linux-2.4.29/fs/nfs/inode.c.orig 2004-04-14 09:05:40.000000000 -0400
+++ linux-2.4.29/fs/nfs/inode.c 2005-01-31 11:02:13.492190000 -0500
@@ -907,11 +907,9 @@ __nfs_revalidate_inode(struct nfs_server
if (status) {
dfprintk(PAGECACHE, "nfs_revalidate_inode: (%x/%Ld) getattr failed, error=%d\n",
inode->i_dev, (long long)NFS_FILEID(inode), status);
- if (status == -ESTALE) {
+ if (status == -ESTALE)
NFS_FLAGS(inode) |= NFS_INO_STALE;
- if (inode != inode->i_sb->s_root->d_inode)
- remove_inode_hash(inode);
- }
+
goto out;
}
prev parent reply other threads:[~2005-01-31 17:42 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-12-13 20:23 [PATCH] NFS - Fix for Infinite loop during syncing Steve Dickson
2004-12-13 20:45 ` Trond Myklebust
2004-12-13 21:20 ` Steve Dickson
2004-12-14 0:14 ` Steve Dickson
2005-01-31 17:44 ` Steve Dickson [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=41FE6E86.8060303@RedHat.com \
--to=steved@redhat.com \
--cc=nfs@lists.sourceforge.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.