All of lore.kernel.org
 help / color / mirror / Atom feed
From: Steve Dickson <SteveD@redhat.com>
To: nfs@lists.sourceforge.net
Subject: Re: [PATCH] NFS - Fix for Infinite loop during syncing
Date: Mon, 31 Jan 2005 12:44:38 -0500	[thread overview]
Message-ID: <41FE6E86.8060303@RedHat.com> (raw)
In-Reply-To: <41BDFA46.7070403@RedHat.com>

[-- Attachment #1: Type: text/plain, Size: 2474 bytes --]

Steve Dickson wrote:

>
> It was brought to my attention that following series of events
> would cause an infinite loop in the 2.4 nfs kernels.
>
> 1) Mount the fileystem with acregmin=1,acregmax=1 from two clients.
> 2) On client 1, create a process that continuously writes to a file.
> 3) On client 2, remove that file that is being written
> 4) On client 1, interrupted out of the writing process (which is failing
>     with ESTALEs) and type sync
>
Here is an update patch to this problem. My original patch does
avoid the infinite loop but didn't address the actual cause of the loop.
The attached patch does... and here is what is happening:

A process is continuity writing to a broken (i.e ESTALE) fd which
is queuing up pages to be sent out.

A getattr happens (due a cache time out) which fails with ESTALE
so _nfs_revalidate_inode() removes the inode from the hash list:

    if (status == -ESTALE) {
        NFS_FLAGS(inode) |= NFS_INO_STALE;
        if (inode != inode->i_sb->s_root->d_inode)
            remove_inode_hash(inode);
    }

Now when __sync_one() comes along and see the dirty pages, the inode
is added to the locked inode list,  data is sync-ed out and then
__refile_inode() is called:
<>
    list_add(&inode->i_list, &inode->i_sb->s_locked_inodes);
<>    inode->i_state |= I_LOCK;
    /* write out data */
    inode->i_state &= ~I_LOCK;
    if (!(inode->i_state & I_FREEING))
        __refile_inode(inode);

Now here is the problem! Since the inode is has already been removed
from the i_hash list, the inode is never refiled 
__refile_inode(inode):

    if (inode->i_state & I_FREEING)
        return;
    if (list_empty(&inode->i_hash))
        return;

which causes the infinite loop because the node is never removed from 
the locked
inode list. Now my original patch avoid this loop because 
__nfs_revalidate_inode()
saw the inode was stale before it removed the inode from the hash list. 
The attached
patch still "breaks" the inode earlier (since is stop a bunch of 
unnecessary i/o)  but
it also it removes the call to remove_inode_hash() in 
_nfs_revalidate_inode() which
is the real cause of the problem....

So code in question is:
    if (inode != inode->i_sb->s_root->d_inode)
        remove_inode_hash(inode);

and I hoping someone can shed some light on as to why the
inode is being removed from the i_hash list with an ESTALE failure.
Does it make sense to remove an inode from the i_hash when there
are dirty pages?

steved.



[-- Attachment #2: linux-2.4.29-nfs-syncloop.patch --]
[-- Type: text/x-patch, Size: 1366 bytes --]

--- linux-2.4.29/fs/nfs/write.c.orig	2004-04-14 09:05:40.000000000 -0400
+++ linux-2.4.29/fs/nfs/write.c	2005-01-31 10:51:45.979056000 -0500
@@ -1073,6 +1073,9 @@ nfs_writeback_done(struct rpc_task *task
 			SetPageError(page);
 			if (req->wb_file)
 				req->wb_file->f_error = task->tk_status;
+			if (task->tk_status == -ESTALE)
+				NFS_FLAGS(inode) |= NFS_INO_STALE;
+
 			nfs_inode_remove_request(req);
 			dprintk(", error = %d\n", task->tk_status);
 			goto next;
@@ -1223,6 +1226,9 @@ nfs_commit_done(struct rpc_task *task)
 		if (task->tk_status < 0) {
 			if (req->wb_file)
 				req->wb_file->f_error = task->tk_status;
+			if (task->tk_status == -ESTALE)
+				NFS_FLAGS(inode) |= NFS_INO_STALE;
+
 			nfs_inode_remove_request(req);
 			dprintk(", error = %d\n", task->tk_status);
 			goto next;
--- linux-2.4.29/fs/nfs/inode.c.orig	2004-04-14 09:05:40.000000000 -0400
+++ linux-2.4.29/fs/nfs/inode.c	2005-01-31 11:02:13.492190000 -0500
@@ -907,11 +907,9 @@ __nfs_revalidate_inode(struct nfs_server
 	if (status) {
 		dfprintk(PAGECACHE, "nfs_revalidate_inode: (%x/%Ld) getattr failed, error=%d\n",
 			 inode->i_dev, (long long)NFS_FILEID(inode), status);
-		if (status == -ESTALE) {
+		if (status == -ESTALE) 
 			NFS_FLAGS(inode) |= NFS_INO_STALE;
-			if (inode != inode->i_sb->s_root->d_inode)
-				remove_inode_hash(inode);
-		}
+
 		goto out;
 	}
 

      parent reply	other threads:[~2005-01-31 17:42 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-12-13 20:23 [PATCH] NFS - Fix for Infinite loop during syncing Steve Dickson
2004-12-13 20:45 ` Trond Myklebust
2004-12-13 21:20   ` Steve Dickson
2004-12-14  0:14     ` Steve Dickson
2005-01-31 17:44 ` Steve Dickson [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=41FE6E86.8060303@RedHat.com \
    --to=steved@redhat.com \
    --cc=nfs@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.