From: David Flynn <davidf@rd.bbc.co.uk>
To: Trond Myklebust <Trond.Myklebust@netapp.com>
Cc: David Flynn <davidf@rd.bbc.co.uk>,
linux-nfs@vger.kernel.org, Chuck Lever <chuck.lever@oracle.com>
Subject: Re: NFS4 BAD_STATEID loop (kernel 3.0.4)
Date: Thu, 27 Oct 2011 22:17:42 +0000 [thread overview]
Message-ID: <20111027221742.GI32587@rd.bbc.co.uk> (raw)
In-Reply-To: <1319470302.2734.4.camel@lade.trondhjem.org>
* Trond Myklebust (Trond.Myklebust@netapp.com) wrote:
> Do you have an example of the stateid argument's value? Does it change
> at all between separate WRITE attempts?
Further to all this, i've just had a similar fault on another machine,
producing a huge amounts of:
[463795.630702] nfs4_reclaim_open_state: Lock reclaim failed!
[463795.637446] nfs4_reclaim_open_state: Lock reclaim failed!
[463795.643113] nfs4_reclaim_open_state: Lock reclaim failed!
A network capture is available:
ftp://ftp.kw.bbc.co.uk/davidf/priv/uekahrae.pcap
$ echo 0 | sudo tee /proc/sys/sunrpc/rpc_debug
[468024.010036] -pid- flgs status -client- --rqstp- -timeout ---ops--
[468024.010051] 6289 0801 0 ffff8801f3e37e00 (null) 0 ffffffffa0229d40 nfsv4 WRITE a:call_start q:NFS client
[468024.010057] 6290 0801 0 ffff8801f3e37e00 (null) 0 ffffffffa0229d40 nfsv4 WRITE a:call_start q:NFS client
blocked task:
[464304.799306] INFO: task rrdtool:28506 blocked for more than 120 seconds.
[464304.799309] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[464304.799311] rrdtool D 0000000000000001 0 28506 4189 0x00000000
[464304.799315] ffff880073bd5ca8 0000000000000082 ffff8804232c5408 0000000000012a40
[464304.799318] ffff880073bd5fd8 0000000000012a40 ffff880073bd4000 0000000000012a40
[464304.799320] 0000000000012a40 0000000000012a40 ffff880073bd5fd8 0000000000012a40
[464304.799322] Call Trace:
[464304.799332] [<ffffffff81110030>] ? __lock_page+0x70/0x70
[464304.799335] [<ffffffff8160007c>] io_schedule+0x8c/0xd0
[464304.799337] [<ffffffff8111003e>] sleep_on_page+0xe/0x20
[464304.799339] [<ffffffff816008ff>] __wait_on_bit+0x5f/0x90
[464304.799341] [<ffffffff81110203>] wait_on_page_bit+0x73/0x80
[464304.799345] [<ffffffff81085bf0>] ? autoremove_wake_function+0x40/0x40
[464304.799347] [<ffffffff8111c5e5>] ? pagevec_lookup_tag+0x25/0x40
[464304.799349] [<ffffffff81110436>] filemap_fdatawait_range+0xf6/0x1a0
[464304.799363] [<ffffffffa02037d0>] ? nfs_destroy_directcache+0x20/0x20 [nfs]
[464304.799365] [<ffffffff8111bae1>] ? do_writepages+0x21/0x40
[464304.799367] [<ffffffff811116bb>] ? __filemap_fdatawrite_range+0x5b/0x60
[464304.799368] [<ffffffff81111730>] filemap_write_and_wait_range+0x70/0x80
[464304.799371] [<ffffffff8119cc6a>] vfs_fsync_range+0x5a/0x90
[464304.799373] [<ffffffff8119cd0c>] vfs_fsync+0x1c/0x20
[464304.799377] [<ffffffffa01f7c74>] nfs_file_flush+0x54/0x80 [nfs]
[464304.799380] [<ffffffff8116ee7f>] filp_close+0x3f/0x90
[464304.799382] [<ffffffff8116f8a7>] sys_close+0xb7/0x120
[464304.799384] [<ffffffff8160a702>] system_call_fastpath+0x16/0x1b
Regards,
..david
next prev parent reply other threads:[~2011-10-27 22:18 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-10-24 10:40 NFS4 BAD_STATEID loop (kernel 3.0) David Flynn
2011-10-24 11:22 ` Trond Myklebust
2011-10-24 13:17 ` David Flynn
2011-10-24 13:32 ` Trond Myklebust
2011-10-24 14:50 ` NFS4 BAD_STATEID loop (kernel 3.0.4) David Flynn
2011-10-24 15:31 ` Trond Myklebust
2011-10-24 15:55 ` David Flynn
2011-10-27 22:17 ` David Flynn [this message]
2011-10-29 0:25 ` NFS4ERR_STALE_CLIENTID loop David Flynn
2011-10-29 17:29 ` Trond Myklebust
2011-10-29 18:02 ` David Flynn
2011-10-29 18:22 ` Myklebust, Trond
2011-10-29 18:23 ` Chuck Lever
2011-10-29 18:26 ` Myklebust, Trond
2011-10-29 18:29 ` David Flynn
2011-10-29 18:15 ` J. Bruce Fields
2011-10-29 18:21 ` Myklebust, Trond
2011-10-29 18:47 ` J. Bruce Fields
2011-10-29 18:50 ` Chuck Lever
2011-10-29 19:19 ` Myklebust, Trond
2011-10-29 19:52 ` David Flynn
2011-10-29 20:42 ` Myklebust, Trond
2011-10-29 21:07 ` David Flynn
2011-10-29 21:12 ` Myklebust, Trond
2011-10-31 13:07 ` Chuck Lever
2011-10-31 13:21 ` David Flynn
2011-10-31 13:39 ` Chuck Lever
2011-10-24 13:43 ` NFS4 BAD_STATEID loop (kernel 3.0) Chuck Lever
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20111027221742.GI32587@rd.bbc.co.uk \
--to=davidf@rd.bbc.co.uk \
--cc=Trond.Myklebust@netapp.com \
--cc=chuck.lever@oracle.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.