From: Simon Kirby <sim@hostway.ca>
To: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: linux-nfs@vger.kernel.org
Subject: Re: NFS client/sunrpc getting stuck on 2.6.36
Date: Sat, 20 Nov 2010 22:43:01 -0800 [thread overview]
Message-ID: <20101121064301.GB3285@hostway.ca> (raw)
In-Reply-To: <1290208645.3135.88.camel@heimdal.trondhjem.org>
On Fri, Nov 19, 2010 at 06:17:25PM -0500, Trond Myklebust wrote:
> On Fri, 2010-11-19 at 14:58 -0800, Simon Kirby wrote:
> > On Fri, Nov 19, 2010 at 05:17:19PM -0500, Trond Myklebust wrote:
> > > So what were all the
> > >
> > > 'lockd: server 10.10.52.xxx not responding, still trying'
> > >
> > > messages all about? There were quite a few of them for a number of
> > > different servers in the moments leading up to the hang. Could it be a
> > > problem with the switch these clients are attached to?
> >
> > If it were a switch problem, would we see port 2049 socket backlogs with
> > netstat -tan or ss -tan? I haven't seen this at all when the problem
> > occurs. All of the sockets are idle (and usually it seems to close them
> > all except the one server that all of the slots are stuck on). tcpdump
> > shows no problems, just very slow requests rates that match the rpc/nfs
> > debugging.
>
> No retransmits that might indicate dropped packets at the switch? How
> fast are the tcp ACKs from the server being returned?
That tcpdump I sent included the ACKs, which all looked normal.
Unfortunately, we haven't seen the problem again yet. Is your "Fix an
infinite loop in call_refresh/call_refreshresult" patch possibly related?
> > If the rpc slots are stuck full, would that cause lockd to print those
> > timeouts?
>
> Yes. That would be the only kind of event that would trigger these
> messages.
and in this case, rpcinto -t and -u should look normal, I would assume,
unless there is a switch/network issue?
Still waiting for it to occur again to try those commands.
Simon-
next prev parent reply other threads:[~2010-11-21 6:43 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-11-11 2:35 NFS client/sunrpc getting stuck on 2.6.36 Simon Kirby
2010-11-11 5:22 ` Trond Myklebust
2010-11-11 8:49 ` Simon Kirby
2010-11-19 20:20 ` Simon Kirby
2010-11-19 21:24 ` Trond Myklebust
2010-11-19 22:03 ` Simon Kirby
2010-11-19 22:17 ` Trond Myklebust
2010-11-19 22:58 ` Simon Kirby
2010-11-19 23:17 ` Trond Myklebust
2010-11-21 6:43 ` Simon Kirby [this message]
2010-11-21 19:55 ` Trond Myklebust
2010-11-21 6:40 ` Simon Kirby
2010-11-21 19:54 ` Trond Myklebust
2010-11-24 5:18 ` Simon Kirby
2010-11-24 15:05 ` Trond Myklebust
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20101121064301.GB3285@hostway.ca \
--to=sim@hostway.ca \
--cc=linux-nfs@vger.kernel.org \
--cc=trond.myklebust@fys.uio.no \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).