All of lore.kernel.org
 help / color / mirror / Atom feed
From: Trond Myklebust <trond.myklebust@fys.uio.no>
To: Filipe Brandenburger <branden@terra.com.br>
Cc: nfs@lists.sourceforge.net
Subject: Re: Processes hanging, directory hanging
Date: Fri, 04 Aug 2006 11:09:52 -0400	[thread overview]
Message-ID: <1154704193.4727.11.camel@localhost> (raw)
In-Reply-To: <20060804113838.26ebeed2@sup-ceu.wrk.terra.com.br>

On Fri, 2006-08-04 at 11:38 -0300, Filipe Brandenburger wrote:
> Hello,
> 
> Please, anybody has any hints on this? I'm still with this problem, and
> I still don't have any clues about what to do next...
> 
> Or should I try to post this on other list, like a kernel list? It
> seems to be that the problem is related to the NFS client, but I can't
> be 100% sure of that...

So you upgraded the server, and the clients started to hang. What makes
you think this is a client problem?

Have you tried comparing 'nfsstat' output on the client and server to
see if the server is processing the client requests. A tcpdump to see if
the client is receiving server replies would be useful too.

Also, check what software you upgraded on the server. If it was samba,
and you have oplock support enabled, then the problem could be related
to leases (IIRC there were a few kernel bugs w.r.t. leases that had to
be fixed recently).

Cheers,
 Trond

> Thanks a lot,
> Filipe Brandenburger
> 
> 
> On Tue, 1 Aug 2006 10:30:59 -0300, Filipe Brandenburger
> <branden@terra.com.br> wrote:
> > I'm facing a rather strange situation on a host of mine. I recently
> > upgraded one server software, and after a week running, several
> > processes hang, and including some directories hang.
> > 
> > The processes hang in "D" (disk wait) state. That way, I cannot strace
> > or gdb them to know what they were doing or where they were.
> > 
> > But the strangest thing are directories. Some directories in NFS start
> > to hang, in some way that if I try to "cd" to them or "ls" them
> > (sometimes even TAB complete hangs them) the process hangs, stays in
> > "disk wait" state, and there's no way I can get it back. If I try to
> > strace a process that changes directory to some of these hanged
> > directories, it goes up to the "getent32" and hangs.
> > 
> > I'm using RHEL4, but I tried to upgrade the kernel to the latest
> > release, and the problem happens as well on the latest kernel (which
> > at the time I upgraded was 2.6.17.6).
> > 
> > So I ask:
> > 
> > 1) Do you know of some bug currently unsolved that could cause this?
> > 
> > 2) It seems to me that the problem is in the kernel, but somehow it's
> > being induced by the new version of the application... What could the
> > application be doing wrong to cause such a problem?
> > 
> > 3) How could I try to see what's happening? Since strace and gdb
> > (which are the tools I know) don't work anymore, I couldn't find
> > anything to try to debug the problem... Should I try to dump something
> > from the kernel? Where exactly should I look?
> > 
> > Thanks in advance,
> > Filipe Brandenburger
> 
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share your
> opinions on IT & business topics through brief surveys -- and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> NFS maillist  -  NFS@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nfs


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

  reply	other threads:[~2006-08-04 15:10 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-08-01 13:30 Processes hanging, directory hanging Filipe Brandenburger
2006-08-04 14:38 ` Filipe Brandenburger
2006-08-04 15:09   ` Trond Myklebust [this message]
2006-08-04 16:51     ` Filipe Brandenburger
2006-08-04 22:22       ` Filipe Brandenburger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1154704193.4727.11.camel@localhost \
    --to=trond.myklebust@fys.uio.no \
    --cc=branden@terra.com.br \
    --cc=nfs@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.