public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* NFS client problems in 2.4.18 to 2.4.20
@ 2003-09-05 20:44 Joshua Weage
  2003-09-05 21:51 ` Trond Myklebust
  0 siblings, 1 reply; 13+ messages in thread
From: Joshua Weage @ 2003-09-05 20:44 UTC (permalink / raw)
  To: linux-kernel

I hope this was not discussed previously, I couldn't find anything
relevant in the archives.

I am having problems with NFS clients getting stuck after reporting a
"nfs server not responding message".  The majority of the time the
mount starts working again when the nfs server load goes down. 
However, sometimes the mount on one client becomes completely
unresponsive, but all of the clients still work correctly.  Even after
letting it set for 2-3+ hours it still doesn't come back up.  I can
ping the server from the locked client and that works.  If I do a lazy
unmount and then remount the NFS disk it works again for awhile - but
tends to lock up again.  A standard umount doesn't work when the client
is hung.

This happens with all RedHat kernel releases 2.4.18 to 2.4.20.

I have tried tuning the NFS server by going to nfs utils 1.0.3 and by
increasing nfsd's and the socket buffer sizes.  I have also increased
the timeout on the clients to 2.0.  One thing that seems to help is to
enable async mode on the NFS server.  However, I've still seen the same
client hang with async turned on.

Machine Details:
12x Cluster nodes 2xAMD Athlon MP's, 100 MbEthernet
1x server 2xPentium III 1.13GHz, Adaptec 39160, Promise RM8000,
GigEthernet
1x Cisco 2924-T switch.

I'm running 8 CPU jobs, each cpu occasionally writes 120MB files to the
NFS disk.  The client lockup always occurs during these file writes.
The lockups have occured on several of the cluster nodes.

Any suggestions on what could be causing this?

Thanks,

Joshua Weage



=====


__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free, easy-to-use web site design software
http://sitebuilder.yahoo.com

^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2003-09-10 18:37 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-05 20:44 NFS client problems in 2.4.18 to 2.4.20 Joshua Weage
2003-09-05 21:51 ` Trond Myklebust
2003-09-06 16:29   ` Joshua Weage
2003-09-06 17:09     ` Trond Myklebust
2003-09-06 21:22       ` Joshua Weage
2003-09-06 23:14         ` Jamie Lokier
2003-09-07  1:54           ` Trond Myklebust
2003-09-07  2:02           ` Trond Myklebust
2003-09-07 14:27             ` Jamie Lokier
2003-09-07 15:18               ` Trond Myklebust
2003-09-07 15:42                 ` Jamie Lokier
2003-09-07 16:03                   ` Trond Myklebust
2003-09-10 18:37     ` Wouter Vlothuizen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox