From: Carsten Aulbert <carsten.aulbert-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
To: linux-nfs@vger.kernel.org
Cc: Henning Fehrmann
<henning.fehrmann-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>,
Steffen Grunewald
<steffen.grunewald-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
Subject: Massive NFS problems on large cluster with large number of mounts
Date: Tue, 01 Jul 2008 10:19:55 +0200 [thread overview]
Message-ID: <4869E8AB.4060905@aei.mpg.de> (raw)
Hi all (now to the right email list),
We are running a large cluster and do a lot of cross-mounting between
the nodes. To get this running we are running a lot of nfsd (196) and
use mountd with 64 threads, just in case we get a massive number of hit=
s
onto a single node. All this is on Debian Etch with a recent 2.6.24
kernel using autofs4 at the moment to do the automounts.
When running these two not nice scripts:
$ cat test_mount
#!/bin/sh
n_node=3D1000
for i in `seq 1 $n_node`;do
n=3D`echo $RANDOM%1342+10001 | bc| sed -e "s/1/n/"`
$HOME/bin/mount.sh $n&
echo -n .
done
$ cat mount.sh
#!/bin/sh
dir=3D"/distributed/spray/data/EatH/S5R1"
ping -c1 -w1 $1 > /dev/null&& file=3D"/atlas/node/$1$dir/"`ls -f
/atlas/node/$1$dir/|head -n 50 | tail -n 1`
md5sum ${file}
With that we encounter different problems:
Running this gives this in syslog:
Jul 1 07:37:19 n1312 rpc.idmapd[2309]: nfsopen:
open(/var/lib/nfs/rpc_pipefs/nfs/clntaa58/idmap): Too many open files
Jul 1 07:37:19 n1312 rpc.idmapd[2309]: nfsopen:
open(/var/lib/nfs/rpc_pipefs/nfs/clntaa58/idmap): Too many open files
Jul 1 07:37:19 n1312 rpc.idmapd[2309]: nfsopen:
open(/var/lib/nfs/rpc_pipefs/nfs/clntaa5e/idmap): Too many open files
Jul 1 07:37:19 n1312 rpc.idmapd[2309]: nfsopen:
open(/var/lib/nfs/rpc_pipefs/nfs/clntaa5e/idmap): Too many open files
Jul 1 07:37:19 n1312 rpc.idmapd[2309]: nfsopen:
open(/var/lib/nfs/rpc_pipefs/nfs/clntaa9c/idmap): Too many open files
Which is not surprising to me. However, there are a few things I'm
wondering about.
(1) All our mounts use nfsvers=3D3 why is rpc.idmapd involved at all?
(2) Why is this daemon growing so extremely large?
# ps aux|grep rpc.idmapd
root 2309 0.1 16.2 2037152 1326944 ? Ss Jun30 1:24
/usr/sbin/rpc.idmapd
NOTE: We are now disabling this one, but still it wouldbe nice to
understand why there seem to be a memory leak.
(3) The script maxes out at about 340 concurrent mounts, any idea how t=
o
increase this number? We are already running all servers with the
insecure option, thus low ports should not be a restriction.
(4) After running this script /etc/mtab and /proc/mounts are out of
sync. Ian Kent from autofs fame suggested a broken local mount
implementation which does not lock mtab well enough. Any idee about tha=
t?
We are currently testing autofs5 and this is not giving these messages,
but still we are not using high/unprivilidged ports.
TIA for any help you might give us.
Cheers
Carsten
--=20
Dr. Carsten Aulbert - Max Planck Institut f=C3=BCr Gravitationsphysik
Callinstra=C3=9Fe 38, 30167 Hannover, Germany
=46on: +49 511 762 17185, Fax: +49 511 762 17193
http://www.top500.org/system/9234 | http://www.top500.org/connfam/6/lis=
t/31
next reply other threads:[~2008-07-01 8:43 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-07-01 8:19 Carsten Aulbert [this message]
[not found] ` <4869E8AB.4060905-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
2008-07-01 18:22 ` Massive NFS problems on large cluster with large number of mounts J. Bruce Fields
2008-07-01 18:26 ` J. Bruce Fields
2008-07-02 14:00 ` Carsten Aulbert
[not found] ` <486B89F5.9000109-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
2008-07-02 20:31 ` J. Bruce Fields
2008-07-02 21:04 ` Trond Myklebust
2008-07-02 21:08 ` J. Bruce Fields
2008-07-03 5:31 ` Carsten Aulbert
[not found] ` <486C642B.3020100-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
2008-07-03 12:35 ` Carsten Aulbert
2008-07-16 9:49 ` Carsten Aulbert
[not found] ` <487DC43F.8040408-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
2008-07-16 19:06 ` J. Bruce Fields
2008-07-17 5:53 ` Carsten Aulbert
[not found] ` <487EDE57.4070100-l1a6w7hxd2yELgA04lAiVw@public.gmane.org>
2008-07-17 14:27 ` J. Bruce Fields
2008-07-17 14:47 ` Chuck Lever
[not found] ` <76bd70e30807170747r31af3280icf0bd3fdbde17bac-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-07-17 14:48 ` J. Bruce Fields
2008-07-17 15:11 ` Chuck Lever
[not found] ` <76bd70e30807170811s78175c0ep3a52da7c0ef95fc6-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-07-28 20:55 ` Chuck Lever
[not found] ` <76bd70e30807281355t4890a9b2q6960d79552538f60-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-07-29 11:32 ` Jeff Layton
[not found] ` <20080729073203.546a4269-RtJpwOs3+0O+kQycOl6kW4xkIHaj4LzF@public.gmane.org>
2008-07-29 17:43 ` Mike Mackovitch
2008-07-30 17:53 ` J. Bruce Fields
2008-07-30 19:33 ` Chuck Lever
[not found] ` <76bd70e30807301233t73f92775tbdeb3f8efbb34a4f-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-07-30 22:01 ` Chuck Lever
[not found] ` <76bd70e30807301501p5c0ba3c6i38fee02a1e606e31-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-08-15 20:34 ` Chuck Lever
[not found] ` <76bd70e30808151334i19822280j67a08b92b17582ba-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2008-08-15 20:47 ` Trond Myklebust
2008-08-15 21:04 ` Trond Myklebust
2008-08-15 21:39 ` Chuck Lever
2008-07-30 22:13 ` J. Bruce Fields
2008-07-31 16:35 ` Chuck Lever
2008-07-17 15:35 ` Trond Myklebust
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4869E8AB.4060905@aei.mpg.de \
--to=carsten.aulbert-l1a6w7hxd2yelga04laivw@public.gmane.org \
--cc=henning.fehrmann-l1a6w7hxd2yELgA04lAiVw@public.gmane.org \
--cc=linux-nfs@vger.kernel.org \
--cc=steffen.grunewald-l1a6w7hxd2yELgA04lAiVw@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox