All of lore.kernel.org
 help / color / mirror / Atom feed
* Odd NFS hung mounts - stale mounts?
@ 2003-05-12 16:28 Heflin, Roger A.
  2003-05-12 16:59 ` James Pearson
  0 siblings, 1 reply; 3+ messages in thread
From: Heflin, Roger A. @ 2003-05-12 16:28 UTC (permalink / raw)
  To: nfs; +Cc: Weathers, Norman R., Rivera, Angel R, Wardrop, Mark A.,
	Glover, D W

Basic problem:
stale nfs file handles.

Conclusion:

It looks like when the automounter umounts and if the server does not =
register
a "rpc.mountd: authenticated unmount request from" we get into this =
situation,
at least on a unused file systems.  I am not exactly sure what is =
happening
on the used filesystems.  This is on a high traffic setup with lots of =
mounts
and umounts and many many nodes, so given the high volume of =
mount/umounts
I would expect some requests to be dropped.

It looks like when a umount is being done and the server is down or does =
not
confirm the umount that the client does not retry the umount and this=20
situation occurs, the situation is explained below.

Does the above seem plausable?


More information:

Basic information, client is 2.4.21pre4 NFSALL (and 2.4.19 NFSALL), =
nfsutils
1.0.1-1.

When doing a df command we get this message in the messages file:

nfs_statfs: statfs error =3D 116

And the df looks like:

hostname:/usr/applinux    0    1    0   0% /tmpmnt/usr/applinux

Doing a umount /tmpmnt/usr/applinux fixes the problem (automounter =
remounts
it correctly).  I have had the problem happen with both automounter and =
fstab=20
mounted file systems, and I have had it happen with a Solaris 8 machine =
as=20
the server, so that argues to me that this is a client problem and not a =

server problem. I have had it happen on  both 2.4.19 NFSALL and =
2.4.21pre4=20
NFSALL clients.

The problem seems to happen without the server or obvious network issues =
going
on, though the problem also happens if the server reboots.   The server =
in
this case would be 2.4.19 NFSALL, and the mount entry is:

hostname:/usr/applinux /tmpmnt/usr/applinux nfs rw,v3,
rsize=3D8192,wsize=3D8192,hard,intr,udp,lock,addr=3Dhostname 0 0

It seems to happen quite a lot if the server reboots (a few out of a lot
of nodes have the issue), with a umount being required to fix it.  It =
does
not happen on all nodes (that we can tell, but it may happen on all =
nodes
that try to umount the down filesystem), just on some of the nodes. =20
It also will happen with the  client and server both up and ok without =
any
warning and without the server  rebooting or having anything funny done =
on it,=20
and it will only affect some  (1 usually) node.  I have had it do this =
while
the filesystems is being actively used (process have the fs open), in =
this case
the processes have to be killed and  then I umount the filesystems.

It looks like a client problem of some sort, the network should be =
relatively=20
clean.



							Roger


-------------------------------------------------------
Enterprise Linux Forum Conference & Expo, June 4-6, 2003, Santa Clara
The only event dedicated to issues related to Linux enterprise solutions
www.enterpriselinuxforum.com

_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 3+ messages in thread
* RE: Odd NFS hung mounts - stale mounts?
@ 2003-05-12 17:20 Heflin, Roger A.
  0 siblings, 0 replies; 3+ messages in thread
From: Heflin, Roger A. @ 2003-05-12 17:20 UTC (permalink / raw)
  To: James Pearson; +Cc: nfs

This is a completly different problem, and this acutally causes
serious issues.  It is a matter of when the client does not get
that message back, then the client actually breaks, and needs
external intervention.

			Roger

> -----Original Message-----
> From:	James Pearson [SMTP:james-p@moving-picture.com]
> Sent:	Monday, May 12, 2003 11:59 AM
> To:	Heflin, Roger A.
> Cc:	nfs@lists.sourceforge.net; Weathers, Norman R.; Rivera, Angel R; =
Wardrop, Mark A.; Glover, D W
> Subject:	Re: [NFS] Odd NFS hung mounts - stale mounts?
>=20
> I recently sent a reply to another posting about a similar subject -
> see:
>=20
> http://marc.theaimsgroup.com/?l=3Dlinux-nfs&m=3D105232522611536&w=3D2
>=20
> My understanding is that it is not important that the server's
> rpc.mountd receives an umount request or not, more importantly, the
> server needs to know about existing client mounts when it reboots -
> which can get 'removed' in certain circumstances.
>=20
> James Pearson
>=20
> "Heflin, Roger A." wrote:
> >=20
> > Basic problem:
> > stale nfs file handles.
> >=20
> > Conclusion:
> >=20
> > It looks like when the automounter umounts and if the server does =
not register
> > a "rpc.mountd: authenticated unmount request from" we get into this =
situation,
> > at least on a unused file systems.  I am not exactly sure what is =
happening
> > on the used filesystems.  This is on a high traffic setup with lots =
of mounts
> > and umounts and many many nodes, so given the high volume of =
mount/umounts
> > I would expect some requests to be dropped.
> >=20
> > It looks like when a umount is being done and the server is down or =
does not
> > confirm the umount that the client does not retry the umount and =
this
> > situation occurs, the situation is explained below.
> >=20
> > Does the above seem plausable?
> >=20
> > More information:
> >=20
> > Basic information, client is 2.4.21pre4 NFSALL (and 2.4.19 NFSALL), =
nfsutils
> > 1.0.1-1.
> >=20
> > When doing a df command we get this message in the messages file:
> >=20
> > nfs_statfs: statfs error =3D 116
> >=20
> > And the df looks like:
> >=20
> > hostname:/usr/applinux    0    1    0   0% /tmpmnt/usr/applinux
> >=20
> > Doing a umount /tmpmnt/usr/applinux fixes the problem (automounter =
remounts
> > it correctly).  I have had the problem happen with both automounter =
and fstab
> > mounted file systems, and I have had it happen with a Solaris 8 =
machine as
> > the server, so that argues to me that this is a client problem and =
not a
> > server problem. I have had it happen on  both 2.4.19 NFSALL and =
2.4.21pre4
> > NFSALL clients.
> >=20
> > The problem seems to happen without the server or obvious network =
issues going
> > on, though the problem also happens if the server reboots.   The =
server in
> > this case would be 2.4.19 NFSALL, and the mount entry is:
> >=20
> > hostname:/usr/applinux /tmpmnt/usr/applinux nfs rw,v3,
> > rsize=3D8192,wsize=3D8192,hard,intr,udp,lock,addr=3Dhostname 0 0
> >=20
> > It seems to happen quite a lot if the server reboots (a few out of a =
lot
> > of nodes have the issue), with a umount being required to fix it.  =
It does
> > not happen on all nodes (that we can tell, but it may happen on all =
nodes
> > that try to umount the down filesystem), just on some of the nodes.
> > It also will happen with the  client and server both up and ok =
without any
> > warning and without the server  rebooting or having anything funny =
done on it,
> > and it will only affect some  (1 usually) node.  I have had it do =
this while
> > the filesystems is being actively used (process have the fs open), =
in this case
> > the processes have to be killed and  then I umount the filesystems.
> >=20
> > It looks like a client problem of some sort, the network should be =
relatively
> > clean.
> >=20
> >                                                         Roger
> >=20
> > -------------------------------------------------------
> > Enterprise Linux Forum Conference & Expo, June 4-6, 2003, Santa =
Clara>=20
> > The only event dedicated to issues related to Linux enterprise =
solutions
> > www.enterpriselinuxforum.com
> >=20
> > _______________________________________________
> > NFS maillist  -  NFS@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nfs


-------------------------------------------------------
Enterprise Linux Forum Conference & Expo, June 4-6, 2003, Santa Clara
The only event dedicated to issues related to Linux enterprise solutions
www.enterpriselinuxforum.com

_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-05-12 17:20 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-05-12 16:28 Odd NFS hung mounts - stale mounts? Heflin, Roger A.
2003-05-12 16:59 ` James Pearson
  -- strict thread matches above, loose matches on Subject: below --
2003-05-12 17:20 Heflin, Roger A.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.