All of lore.kernel.org
 help / color / mirror / Atom feed
* [chip@debian.org: Debian Bug#203918 - statd request on eth interface, not localhost?]
@ 2003-08-12 15:21 Chip Salzenberg
  2003-08-13  4:06 ` Neil Brown
  0 siblings, 1 reply; 15+ messages in thread
From: Chip Salzenberg @ 2003-08-12 15:21 UTC (permalink / raw)
  To: nfs; +Cc: Jordi Mallach, 203918-quiet

HEY NEIL.  HEY EVERYBODY ELSE TOO.

Could you please give me some reply to this issue?  The beginning of
the bug report looks boring but it's actually weird.  To me anyway.  :-/


----- Forwarded message from Chip Salzenberg <chip@debian.org> -----

Subject: [NFS] Debian Bug#203918 - statd request on eth interface, not localhost?
From: Chip Salzenberg <chip@debian.org>
To: nfs@lists.sourceforge.net
Cc: 
Date: Mon, 4 Aug 2003 20:48:53 -0400

OK, guys, I've got an interesting bug here.  The server kernel is
2.4.20, the client is 2.6.0 (I think test2).  The first mount attempt
succeeds, and things seem to work.  But unmount requests fail.  And
there's a statd anomaly along the way.

The server is "natura" (213.96.69.115) and the client is "nubol" (IP
unknown).

Here's the /etc/exports:
> /var/cache/apt/archives *.int.oskuro.net(rw,async,no_root_squash)
> /home *.int.oskuro.net(rw,async,no_root_squash)

And here's the intriguing server log message:

> Aug  3 18:22:42 natura rpc.statd[467]: Call to statd from non-local host 213.96.69.115
> Aug  3 18:22:42 natura rpc.statd[467]: STAT_FAIL to natura for SM_MON of 192.168.1.3
> 
> This is the first mount. If you remember, it worked ok the first time I
> apt-get'ed. 213.96.69.115 is the public ip of the nfs server.

So let's get this straight.  The server's kernel is apparently sending
an SM_MON request to its own statd, but not using the loopback address,
but rather the public ethernet address.

Can I get a "Huh?!" from the congregation?  How does that even happen?

Subsequent trouble is perhaps a result of the first problem, somehow?

> Bah, now it won't umount.
> Aug  3 18:29:58 nubol automount[457]: shutdown failed: filesystem still busy
> 
> But nothing is using it.
> 65567:jordi@nubol:~$ sudo lsof |grep autofs
> zsh: done       sudo lsof | 
> zsh: exit 1     grep autofs
> 65568:jordi@nubol:~$ sudo umount  /var/autofs/natura/archives
> umount: /var/autofs/natura/archives: device is busy
> 
> So I can't umount right now. Now, it stays mounted, and it works, but
> it'd be nice if I could get rid of it too :)
> 
> The other day, it would umount and I got the connection refused error.

adTHANKSvance for debugging info.
-- 
Chip Salzenberg               - a.k.a. -               <chip@pobox.com>
"I wanted to play hopscotch with the impenetrable mystery of existence,
    but he stepped in a wormhole and had to go in early."  // MST3K


-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs



----- End forwarded message -----

-- 
Chip Salzenberg               - a.k.a. -               <chip@pobox.com>
"I wanted to play hopscotch with the impenetrable mystery of existence,
    but he stepped in a wormhole and had to go in early."  // MST3K


-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100003ave/direct;at.aspnet_072303_01/01
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 15+ messages in thread
* Re: Debian bug #165744 - 'Received erroneous SM_UNMON request'
@ 2003-09-13 20:02 Frans Pop
  2003-09-15  2:23 ` Chip Salzenberg
  0 siblings, 1 reply; 15+ messages in thread
From: Frans Pop @ 2003-09-13 20:02 UTC (permalink / raw)
  To: Chip Salzenberg, Neil Brown; +Cc: nfs

=2D----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hello Chip, Neil,

I think I may have found the source of this bug.
In Debian, rpc.statd is compiled with --enable-secure-statd; when I recompi=
le=20
with this option diabled, the error messages disapear!

I tried this after I took a look at the source for rpc.statd. I noticed tha=
t=20
in monitor.c there is the statement
   my_name =3D "127.0.0.1"
when servicing SM_MON requests with secure-statd enabled.
I think in my setup this causes the lookups in the run-time list (which use=
=20
my_name) to fail when servicing SM_UNMON requests if in my situation my_nam=
e=20
is equal to the real adresses of my boxes, causing the 'Received erroneous=
=20
SM_UNMON request' errors.

I think this would also explain the 'notify_host: failed to notify 127.0.0.=
1'=20
errors. This second error occurs when I reboot the server while a=20
NFS-connection was up: after restarting the server tries to notify the clie=
nt=20
it is back up, but can't because the client is not at localhost but at=20
10.19.66.21 and there is no NFS-client running on the server itself.

I think the reason the error appears in my setup could be because I have NF=
S=20
kernel-support compiled in both server and clients. In Debian, this means=20
rpc.lockd is not run from the init scripts. (In /etc/init.d/nfs-common 'gre=
p=20
=2D -q lockdctl /proc/ksyms' returns 1, so NEED_LOCKD is set to 'no'.)

I have (from ps alx):
  F   UID   PID  ... WCHAN  STAT ... COMMAND
140     1   129      poll   S        /sbin/portmap
040     0   135      rpciod SW       [rpciod]
040     0   136      svc_re SW       [lockd]
140     0   214      select S        /sbin/rpc.statd

I can easily reproduce the errors by restoring the original binary of=20
rpc.statd. Also, the error occurs on all - well both - my nfs-clients. The=
=20
error occurs very frequently when I am running with the original binaries.

Hope this information helps. I am willing to help any way I can to solve th=
is=20
bug.

Regards,

=46rans Pop

P.S. I already send a similar message to 165744@bugs.debian.org (CC Neil) o=
n=20
August 29, but as there was no reply to date I am trying again. I would=20
appreciate a reply.
=2D----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.1 (GNU/Linux)

iD8DBQE/Y3fkgm/Kwh6ICoQRApboAJ4gqgBY/KRL3WZg6QNu+jnni/5xwQCfTc6z
6/R4stFNt+a8EnlXjOp3NI8=3D
=3D/CN2
=2D----END PGP SIGNATURE-----



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2003-09-22  3:24 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-08-12 15:21 [chip@debian.org: Debian Bug#203918 - statd request on eth interface, not localhost?] Chip Salzenberg
2003-08-13  4:06 ` Neil Brown
2003-08-13  8:22   ` Jordi Mallach
2003-08-20 17:59     ` Chip Salzenberg
2003-08-21  7:58       ` Jordi Mallach
2003-08-21 14:48         ` Chip Salzenberg
2003-08-22 19:38         ` Jordi Mallach
2003-08-20 18:06   ` Debian bug #165744 - 'Received erroneous SM_UNMON request' Chip Salzenberg
2003-08-22  6:26     ` Neil Brown
2003-08-22 15:10       ` Chip Salzenberg
2003-08-25  6:37         ` Neil Brown
  -- strict thread matches above, loose matches on Subject: below --
2003-09-13 20:02 Frans Pop
2003-09-15  2:23 ` Chip Salzenberg
2003-09-15 16:46   ` Frans Pop
2003-09-22  3:24   ` Neil Brown

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.