All of lore.kernel.org
 help / color / mirror / Atom feed
* Two problems w/ autofs
@ 2009-02-17 23:52 Ahmon Dancy
  2009-02-18  2:30 ` Ian Kent
  0 siblings, 1 reply; 4+ messages in thread
From: Ahmon Dancy @ 2009-02-17 23:52 UTC (permalink / raw)
  To: autofs

Hello Ian, et al.

Using autofs-5.0.3-36 (built from Fedora 10 updates source RPM) on
2.6.26.6-49.fc8 kernel.  We run w/ LOGGING=debug.

We use autofs extensively for NFS multimounts and there are a few
issues that we're trying to get resolved:

Problem #1:

We us a TIMEOUT of 300 but there are often mounts that are never
expired, even if lsof shows no process is using the mount points in
question.   kill -USR1 doesn't help.

Example: /net/bb2 isn't expiring.

Relevant portion of /proc/mounts:

bb2:/ /net/bb2 nfs rw,nosuid,nodev,relatime,vers=3,rsize=32768,wsize=32768,namlen=255,hard,nointr,proto=tcp,timeo=600,retrans=2,sec=sys,mountproto=udp,addr=192.168.95.79 0 0
-hosts /net/bb2/acl autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
-hosts /net/bb2/home autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
-hosts /net/bb2/opt autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
-hosts /net/bb2/tmp autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
-hosts /net/bb2/usr autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
-hosts /net/bb2/var autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
bb2:/opt /net/bb2/opt nfs rw,nosuid,nodev,relatime,vers=3,rsize=32768,wsize=32768,namlen=255,hard,nointr,proto=tcp,timeo=600,retrans=2,sec=sys,mountproto=udp,addr=192.168.95.79 0 0

Sample log entry:

Feb 17 15:31:40 gemini automount[9397]: handle_packet_expire_indirect: token 67811, name bb2
Feb 17 15:31:40 gemini automount[9397]: expiring path /net/bb2
Feb 17 15:31:40 gemini automount[9397]: umount_multi: path /net/bb2 incl 1
Feb 17 15:31:40 gemini automount[9397]: some offset mounts still present under /net/bb2
Feb 17 15:31:40 gemini automount[9397]: couldn't complete expire of /net/bb2
Feb 17 15:31:40 gemini automount[9397]: send_fail: token = 67811

Interestingly, there's no mention of /net/bb2/opt, which it would need
to umount first.


Problem #2:

One of the multimounts is misbehaving and not lazy-mounting.  The
problem seems to stem from a failure to do a lazy umount earlier:

Log entries for the umount attempt:

Feb 17 13:32:54 gemini automount[9397]: handle_packet: type = 4
Feb 17 13:32:54 gemini automount[9397]: handle_packet_expire_indirect: token 67564, name cobweb
Feb 17 13:32:54 gemini automount[9397]: expiring path /net/cobweb
Feb 17 13:32:54 gemini automount[9397]: umount_multi: path /net/cobweb incl 1
Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/home/ftp
Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/home/ftp not mounted
Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/home/patches
Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/home/patches not mounted
Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/www/sites/franzdownload
Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/www/sites/franzdownload not mounted
Feb 17 13:32:54 gemini automount[9397]: some offset mounts still present under /net/cobweb
Feb 17 13:32:54 gemini automount[9397]: couldn't complete expire of /net/cobweb
Feb 17 13:32:54 gemini automount[9397]: send_fail: token = 67564

After this happened, accesses to /net/cobweb/home/ftp did not result
in an automatic mount, just an empty directory.


This is a production system so I can only do so much, but I can try
reasonable tests and supply debugging output.   Unfortunately I
haven't yet found a way to explicitly cause the broken scenario to
happen.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Two problems w/ autofs
  2009-02-17 23:52 Two problems w/ autofs Ahmon Dancy
@ 2009-02-18  2:30 ` Ian Kent
  2009-02-18 16:06   ` Ahmon Dancy
  0 siblings, 1 reply; 4+ messages in thread
From: Ian Kent @ 2009-02-18  2:30 UTC (permalink / raw)
  To: Ahmon Dancy; +Cc: autofs

On Tue, 2009-02-17 at 15:52 -0800, Ahmon Dancy wrote:
> Hello Ian, et al.
> 
> Using autofs-5.0.3-36 (built from Fedora 10 updates source RPM) on
> 2.6.26.6-49.fc8 kernel.  We run w/ LOGGING=debug.

You will need update your 2.6.26 based kernel with the latest kernel
module patch before we go further with this. It can be found in the
patches directory of the 5.0.4 tarball on kernel.org.

> 
> We use autofs extensively for NFS multimounts and there are a few
> issues that we're trying to get resolved:
> 
> Problem #1:
> 
> We us a TIMEOUT of 300 but there are often mounts that are never
> expired, even if lsof shows no process is using the mount points in
> question.   kill -USR1 doesn't help.
> 
> Example: /net/bb2 isn't expiring.
> 
> Relevant portion of /proc/mounts:
> 
> bb2:/ /net/bb2 nfs rw,nosuid,nodev,relatime,vers=3,rsize=32768,wsize=32768,namlen=255,hard,nointr,proto=tcp,timeo=600,retrans=2,sec=sys,mountproto=udp,addr=192.168.95.79 0 0
> -hosts /net/bb2/acl autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> -hosts /net/bb2/home autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> -hosts /net/bb2/opt autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> -hosts /net/bb2/tmp autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> -hosts /net/bb2/usr autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> -hosts /net/bb2/var autofs rw,relatime,fd=12,pgrp=9397,timeout=300,minproto=5,maxproto=5,offset 0 0
> bb2:/opt /net/bb2/opt nfs rw,nosuid,nodev,relatime,vers=3,rsize=32768,wsize=32768,namlen=255,hard,nointr,proto=tcp,timeo=600,retrans=2,sec=sys,mountproto=udp,addr=192.168.95.79 0 0
> 
> Sample log entry:
> 
> Feb 17 15:31:40 gemini automount[9397]: handle_packet_expire_indirect: token 67811, name bb2
> Feb 17 15:31:40 gemini automount[9397]: expiring path /net/bb2
> Feb 17 15:31:40 gemini automount[9397]: umount_multi: path /net/bb2 incl 1
> Feb 17 15:31:40 gemini automount[9397]: some offset mounts still present under /net/bb2
> Feb 17 15:31:40 gemini automount[9397]: couldn't complete expire of /net/bb2
> Feb 17 15:31:40 gemini automount[9397]: send_fail: token = 67811
> 
> Interestingly, there's no mention of /net/bb2/opt, which it would need
> to umount first.
> 
> 
> Problem #2:
> 
> One of the multimounts is misbehaving and not lazy-mounting.  The
> problem seems to stem from a failure to do a lazy umount earlier:
> 
> Log entries for the umount attempt:
> 
> Feb 17 13:32:54 gemini automount[9397]: handle_packet: type = 4
> Feb 17 13:32:54 gemini automount[9397]: handle_packet_expire_indirect: token 67564, name cobweb
> Feb 17 13:32:54 gemini automount[9397]: expiring path /net/cobweb
> Feb 17 13:32:54 gemini automount[9397]: umount_multi: path /net/cobweb incl 1
> Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/home/ftp
> Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/home/ftp not mounted
> Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/home/patches
> Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/home/patches not mounted
> Feb 17 13:32:54 gemini automount[9397]: umount_multi_triggers: umount offset /net/cobweb/www/sites/franzdownload
> Feb 17 13:32:54 gemini automount[9397]: umount_autofs_offset: offset /net/cobweb/www/sites/franzdownload not mounted
> Feb 17 13:32:54 gemini automount[9397]: some offset mounts still present under /net/cobweb
> Feb 17 13:32:54 gemini automount[9397]: couldn't complete expire of /net/cobweb
> Feb 17 13:32:54 gemini automount[9397]: send_fail: token = 67564
> 
> After this happened, accesses to /net/cobweb/home/ftp did not result
> in an automatic mount, just an empty directory.
> 
> 
> This is a production system so I can only do so much, but I can try
> reasonable tests and supply debugging output.   Unfortunately I
> haven't yet found a way to explicitly cause the broken scenario to
> happen.
> 
> _______________________________________________
> autofs mailing list
> autofs@linux.kernel.org
> http://linux.kernel.org/mailman/listinfo/autofs

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Two problems w/ autofs
  2009-02-18  2:30 ` Ian Kent
@ 2009-02-18 16:06   ` Ahmon Dancy
  2009-02-18 23:56     ` Ian Kent
  0 siblings, 1 reply; 4+ messages in thread
From: Ahmon Dancy @ 2009-02-18 16:06 UTC (permalink / raw)
  To: Ian Kent; +Cc: autofs

>> On Tue, 2009-02-17 at 15:52 -0800, Ahmon Dancy wrote:
>> > Hello Ian, et al.
>> > 
>> > Using autofs-5.0.3-36 (built from Fedora 10 updates source RPM) on
>> > 2.6.26.6-49.fc8 kernel.  We run w/ LOGGING=debug.
>> 
>> You will need update your 2.6.26 based kernel with the latest kernel
>> module patch before we go further with this. It can be found in the
>> patches directory of the 5.0.4 tarball on kernel.org.

Is there any released fedora-supplied kernel (perhaps for Fedora 9
or 10) that has the necessary goods?  If the system in question needs
to be restarted for a kernel update, it may be a good time to update
the Fedora installation as well.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Two problems w/ autofs
  2009-02-18 16:06   ` Ahmon Dancy
@ 2009-02-18 23:56     ` Ian Kent
  0 siblings, 0 replies; 4+ messages in thread
From: Ian Kent @ 2009-02-18 23:56 UTC (permalink / raw)
  To: Ahmon Dancy; +Cc: autofs

On Wed, 2009-02-18 at 08:06 -0800, Ahmon Dancy wrote:
> >> On Tue, 2009-02-17 at 15:52 -0800, Ahmon Dancy wrote:
> >> > Hello Ian, et al.
> >> > 
> >> > Using autofs-5.0.3-36 (built from Fedora 10 updates source RPM) on
> >> > 2.6.26.6-49.fc8 kernel.  We run w/ LOGGING=debug.
> >> 
> >> You will need update your 2.6.26 based kernel with the latest kernel
> >> module patch before we go further with this. It can be found in the
> >> patches directory of the 5.0.4 tarball on kernel.org.
> 
> Is there any released fedora-supplied kernel (perhaps for Fedora 9
> or 10) that has the necessary goods?  If the system in question needs
> to be restarted for a kernel update, it may be a good time to update
> the Fedora installation as well.

F-10 has everything but one fix that relates to the expiration of a
specific type of mount. You might see that problem with the hosts map.

When the F-10 kernel goes to 2.6.28 it will have everything that is
needed but that could be a while.

But I wouldn't like to update the OS during scheduled downtime without
some testing before hand and a fallback plan.

Ian

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2009-02-18 23:56 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-02-17 23:52 Two problems w/ autofs Ahmon Dancy
2009-02-18  2:30 ` Ian Kent
2009-02-18 16:06   ` Ahmon Dancy
2009-02-18 23:56     ` Ian Kent

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.