From: jimc@math.ucla.edu (Jim Carter)
To: autofs@linux.kernel.org
Cc: Ian Kent <raven@themaw.net>
Subject: Re: clients suddenly start hanging (was: (no subject))
Date: Thu, 24 Apr 2008 09:52:47 -0700 (PDT) [thread overview]
Message-ID: <20080424165247.CEF7F2111B0@simba.math.ucla.edu> (raw)
In-Reply-To: <x4963u8ibhf.fsf@segfault.boston.devel.redhat.com> (Jeff Moyer "Wed, 23 Apr 2008 16:04:44 -0400")
On Wed, 23 Apr 2008 16:04:44 -0400 Jeff Moyer writes:
> jimc@math.ucla.edu (Jim Carter) writes:
> > This started immediately after we upgraded the server host from SuSE
> > 10.1 to SuSE 10.3; autofs version changed from 4.1.4 to 5.0.2.
> That's a big jump!
SuSE 10.1 is now 2 years old. We try to get 18 months of use out of
each release we put into production, and it typically takes 6 months
from when the distro is issued until we get it into full production.
> > =-- auto.net ---
> > * -rsize=8192,wsize=8192,retry=1,soft,fstype=autofs,-DSERVER=& \
> > file:/etc/auto.net.generic
> A ha! Submounts! We're currently chasing a couple of issues in this
> area.
And almost all of our automounts are in this form. Since the hanging
mode has not [yet] been seen on workstations or shared execution servers
[update: detected this morning on Koala, our Koolu, with the least
frequent automounting of all our machines due to its role as a kiosk
:-)], I'm guessing that the rate of getting messed up is proportional to
the square of the rate of automounting; in other words, a race condition
is involved: when a filesystem expires (and is unmounted) and
simultaneously a client refers to it causing automounting, something bad
happens.
> > =------------- Output from DEFAULT_LOGGING=debug -------
> [snip]
> Jim, I'm not sure I see anything out of the ordinary in this snippet of
> the debug log. Can you search your logs for a message that contains,
> "ask umount returned busy"? If you see that, then we're looking at the
> same problem. If you don't, well, we'll have to get more information
> from you.
Yes! These are seen on both machines that I ran tests on. They are
seen with DEFAULT_LOGGING=none -- none occurred when I had debug turned
on, though I believe that the test program was locked up and not
actually mounting anything at that time. Each one refers to the
per-host submount, not to a NFS mounted filesystem. They are isolated
without preceeding or following automount messages. They are seen both
when I was running the test program, and when I wasn't. My impression
is that the probability of having one of these messages is the same per
automount. Here are a few, happening during the test program.
debug.1:Apr 21 20:56:14 simba automount[12865]: umount_autofs_indirect: ask umount returned busy /net/nemo01
debug.1:Apr 21 22:18:26 simba automount[459]: umount_autofs_indirect: ask umount returned busy /net/naseberry
debug.1:Apr 21 22:20:08 simba automount[459]: umount_autofs_indirect: ask umount returned busy /net/bamboo33
debug.1:Apr 22 22:44:19 simba automount[3059]: umount_autofs_indirect: ask umount returned busy /net/daggett
Interesting: When I rebooted one of the machines, I got one of these
messages for the /home YP map (not involving submounts) during shutdown:
Apr 20 17:51:51 serval mountd[2843]: Caught signal 15, un-registering and exitin
g.
Apr 20 17:51:51 serval sshd[3053]: Received signal 15; terminating.
Apr 20 17:51:51 serval xinetd[3050]: Exiting...
Apr 20 17:52:04 serval automount[2795]: umount_autofs_indirect: ask umount returned busy /home
Apr 20 17:52:13 serval kernel: Kernel logging (proc) stopped.
etc.
On Thu, 24 Apr 2008 11:10:53 +0800 Ian Kent writes:
> I don't know if SuSE provide debuginfo packages but the thread trace is
> useless without debug info.
> The backtrace is the most effective way to identify a few known
> problems. It's really important.
I'm at work today and I'll make this happen. I think SuSE has debuginfo
packages in their archive, but if not I'll recompile autofs, setting the
-g switch in the spec file. I'll also provide the URL of the source RPM
and a list of applied patches.
James F. Carter Voice 310 825 2897 FAX 310 206 6673
UCLA-Mathnet; 6115 MSA; 520 Portola Plaza; Los Angeles, CA, USA 90095-1555
Email: jimc@math.ucla.edu http://www.math.ucla.edu/~jimc (q.v. for PGP key)
next prev parent reply other threads:[~2008-04-24 16:52 UTC|newest]
Thread overview: 51+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-04-23 18:50 (no subject) Jim Carter
2008-04-23 20:04 ` Jeff Moyer
2008-04-24 3:10 ` Ian Kent
2008-04-24 16:52 ` Jim Carter [this message]
2008-04-26 1:17 ` clients suddenly start hanging (was: (no subject)) Jim Carter
2008-04-26 5:34 ` Ian Kent
2008-04-26 18:48 ` Jim Carter
2008-04-27 5:52 ` Ian Kent
2008-04-26 22:16 ` Jim Carter
2008-04-28 6:26 ` [PATCH 1/2] autofs4 - fix execution order race in mount request code Ian Kent
2008-05-08 4:52 ` clients suddenly start hanging (was: (no subject)) Jim Carter
2008-05-08 6:13 ` Ian Kent
2008-05-11 4:14 ` Jim Carter
2008-05-11 7:57 ` Ian Kent
2008-05-15 21:59 ` Jim Carter
2008-05-16 3:00 ` Ian Kent
2008-05-18 4:07 ` Ian Kent
2008-05-21 6:58 ` Ian Kent
2008-05-22 21:42 ` Jim Carter
2008-05-23 2:35 ` Ian Kent
2008-05-26 0:34 ` Jim Carter
2008-06-12 3:20 ` Ian Kent
-- strict thread matches above, loose matches on Subject: below --
2008-06-12 4:50 [PATCH 00/10] Kernel patch series Ian Kent
2008-06-12 4:50 ` [PATCH 01/10] autofs4 - check for invalid dentry in getpath Ian Kent
2008-06-12 4:50 ` [PATCH 02/10] autofs4 - fix sparse warning in waitq.c:autofs4_expire_indirect() Ian Kent
2008-06-12 4:50 ` [PATCH 03/10] autofs4 - fix incorrect return from root.c:try_to_fill_dentry() Ian Kent
2008-06-12 4:51 ` [PATCH 04/10] autofs4 - fix mntput, dput order bug Ian Kent
2008-06-12 4:51 ` [PATCH 05/10] autofs4 - don't make expiring dentry negative Ian Kent
2008-06-12 4:51 ` [PATCH 06/10] autofs4 - use look aside list for lookups Ian Kent
2008-06-12 4:51 ` [PATCH 07/10] autofs4 - don't release directory mutex if called in oz_mode Ian Kent
2008-06-12 4:51 ` [PATCH 08/10] autofs4 - use lookup intent flags to trigger mounts Ian Kent
2008-06-12 4:51 ` [PATCH 09/10] autofs4 - use struct qstr in waitq.c Ian Kent
2008-06-12 4:51 ` [PATCH 10/10] autofs4 - fix pending mount race Ian Kent
2008-06-14 1:13 ` [PATCH 00/10] Kernel patch series Jim Carter
2008-06-14 3:30 ` Ian Kent
2008-06-14 3:42 ` Ian Kent
2008-06-19 0:40 ` clients suddenly start hanging (was: (no subject)) Jim Carter
2008-06-19 3:14 ` Ian Kent
2008-06-19 17:08 ` Jim Carter
2008-06-19 18:34 ` Jim Carter
2008-06-20 4:09 ` Ian Kent
2008-06-21 1:02 ` Jim Carter
2008-06-21 3:12 ` Ian Kent
2008-06-23 3:49 ` Jim Carter
2008-06-23 4:46 ` Ian Kent
2008-06-24 3:08 ` Ian Kent
2008-06-24 17:02 ` Stephen Biggs
2008-06-24 23:39 ` Jim Carter
2008-06-25 3:33 ` Ian Kent
2008-06-25 5:00 ` Ian Kent
2008-06-23 4:15 ` Ian Kent
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080424165247.CEF7F2111B0@simba.math.ucla.edu \
--to=jimc@math.ucla.edu \
--cc=autofs@linux.kernel.org \
--cc=raven@themaw.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.