From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ian Kent Subject: Re: Autofs hangs Date: Tue, 15 Feb 2011 11:32:05 +0800 Message-ID: <1297740725.13007.11.camel@perseus> References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=messagingengine.com; h=subject:from:to:cc:in-reply-to:references:content-type:date:message-id:mime-version:content-transfer-encoding; s=smtpout; bh=FEi039Uv4p0iKGlRZ6WQkcVg4Nk=; b=nisZ/iOHIk3pwEL+3cBLCZN9yhdjpAkq/h7IbHqcKauPGD+r/WdpLn/8kXUZ5W/fdyLh+JRSgwqlboL6R8pzVwp3ftRhhq5yy+otVwKnehbOxUO8A01D7vOIs1/uozBCj3s7gsbQr+Zgcp1cC20lFvAZX2iEkYKb3wS2qJu8FGc= In-Reply-To: List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: autofs-bounces@linux.kernel.org Errors-To: autofs-bounces@linux.kernel.org To: Steve Thompson Cc: autofs@linux.kernel.org On Mon, 2011-02-14 at 14:31 -0500, Steve Thompson wrote: > CentOS 5.5, 32- and 64-bit. Maps from LDAP. I upgraded recently (as part > of a system-wide yum update) from autofs-5.0.1-0.rc2.143.el5_5.4 to > autofs-5.0.1-0.rc2.143.el5_5.6, and ever since I have had a MASSIVE number > of automount hangs (several per day across 170 systems). Nothing is > logged, and the automount deamon is hung solid; only a SIGKILL and a > restart can recover the daemon (and, if I wasn't already logged in, a > power cycle is required - all accounts use automounted NFS home > directories). Reading the Changelog, I see these changes: > > - add mutex to serialize access to mount module handle in parse module > - dont check null cache on expire > - fix null cache race > - fix cache_init() on source re-read > - fix negative cache included map lookup > > I don't know which of these to point the finger at, but I have also > noticed many messages similar to: > > automount]: update_negative_cache: key "" not found in map. > > all of which are incorrect. Really need a fix for this asap - we're > almost dead in the water. I can't see any obvious problems Can you get a gdb backtrace of a couple of the hangs and post them? In particular I need the output of "thr a a bt". Ian