From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Teigland Date: Wed, 1 Dec 2010 12:27:56 -0500 Subject: [Cluster-devel] Patch: making DLM more robust In-Reply-To: <4CF6140D.7060809@bull.net> References: <4CEA9ADD.2050109@bull.net> <20101122173442.GA21879@redhat.com> <4CEBD6A2.8090005@bull.net> <20101123171508.GC30147@redhat.com> <4CF52D0E.2020800@bull.net> <20101130173051.GB27123@redhat.com> <4CF6140D.7060809@bull.net> Message-ID: <20101201172756.GF26955@redhat.com> List-Id: To: cluster-devel.redhat.com MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit On Wed, Dec 01, 2010 at 10:23:25AM +0100, Menyhart Zoltan wrote: > If we cannot obtain a given resource within a limited time frame, > then it is a real error for the customer: s/he cannot mount an OCFS2 > volume, cannot issue a cluster command, etc. Matter of opinion and preference I suppose. > >2. The difficulty, even with EINTR, is correctly and cleanly unwinding the > >dlm_controld state. > > Let's take this example indlm/libdlm/libdlm.c: The problem is not backing out of libdlm, it's leaving the cpg group, etc in dlm_controld (when the join itself is not even complete). It should all be possible, but I've never viewed this as a problem worth fixing given the effort required. Dave