From mboxrd@z Thu Jan  1 00:00:00 1970
From: David Teigland <teigland@redhat.com>
Date: Wed, 1 Dec 2010 12:27:56 -0500
Subject: [Cluster-devel] Patch: making DLM more robust
In-Reply-To: <4CF6140D.7060809@bull.net>
References: <4CEA9ADD.2050109@bull.net> <20101122173442.GA21879@redhat.com>
	<4CEBD6A2.8090005@bull.net> <20101123171508.GC30147@redhat.com>
	<4CF52D0E.2020800@bull.net> <20101130173051.GB27123@redhat.com>
	<4CF6140D.7060809@bull.net>
Message-ID: <20101201172756.GF26955@redhat.com>
List-Id: <cluster-devel.redhat.com>
To: cluster-devel.redhat.com
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit

On Wed, Dec 01, 2010 at 10:23:25AM +0100, Menyhart Zoltan wrote:
> If we cannot obtain a given resource within a limited time frame,
> then it is a real error for the customer: s/he cannot mount an OCFS2
> volume, cannot issue a cluster command, etc.

Matter of opinion and preference I suppose.

> >2. The difficulty, even with EINTR, is correctly and cleanly unwinding the
> >dlm_controld state.
> 
> Let's take this example indlm/libdlm/libdlm.c:

The problem is not backing out of libdlm, it's leaving the cpg group, etc
in dlm_controld (when the join itself is not even complete).  It should
all be possible, but I've never viewed this as a problem worth fixing
given the effort required.

Dave