From mboxrd@z Thu Jan 1 00:00:00 1970 From: Christine Caulfield Date: Fri, 05 Sep 2014 09:47:00 +0100 Subject: [Cluster-devel] [PATCH] dlm: clear out addrs before calling into corosync Message-ID: <54097884.9010807@redhat.com> List-Id: To: cluster-devel.redhat.com MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit The corosync_cfg_get_node_addrs() call does not fill the whole of the addrs field passed in, specifically it only writes the the address family and IP address, leaving the port number untouched. If the port number contains junk, then that can get passed into the kernel by dlm_controld where it is subsequently used in the comparison that checks for valid cluster nodes in a connection. If this happens then an otherwise valid connection can be rejected and the dlm will hang. I've seen this quite often on s390 but I don't see any reason why it might not also be causing intermittent connection problems on other archs. Signed-Off-By: Christine Caulfield -------------- next part -------------- A non-text attachment was scrubbed... Name: dlm_members_clear_addr.patch Type: text/x-patch Size: 440 bytes Desc: not available URL: