* [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP
@ 2007-10-25 22:49 Lon Hohberger
2007-10-25 22:51 ` Lon Hohberger
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Lon Hohberger @ 2007-10-25 22:49 UTC (permalink / raw)
To: cluster-devel.redhat.com
A common problem occurs when multiple IP addresses within the same
subnet are assigned to the same NIC. If we make a connection attempt to
another address on the same subnet as one of those addresses, the
connection attempt will not necessarily be routed from the address we
want.
In the case of the DLM, the other nodes will quickly drop the connection
attempt, causing problems.
This patch makes the DLM bind to the local address it acquired from the
cluster manager when using TCP prior to making a connection, obviating
the need for administrators to "fix" their systems or use clever routing
tricks.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dlm-lowcomms-tcp-src-upstream.patch
Type: text/x-patch
Size: 1099 bytes
Desc: not available
URL: <http://listman.redhat.com/archives/cluster-devel/attachments/20071025/cc427c3d/attachment.bin>
^ permalink raw reply [flat|nested] 7+ messages in thread* [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-25 22:49 [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP Lon Hohberger @ 2007-10-25 22:51 ` Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger 2007-10-29 9:22 ` [Cluster-devel] " Steven Whitehouse 2 siblings, 0 replies; 7+ messages in thread From: Lon Hohberger @ 2007-10-25 22:51 UTC (permalink / raw) To: cluster-devel.redhat.com On Thu, 2007-10-25 at 18:49 -0400, Lon Hohberger wrote: > A common problem occurs when multiple IP addresses within the same > subnet are assigned to the same NIC. If we make a connection attempt to > another address on the same subnet as one of those addresses, the > connection attempt will not necessarily be routed from the address we > want. > > In the case of the DLM, the other nodes will quickly drop the connection > attempt, causing problems. > > This patch makes the DLM bind to the local address it acquired from the > cluster manager when using TCP prior to making a connection, obviating > the need for administrators to "fix" their systems or use clever routing > tricks. > ^ permalink raw reply [flat|nested] 7+ messages in thread
* [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-25 22:49 [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger @ 2007-10-25 22:51 ` Lon Hohberger 2007-10-29 9:22 ` [Cluster-devel] " Steven Whitehouse 2 siblings, 0 replies; 7+ messages in thread From: Lon Hohberger @ 2007-10-25 22:51 UTC (permalink / raw) To: cluster-devel.redhat.com Whoops; sorry for the spam there. Patch with fixed comments. -------------- next part -------------- A non-text attachment was scrubbed... Name: dlm-lowcomms-tcp-src-upstream.patch Type: text/x-patch Size: 1064 bytes Desc: not available URL: <http://listman.redhat.com/archives/cluster-devel/attachments/20071025/eb7e9203/attachment.bin> ^ permalink raw reply [flat|nested] 7+ messages in thread
* [Cluster-devel] Re: [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-25 22:49 [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger @ 2007-10-29 9:22 ` Steven Whitehouse 2007-10-30 14:02 ` Patrick Caulfield 2 siblings, 1 reply; 7+ messages in thread From: Steven Whitehouse @ 2007-10-29 9:22 UTC (permalink / raw) To: cluster-devel.redhat.com Hi, On Thu, 2007-10-25 at 18:49 -0400, Lon Hohberger wrote: > A common problem occurs when multiple IP addresses within the same > subnet are assigned to the same NIC. If we make a connection attempt to > another address on the same subnet as one of those addresses, the > connection attempt will not necessarily be routed from the address we > want. > > In the case of the DLM, the other nodes will quickly drop the connection > attempt, causing problems. > > This patch makes the DLM bind to the local address it acquired from the > cluster manager when using TCP prior to making a connection, obviating > the need for administrators to "fix" their systems or use clever routing > tricks. > I'm happy to apply the patch, but I wonder why not just set the prefsrc for the route? That will fix it for all applications not just the DLM. Also if there are multiple NICs in the system with the same IP address set assigned to them, then this solution may fail whereas the prefsrc solution will continue to work as that allows specification of a device too (amoung other things), Steve. ^ permalink raw reply [flat|nested] 7+ messages in thread
* [Cluster-devel] Re: [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-29 9:22 ` [Cluster-devel] " Steven Whitehouse @ 2007-10-30 14:02 ` Patrick Caulfield 2007-10-31 13:33 ` Steven Whitehouse 0 siblings, 1 reply; 7+ messages in thread From: Patrick Caulfield @ 2007-10-30 14:02 UTC (permalink / raw) To: cluster-devel.redhat.com Steven Whitehouse wrote: > Hi, > > On Thu, 2007-10-25 at 18:49 -0400, Lon Hohberger wrote: >> A common problem occurs when multiple IP addresses within the same >> subnet are assigned to the same NIC. If we make a connection attempt to >> another address on the same subnet as one of those addresses, the >> connection attempt will not necessarily be routed from the address we >> want. >> >> In the case of the DLM, the other nodes will quickly drop the connection >> attempt, causing problems. >> >> This patch makes the DLM bind to the local address it acquired from the >> cluster manager when using TCP prior to making a connection, obviating >> the need for administrators to "fix" their systems or use clever routing >> tricks. >> > > I'm happy to apply the patch, but I wonder why not just set the prefsrc > for the route? That will fix it for all applications not just the DLM. > Also if there are multiple NICs in the system with the same IP address > set assigned to them, then this solution may fail whereas the prefsrc > solution will continue to work as that allows specification of a device > too (amoung other things), I think it's appropriate to fix this in the DLM code. DLM doesn't work without it in this situation and there night (possibly!?) be applications that don't want prefsrc setting. -- Patrick ^ permalink raw reply [flat|nested] 7+ messages in thread
* [Cluster-devel] Re: [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-30 14:02 ` Patrick Caulfield @ 2007-10-31 13:33 ` Steven Whitehouse 2007-10-31 13:54 ` Patrick Caulfield 0 siblings, 1 reply; 7+ messages in thread From: Steven Whitehouse @ 2007-10-31 13:33 UTC (permalink / raw) To: cluster-devel.redhat.com Hi, On Tue, 2007-10-30 at 14:02 +0000, Patrick Caulfield wrote: > Steven Whitehouse wrote: > > Hi, > > > > On Thu, 2007-10-25 at 18:49 -0400, Lon Hohberger wrote: > >> A common problem occurs when multiple IP addresses within the same > >> subnet are assigned to the same NIC. If we make a connection attempt to > >> another address on the same subnet as one of those addresses, the > >> connection attempt will not necessarily be routed from the address we > >> want. > >> > >> In the case of the DLM, the other nodes will quickly drop the connection > >> attempt, causing problems. > >> > >> This patch makes the DLM bind to the local address it acquired from the > >> cluster manager when using TCP prior to making a connection, obviating > >> the need for administrators to "fix" their systems or use clever routing > >> tricks. > >> > > > > I'm happy to apply the patch, but I wonder why not just set the prefsrc > > for the route? That will fix it for all applications not just the DLM. > > Also if there are multiple NICs in the system with the same IP address > > set assigned to them, then this solution may fail whereas the prefsrc > > solution will continue to work as that allows specification of a device > > too (amoung other things), > > I think it's appropriate to fix this in the DLM code. DLM doesn't work without > it in this situation and there night (possibly!?) be applications that don't > want prefsrc setting. > Ok, well I'll put the patch in, but it still sounds very odd to me. If they really need to do different things for different applications then setting the fwmark and doing route by fwmark would be the sensible solution, Steve. ^ permalink raw reply [flat|nested] 7+ messages in thread
* [Cluster-devel] Re: [PATCH] [DLM] Bind connections from known local address when using TCP 2007-10-31 13:33 ` Steven Whitehouse @ 2007-10-31 13:54 ` Patrick Caulfield 0 siblings, 0 replies; 7+ messages in thread From: Patrick Caulfield @ 2007-10-31 13:54 UTC (permalink / raw) To: cluster-devel.redhat.com Steven Whitehouse wrote: > Hi, > > On Tue, 2007-10-30 at 14:02 +0000, Patrick Caulfield wrote: >> Steven Whitehouse wrote: >>> Hi, >>> >>> On Thu, 2007-10-25 at 18:49 -0400, Lon Hohberger wrote: >>>> A common problem occurs when multiple IP addresses within the same >>>> subnet are assigned to the same NIC. If we make a connection attempt to >>>> another address on the same subnet as one of those addresses, the >>>> connection attempt will not necessarily be routed from the address we >>>> want. >>>> >>>> In the case of the DLM, the other nodes will quickly drop the connection >>>> attempt, causing problems. >>>> >>>> This patch makes the DLM bind to the local address it acquired from the >>>> cluster manager when using TCP prior to making a connection, obviating >>>> the need for administrators to "fix" their systems or use clever routing >>>> tricks. >>>> >>> I'm happy to apply the patch, but I wonder why not just set the prefsrc >>> for the route? That will fix it for all applications not just the DLM. >>> Also if there are multiple NICs in the system with the same IP address >>> set assigned to them, then this solution may fail whereas the prefsrc >>> solution will continue to work as that allows specification of a device >>> too (amoung other things), >> I think it's appropriate to fix this in the DLM code. DLM doesn't work without >> it in this situation and there night (possibly!?) be applications that don't >> want prefsrc setting. >> > Ok, well I'll put the patch in, but it still sounds very odd to me. If > they really need to do different things for different applications then > setting the fwmark and doing route by fwmark would be the sensible > solution, I think the point is that the DLM explicitly checks the source address of incoming connections. The purpose of this is to ensure (to some extent at least!) that it is talking to a known cluster node. If we didn't do this check (and I'm reluctant to take it out) then we could also do without this bind code. All we're doing here really, is matching the two halves of the connection code up properly. Patrick ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2007-10-31 13:54 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-10-25 22:49 [Cluster-devel] [PATCH] [DLM] Bind connections from known local address when using TCP Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger 2007-10-25 22:51 ` Lon Hohberger 2007-10-29 9:22 ` [Cluster-devel] " Steven Whitehouse 2007-10-30 14:02 ` Patrick Caulfield 2007-10-31 13:33 ` Steven Whitehouse 2007-10-31 13:54 ` Patrick Caulfield
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).