All of lore.kernel.org
 help / color / mirror / Atom feed
* issues with the rdma-cm server side mapping of IP to GID
@ 2014-02-25  8:18 Or Gerlitz
       [not found] ` <530C51EF.2000509-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
  0 siblings, 1 reply; 10+ messages in thread
From: Or Gerlitz @ 2014-02-25  8:18 UTC (permalink / raw)
  To: Hefty, Sean
  Cc: linux-rdma (linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org),
	Yan Burman

Hi Sean,

We came across a pretty deadly situation with rdma-cm based 
client/server application where the client set their RC QP to send to 
HCA X on the server node but the server app opened their QP on HCA Y. 
The result was un-acked RC packets and RC session failure.

This happened because the mapping between destination IP to destination 
GID as seen by the client was different from what's present in the 
server IP stack at the time the connection request arrived -- the server 
side rdma-cm IP --> GID mapping is established by the 
cma_translate_addr() call in cma_new_conn_id() which is done on the 
destination IP taken from the RDMA-CM header in the CM REQ.

Such situation can happen in the following cases:

1. net.ipv4.conf.default.arp_ignore equals 0 (the default)
2. server side bonding/teaming fail-over when the Gratitous ARP sent was 
lost
3. re-order of ibM net-devices mapping to HCA PCI devices after server 
boot/crash
4. etc more

Basically, when the rdma-cm observes difference between the destination 
GID as present in the IB path within
the CM REQ to the one resolved locally,  we should at least print a 
warning. Perhaps, we should reject the connection request? (in that 
case, I wasn't sure what would be the appropriate reject reason), any 
more ideas?

Or.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2014-03-05 20:04 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-02-25  8:18 issues with the rdma-cm server side mapping of IP to GID Or Gerlitz
     [not found] ` <530C51EF.2000509-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2014-03-01 23:50   ` Hefty, Sean
     [not found]     ` <1828884A29C6694DAF28B7E6B8A8237388D1F0BD-P5GAC/sN6hkd3b2yrw5b5LfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2014-03-02  8:51       ` Or Gerlitz
     [not found]         ` <5312F107.2000404-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2014-03-03 14:46           ` Hefty, Sean
     [not found]             ` <1828884A29C6694DAF28B7E6B8A8237388D1F39F-P5GAC/sN6hkd3b2yrw5b5LfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2014-03-03 16:47               ` Or Gerlitz
     [not found]                 ` <CAJZOPZ+ZC62FeZCy17ZMkzkxqrTdTrNRDs+nWQQ4Xjb9Sx5T3A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2014-03-03 17:15                   ` Hefty, Sean
     [not found]                     ` <1828884A29C6694DAF28B7E6B8A8237388D1F440-P5GAC/sN6hkd3b2yrw5b5LfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2014-03-04 21:12                       ` Or Gerlitz
     [not found]                         ` <CAJZOPZKT0oGfx99PhX9OP5_qaa2QyRVEJqE+hSkM7tykN23GOw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2014-03-04 21:31                           ` Hefty, Sean
     [not found]                             ` <1828884A29C6694DAF28B7E6B8A8237388D200B2-P5GAC/sN6hkd3b2yrw5b5LfspsVTdybXVpNB7YpNyf8@public.gmane.org>
2014-03-04 21:39                               ` Hefty, Sean
2014-03-05 20:04                               ` Or Gerlitz

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.