* RDMA Multicasting
@ 2015-04-08 18:17 Allen Andrews
[not found] ` <A1EC1DB6D8AEAF4CB8006B3653F61C5471F997F9-DWYeeINJQrxExQ8dmkPuX0M9+F4ksjoh@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Allen Andrews @ 2015-04-08 18:17 UTC (permalink / raw)
To: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
All,
I am trying to find out if RDMA Multicasting is supported on RoCE in Linux.
If so, is it supported above the verbs interface?
Allen Andrews
Emulex
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread[parent not found: <A1EC1DB6D8AEAF4CB8006B3653F61C5471F997F9-DWYeeINJQrxExQ8dmkPuX0M9+F4ksjoh@public.gmane.org>]
* Re: RDMA Multicasting [not found] ` <A1EC1DB6D8AEAF4CB8006B3653F61C5471F997F9-DWYeeINJQrxExQ8dmkPuX0M9+F4ksjoh@public.gmane.org> @ 2015-04-09 14:56 ` Christoph Lameter [not found] ` <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN@mx.google.com> 0 siblings, 1 reply; 7+ messages in thread From: Christoph Lameter @ 2015-04-09 14:56 UTC (permalink / raw) To: Allen Andrews; +Cc: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org On Wed, 8 Apr 2015, Allen Andrews wrote: > I am trying to find out if RDMA Multicasting is supported on RoCE in Linux. > If so, is it supported above the verbs interface? RDMA multicasting? You mean sending ud(or udp) messages via the RDMA API? That works even without RoCE. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN@mx.google.com>]
[parent not found: <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org>]
* Re: RDMA Multicasting [not found] ` <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org> @ 2015-04-09 18:35 ` Christoph Lameter [not found] ` <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN@mx.google.com> 0 siblings, 1 reply; 7+ messages in thread From: Christoph Lameter @ 2015-04-09 18:35 UTC (permalink / raw) To: Caitlin Bestler; +Cc: Allen Andrews, linux-rdma-u79uwXL29TY76Z2rM5mHXA On Thu, 9 Apr 2015, Caitlin Bestler wrote: > RDMA requires specific delivery semantics. Successful completion of an > untagged message (an RDMA Send) implies that all prior tagged > (Write/Read) packets have been successfully placed in user memory. That seems to be incompatible with a multicast delivery mechanism because there is state to all endpoints involved. > Just putting RDMA packets over an unreliable transport will not accomplish that. I am not sure what an "RDMA packet" is. RDMA is a memory to memory transfer action. A packet is information send on a medium. A packet that describes an RDMA action to be taken between certain endpoints? Infiniband is lossless and thus what "unreliable" means is also quite foggy. > You are probably better off using RDMA ideas over UDP/UD, and doing the > direct memory placement from your own code, instead. So send the memory transfer info via multicast datagram to the endpoints and then run the transfer from the endpoint. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN@mx.google.com>]
[parent not found: <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org>]
* Re: RDMA Multicasting [not found] ` <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org> @ 2015-04-10 15:14 ` Christoph Lameter [not found] ` <alpine.DEB.2.11.1504101011300.30374-gkYfJU5Cukgdnm+yROfE0A@public.gmane.org> 0 siblings, 1 reply; 7+ messages in thread From: Christoph Lameter @ 2015-04-10 15:14 UTC (permalink / raw) To: Caitlin Bestler; +Cc: Allen Andrews, linux-rdma-u79uwXL29TY76Z2rM5mHXA On Thu, 9 Apr 2015, Caitlin Bestler wrote: > > Infiniband is lossless and thus what "unreliable" means is also quite > > foggy. > > InfiniBand is not lossless. It does a superb job of avoiding drops caused > by congestion. But applications should not assume that all UD messages > have been received, but should add their own checking. That is why the > U stands for Unreliable. Well applications can screw up yes but the fabric *IS* lossless. > > > You are probably better off using RDMA ideas over UDP/UD, and doing the > > > direct memory placement from your own code, instead. > > > > So send the memory transfer info via multicast datagram to the > > endpoints and then run the transfer from the endpoint. > > Yes, possibly in a kermel module. Why? You can simply do this already from userspace with verbs messaging and RDMA tranfers. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <alpine.DEB.2.11.1504101011300.30374-gkYfJU5Cukgdnm+yROfE0A@public.gmane.org>]
* Re: RDMA Multicasting [not found] ` <alpine.DEB.2.11.1504101011300.30374-gkYfJU5Cukgdnm+yROfE0A@public.gmane.org> @ 2015-04-10 16:54 ` Atchley, Scott 2015-04-10 21:33 ` Bob Ciotti 1 sibling, 0 replies; 7+ messages in thread From: Atchley, Scott @ 2015-04-10 16:54 UTC (permalink / raw) To: Christoph Lameter Cc: Caitlin Bestler, Allen Andrews, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org On Apr 10, 2015, at 11:14 AM, Christoph Lameter <cl-vYTEC60ixJUAvxtiuMwx3w@public.gmane.org> wrote: > On Thu, 9 Apr 2015, Caitlin Bestler wrote: > >>> Infiniband is lossless and thus what "unreliable" means is also quite >>> foggy. >> >> InfiniBand is not lossless. It does a superb job of avoiding drops caused >> by congestion. But applications should not assume that all UD messages >> have been received, but should add their own checking. That is why the >> U stands for Unreliable. > > Well applications can screw up yes but the fabric *IS* lossless. On a very rare bit flip, the switches or HCAs can drop a UD packet. Lossless, but not guaranteed. > >>>> You are probably better off using RDMA ideas over UDP/UD, and doing the >>>> direct memory placement from your own code, instead. >>> >>> So send the memory transfer info via multicast datagram to the >>> endpoints and then run the transfer from the endpoint. >> >> Yes, possibly in a kermel module. > > Why? You can simply do this already from userspace with verbs messaging > and RDMA tranfers. > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: RDMA Multicasting [not found] ` <alpine.DEB.2.11.1504101011300.30374-gkYfJU5Cukgdnm+yROfE0A@public.gmane.org> 2015-04-10 16:54 ` Atchley, Scott @ 2015-04-10 21:33 ` Bob Ciotti [not found] ` <5528418E.9040403-NSQ8wuThN14@public.gmane.org> 1 sibling, 1 reply; 7+ messages in thread From: Bob Ciotti @ 2015-04-10 21:33 UTC (permalink / raw) To: Christoph Lameter, Caitlin Bestler Cc: Allen Andrews, linux-rdma-u79uwXL29TY76Z2rM5mHXA On 4/10/15 8:14 AM, Christoph Lameter wrote: > On Thu, 9 Apr 2015, Caitlin Bestler wrote: > >>> Infiniband is lossless and thus what "unreliable" means is also quite >>> foggy. >> >> InfiniBand is not lossless. It does a superb job of avoiding drops caused >> by congestion. But applications should not assume that all UD messages >> have been received, but should add their own checking. That is why the >> U stands for Unreliable. > > Well applications can screw up yes but the fabric *IS* lossless. This is a very bad assumption. Fabric *is not* lossless. Packet loss happens - for various reasons that are typically specific to the environment. We have found several broken pieces of SW that assumed RC connections are reliable, and do bad things when the qp hits its retry limit. So not only loosing 1 packet, but 7 in a row. Building SW that relied on RC - well OK, but assuming UD reliable in all environments? If HW never broke, and systems were never overloaded, room temperatures never fluctuated, IB cables were always reliable... Its not that uncommon to see packet loss on a perfectly functioning (large) system let alone one thats having HW issues... >>>> You are probably better off using RDMA ideas over UDP/UD, and doing the >>>> direct memory placement from your own code, instead. >>> >>> So send the memory transfer info via multicast datagram to the >>> endpoints and then run the transfer from the endpoint. >> >> Yes, possibly in a kermel module. > > Why? You can simply do this already from userspace with verbs messaging > and RDMA tranfers. > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <5528418E.9040403-NSQ8wuThN14@public.gmane.org>]
* Re: RDMA Multicasting [not found] ` <5528418E.9040403-NSQ8wuThN14@public.gmane.org> @ 2015-04-11 2:07 ` Christoph Lameter 0 siblings, 0 replies; 7+ messages in thread From: Christoph Lameter @ 2015-04-11 2:07 UTC (permalink / raw) To: Bob Ciotti Cc: Caitlin Bestler, Allen Andrews, linux-rdma-u79uwXL29TY76Z2rM5mHXA On Fri, 10 Apr 2015, Bob Ciotti wrote: > > Well applications can screw up yes but the fabric *IS* lossless. > > This is a very bad assumption. Fabric *is not* lossless. I guess then we need to get our money back because it was advertised to be lossless.... > Packet loss happens - for various reasons that are typically specific to the > environment. Right. Even the most reliable transport will not work if someone goes postal and gets a big sledgehammer and starts hitting the equipment with that. > If HW never broke, and systems were never overloaded, room temperatures never > fluctuated, IB > cables were always reliable... Its not that uncommon to see packet loss on a > perfectly functioning > (large) system let alone one thats having HW issues... If things break then yes but otherwise the fabric is expected to behave as advertised. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2015-04-11 2:07 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-04-08 18:17 RDMA Multicasting Allen Andrews
[not found] ` <A1EC1DB6D8AEAF4CB8006B3653F61C5471F997F9-DWYeeINJQrxExQ8dmkPuX0M9+F4ksjoh@public.gmane.org>
2015-04-09 14:56 ` Christoph Lameter
[not found] ` <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN@mx.google.com>
[not found] ` <5526c0f8.2116430a.28f7.ffffd059SMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org>
2015-04-09 18:35 ` Christoph Lameter
[not found] ` <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN@mx.google.com>
[not found] ` <5526e18b.a50c450a.4b9f.77beSMTPIN_ADDED_BROKEN-ATjtLOhZ0NVl57MIdRCFDg@public.gmane.org>
2015-04-10 15:14 ` Christoph Lameter
[not found] ` <alpine.DEB.2.11.1504101011300.30374-gkYfJU5Cukgdnm+yROfE0A@public.gmane.org>
2015-04-10 16:54 ` Atchley, Scott
2015-04-10 21:33 ` Bob Ciotti
[not found] ` <5528418E.9040403-NSQ8wuThN14@public.gmane.org>
2015-04-11 2:07 ` Christoph Lameter
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox