* iboe - address resolution in userspace
@ 2010-07-29 17:09 Eli Cohen
2010-07-29 17:26 ` Jason Gunthorpe
0 siblings, 1 reply; 4+ messages in thread
From: Eli Cohen @ 2010-07-29 17:09 UTC (permalink / raw)
To: Roland Dreier; +Cc: linux-rdma
Roland,
This is the userspace patch implementing the outcome of the email
discussions about address resolution conducted over the last couple of
weeks.
We are planning to submit the full patch series based on your iboe
branch within the next few days with the hope that this can be pushed
upstream for 2.6.36.
>From ff1c4c562f91082126d53297e316437026fb8ac0 Mon Sep 17 00:00:00 2001
From: Eli Cohen <eli-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>
Date: Wed, 28 Jul 2010 18:07:45 +0300
Subject: [PATCH] libibverbs: Add API to resolve GID to L2 address
Add a new API, resolve_eth_ll_gid(), which resolves a GID to layer 2 address
information. The GID resembles an IPv6 link local address and encodes the MAC
address and the VLAN tag within it. The function accepts the destination GID,
port number and source GID index and returns the MAC VLAN and indications
whether the remote address is tagged VLAN and whether the address is multicast.
Signed-off-by: Eli Cohen <eli-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>
---
include/infiniband/driver.h | 7 ++++
src/verbs.c | 78 +++++++++++++++++++++++++++++++++++++++++++
2 files changed, 85 insertions(+), 0 deletions(-)
diff --git a/include/infiniband/driver.h b/include/infiniband/driver.h
index 9a81416..be4ec98 100644
--- a/include/infiniband/driver.h
+++ b/include/infiniband/driver.h
@@ -37,6 +37,8 @@
#include <infiniband/verbs.h>
#include <infiniband/kern-abi.h>
+#include <arpa/inet.h>
+#include <string.h>
#ifdef __cplusplus
# define BEGIN_C_DECLS extern "C" {
@@ -143,4 +145,9 @@ const char *ibv_get_sysfs_path(void);
int ibv_read_sysfs_file(const char *dir, const char *file,
char *buf, size_t size);
+int resolve_eth_gid(struct ibv_pd *pd, uint8_t port_num,
+ union ibv_gid *dgid, uint8_t sgid_index,
+ uint8_t mac[], uint16_t *vlan, uint8_t *tagged,
+ uint8_t *is_mcast);
+
#endif /* INFINIBAND_DRIVER_H */
diff --git a/src/verbs.c b/src/verbs.c
index ba3c0a4..a04e92f 100644
--- a/src/verbs.c
+++ b/src/verbs.c
@@ -148,6 +148,7 @@ struct ibv_pd *__ibv_alloc_pd(struct ibv_context *context)
}
default_symver(__ibv_alloc_pd, ibv_alloc_pd);
+
int __ibv_dealloc_pd(struct ibv_pd *pd)
{
return pd->context->ops.dealloc_pd(pd);
@@ -543,3 +544,80 @@ int __ibv_detach_mcast(struct ibv_qp *qp, const union ibv_gid *gid, uint16_t lid
return qp->context->ops.detach_mcast(qp, gid, lid);
}
default_symver(__ibv_detach_mcast, ibv_detach_mcast);
+
+static uint16_t get_vlan_id(union ibv_gid *dgid)
+{
+ uint16_t vid;
+
+ vid = dgid->raw[11] << 8 | dgid->raw[12];
+ return vid == 0xfffe ? 0 : vid & 0xfff;
+}
+
+static void get_ll_mac(union ibv_gid *gid, uint8_t *mac)
+{
+ memcpy(mac, &gid->raw[8], 3);
+ memcpy(mac + 3, &gid->raw[13], 3);
+ mac[0] ^= 2;
+}
+
+static int is_multicast_gid(union ibv_gid *gid)
+{
+ return gid->raw[0] == 0xff ? 1 : 0;
+}
+
+static void get_mcast_mac(union ibv_gid *gid, uint8_t *mac)
+{
+ int i;
+
+ mac[0] = 0x33;
+ mac[1] = 0x33;
+ for (i = 2; i < 6; ++i)
+ mac[i] = gid->raw[i + 10];
+}
+
+static int is_link_local_gid(union ibv_gid *gid)
+{
+ uint32_t hi = *(uint32_t *)(gid->raw);
+ uint32_t lo = *(uint32_t *)(gid->raw + 4);
+ if (hi == htonl(0xfe800000) && lo == 0)
+ return 1;
+
+ return 0;
+}
+
+static int resolve_grh(union ibv_gid *dgid, uint8_t *mac, uint8_t *is_mcast)
+{
+ if (is_link_local_gid(dgid))
+ get_ll_mac(dgid, mac);
+ else if (is_multicast_gid(dgid)) {
+ get_mcast_mac(dgid, mac);
+ *is_mcast = 1;
+ } else
+ return -EINVAL;
+
+ return 0;
+}
+
+int __resolve_eth_gid(struct ibv_pd *pd, uint8_t port_num,
+ union ibv_gid *dgid, uint8_t sgid_index,
+ uint8_t mac[], uint16_t *vlan, uint8_t *tagged,
+ uint8_t *is_mcast)
+{
+ int err;
+ union ibv_gid sgid;
+
+ err = resolve_grh(dgid, mac, is_mcast);
+ if (err)
+ return err;
+
+ err = ibv_query_gid(pd->context, port_num, sgid_index, &sgid);
+ if (err)
+ return err;
+
+ *vlan = get_vlan_id(&sgid);
+ *tagged = *vlan;
+
+ return 0;
+}
+default_symver(__resolve_eth_gid, resolve_eth_gid);
+
--
1.7.2
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply related [flat|nested] 4+ messages in thread* Re: iboe - address resolution in userspace
2010-07-29 17:09 iboe - address resolution in userspace Eli Cohen
@ 2010-07-29 17:26 ` Jason Gunthorpe
[not found] ` <AANLkTinp++TVkhz+vCnWX4kSU5o3rjh18C+e99Vq868b@mail.gmail.com>
0 siblings, 1 reply; 4+ messages in thread
From: Jason Gunthorpe @ 2010-07-29 17:26 UTC (permalink / raw)
To: Eli Cohen; +Cc: Roland Dreier, linux-rdma
On Thu, Jul 29, 2010 at 08:09:31PM +0300, Eli Cohen wrote:
> Add a new API, resolve_eth_ll_gid(), which resolves a GID to layer 2
> address information. The GID resembles an IPv6 link local address
> and encodes the MAC address and the VLAN tag within it. The function
> accepts the destination GID, port number and source GID index and
> returns the MAC VLAN and indications whether the remote address is
> tagged VLAN and whether the address is multicast.
Is it reasonable for the receiver to have an sgid table entry for
every vlan supported?
resolve_eth_gid should probably fail if dgid vlan != sgid vlan..
> +int resolve_eth_gid(struct ibv_pd *pd, uint8_t port_num,
> + union ibv_gid *dgid, uint8_t sgid_index,
> + uint8_t mac[], uint16_t *vlan, uint8_t *tagged,
> + uint8_t *is_mcast);
union ibv_gid * should be const (and const flowed down)
> +static int is_multicast_gid(union ibv_gid *gid)
> +{
> + return gid->raw[0] == 0xff ? 1 : 0;
> +}
This ternary if is kinda redundant?
> +static int resolve_grh(union ibv_gid *dgid, uint8_t *mac, uint8_t *is_mcast)
> +{
Funny name for the function? It does not touch GRHs?
> + *vlan = get_vlan_id(&sgid);
> + *tagged = *vlan;
It is hard to see what you are trying to do here, can you document
this GID address format you invented? vlan 0 is not special, you
shouldn't use it to mean untagged packets. I'm guessing the scheme
is if the GID has the 0xFFFE bit then it should be untagged
otherwise it should be tagged with the VID in bytes 11/12? If so
then this computation if *tagged is wrong, it prevents someone
creating tagged vlan id 0 packets, which is a valid thing to do.
Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2010-08-04 17:24 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-07-29 17:09 iboe - address resolution in userspace Eli Cohen
2010-07-29 17:26 ` Jason Gunthorpe
[not found] ` <AANLkTinp++TVkhz+vCnWX4kSU5o3rjh18C+e99Vq868b@mail.gmail.com>
[not found] ` <AANLkTinp++TVkhz+vCnWX4kSU5o3rjh18C+e99Vq868b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-08-01 16:43 ` Eli Cohen
2010-08-04 17:24 ` Jason Gunthorpe
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox