* RDMA Read sge errors
@ 2010-01-11 8:34 Håkon Bugge
[not found] ` <2AAC14CC-DACE-4CD8-833A-5E7406593732-U0mLk4xYmo8@public.gmane.org>
0 siblings, 1 reply; 3+ messages in thread
From: Håkon Bugge @ 2010-01-11 8:34 UTC (permalink / raw)
To: OFED mailing list
Hi,
I modified the perftest program ib_read_bw to use sge lists. The length of each sge is 1 (one) byte. I configured the QP to support up to 32 sges (aligned with the capabilities of the device).
Now, everything works correct up to 30 sges. Using 31 or 32 sges, I receive 29 bytes in both cases, but completion gives local length error.
I am running on CentOS 5.3, Intel E5540, OFED 1.4.1, and ConnectX w/fw 2.6.000.
Thanks, Håkon
--
Håkon Bugge
Haakon.bugge-U0mLk4xYmo8@public.gmane.org
+47 924 84 514
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: RDMA Read sge errors
[not found] ` <2AAC14CC-DACE-4CD8-833A-5E7406593732-U0mLk4xYmo8@public.gmane.org>
@ 2010-01-11 12:21 ` Or Gerlitz
[not found] ` <4B4B17DA.2070106-hKgKHo2Ms0FWk0Htik3J/w@public.gmane.org>
0 siblings, 1 reply; 3+ messages in thread
From: Or Gerlitz @ 2010-01-11 12:21 UTC (permalink / raw)
To: Jack Morgenstein; +Cc: Håkon Bugge, linux-rdma
Jack, I see now that commit cd155c1 "IB/mlx4: Fix creation of kernel QP with max number of send s/g entries" is mainstream but not ofed 1.4.x and that mlx4_0090_fix_sq_wrs.patch (below) is in ofed but not mainstream, was it rejected from the mainline kernel? why?
Or.
1. Limit qp resources accepted for ib_create_qp() to the limits reported
in ib_query_device(). In kernel space,make sure that the limits
returned to the caller following qp creation also lie within the
reported device limits. For userspace, report as before, and
do adjustment in libmlx4 (so as not to break ABI).
2. Limit max number of wqes per QP reported when querying the device,
so that ib_create_qp will never fail due to any additional headroom WQEs allocated.
Signed-off-by: Jack Morgenstein <jackm-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
---
drivers/infiniband/hw/mlx4/main.c | 2 +-
drivers/infiniband/hw/mlx4/mlx4_ib.h | 7 +++++++
drivers/infiniband/hw/mlx4/qp.c | 25 +++++++++++++++++++------
3 files changed, 27 insertions(+), 7 deletions(-)
Index: ofed_kernel/drivers/infiniband/hw/mlx4/main.c
===================================================================
--- ofed_kernel.orig/drivers/infiniband/hw/mlx4/main.c
+++ ofed_kernel/drivers/infiniband/hw/mlx4/main.c
@@ -122,7 +122,7 @@ static int mlx4_ib_query_device(struct i
props->max_mr_size = ~0ull;
props->page_size_cap = dev->dev->caps.page_size_cap;
props->max_qp = dev->dev->caps.num_qps - dev->dev->caps.reserved_qps;
- props->max_qp_wr = dev->dev->caps.max_wqes;
+ props->max_qp_wr = dev->dev->caps.max_wqes - MLX4_IB_SQ_MAX_SPARE;
props->max_sge = min(dev->dev->caps.max_sq_sg,
dev->dev->caps.max_rq_sg);
props->max_cq = dev->dev->caps.num_cqs - dev->dev->caps.reserved_cqs;
Index: ofed_kernel/drivers/infiniband/hw/mlx4/mlx4_ib.h
===================================================================
--- ofed_kernel.orig/drivers/infiniband/hw/mlx4/mlx4_ib.h
+++ ofed_kernel/drivers/infiniband/hw/mlx4/mlx4_ib.h
@@ -44,6 +44,13 @@
#include <linux/mlx4/device.h>
#include <linux/mlx4/doorbell.h>
+enum {
+ MLX4_IB_SQ_MIN_WQE_SHIFT = 6
+};
+
+#define MLX4_IB_SQ_HEADROOM(shift) ((2048 >> (shift)) + 1)
+#define MLX4_IB_SQ_MAX_SPARE (MLX4_IB_SQ_HEADROOM(MLX4_IB_SQ_MIN_WQE_SHIFT))
+
struct mlx4_ib_ucontext {
struct ib_ucontext ibucontext;
struct mlx4_uar uar;
Index: ofed_kernel/drivers/infiniband/hw/mlx4/qp.c
===================================================================
--- ofed_kernel.orig/drivers/infiniband/hw/mlx4/qp.c
+++ ofed_kernel/drivers/infiniband/hw/mlx4/qp.c
@@ -289,8 +289,9 @@ static int set_rq_size(struct mlx4_ib_de
int is_user, int has_srq, struct mlx4_ib_qp *qp)
{
/* Sanity check RQ size before proceeding */
- if (cap->max_recv_wr > dev->dev->caps.max_wqes ||
- cap->max_recv_sge > dev->dev->caps.max_rq_sg)
+ if (cap->max_recv_wr > dev->dev->caps.max_wqes - MLX4_IB_SQ_MAX_SPARE ||
+ cap->max_recv_sge >
+ min(dev->dev->caps.max_sq_sg, dev->dev->caps.max_rq_sg))
return -EINVAL;
if (has_srq) {
@@ -309,8 +310,19 @@ static int set_rq_size(struct mlx4_ib_de
qp->rq.wqe_shift = ilog2(qp->rq.max_gs * sizeof (struct mlx4_wqe_data_seg));
}
- cap->max_recv_wr = qp->rq.max_post = qp->rq.wqe_cnt;
- cap->max_recv_sge = qp->rq.max_gs;
+ /* leave userspace return values as they were, so as not to break ABI */
+ if (is_user) {
+ cap->max_recv_wr = qp->rq.max_post = qp->rq.wqe_cnt;
+ cap->max_recv_sge = qp->rq.max_gs;
+ } else {
+ cap->max_recv_wr = qp->rq.max_post =
+ min(dev->dev->caps.max_wqes - MLX4_IB_SQ_MAX_SPARE, qp->rq.wqe_cnt);
+ cap->max_recv_sge = min(qp->rq.max_gs,
+ min(dev->dev->caps.max_sq_sg,
+ dev->dev->caps.max_rq_sg));
+ }
+ /* We don't support inline sends for kernel QPs (yet) */
+
return 0;
}
@@ -321,8 +333,9 @@ static int set_kernel_sq_size(struct mlx
int s;
/* Sanity check SQ size before proceeding */
- if (cap->max_send_wr > dev->dev->caps.max_wqes ||
- cap->max_send_sge > dev->dev->caps.max_sq_sg ||
+ if (cap->max_send_wr > (dev->dev->caps.max_wqes - MLX4_IB_SQ_MAX_SPARE) ||
+ cap->max_send_sge >
+ min(dev->dev->caps.max_sq_sg, dev->dev->caps.max_rq_sg) ||
cap->max_inline_data + send_wqe_overhead(type, qp->flags) +
sizeof (struct mlx4_wqe_inline_seg) > dev->dev->caps.max_sq_desc_sz)
return -EINVAL;
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: RDMA Read sge errors
[not found] ` <4B4B17DA.2070106-hKgKHo2Ms0FWk0Htik3J/w@public.gmane.org>
@ 2010-01-12 12:59 ` Jack Morgenstein
0 siblings, 0 replies; 3+ messages in thread
From: Jack Morgenstein @ 2010-01-12 12:59 UTC (permalink / raw)
To: Or Gerlitz, rdreier-FYB4Gu1CFyUAvxtiuMwx3w; +Cc: linux-rdma
Adding our correspondence to the openfabrics list:
I think it just sort of got dropped. I submitted a 5-patch set to
Roland and the list on October 24, 2007.
See the thread beginning at:
http://lists.openfabrics.org/pipermail/general/2007-October/042351.html
We didn't pursue it at the time.
-Jack
so all the five patches are (A) needed and (B) non-merged upstream (mlx4
&& libmlx4)?!
if they aren't really needed is there any reason to keep them in ofed?
Or.
They are really needed -- that is why they have been kept in.
They are also used in the XRC implementation.
-Jack
not that I enjoy putting things in TODO for you, but its really bad
that such patches are floating around, e.g exist only in ofed.
Or.
> -----Original Message-----
> From: Or Gerlitz [mailto:ogerlitz-hKgKHo2Ms0FWk0Htik3J/w@public.gmane.org]
> Sent: Monday, January 11, 2010 2:22 PM
> To: Jack Morgenstein
> Cc: Haakon.Bugge-UdXhSnd/wVw@public.gmane.org; linux-rdma
> Subject: Re: RDMA Read sge errors
>
> Jack, I see now that commit cd155c1 "IB/mlx4: Fix creation of kernel
QP with
> max number of send s/g entries" is mainstream but not ofed 1.4.x and
that
> mlx4_0090_fix_sq_wrs.patch (below) is in ofed but not mainstream, was
it
> rejected from the mainline kernel? why?
>
> Or.
>
>
> 1. Limit qp resources accepted for ib_create_qp() to the limits
reported
> in ib_query_device(). In kernel space,make sure that the limits
> returned to the caller following qp creation also lie within the
> reported device limits. For userspace, report as before, and
> do adjustment in libmlx4 (so as not to break ABI).
>
> 2. Limit max number of wqes per QP reported when querying the device,
> so that ib_create_qp will never fail due to any additional headroom
WQEs
> allocated.
>
> Signed-off-by: Jack Morgenstein <jackm-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
>
> ---
> drivers/infiniband/hw/mlx4/main.c | 2 +-
> drivers/infiniband/hw/mlx4/mlx4_ib.h | 7 +++++++
> drivers/infiniband/hw/mlx4/qp.c | 25 +++++++++++++++++++------
> 3 files changed, 27 insertions(+), 7 deletions(-)
>
> Index: ofed_kernel/drivers/infiniband/hw/mlx4/main.c
> ===================================================================
> --- ofed_kernel.orig/drivers/infiniband/hw/mlx4/main.c
> +++ ofed_kernel/drivers/infiniband/hw/mlx4/main.c
> @@ -122,7 +122,7 @@ static int mlx4_ib_query_device(struct i
> props->max_mr_size = ~0ull;
> props->page_size_cap = dev->dev->caps.page_size_cap;
> props->max_qp = dev->dev->caps.num_qps - dev->dev-
> >caps.reserved_qps;
> - props->max_qp_wr = dev->dev->caps.max_wqes;
> + props->max_qp_wr = dev->dev->caps.max_wqes -
MLX4_IB_SQ_MAX_SPARE;
> props->max_sge = min(dev->dev->caps.max_sq_sg,
> dev->dev->caps.max_rq_sg);
> props->max_cq = dev->dev->caps.num_cqs - dev->dev-
> >caps.reserved_cqs;
> Index: ofed_kernel/drivers/infiniband/hw/mlx4/mlx4_ib.h
> ===================================================================
> --- ofed_kernel.orig/drivers/infiniband/hw/mlx4/mlx4_ib.h
> +++ ofed_kernel/drivers/infiniband/hw/mlx4/mlx4_ib.h
> @@ -44,6 +44,13 @@
> #include <linux/mlx4/device.h>
> #include <linux/mlx4/doorbell.h>
>
> +enum {
> + MLX4_IB_SQ_MIN_WQE_SHIFT = 6
> +};
> +
> +#define MLX4_IB_SQ_HEADROOM(shift) ((2048 >> (shift)) + 1)
> +#define MLX4_IB_SQ_MAX_SPARE
(MLX4_IB_SQ_HEADROOM(MLX4_IB_SQ_MIN_WQE_SHIFT))
> +
> struct mlx4_ib_ucontext {
> struct ib_ucontext ibucontext;
> struct mlx4_uar uar;
> Index: ofed_kernel/drivers/infiniband/hw/mlx4/qp.c
> ===================================================================
> --- ofed_kernel.orig/drivers/infiniband/hw/mlx4/qp.c
> +++ ofed_kernel/drivers/infiniband/hw/mlx4/qp.c
> @@ -289,8 +289,9 @@ static int set_rq_size(struct mlx4_ib_de
> int is_user, int has_srq, struct mlx4_ib_qp *qp)
> {
> /* Sanity check RQ size before proceeding */
> - if (cap->max_recv_wr > dev->dev->caps.max_wqes ||
> - cap->max_recv_sge > dev->dev->caps.max_rq_sg)
> + if (cap->max_recv_wr > dev->dev->caps.max_wqes -
MLX4_IB_SQ_MAX_SPARE ||
> + cap->max_recv_sge >
> + min(dev->dev->caps.max_sq_sg, dev->dev->caps.max_rq_sg))
> return -EINVAL;
>
> if (has_srq) {
> @@ -309,8 +310,19 @@ static int set_rq_size(struct mlx4_ib_de
> qp->rq.wqe_shift = ilog2(qp->rq.max_gs * sizeof (struct
> mlx4_wqe_data_seg));
> }
>
> - cap->max_recv_wr = qp->rq.max_post = qp->rq.wqe_cnt;
> - cap->max_recv_sge = qp->rq.max_gs;
> + /* leave userspace return values as they were, so as not to break
ABI */
> + if (is_user) {
> + cap->max_recv_wr = qp->rq.max_post = qp->rq.wqe_cnt;
> + cap->max_recv_sge = qp->rq.max_gs;
> + } else {
> + cap->max_recv_wr = qp->rq.max_post =
> + min(dev->dev->caps.max_wqes - MLX4_IB_SQ_MAX_SPARE,
qp-
> >rq.wqe_cnt);
> + cap->max_recv_sge = min(qp->rq.max_gs,
> + min(dev->dev->caps.max_sq_sg,
> + dev->dev->caps.max_rq_sg));
> + }
> + /* We don't support inline sends for kernel QPs (yet) */
> +
>
> return 0;
> }
> @@ -321,8 +333,9 @@ static int set_kernel_sq_size(struct mlx
> int s;
>
> /* Sanity check SQ size before proceeding */
> - if (cap->max_send_wr > dev->dev->caps.max_wqes ||
> - cap->max_send_sge > dev->dev->caps.max_sq_sg ||
> + if (cap->max_send_wr > (dev->dev->caps.max_wqes -
> MLX4_IB_SQ_MAX_SPARE) ||
> + cap->max_send_sge >
> + min(dev->dev->caps.max_sq_sg, dev->dev->caps.max_rq_sg) ||
> cap->max_inline_data + send_wqe_overhead(type, qp->flags) +
> sizeof (struct mlx4_wqe_inline_seg) >
dev->dev->caps.max_sq_desc_sz)
> return -EINVAL;
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2010-01-12 12:59 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-11 8:34 RDMA Read sge errors Håkon Bugge
[not found] ` <2AAC14CC-DACE-4CD8-833A-5E7406593732-U0mLk4xYmo8@public.gmane.org>
2010-01-11 12:21 ` Or Gerlitz
[not found] ` <4B4B17DA.2070106-hKgKHo2Ms0FWk0Htik3J/w@public.gmane.org>
2010-01-12 12:59 ` Jack Morgenstein
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox