public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
@ 2011-11-02 21:35 Davis, Arlin R
       [not found] ` <54347E5A035A054EAE9D05927FB467F916E93128-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Davis, Arlin R @ 2011-11-02 21:35 UTC (permalink / raw)
  To: linux-rdma; +Cc: ofw-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 1408 bytes --]

For larger, more congested fabrics, a larger ACK timer is needed.
Consumers can still change default with environment variable
DAPL_ACK_TIMER if they need to increase or decrease.

This applies to SCM and UCM providers only. The CMA provider, which
uses rdma_cm, has no way to control ack timer with current API.

Signed-off-by: Arlin Davis <arlin.r.davis@intel.com>
---
 dapl/openib_common/dapl_ib_common.h |    4 ++--
 1 files changed, 2 insertions(+), 2 deletions(-)

diff --git a/dapl/openib_common/dapl_ib_common.h b/dapl/openib_common/dapl_ib_common.h
index a0dd772..e757b65 100644
--- a/dapl/openib_common/dapl_ib_common.h
+++ b/dapl/openib_common/dapl_ib_common.h
@@ -158,8 +158,8 @@ typedef uint16_t		ib_hca_port_t;
 #define DAT_UD_QKEY	0x78654321
 
 /* RC timer - retry count defaults */
-#define DCM_ACK_TIMER	16 /* 5 bits, 4.096us*2^ack_timer. 16== 268ms */
-#define DCM_ACK_RETRY	7  /* 3 bits, 7 * 268ms = 1.8 seconds */
+#define DCM_ACK_TIMER	20 /* 5 bits, 4.096us*2^ack_timer. 16== 268ms, 20==4.2s */
+#define DCM_ACK_RETRY	7  /* 3 bits, 7 * 4.2  == 30 seconds */
 #define DCM_RNR_TIMER	12 /* 5 bits, 12 =.64ms, 28 =163ms, 31 =491ms */
 #define DCM_RNR_RETRY	7  /* 3 bits, 7 == infinite */
 #define DCM_IB_MTU	2048
-- 
1.7.3



N‹§²æìr¸›yúèšØb²X¬¶Ç§vØ^–)Þº{.nÇ+‰·¥Š{±­ÙšŠ{ayº\x1dʇڙë,j\a­¢f£¢·hš‹»öì\x17/oSc¾™Ú³9˜uÀ¦æå‰È&jw¨®\x03(­éšŽŠÝ¢j"ú\x1a¶^[m§ÿïêäz¹Þ–Šàþf£¢·hšˆ§~ˆmš

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* RE: [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
       [not found] ` <54347E5A035A054EAE9D05927FB467F916E93128-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2011-11-02 22:40   ` Hefty, Sean
       [not found]     ` <1828884A29C6694DAF28B7E6B8A8237316E8CBCB-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Hefty, Sean @ 2011-11-02 22:40 UTC (permalink / raw)
  To: Davis, Arlin R, linux-rdma
  Cc: ofw-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 628 bytes --]

> For larger, more congested fabrics, a larger ACK timer is needed.
> Consumers can still change default with environment variable
> DAPL_ACK_TIMER if they need to increase or decrease.
> 
> This applies to SCM and UCM providers only. The CMA provider, which
> uses rdma_cm, has no way to control ack timer with current API.

The rdma_cm uses the path lifetime to determine the QP timeout value.  The path record can be manipulated by the user when calling rdma_create_ep().

- Sean

N‹§²æìr¸›yúèšØb²X¬¶Ç§vØ^–)Þº{.nÇ+‰·¥Š{±­ÙšŠ{ayº\x1dʇڙë,j\a­¢f£¢·hš‹»öì\x17/oSc¾™Ú³9˜uÀ¦æå‰È&jw¨®\x03(­éšŽŠÝ¢j"ú\x1a¶^[m§ÿïêäz¹Þ–Šàþf£¢·hšˆ§~ˆmš

^ permalink raw reply	[flat|nested] 6+ messages in thread

* RE: [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
       [not found]     ` <1828884A29C6694DAF28B7E6B8A8237316E8CBCB-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2011-11-02 23:09       ` Davis, Arlin R
       [not found]         ` <54347E5A035A054EAE9D05927FB467F916E931C5-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Davis, Arlin R @ 2011-11-02 23:09 UTC (permalink / raw)
  To: Hefty, Sean, linux-rdma
  Cc: ofw-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org


> > For larger, more congested fabrics, a larger ACK timer is needed.
> > Consumers can still change default with environment variable
> > DAPL_ACK_TIMER if they need to increase or decrease.
> >
> > This applies to SCM and UCM providers only. The CMA provider, which
> > uses rdma_cm, has no way to control ack timer with current API.
> 
> The rdma_cm uses the path lifetime to determine the QP timeout value.
> The path record can be manipulated by the user when calling
> rdma_create_ep().

I stand corrected. I guess I should have said "has no direct way". 
Yes, a user can modify path lifetime via rdma_create_ep() but there 
is no way for the user to know how much that will be manipulated and 
increased in the IB CM driver. 



^ permalink raw reply	[flat|nested] 6+ messages in thread

* RE: [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
       [not found]         ` <54347E5A035A054EAE9D05927FB467F916E931C5-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2011-11-02 23:19           ` Hefty, Sean
  2011-11-02 23:31             ` Davis, Arlin R
  0 siblings, 1 reply; 6+ messages in thread
From: Hefty, Sean @ 2011-11-02 23:19 UTC (permalink / raw)
  To: Davis, Arlin R, linux-rdma
  Cc: ofw-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org

> Yes, a user can modify path lifetime via rdma_create_ep() but there
> is no way for the user to know how much that will be manipulated and
> increased in the IB CM driver.

Sure there is.  It's an open source driver.  :)

The ib_cm calculates the "correct" timeout based on the packet lifetime provided by the SA and the CA ack timeout.  If user modifications to this value are necessary, then there is a bug in either the SA or HCA driver. 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
  2011-11-02 23:19           ` Hefty, Sean
@ 2011-11-02 23:31             ` Davis, Arlin R
       [not found]               ` <54347E5A035A054EAE9D05927FB467F916E931E8-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Davis, Arlin R @ 2011-11-02 23:31 UTC (permalink / raw)
  To: Hefty, Sean, linux-rdma; +Cc: ofw@lists.openfabrics.org

> > Yes, a user can modify path lifetime via rdma_create_ep() but there
> > is no way for the user to know how much that will be manipulated and
> > increased in the IB CM driver.
> 
> Sure there is.  It's an open source driver.  :)
> 
> The ib_cm calculates the "correct" timeout based on the packet lifetime
> provided by the SA and the CA ack timeout.  If user modifications to
> this value are necessary, then there is a bug in either the SA or HCA
> driver.

or there could be congestion on the fabric that the SA or HCA driver is unaware of. 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20
       [not found]               ` <54347E5A035A054EAE9D05927FB467F916E931E8-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2011-11-03  2:28                 ` Jason Gunthorpe
  0 siblings, 0 replies; 6+ messages in thread
From: Jason Gunthorpe @ 2011-11-03  2:28 UTC (permalink / raw)
  To: Davis, Arlin R
  Cc: Hefty, Sean, linux-rdma,
	ofw-ZwoEplunGu1OwGhvXhtEPSCwEArCW2h5@public.gmane.org

On Wed, Nov 02, 2011 at 11:31:38PM +0000, Davis, Arlin R wrote:
> > > Yes, a user can modify path lifetime via rdma_create_ep() but there
> > > is no way for the user to know how much that will be manipulated and
> > > increased in the IB CM driver.
> > 
> > Sure there is.  It's an open source driver.  :)
> > 
> > The ib_cm calculates the "correct" timeout based on the packet lifetime
> > provided by the SA and the CA ack timeout.  If user modifications to
> > this value are necessary, then there is a bug in either the SA or HCA
> > driver.
> 
> or there could be congestion on the fabric that the SA or HCA driver
> is unaware of.

That is what packet life time is supposed to model, and the policy
knob for it belongs in the SA, not every app..

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2011-11-03  2:28 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-11-02 21:35 [PATCH] DAPL v2.0: common: increase default IB ack timer from 16 to 20 Davis, Arlin R
     [not found] ` <54347E5A035A054EAE9D05927FB467F916E93128-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2011-11-02 22:40   ` Hefty, Sean
     [not found]     ` <1828884A29C6694DAF28B7E6B8A8237316E8CBCB-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2011-11-02 23:09       ` Davis, Arlin R
     [not found]         ` <54347E5A035A054EAE9D05927FB467F916E931C5-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2011-11-02 23:19           ` Hefty, Sean
2011-11-02 23:31             ` Davis, Arlin R
     [not found]               ` <54347E5A035A054EAE9D05927FB467F916E931E8-P5GAC/sN6hmkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2011-11-03  2:28                 ` Jason Gunthorpe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox