public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
* ucma question - backlog limit
@ 2010-08-09 14:34 Steve Wise
       [not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 14:34 UTC (permalink / raw)
  To: Sean Hefty; +Cc: linux-rdma

Hey Sean,

What is the logic behind limiting the listen backlog to 128?

------

snipit from ucma.c:

enum {
         UCMA_MAX_BACKLOG        = 128
};


snipit from ucma_listen():

         ctx->backlog = cmd.backlog > 0 && cmd.backlog < UCMA_MAX_BACKLOG ?
                        cmd.backlog : UCMA_MAX_BACKLOG;
------



Thanks,

Steve.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: ucma question - backlog limit
       [not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 17:06   ` Bernard Metzler
       [not found]     ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Bernard Metzler @ 2010-08-09 17:06 UTC (permalink / raw)
  To: Steve Wise
  Cc: linux-rdma, linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA, Sean Hefty

I think that could be relaxed.
Running OFED (and softiwarp) on larger installations,
we already had to set this to some KByte...

Bernard.

linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org wrote on 08/09/2010 04:34:31 PM:

> Hey Sean,
>
> What is the logic behind limiting the listen backlog to 128?
>
> ------
>
> snipit from ucma.c:
>
> enum {
>          UCMA_MAX_BACKLOG        = 128
> };
>
>
> snipit from ucma_listen():
>
>          ctx->backlog = cmd.backlog > 0 && cmd.backlog <
UCMA_MAX_BACKLOG ?
>                         cmd.backlog : UCMA_MAX_BACKLOG;
> ------
>
>
>
> Thanks,
>
> Steve.
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: ucma question - backlog limit
       [not found]     ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
@ 2010-08-09 17:28       ` Steve Wise
       [not found]         ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 17:28 UTC (permalink / raw)
  To: Bernard Metzler
  Cc: linux-rdma, linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA, Sean Hefty

I agree.  I think it should default to say 1024.  Further, I propose we 
make this a sysctl variable such that admins can tweak it as needed.



On 08/09/2010 12:06 PM, Bernard Metzler wrote:
> I think that could be relaxed.
> Running OFED (and softiwarp) on larger installations,
> we already had to set this to some KByte...
>
> Bernard.
>
> linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org wrote on 08/09/2010 04:34:31 PM:
>
>    
>> Hey Sean,
>>
>> What is the logic behind limiting the listen backlog to 128?
>>
>> ------
>>
>> snipit from ucma.c:
>>
>> enum {
>>           UCMA_MAX_BACKLOG        = 128
>> };
>>
>>
>> snipit from ucma_listen():
>>
>>           ctx->backlog = cmd.backlog>  0&&  cmd.backlog<
>>      
> UCMA_MAX_BACKLOG ?
>    
>>                          cmd.backlog : UCMA_MAX_BACKLOG;
>> ------
>>
>>
>>
>> Thanks,
>>
>> Steve.
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
>> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>      
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>    

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: ucma question - backlog limit
       [not found]         ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 17:51           ` Hefty, Sean
       [not found]             ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Hefty, Sean @ 2010-08-09 17:51 UTC (permalink / raw)
  To: Steve Wise, Bernard Metzler
  Cc: linux-rdma,
	linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org

> I agree.  I think it should default to say 1024.  Further, I propose we
> make this a sysctl variable such that admins can tweak it as needed.

The default is based on:

#define SOMAXCONN       128

I have no objection to making it adjustable.

- Sean
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: ucma question - backlog limit
       [not found]             ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2010-08-09 18:33               ` Steve Wise
       [not found]                 ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 18:33 UTC (permalink / raw)
  To: Hefty, Sean
  Cc: Bernard Metzler, linux-rdma,
	linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org

On 08/09/2010 12:51 PM, Hefty, Sean wrote:
>> I agree.  I think it should default to say 1024.  Further, I propose we
>> make this a sysctl variable such that admins can tweak it as needed.
>>      
> The default is based on:
>
> #define SOMAXCONN       128
>
> I have no objection to making it adjustable.
>
> - Sean
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>    


For TCP, a dropped SYN will cause the peer to retransmit the SYN, and 
the connection will be eventually established.  So a small backlog limit 
is probably ok for TCP.  With iWARP, the connect request is TCP payload 
an on established connection.  So the connection request is transmitted 
and acked at the TCP level by the time the connect request gets dropped 
in the ucma.  What finally happens is the peer times out (after many 
seconds typically) and kills the connection.  Further, a 32 node 256NP 
OpenMPI job will generate > 128 connect requests on some ranks.

So I think its reasonable to up default max value.   I'll post a patch 
to change the max and make it a sysctl variable as well.

Ok?

Steve.

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: ucma question - backlog limit
       [not found]                 ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 18:45                   ` Hefty, Sean
  2010-08-09 20:05                   ` Bernard Metzler
  1 sibling, 0 replies; 7+ messages in thread
From: Hefty, Sean @ 2010-08-09 18:45 UTC (permalink / raw)
  To: Steve Wise
  Cc: Bernard Metzler, linux-rdma,
	linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org

> For TCP, a dropped SYN will cause the peer to retransmit the SYN, and
> the connection will be eventually established.  So a small backlog limit
> is probably ok for TCP.  With iWARP, the connect request is TCP payload
> an on established connection.  So the connection request is transmitted
> and acked at the TCP level by the time the connect request gets dropped
> in the ucma.  What finally happens is the peer times out (after many
> seconds typically) and kills the connection.  Further, a 32 node 256NP
> OpenMPI job will generate > 128 connect requests on some ranks.
> 
> So I think its reasonable to up default max value.   I'll post a patch
> to change the max and make it a sysctl variable as well.
> 
> Ok?

Yes - I think that makes sense.

- Sean
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: ucma question - backlog limit
       [not found]                 ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  2010-08-09 18:45                   ` Hefty, Sean
@ 2010-08-09 20:05                   ` Bernard Metzler
  1 sibling, 0 replies; 7+ messages in thread
From: Bernard Metzler @ 2010-08-09 20:05 UTC (permalink / raw)
  To: Steve Wise
  Cc: linux-rdma,
	linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	Hefty, Sean

Agreed.

Currently, if running out of backlog, the cm actively
rejects the connection with -ENOMEM back to the provider
(see ucma_event_handler(), and cm_conn_req_handler()).
That should allow the provider to zap the TCP connection,
but its not very productive ;)

Bernard.

Steve Wise <swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org> wrote on 08/09/2010 08:33:18 PM:

> On 08/09/2010 12:51 PM, Hefty, Sean wrote:
> >> I agree.  I think it should default to say 1024.  Further, I propose
we
> >> make this a sysctl variable such that admins can tweak it as needed.
> >>
> > The default is based on:
> >
> > #define SOMAXCONN       128
> >
> > I have no objection to making it adjustable.
> >
> > - Sean
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-rdma"
in
> > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> >
>
>
> For TCP, a dropped SYN will cause the peer to retransmit the SYN, and
> the connection will be eventually established.  So a small backlog limit
> is probably ok for TCP.  With iWARP, the connect request is TCP payload
> an on established connection.  So the connection request is transmitted
> and acked at the TCP level by the time the connect request gets dropped
> in the ucma.  What finally happens is the peer times out (after many
> seconds typically) and kills the connection.  Further, a 32 node 256NP
> OpenMPI job will generate > 128 connect requests on some ranks.
>
> So I think its reasonable to up default max value.   I'll post a patch
> to change the max and make it a sysctl variable as well.
>
> Ok?
>
> Steve.
>

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2010-08-09 20:05 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-09 14:34 ucma question - backlog limit Steve Wise
     [not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 17:06   ` Bernard Metzler
     [not found]     ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
2010-08-09 17:28       ` Steve Wise
     [not found]         ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 17:51           ` Hefty, Sean
     [not found]             ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2010-08-09 18:33               ` Steve Wise
     [not found]                 ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 18:45                   ` Hefty, Sean
2010-08-09 20:05                   ` Bernard Metzler

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox