* ucma question - backlog limit
@ 2010-08-09 14:34 Steve Wise
[not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 14:34 UTC (permalink / raw)
To: Sean Hefty; +Cc: linux-rdma
Hey Sean,
What is the logic behind limiting the listen backlog to 128?
------
snipit from ucma.c:
enum {
UCMA_MAX_BACKLOG = 128
};
snipit from ucma_listen():
ctx->backlog = cmd.backlog > 0 && cmd.backlog < UCMA_MAX_BACKLOG ?
cmd.backlog : UCMA_MAX_BACKLOG;
------
Thanks,
Steve.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ucma question - backlog limit
[not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 17:06 ` Bernard Metzler
[not found] ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Bernard Metzler @ 2010-08-09 17:06 UTC (permalink / raw)
To: Steve Wise
Cc: linux-rdma, linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA, Sean Hefty
I think that could be relaxed.
Running OFED (and softiwarp) on larger installations,
we already had to set this to some KByte...
Bernard.
linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org wrote on 08/09/2010 04:34:31 PM:
> Hey Sean,
>
> What is the logic behind limiting the listen backlog to 128?
>
> ------
>
> snipit from ucma.c:
>
> enum {
> UCMA_MAX_BACKLOG = 128
> };
>
>
> snipit from ucma_listen():
>
> ctx->backlog = cmd.backlog > 0 && cmd.backlog <
UCMA_MAX_BACKLOG ?
> cmd.backlog : UCMA_MAX_BACKLOG;
> ------
>
>
>
> Thanks,
>
> Steve.
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ucma question - backlog limit
[not found] ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
@ 2010-08-09 17:28 ` Steve Wise
[not found] ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 17:28 UTC (permalink / raw)
To: Bernard Metzler
Cc: linux-rdma, linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA, Sean Hefty
I agree. I think it should default to say 1024. Further, I propose we
make this a sysctl variable such that admins can tweak it as needed.
On 08/09/2010 12:06 PM, Bernard Metzler wrote:
> I think that could be relaxed.
> Running OFED (and softiwarp) on larger installations,
> we already had to set this to some KByte...
>
> Bernard.
>
> linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org wrote on 08/09/2010 04:34:31 PM:
>
>
>> Hey Sean,
>>
>> What is the logic behind limiting the listen backlog to 128?
>>
>> ------
>>
>> snipit from ucma.c:
>>
>> enum {
>> UCMA_MAX_BACKLOG = 128
>> };
>>
>>
>> snipit from ucma_listen():
>>
>> ctx->backlog = cmd.backlog> 0&& cmd.backlog<
>>
> UCMA_MAX_BACKLOG ?
>
>> cmd.backlog : UCMA_MAX_BACKLOG;
>> ------
>>
>>
>>
>> Thanks,
>>
>> Steve.
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
>> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: ucma question - backlog limit
[not found] ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 17:51 ` Hefty, Sean
[not found] ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Hefty, Sean @ 2010-08-09 17:51 UTC (permalink / raw)
To: Steve Wise, Bernard Metzler
Cc: linux-rdma,
linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> I agree. I think it should default to say 1024. Further, I propose we
> make this a sysctl variable such that admins can tweak it as needed.
The default is based on:
#define SOMAXCONN 128
I have no objection to making it adjustable.
- Sean
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ucma question - backlog limit
[not found] ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
@ 2010-08-09 18:33 ` Steve Wise
[not found] ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: Steve Wise @ 2010-08-09 18:33 UTC (permalink / raw)
To: Hefty, Sean
Cc: Bernard Metzler, linux-rdma,
linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
On 08/09/2010 12:51 PM, Hefty, Sean wrote:
>> I agree. I think it should default to say 1024. Further, I propose we
>> make this a sysctl variable such that admins can tweak it as needed.
>>
> The default is based on:
>
> #define SOMAXCONN 128
>
> I have no objection to making it adjustable.
>
> - Sean
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
For TCP, a dropped SYN will cause the peer to retransmit the SYN, and
the connection will be eventually established. So a small backlog limit
is probably ok for TCP. With iWARP, the connect request is TCP payload
an on established connection. So the connection request is transmitted
and acked at the TCP level by the time the connect request gets dropped
in the ucma. What finally happens is the peer times out (after many
seconds typically) and kills the connection. Further, a 32 node 256NP
OpenMPI job will generate > 128 connect requests on some ranks.
So I think its reasonable to up default max value. I'll post a patch
to change the max and make it a sysctl variable as well.
Ok?
Steve.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: ucma question - backlog limit
[not found] ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-08-09 18:45 ` Hefty, Sean
2010-08-09 20:05 ` Bernard Metzler
1 sibling, 0 replies; 7+ messages in thread
From: Hefty, Sean @ 2010-08-09 18:45 UTC (permalink / raw)
To: Steve Wise
Cc: Bernard Metzler, linux-rdma,
linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> For TCP, a dropped SYN will cause the peer to retransmit the SYN, and
> the connection will be eventually established. So a small backlog limit
> is probably ok for TCP. With iWARP, the connect request is TCP payload
> an on established connection. So the connection request is transmitted
> and acked at the TCP level by the time the connect request gets dropped
> in the ucma. What finally happens is the peer times out (after many
> seconds typically) and kills the connection. Further, a 32 node 256NP
> OpenMPI job will generate > 128 connect requests on some ranks.
>
> So I think its reasonable to up default max value. I'll post a patch
> to change the max and make it a sysctl variable as well.
>
> Ok?
Yes - I think that makes sense.
- Sean
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: ucma question - backlog limit
[not found] ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 18:45 ` Hefty, Sean
@ 2010-08-09 20:05 ` Bernard Metzler
1 sibling, 0 replies; 7+ messages in thread
From: Bernard Metzler @ 2010-08-09 20:05 UTC (permalink / raw)
To: Steve Wise
Cc: linux-rdma,
linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
Hefty, Sean
Agreed.
Currently, if running out of backlog, the cm actively
rejects the connection with -ENOMEM back to the provider
(see ucma_event_handler(), and cm_conn_req_handler()).
That should allow the provider to zap the TCP connection,
but its not very productive ;)
Bernard.
Steve Wise <swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org> wrote on 08/09/2010 08:33:18 PM:
> On 08/09/2010 12:51 PM, Hefty, Sean wrote:
> >> I agree. I think it should default to say 1024. Further, I propose
we
> >> make this a sysctl variable such that admins can tweak it as needed.
> >>
> > The default is based on:
> >
> > #define SOMAXCONN 128
> >
> > I have no objection to making it adjustable.
> >
> > - Sean
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-rdma"
in
> > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> >
>
>
> For TCP, a dropped SYN will cause the peer to retransmit the SYN, and
> the connection will be eventually established. So a small backlog limit
> is probably ok for TCP. With iWARP, the connect request is TCP payload
> an on established connection. So the connection request is transmitted
> and acked at the TCP level by the time the connect request gets dropped
> in the ucma. What finally happens is the peer times out (after many
> seconds typically) and kills the connection. Further, a 32 node 256NP
> OpenMPI job will generate > 128 connect requests on some ranks.
>
> So I think its reasonable to up default max value. I'll post a patch
> to change the max and make it a sysctl variable as well.
>
> Ok?
>
> Steve.
>
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2010-08-09 20:05 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-09 14:34 ucma question - backlog limit Steve Wise
[not found] ` <4C6011F7.3060209-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 17:06 ` Bernard Metzler
[not found] ` <OFD9132F2D.FEEEA35D-ONC125777A.005D920E-C125777A.005DF132-Xeyd2O9EBijQT0dZR+AlfA@public.gmane.org>
2010-08-09 17:28 ` Steve Wise
[not found] ` <4C603AA6.6030409-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 17:51 ` Hefty, Sean
[not found] ` <CF9C39F99A89134C9CF9C4CCB68B8DDF25A954B79E-osO9UTpF0USkrb+BlOpmy7fspsVTdybXVpNB7YpNyf8@public.gmane.org>
2010-08-09 18:33 ` Steve Wise
[not found] ` <4C6049EE.1020907-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-08-09 18:45 ` Hefty, Sean
2010-08-09 20:05 ` Bernard Metzler
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox