From: Hal Rosenstock <hal-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
To: Bob.Ciotti-NSQ8wuThN14@public.gmane.org
Cc: "linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: rdmacm issue
Date: Wed, 10 Jun 2015 09:35:29 -0400 [thread overview]
Message-ID: <55783D21.1050104@dev.mellanox.co.il> (raw)
In-Reply-To: <5577986B.7070702-NSQ8wuThN14@public.gmane.org>
On 6/9/2015 9:52 PM, Bob Ciotti wrote:
> We have an issue where lustre servers and clients cannot talk to each
> other.
> There are about 11,000 clients all trying to connect to a server that
> just been rebooted
> (nbp6-oss3 in this example)
>
> pfe21 is a lustre client thats trying to remount the filesystem from
> nbp6-oss3.
>
> running rping server on pfe21 hangs and waits until the client tried to
> connect, then it prints out
> debug information up to cq_thread started. and hangs there, for a minute
> or so until issuing the two UNREACHABLE errors:
>
> pfe21 ~ # rping -v -s -d -P -p2 -a 10.151.27.19
> port 2
> created cm_id 0x60e350
> rdma_bind_addr successful
> rdma_listen
> cma_event type RDMA_CM_EVENT_CONNECT_REQUEST cma_id 0x60b620 (child)
> child cma 0x60b620
> created pd 0x60bd80
> created channel 0x60bda0
> created cq 0x60bdc0
> created qp 0x60bf00
> rping_setup_buffers called on cb 0x60b8c0
> allocated & registered buffers...
> accepting client connection request
> cq_thread started.
> cma_event type RDMA_CM_EVENT_UNREACHABLE cma_id 0x60b620 (child)
> cma event RDMA_CM_EVENT_UNREACHABLE, error -110
>
>
> The rping client is started below. As soon as it starts, it runs up to
> the point
> of cq_thread started. Hangs there and eventually times out as well,
> issuing 4 error
> messages:
>
> nbp6-oss3 ~ # rping -c -vp-d -S 30 -p 2 -a 10.151.27.19
> size 30
> port 2
> created cm_id 0x60f640
> cma_event type RDMA_CM_EVENT_ADDR_RESOLVED cma_id 0x60f640 (parent)
> cma_event type RDMA_CM_EVENT_ROUTE_RESOLVED cma_id 0x60f640 (parent)
> rdma_resolve_addr - rdma_resolve_route successful
> created pd 0x60ab10
> created channel 0x60ab30
> created cq 0x60ab50
> created qp 0x60ac60
> rping_setup_buffers called on cb 0x6072e0
> allocated & registered buffers...
> cq_thread started.
> cma_event type RDMA_CM_EVENT_UNREACHABLE cma_id 0x60f640 (parent)
> cma event RDMA_CM_EVENT_UNREACHABLE, error -110
> wait for CONNECTED state 4
> connect error -1
>
>
> Any ideas? The neighbor entries on both side were in a reachable state
> before the test. And the two systems did manage to find one another.
> Keep in mind that when this
> is going on, 11,000+ clients are trying to connect to nbp6-oss3
>
> Normally lustre mounts fine and rping have no issues. We have noticed
> some neighbor resolution issues and are considering ucast_solicit,
> mcast_solicit, and unres_qlen changes
> because we also occationally experience issues with icmp ping. Its
> typically been the experience that changing these configs have little
> effect. Yes its probably the case that
> unicast arp refresh has long failed and 11,000+ clients may be
> multicasting for
> for arp.
>
> Any help or insight greatly appreciated.
RDMA_CM_EVENT_UNREACHABLE is indicated when there are timeouts in
underlying CM protocol exchange. I suspect that the server is really
busy and doesn't respond to the low level CM MADs in a timely manner.
RDMA CM (and other kernel ULPs like IPoIB and SRP use hard coded local
and remote response timeouts of 20 which is ~4.3 sec. This was discussed
back in 2006 in
http://comments.gmane.org/gmane.linux.drivers.openib/27664. In this
scenario, the response took more than 30 seconds. More recently, there
was proposal to base RDMA CM response timeout on subnet timeout
(http://permalink.gmane.org/gmane.linux.drivers.rdma/19969).
HTH,
Hal
>
> thx, bob
>
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2015-06-10 13:35 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-10 1:52 rdmacm issue Bob Ciotti
[not found] ` <5577986B.7070702-NSQ8wuThN14@public.gmane.org>
2015-06-10 13:35 ` Hal Rosenstock [this message]
[not found] ` <55783D21.1050104-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2015-06-10 15:45 ` Hefty, Sean
2015-06-10 16:51 ` Bob Ciotti
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55783D21.1050104@dev.mellanox.co.il \
--to=hal-ldsdmyg8hgv8yrgs2mwiifqbs+8scbdb@public.gmane.org \
--cc=Bob.Ciotti-NSQ8wuThN14@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox