From: Nicolas Morey-Chaisemartin <NMoreyChaisemartin-l3A5Bk7waGM@public.gmane.org>
To: "linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: [BUG] SRP daemon and SM migration
Date: Mon, 4 Dec 2017 12:08:32 +0100 [thread overview]
Message-ID: <bd47faf6-eae0-d520-42ce-eb3d6bce4376@suse.de> (raw)
Hi
A bug was reported to SUSE concerning the srp_daemon.
When it's running and the matser SM changes host (host shutdown, or new higher priority SM started), srp_daemon outputs these errors at every scan:
srp_daemon[25394]: No response to inform info registration
srp_daemon[25394]: Fail to register to traps, maybe there is no opensm running on fabric or IB port is down
It seems this was introduced by this commit:
commit 4952e5f7df0c93d6f3972975106c5e06623a301d
Author: Roi Dayan <roid-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Date: Thu Mar 21 17:38:11 2013 +0200
Fix a memory leak
Avoid leaking one IB AH per rescan. Only allocate a new AH if the
port LID changed or after a LID has been assigned by the SM.
Signed-off-by: Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org>
Signed-off-by: Roi Dayan <roid-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
One of the side effect of the leak fix is that create_ah is only called when the local port lid changes.
And register_to_traps uses the sm_id from ud_res which is filled by create_ah.
Thus if the SM lid changes but not the local LID, it keeps trying to contact the previous LID.
I tried fixing it by getting get_port_lid to also return the SM lid and calling create_ah on local lid OR SM lid changes.
It seems to be working at first (at least the call is always done to the right lid).
But after a while (doing ping pong between 2 SM by changing the priority) I still end up getting the error above.
Even through the LID is right this time.
It may not be the same bug though. Is there some calls to do to unregister from the previous SM before registering to the new one ?
Any idea on what could cause this ? I don't seem to get any more infos in all the logs I've checked...
Nicolas
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next reply other threads:[~2017-12-04 11:08 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-04 11:08 Nicolas Morey-Chaisemartin [this message]
[not found] ` <bd47faf6-eae0-d520-42ce-eb3d6bce4376-l3A5Bk7waGM@public.gmane.org>
2017-12-04 13:50 ` [BUG] SRP daemon and SM migration Hal Rosenstock
[not found] ` <1978d0ad-909f-ed8b-5b54-e8c465d3641f-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2017-12-08 9:36 ` [RFC rdma-core] srp_daemon: handle SM lid change Nicolas Morey-Chaisemartin
[not found] ` <82bd1d9a-7e6a-324e-1e00-43fcf130faf0-IBi9RG/b67k@public.gmane.org>
2017-12-08 13:33 ` Hal Rosenstock
[not found] ` <82dc0912-3ac3-9548-2c3b-703c6e1c9c95-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2017-12-08 13:56 ` Dennis Dalessandro
[not found] ` <dce98958-a6e2-b289-2a58-609d1914affb-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
2017-12-08 16:41 ` Bart Van Assche
2017-12-08 17:35 ` Nicolas Morey-Chaisemartin
2017-12-11 14:46 ` [RFC rdma-core 2/2] srp_daemon: fix CQ handling Nicolas Morey-Chaisemartin
[not found] ` <f35bfdad-9027-d2be-9706-b5a0edd9a778-IBi9RG/b67k@public.gmane.org>
2017-12-11 16:30 ` Bart Van Assche
[not found] ` <1513009807.2747.11.camel-Sjgp3cTcYWE@public.gmane.org>
2017-12-11 16:38 ` Nicolas Morey-Chaisemartin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=bd47faf6-eae0-d520-42ce-eb3d6bce4376@suse.de \
--to=nmoreychaisemartin-l3a5bk7wagm@public.gmane.org \
--cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox