public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
* libibverbs and versions
@ 2010-06-02 15:50 Steve Wise
       [not found] ` <4C067DE0.8020508-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Steve Wise @ 2010-06-02 15:50 UTC (permalink / raw)
  To: Roland Dreier; +Cc: linux-rdma

Hey Roland, 

I have this library I'm writing that uses libibverbs.  Somehow it seems 
to be using the 1.0 version instead of the newer version (I'm 
guessing).  When I call ibv_create_cq()  I get a seg fault and the stack 
looks like this:

#0  0x000000319d80871d in pthread_mutex_lock () from /lib64/libpthread.so.0
#1  0x00007f60e93fac9a in __ibv_create_cq (context=0x7f60e8597c51, cqe=256,
    cq_context=0x0, channel=0x0, comp_vector=0) at src/verbs.c:278
#2  0x00007f60e93f727f in __ibv_create_cq_1_0 (context=0x61db30, cqe=256,
    cq_context=0x0, channel=0x0, comp_vector=0) at src/compat-1_0.c:649

At the  __ibv_create_cq_1_0 stack frame, context looks good:

(gdb) p *context
$3 = {device = 0x61ede0, ops = {
    query_device = 0x7f60e8596c98 <c4iw_query_device>,
    query_port = 0x7f60e8596d35 <c4iw_query_port>,
    alloc_pd = 0x7f60e8596d66 <c4iw_alloc_pd>,
    dealloc_pd = 0x7f60e8596dd9 <c4iw_free_pd>,
    reg_mr = 0x7f60e8596f77 <c4iw_reg_mr>, dereg_mr = 0,
    create_cq = 0x7f60e8596fae <c4iw_dereg_mr>, poll_cq = 0,
    req_notify_cq = 0, cq_event = 0,
<stuff truncated>

But context->real_context looks bad:

(gdb) p *context->real_context
$5 = {device = 0x30ec8348e5894855, ops = {query_device = 0xe0758948e87d8948,
    query_port = 0xe87d8b48dc558966, alloc_pd = 0x458948fffffdf2e8,
    dealloc_pd = 0xc78148f07d8b48f0, reg_mr = 0xffaa4ee800000130,
    rereg_mr = 0xc78148f07d8b48ff, dereg_mr = 0xfffddce8000000b0,
    alloc_mw = 0x7d8b480974c085ff, bind_mw = 0xb70fffffed72e8f0,
    dealloc_mw = 0x8b48e0758b48dc55, create_cq = 0x89ffffac70e8e87d,
    poll_cq = 0x358d48fc558bfc45, req_notify_cq = 0xb43d8d48000016a3,
<stuff truncated>


When I walk through a different program that does work, I don't see the 
__ibv_create_cq_1_0() call.   The program seems to call 
__ibv_create_cq() directly.

What am I doing wrong?  How does the loader determine which version is 
needed?

Thanks in advance,

Stevo

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: libibverbs and versions
       [not found] ` <4C067DE0.8020508-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-06-02 16:46   ` Roland Dreier
       [not found]     ` <adaeigptma8.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>
  2010-06-02 17:32   ` Jason Gunthorpe
  1 sibling, 1 reply; 6+ messages in thread
From: Roland Dreier @ 2010-06-02 16:46 UTC (permalink / raw)
  To: Steve Wise; +Cc: linux-rdma

 > I have this library I'm writing that uses libibverbs.  Somehow it
 > seems to be using the 1.0 version instead of the newer version (I'm
 > guessing).  When I call ibv_create_cq()  I get a seg fault and the
 > stack looks like this:
 > 
 > #0  0x000000319d80871d in pthread_mutex_lock () from /lib64/libpthread.so.0
 > #1  0x00007f60e93fac9a in __ibv_create_cq (context=0x7f60e8597c51, cqe=256,
 >    cq_context=0x0, channel=0x0, comp_vector=0) at src/verbs.c:278
 > #2  0x00007f60e93f727f in __ibv_create_cq_1_0 (context=0x61db30, cqe=256,
 >    cq_context=0x0, channel=0x0, comp_vector=0) at src/compat-1_0.c:649

Code compiled against libibverbs 1.1 shouldn't ever hit the 1.0
compatibility code.  libibverbs is using versioned symbols, so if your
library is built/linked correctly, it should pick up the 1.1 versions.
How are you building your library?

 - R.
-- 
Roland Dreier <rolandd-FYB4Gu1CFyUAvxtiuMwx3w@public.gmane.org> || For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/index.html
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: libibverbs and versions
       [not found] ` <4C067DE0.8020508-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  2010-06-02 16:46   ` Roland Dreier
@ 2010-06-02 17:32   ` Jason Gunthorpe
       [not found]     ` <20100602173245.GG15969-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
  1 sibling, 1 reply; 6+ messages in thread
From: Jason Gunthorpe @ 2010-06-02 17:32 UTC (permalink / raw)
  To: Steve Wise; +Cc: Roland Dreier, linux-rdma

On Wed, Jun 02, 2010 at 10:50:56AM -0500, Steve Wise wrote:

> I have this library I'm writing that uses libibverbs.  Somehow it seems  
> to be using the 1.0 version instead of the newer version (I'm guessing).  
> When I call ibv_create_cq()  I get a seg fault and the stack looks like 
> this:

Make sure you link your library to libibverbs and do not rely on
linking the app to libibverbs to resolve the symbols. Check with 
ldd foo.so

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: libibverbs and versions
       [not found]     ` <adaeigptma8.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>
@ 2010-06-02 17:39       ` Steve Wise
  0 siblings, 0 replies; 6+ messages in thread
From: Steve Wise @ 2010-06-02 17:39 UTC (permalink / raw)
  To: Roland Dreier; +Cc: linux-rdma

Roland Dreier wrote:
>  > I have this library I'm writing that uses libibverbs.  Somehow it
>  > seems to be using the 1.0 version instead of the newer version (I'm
>  > guessing).  When I call ibv_create_cq()  I get a seg fault and the
>  > stack looks like this:
>  > 
>  > #0  0x000000319d80871d in pthread_mutex_lock () from /lib64/libpthread.so.0
>  > #1  0x00007f60e93fac9a in __ibv_create_cq (context=0x7f60e8597c51, cqe=256,
>  >    cq_context=0x0, channel=0x0, comp_vector=0) at src/verbs.c:278
>  > #2  0x00007f60e93f727f in __ibv_create_cq_1_0 (context=0x61db30, cqe=256,
>  >    cq_context=0x0, channel=0x0, comp_vector=0) at src/compat-1_0.c:649
>
> Code compiled against libibverbs 1.1 shouldn't ever hit the 1.0
> compatibility code.  libibverbs is using versioned symbols, so if your
> library is built/linked correctly, it should pick up the 1.1 versions.
> How are you building your library?
>
>  - R.
>   

Apparently I'm building it incorrectly. :)    I stole the Makefile.am 
and configure.in from one of my other libs.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: libibverbs and versions
       [not found]     ` <20100602173245.GG15969-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
@ 2010-06-02 17:40       ` Steve Wise
       [not found]         ` <4C069774.4050402-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Steve Wise @ 2010-06-02 17:40 UTC (permalink / raw)
  To: Jason Gunthorpe; +Cc: Roland Dreier, linux-rdma


Jason Gunthorpe wrote:
> On Wed, Jun 02, 2010 at 10:50:56AM -0500, Steve Wise wrote:
>
>   
>> I have this library I'm writing that uses libibverbs.  Somehow it seems  
>> to be using the 1.0 version instead of the newer version (I'm guessing).  
>> When I call ibv_create_cq()  I get a seg fault and the stack looks like 
>> this:
>>     
>
> Make sure you link your library to libibverbs and do not rely on
> linking the app to libibverbs to resolve the symbols. Check with 
> ldd foo.so
>
> Jason
>   

This was the issue.  My library wasn't explicitly linking via 
-libverbs.  When I added that, it worked.

But how did it manage to link to the old version I wonder?


--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: libibverbs and versions
       [not found]         ` <4C069774.4050402-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
@ 2010-06-02 17:59           ` Jason Gunthorpe
  0 siblings, 0 replies; 6+ messages in thread
From: Jason Gunthorpe @ 2010-06-02 17:59 UTC (permalink / raw)
  To: Steve Wise; +Cc: Roland Dreier, linux-rdma

On Wed, Jun 02, 2010 at 12:40:04PM -0500, Steve Wise wrote:
>>> I have this library I'm writing that uses libibverbs.  Somehow it 
>>> seems  to be using the 1.0 version instead of the newer version (I'm 
>>> guessing).  When I call ibv_create_cq()  I get a seg fault and the 
>>> stack looks like this:

>> Make sure you link your library to libibverbs and do not rely on
>> linking the app to libibverbs to resolve the symbols. Check with ldd 
>> foo.so

> This was the issue.  My library wasn't explicitly linking via -libverbs.  
> When I added that, it worked.
>
> But how did it manage to link to the old version I wonder?

If the linker doesn't have the library to look through an external
symbol is simply set as 'ibv_create_cq' in the symbol table. That is
the same name an app using libibverbs 1.0 would use to refer to the
old symbol. When linker has the library it sees that the proper name
for the symbol is now ibv_create_cq@1.1 and puts that in the symbol
table instead.

It is critical that all library are explicitly linked to all their
dependent libraries, otherwise a variety of bad things can happen.

Hmm.. Also, this reveals that it was a mistake to change the the
argument objects and use versioned symbols to cover that up -
versioned symbols cannot be used that way, it makes a really big mess
in real use cases. Curious that nobody has noticed this before..

Basically what Steve did is linked his library to verbs 1.0 and linked
the app to 1.1, and then passed an ibv_context across his library's
function call boundary.. Kablooie.

Makes me wonder if the versioned symbol bother is even worth the
trouble.

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2010-06-02 17:59 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-06-02 15:50 libibverbs and versions Steve Wise
     [not found] ` <4C067DE0.8020508-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-06-02 16:46   ` Roland Dreier
     [not found]     ` <adaeigptma8.fsf-BjVyx320WGW9gfZ95n9DRSW4+XlvGpQz@public.gmane.org>
2010-06-02 17:39       ` Steve Wise
2010-06-02 17:32   ` Jason Gunthorpe
     [not found]     ` <20100602173245.GG15969-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2010-06-02 17:40       ` Steve Wise
     [not found]         ` <4C069774.4050402-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org>
2010-06-02 17:59           ` Jason Gunthorpe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox