* failed test on bug 2027
@ 2010-04-27 10:34 Andrea Gozzelino
[not found] ` <5464727.1272364442704.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org>
0 siblings, 1 reply; 5+ messages in thread
From: Andrea Gozzelino @ 2010-04-27 10:34 UTC (permalink / raw)
To: amirv-VPRAkNaXOzVS1MOuV/RT9w
Cc: swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW,
linux-rdma-u79uwXL29TY76Z2rM5mHXA
[-- Attachment #1: Type: text/plain, Size: 1908 bytes --]
Hi Amir,
I have checked the bug solution with OFED-1.5.2-20100425-0600 version
without success.
TEST LOG:
client machine
export LD_PRELOAD=/usr/lib64/libsdp.so
netperf -H address -c -C -l 60 -- -m 65536
tail -f /var/log/libsdp.log
OUTPUT
Tue Apr 27 10:08:38 2010 netperf[19051] libsdp Error connect: failed for
SDP
fd:6 with error:Cannot allocate memory
Tue Apr 27 11:47:55 2010 netperf[19081] libsdp Error connect: failed for
SDP
fd:6 with error:Cannot allocate memory
dmesg
OUTPUT
client_script[13876]: segfault at 00007fffef8de000 rip 000000387d07ad8a
rsp
00007fffef8d5648 error 6
C_copiato[20437]: segfault at 00007ffff0257b3c rip 00000000004009e3 rsp
00007ffff0257b30 error 6
client_20[1539]: segfault at 00007fffa2acd85c rip 0000000000400a80 rsp
00007fffa2acd850 error 6
sdp_init_qp:95 sdp_sock( 2100:2 55190:0): recv sge's. capability: 4
needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 41003:0): recv sge's. capability: 4
needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 55192:0): recv sge's. capability: 4
needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 38542:0): recv sge's. capability: 4
needed: 9
------------------
server machine
export LD_PRELOAD=/usr/lib64/libsdp.so
netserver
Starting netserver at port 12865
Starting netserver at hostname 0.0.0.0 port 12865 and family AF_UNSPEC
------------------
Amir, I think that bug 2028 is also involved in NE020 SDP working.
Steve, is it possible?
If yes, I will check the bug 2027 and 2028 solutions in July 2010.
Is it correct?
I reopen the bug 2027 because I don't see any changes.
Please consider that the bug 2027 could be connected with the 2028 one
in Bugzilla.
Keep in touch.
Thank you very much.
Regards,
Andrea
Andrea Gozzelino
INFN - Laboratori Nazionali di Legnaro (LNL)
Viale dell'Universita' 2
I-35020 - Legnaro (PD)- ITALIA
Tel: +39 049 8068346
Fax: +39 049 641925
Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org
[-- Attachment #2: test.txt --]
[-- Type: text/plain, Size: 1207 bytes --]
client machine
export LD_PRELOAD=/usr/lib64/libsdp.so
netperf -H address -c -C -l 60 -- -m 65536
tail -f /var/log/libsdp.log
OUTPUT
Tue Apr 27 10:08:38 2010 netperf[19051] libsdp Error connect: failed for SDP fd:6 with error:Cannot allocate memory
Tue Apr 27 11:47:55 2010 netperf[19081] libsdp Error connect: failed for SDP fd:6 with error:Cannot allocate memory
dmesg
OUTPUT
client_script[13876]: segfault at 00007fffef8de000 rip 000000387d07ad8a rsp 00007fffef8d5648 error 6
C_copiato[20437]: segfault at 00007ffff0257b3c rip 00000000004009e3 rsp 00007ffff0257b30 error 6
client_20[1539]: segfault at 00007fffa2acd85c rip 0000000000400a80 rsp 00007fffa2acd850 error 6
sdp_init_qp:95 sdp_sock( 2100:2 55190:0): recv sge's. capability: 4 needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 41003:0): recv sge's. capability: 4 needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 55192:0): recv sge's. capability: 4 needed: 9
sdp_init_qp:95 sdp_sock( 2100:2 38542:0): recv sge's. capability: 4 needed: 9
------------------
server machine
export LD_PRELOAD=/usr/lib64/libsdp.so
netserver
Starting netserver at port 12865
Starting netserver at hostname 0.0.0.0 port 12865 and family AF_UNSPEC
------------------
^ permalink raw reply [flat|nested] 5+ messages in thread[parent not found: <5464727.1272364442704.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org>]
* Re: failed test on bug 2027 [not found] ` <5464727.1272364442704.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org> @ 2010-04-27 11:25 ` Amir Vadai [not found] ` <4BD6C9C3.6070905-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> 0 siblings, 1 reply; 5+ messages in thread From: Amir Vadai @ 2010-04-27 11:25 UTC (permalink / raw) To: Andrea Gozzelino Cc: swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org Andrea Hi, I am sorry - the commit missed the 24.4.2010 build which I asked you to test. Please test this build (the latest): OFED-1.5.2-20100426-0600 - Amir On 04/27/2010 01:34 PM, Andrea Gozzelino wrote: > Hi Amir, > I have checked the bug solution with OFED-1.5.2-20100425-0600 version > without success. > > TEST LOG: > client machine > export LD_PRELOAD=/usr/lib64/libsdp.so > netperf -H address -c -C -l 60 -- -m 65536 > tail -f /var/log/libsdp.log > > OUTPUT > Tue Apr 27 10:08:38 2010 netperf[19051] libsdp Error connect: failed for > SDP > fd:6 with error:Cannot allocate memory > Tue Apr 27 11:47:55 2010 netperf[19081] libsdp Error connect: failed for > SDP > fd:6 with error:Cannot allocate memory > > dmesg > > OUTPUT > client_script[13876]: segfault at 00007fffef8de000 rip 000000387d07ad8a > rsp > 00007fffef8d5648 error 6 > C_copiato[20437]: segfault at 00007ffff0257b3c rip 00000000004009e3 rsp > 00007ffff0257b30 error 6 > client_20[1539]: segfault at 00007fffa2acd85c rip 0000000000400a80 rsp > 00007fffa2acd850 error 6 > sdp_init_qp:95 sdp_sock( 2100:2 55190:0): recv sge's. capability: 4 > needed: 9 > sdp_init_qp:95 sdp_sock( 2100:2 41003:0): recv sge's. capability: 4 > needed: 9 > sdp_init_qp:95 sdp_sock( 2100:2 55192:0): recv sge's. capability: 4 > needed: 9 > sdp_init_qp:95 sdp_sock( 2100:2 38542:0): recv sge's. capability: 4 > needed: 9 > > ------------------ > > server machine > export LD_PRELOAD=/usr/lib64/libsdp.so > netserver > Starting netserver at port 12865 > Starting netserver at hostname 0.0.0.0 port 12865 and family AF_UNSPEC > > ------------------ > Amir, I think that bug 2028 is also involved in NE020 SDP working. > Steve, is it possible? > If yes, I will check the bug 2027 and 2028 solutions in July 2010. > Is it correct? > > I reopen the bug 2027 because I don't see any changes. > Please consider that the bug 2027 could be connected with the 2028 one > in Bugzilla. > > Keep in touch. > Thank you very much. > Regards, > Andrea > > > > > Andrea Gozzelino > > INFN - Laboratori Nazionali di Legnaro (LNL) > Viale dell'Universita' 2 > I-35020 - Legnaro (PD)- ITALIA > Tel: +39 049 8068346 > Fax: +39 049 641925 > Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <4BD6C9C3.6070905-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>]
* SDP: test bug 2027 and new error [not found] ` <4BD6C9C3.6070905-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> @ 2010-04-28 14:00 ` Andrea Gozzelino [not found] ` <2307255.1272463222662.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org> 0 siblings, 1 reply; 5+ messages in thread From: Andrea Gozzelino @ 2010-04-28 14:00 UTC (permalink / raw) To: Amir Vadai Cc: swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org On Apr 27, 2010 01:25 PM, Amir Vadai <amirv-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> wrote: > Andrea Hi, > > I am sorry - the commit missed the 24.4.2010 build which I asked you > to > test. > Please test this build (the latest): OFED-1.5.2-20100426-0600 > > - Amir > > > On 04/27/2010 01:34 PM, Andrea Gozzelino wrote: > > Hi Amir, > > I have checked the bug solution with OFED-1.5.2-20100425-0600 > > version > > without success. > > > > TEST LOG: > > client machine > > export LD_PRELOAD=/usr/lib64/libsdp.so > > netperf -H address -c -C -l 60 -- -m 65536 > > tail -f /var/log/libsdp.log > > > > OUTPUT > > Tue Apr 27 10:08:38 2010 netperf[19051] libsdp Error connect: failed > > for > > SDP > > fd:6 with error:Cannot allocate memory > > Tue Apr 27 11:47:55 2010 netperf[19081] libsdp Error connect: failed > > for > > SDP > > fd:6 with error:Cannot allocate memory > > > > dmesg > > > > OUTPUT > > client_script[13876]: segfault at 00007fffef8de000 rip > > 000000387d07ad8a > > rsp > > 00007fffef8d5648 error 6 > > C_copiato[20437]: segfault at 00007ffff0257b3c rip 00000000004009e3 > > rsp > > 00007ffff0257b30 error 6 > > client_20[1539]: segfault at 00007fffa2acd85c rip 0000000000400a80 > > rsp > > 00007fffa2acd850 error 6 > > sdp_init_qp:95 sdp_sock( 2100:2 55190:0): recv sge's. capability: 4 > > needed: 9 > > sdp_init_qp:95 sdp_sock( 2100:2 41003:0): recv sge's. capability: 4 > > needed: 9 > > sdp_init_qp:95 sdp_sock( 2100:2 55192:0): recv sge's. capability: 4 > > needed: 9 > > sdp_init_qp:95 sdp_sock( 2100:2 38542:0): recv sge's. capability: 4 > > needed: 9 > > > > ------------------ > > > > server machine > > export LD_PRELOAD=/usr/lib64/libsdp.so > > netserver > > Starting netserver at port 12865 > > Starting netserver at hostname 0.0.0.0 port 12865 and family > > AF_UNSPEC > > > > ------------------ > > Amir, I think that bug 2028 is also involved in NE020 SDP working. > > Steve, is it possible? > > If yes, I will check the bug 2027 and 2028 solutions in July 2010. > > Is it correct? > > > > I reopen the bug 2027 because I don't see any changes. > > Please consider that the bug 2027 could be connected with the 2028 > > one > > in Bugzilla. > > > > Keep in touch. > > Thank you very much. > > Regards, > > Andrea > > > > > > > > > > Andrea Gozzelino > > > > INFN - Laboratori Nazionali di Legnaro (LNL) > > Viale dell'Universita' 2 > > I-35020 - Legnaro (PD)- ITALIA > > Tel: +39 049 8068346 > > Fax: +39 049 641925 > > Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org > > > Hi Amir, I have tested your solution of bug 2027 with OFED-1.5.2-20100426-0600. Netserver/netperf does not work with Socket Direct Protocol SDP. Now the error in the log file is: libsdp error connect: failed for SDP fd with error No such device sdp_init_qp:105 SDP not available on device nes0 I have a look in sdp_cma.c file: I see the message text near the previous one about # SGE capability. Amir, could you please comment these error? Probably I can close the bug 2027 because the solution is OK and I will open another one about "SDP not available on device nes0". Steve, is there a connession with the Fast Memory Region (bug 2028)? At the end, #SGE capability is now OK but SDP on NE020 cards - driver iw_nes does not work yet properly. I will wait your suggestions. Regards, Andrea Andrea Gozzelino INFN - Laboratori Nazionali di Legnaro (LNL) Viale dell'Universita' 2 I-35020 - Legnaro (PD)- ITALIA Tel: +39 049 8068346 Fax: +39 049 641925 Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <2307255.1272463222662.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org>]
* Re: SDP: test bug 2027 and new error [not found] ` <2307255.1272463222662.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org> @ 2010-04-28 14:15 ` Amir Vadai [not found] ` <3358583.1272465775631.SLOX.WebMail.wwwrun@imap.lnl.infn.it> 0 siblings, 1 reply; 5+ messages in thread From: Amir Vadai @ 2010-04-28 14:15 UTC (permalink / raw) To: Andrea Gozzelino Cc: swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW@public.gmane.org, linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org Hi, I think this is because sdp_add_device() require FMR to be supported. I need to put a fix to disable ZCopy and not create FMR pool if doesn't have support - I thought it would be part of BZ2027 fix. As a workaround - you could comment out all the calls to sdp_dev->fmr_pool. and make sure ZCopy is disabled by setting 0 in sdp_zcopy_thresh. - Amir On 04/28/2010 05:00 PM, Andrea Gozzelino wrote: > On Apr 27, 2010 01:25 PM, Amir Vadai <amirv-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> wrote: > > >> Andrea Hi, >> >> I am sorry - the commit missed the 24.4.2010 build which I asked you >> to >> test. >> Please test this build (the latest): OFED-1.5.2-20100426-0600 >> >> - Amir >> >> >> On 04/27/2010 01:34 PM, Andrea Gozzelino wrote: >> >>> Hi Amir, >>> I have checked the bug solution with OFED-1.5.2-20100425-0600 >>> version >>> without success. >>> >>> TEST LOG: >>> client machine >>> export LD_PRELOAD=/usr/lib64/libsdp.so >>> netperf -H address -c -C -l 60 -- -m 65536 >>> tail -f /var/log/libsdp.log >>> >>> OUTPUT >>> Tue Apr 27 10:08:38 2010 netperf[19051] libsdp Error connect: failed >>> for >>> SDP >>> fd:6 with error:Cannot allocate memory >>> Tue Apr 27 11:47:55 2010 netperf[19081] libsdp Error connect: failed >>> for >>> SDP >>> fd:6 with error:Cannot allocate memory >>> >>> dmesg >>> >>> OUTPUT >>> client_script[13876]: segfault at 00007fffef8de000 rip >>> 000000387d07ad8a >>> rsp >>> 00007fffef8d5648 error 6 >>> C_copiato[20437]: segfault at 00007ffff0257b3c rip 00000000004009e3 >>> rsp >>> 00007ffff0257b30 error 6 >>> client_20[1539]: segfault at 00007fffa2acd85c rip 0000000000400a80 >>> rsp >>> 00007fffa2acd850 error 6 >>> sdp_init_qp:95 sdp_sock( 2100:2 55190:0): recv sge's. capability: 4 >>> needed: 9 >>> sdp_init_qp:95 sdp_sock( 2100:2 41003:0): recv sge's. capability: 4 >>> needed: 9 >>> sdp_init_qp:95 sdp_sock( 2100:2 55192:0): recv sge's. capability: 4 >>> needed: 9 >>> sdp_init_qp:95 sdp_sock( 2100:2 38542:0): recv sge's. capability: 4 >>> needed: 9 >>> >>> ------------------ >>> >>> server machine >>> export LD_PRELOAD=/usr/lib64/libsdp.so >>> netserver >>> Starting netserver at port 12865 >>> Starting netserver at hostname 0.0.0.0 port 12865 and family >>> AF_UNSPEC >>> >>> ------------------ >>> Amir, I think that bug 2028 is also involved in NE020 SDP working. >>> Steve, is it possible? >>> If yes, I will check the bug 2027 and 2028 solutions in July 2010. >>> Is it correct? >>> >>> I reopen the bug 2027 because I don't see any changes. >>> Please consider that the bug 2027 could be connected with the 2028 >>> one >>> in Bugzilla. >>> >>> Keep in touch. >>> Thank you very much. >>> Regards, >>> Andrea >>> >>> >>> >>> >>> Andrea Gozzelino >>> >>> INFN - Laboratori Nazionali di Legnaro (LNL) >>> Viale dell'Universita' 2 >>> I-35020 - Legnaro (PD)- ITALIA >>> Tel: +39 049 8068346 >>> Fax: +39 049 641925 >>> Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org >>> >>> >> > Hi Amir, > > I have tested your solution of bug 2027 with OFED-1.5.2-20100426-0600. > Netserver/netperf does not work with Socket Direct Protocol SDP. > > Now the error in the log file is: > libsdp error connect: failed for SDP fd with error No such device > sdp_init_qp:105 SDP not available on device nes0 > > I have a look in sdp_cma.c file: I see the message text near the > previous one about # SGE capability. > > Amir, could you please comment these error? > > Probably I can close the bug 2027 because the solution is OK and I will > open another one about "SDP not available on device nes0". > > Steve, is there a connession with the Fast Memory Region (bug 2028)? > > At the end, #SGE capability is now OK but SDP on NE020 cards - driver > iw_nes does not work yet properly. > I will wait your suggestions. > > Regards, > Andrea > > > Andrea Gozzelino > > INFN - Laboratori Nazionali di Legnaro (LNL) > Viale dell'Universita' 2 > I-35020 - Legnaro (PD)- ITALIA > Tel: +39 049 8068346 > Fax: +39 049 641925 > Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org > > > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <3358583.1272465775631.SLOX.WebMail.wwwrun@imap.lnl.infn.it>]
[parent not found: <4BD84C68.5000407@mellanox.co.il>]
[parent not found: <4BD84C68.5000407-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>]
* about SDP on NE020 card [not found] ` <4BD84C68.5000407-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> @ 2010-04-29 8:55 ` Andrea Gozzelino 0 siblings, 0 replies; 5+ messages in thread From: Andrea Gozzelino @ 2010-04-29 8:55 UTC (permalink / raw) To: Amir Vadai Cc: swise-7bPotxP6k4+P2YhJcF5u+vpXobYPEAuW, linux-rdma-u79uwXL29TY76Z2rM5mHXA On Apr 28, 2010 04:55 PM, Amir Vadai <amirv-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org> wrote: > comment it out in sdp_cma.c at: > sdp_remove_device() and sdp_add_device() > > If I am right, the problem that you see, is that ib_create_fmr_pool() > is > failing (you should see an error in dmesg), and because of that > ib_set_client_data() is not called. And this is the reason for the > error > that you saw in the logs. > > Because you won't be using ZCopy, no need to call ib_create_fmr_pool() > - > ofcourse, make sure you comment out the calls to ib_destroy_fmr_pool > and > ib_flush_fmr_pool() > > - Amir Hi Amir, I put in off status the ZCopy with command: # echo 0 > /sys/module/ib_sdp/parameters/sdp_zcopy_thresh I comment out some lines in file sdp_main.c, but now the programs don't compile with a lot of errors. I list out the lines that I have commented out: file sdp_main.c line 2625: struct ib_fmr_pool_param fmr_param lines from 2645 to 2661 lines 2665 2666 lines from 2739 to 2745 All these lines contain a word "fmr". I don't find word "fmr" in file sdp_cma.c; sdp_main.c calls objects of sdp_cma.c SDP doesn't work. The output message of dmesg command is: sdp_init_qp:105 sdp_sock SDP not available on device nes0. I would like identify the problem: if we are almost sure that Fast Memory Region FMR is responsible for that, the bug 2028 solution in July will remove the error and I hope that SDP will work fine. On the other side, if FMR is not involved, there is an unknown situation and I am not happy. Could you evaluate the written error? Could you give me your opinion? Amir and Steve, thank you very much for help. Regards, Andrea Andrea Gozzelino INFN - Laboratori Nazionali di Legnaro (LNL) Viale dell'Universita' 2 I-35020 - Legnaro (PD)- ITALIA Tel: +39 049 8068346 Fax: +39 049 641925 Mail: andrea.gozzelino-PK20h7lG/Rc1GQ1Ptb7lUw@public.gmane.org -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2010-04-29 8:55 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-04-27 10:34 failed test on bug 2027 Andrea Gozzelino
[not found] ` <5464727.1272364442704.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org>
2010-04-27 11:25 ` Amir Vadai
[not found] ` <4BD6C9C3.6070905-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>
2010-04-28 14:00 ` SDP: test bug 2027 and new error Andrea Gozzelino
[not found] ` <2307255.1272463222662.SLOX.WebMail.wwwrun-XDIR3SKYeFbgKi2NxijLtw@public.gmane.org>
2010-04-28 14:15 ` Amir Vadai
[not found] ` <3358583.1272465775631.SLOX.WebMail.wwwrun@imap.lnl.infn.it>
[not found] ` <4BD84C68.5000407@mellanox.co.il>
[not found] ` <4BD84C68.5000407-VPRAkNaXOzVS1MOuV/RT9w@public.gmane.org>
2010-04-29 8:55 ` about SDP on NE020 card Andrea Gozzelino
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox