* mlx4 problems with 4.2-rc8
@ 2015-08-29 5:27 Doug Ledford
[not found] ` <55E142DC.8060205-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
0 siblings, 1 reply; 11+ messages in thread
From: Doug Ledford @ 2015-08-29 5:27 UTC (permalink / raw)
To: Or Gerlitz, linux-rdma, Amir Vadai
[-- Attachment #1: Type: text/plain, Size: 3850 bytes --]
I'm seeing this with rc8 on a dual port mlx4 adapter set to IB/Eth mode:
[ 77.883513] IPv6: ADDRCONF(NETDEV_UP): mlx4_roce: link is not ready
[ 77.892044] mlx4_en: mlx4_roce: frag:0 - size:1518 prefix:0 stride:1536
[ 77.903129] genirq: Flags mismatch irq 135. 00000000
(mlx4-65@0000:05:00.0) vs. 00000000 (mlx4-65@0000:05:00.0)
[ 77.914965] CPU: 0 PID: 1541 Comm: NetworkManager Not tainted
4.2.0-rc8 #58
[ 77.923292] Hardware name: Dell Inc. PowerEdge R820/04K5X5, BIOS
2.2.3 07/09/2014
[ 77.932205] 0000000000000000 00000000c16e3ce1 ffff8820365ab498
ffffffff8167e6ff
[ 77.941072] 0000000000000000 ffff8820339e9a00 ffff8820365ab4f8
ffffffff810d2b6e
[ 77.949938] 0000000000000246 ffff881032e67aa4 ffff881035e10ba0
00000000c16e3ce1
[ 77.958812] Call Trace:
[ 77.962109] [<ffffffff8167e6ff>] dump_stack+0x45/0x57
[ 77.968412] [<ffffffff810d2b6e>] __setup_irq+0x51e/0x590
[ 77.975018] [<ffffffffc03870a0>] ? mlx4_interrupt+0x80/0x80 [mlx4_core]
[ 77.983072] [<ffffffff810d2d64>] request_threaded_irq+0xf4/0x1a0
[ 77.990468] [<ffffffffc0385d55>] mlx4_assign_eq+0x135/0x360 [mlx4_core]
[ 77.998513] [<ffffffffc0537537>] mlx4_en_activate_cq+0x2a7/0x310
[mlx4_en]
[ 78.006853] [<ffffffff8130a2c8>] ? alloc_cpumask_var_node+0x28/0x40
[ 78.014542] [<ffffffff8131e8b9>] ? find_next_bit+0x19/0x20
[ 78.021334] [<ffffffff8130a284>] ? cpumask_next_and+0x34/0x50
[ 78.028425] [<ffffffffc053ae6b>] mlx4_en_start_port+0x1bb/0xb60
[mlx4_en]
[ 78.036689] [<ffffffffc037fe01>] ? mlx4_free_cmd_mailbox+0x31/0x40
[mlx4_core]
[ 78.045435] [<ffffffffc053bb59>] mlx4_en_open+0x349/0x630 [mlx4_en]
[ 78.053107] [<ffffffff815732f9>] __dev_open+0xc9/0x140
[ 78.059538] [<ffffffff81573621>] __dev_change_flags+0xa1/0x160
[ 78.066718] [<ffffffff81573709>] dev_change_flags+0x29/0x60
[ 78.073602] [<ffffffff81580dbe>] do_setlink+0x5be/0xa70
[ 78.080097] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200]
[ 78.087859] [<ffffffffc01b1456>] ? mga_dirty_update+0x1e6/0x2f0
[mgag200]
[ 78.096112] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200]
[ 78.103873] [<ffffffff81582470>] rtnl_newlink+0x4f0/0x880
[ 78.110586] [<ffffffff81582073>] ? rtnl_newlink+0xf3/0x880
[ 78.117372] [<ffffffff81294238>] ? security_capable+0x48/0x60
[ 78.124452] [<ffffffff81081b1d>] ? ns_capable+0x2d/0x60
[ 78.130950] [<ffffffff8157f8c4>] rtnetlink_rcv_msg+0xa4/0x250
[ 78.138028] [<ffffffff812987c0>] ? sock_has_perm+0x70/0x90
[ 78.144824] [<ffffffff8157f820>] ? rtnetlink_rcv+0x40/0x40
[ 78.151615] [<ffffffff815a2bdf>] netlink_rcv_skb+0xaf/0xc0
[ 78.158425] [<ffffffff8157f80c>] rtnetlink_rcv+0x2c/0x40
[ 78.164997] [<ffffffff815a22d1>] netlink_unicast+0x101/0x1f0
[ 78.171937] [<ffffffff815a27c1>] netlink_sendmsg+0x401/0x660
[ 78.178867] [<ffffffff81553e78>] sock_sendmsg+0x38/0x50
[ 78.185335] [<ffffffff815547d5>] ___sys_sendmsg+0x275/0x290
[ 78.192176] [<ffffffff81262c56>] ? sysctl_head_finish+0x46/0x50
[ 78.199411] [<ffffffff81262e08>] ? proc_sys_call_handler+0x88/0xe0
[ 78.206946] [<ffffffff8131854c>] ? lockref_put_or_lock+0x4c/0x80
[ 78.214296] [<ffffffff81555197>] __sys_sendmsg+0x57/0xa0
[ 78.220878] [<ffffffff815551f2>] SyS_sendmsg+0x12/0x20
[ 78.227283] [<ffffffff8168536e>] entry_SYSCALL_64_fastpath+0x12/0x71
[ 78.235114] mlx4_en 0000:05:00.0: Failed assigning an EQ to
\xfffffff\xffffffb6Z6
\xffffff88\xffffffff\xffffffff\xffffff84\xffffffa20\xffffff81\xffffffff\xffffffff\xffffffff\xffffffff
[ 78.243732] mlx4_en: mlx4_roce: Failed activating Rx CQ
[ 78.319027] mlx4_en: mlx4_roce: Failed starting port:2
The interface in question is unusable.
--
Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
GPG KeyID: 0E572FDD
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 884 bytes --]
^ permalink raw reply [flat|nested] 11+ messages in thread[parent not found: <55E142DC.8060205-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <55E142DC.8060205-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> @ 2015-08-30 1:13 ` Or Gerlitz [not found] ` <CAJ3xEMj5By11L3qbSKxcEiMarB6CeyeERMnuK_vvH11VLLFypw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Or Gerlitz @ 2015-08-30 1:13 UTC (permalink / raw) To: Doug Ledford Cc: Or Gerlitz, linux-rdma, Amir Vadai, Matan Barak, Jack Morgenstein On Fri, Aug 28, 2015 at 10:27 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: > I'm seeing this with rc8 on a dual port mlx4 adapter set to IB/Eth mode: mmm, both Amir and myself are just finishing vacations... so WB notes are not always lovely as you want them to be, life > > [ 77.883513] IPv6: ADDRCONF(NETDEV_UP): mlx4_roce: link is not ready > [ 77.892044] mlx4_en: mlx4_roce: frag:0 - size:1518 prefix:0 stride:1536 > [ 77.903129] genirq: Flags mismatch irq 135. 00000000 > (mlx4-65@0000:05:00.0) vs. 00000000 (mlx4-65@0000:05:00.0) is this strict regression from some known point in the past on this system/config -- i.e 4.1 or 4.2-rc1?! Can you please send the mlx4 driver output when you load it with debug prints on? also do things work if you set the ports type to be ib/ib or eth/eth? send us your compressed .config Matan, any idea what goes wrong here? Or. > [ 77.914965] CPU: 0 PID: 1541 Comm: NetworkManager Not tainted > 4.2.0-rc8 #58 > [ 77.923292] Hardware name: Dell Inc. PowerEdge R820/04K5X5, BIOS > 2.2.3 07/09/2014 > [ 77.932205] 0000000000000000 00000000c16e3ce1 ffff8820365ab498 > ffffffff8167e6ff > [ 77.941072] 0000000000000000 ffff8820339e9a00 ffff8820365ab4f8 > ffffffff810d2b6e > [ 77.949938] 0000000000000246 ffff881032e67aa4 ffff881035e10ba0 > 00000000c16e3ce1 > [ 77.958812] Call Trace: > [ 77.962109] [<ffffffff8167e6ff>] dump_stack+0x45/0x57 > [ 77.968412] [<ffffffff810d2b6e>] __setup_irq+0x51e/0x590 > [ 77.975018] [<ffffffffc03870a0>] ? mlx4_interrupt+0x80/0x80 [mlx4_core] > [ 77.983072] [<ffffffff810d2d64>] request_threaded_irq+0xf4/0x1a0 > [ 77.990468] [<ffffffffc0385d55>] mlx4_assign_eq+0x135/0x360 [mlx4_core] > [ 77.998513] [<ffffffffc0537537>] mlx4_en_activate_cq+0x2a7/0x310 > [mlx4_en] > [ 78.006853] [<ffffffff8130a2c8>] ? alloc_cpumask_var_node+0x28/0x40 > [ 78.014542] [<ffffffff8131e8b9>] ? find_next_bit+0x19/0x20 > [ 78.021334] [<ffffffff8130a284>] ? cpumask_next_and+0x34/0x50 > [ 78.028425] [<ffffffffc053ae6b>] mlx4_en_start_port+0x1bb/0xb60 > [mlx4_en] > [ 78.036689] [<ffffffffc037fe01>] ? mlx4_free_cmd_mailbox+0x31/0x40 > [mlx4_core] > [ 78.045435] [<ffffffffc053bb59>] mlx4_en_open+0x349/0x630 [mlx4_en] > [ 78.053107] [<ffffffff815732f9>] __dev_open+0xc9/0x140 > [ 78.059538] [<ffffffff81573621>] __dev_change_flags+0xa1/0x160 > [ 78.066718] [<ffffffff81573709>] dev_change_flags+0x29/0x60 > [ 78.073602] [<ffffffff81580dbe>] do_setlink+0x5be/0xa70 > [ 78.080097] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] > [ 78.087859] [<ffffffffc01b1456>] ? mga_dirty_update+0x1e6/0x2f0 > [mgag200] > [ 78.096112] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] > [ 78.103873] [<ffffffff81582470>] rtnl_newlink+0x4f0/0x880 > [ 78.110586] [<ffffffff81582073>] ? rtnl_newlink+0xf3/0x880 > [ 78.117372] [<ffffffff81294238>] ? security_capable+0x48/0x60 > [ 78.124452] [<ffffffff81081b1d>] ? ns_capable+0x2d/0x60 > [ 78.130950] [<ffffffff8157f8c4>] rtnetlink_rcv_msg+0xa4/0x250 > [ 78.138028] [<ffffffff812987c0>] ? sock_has_perm+0x70/0x90 > [ 78.144824] [<ffffffff8157f820>] ? rtnetlink_rcv+0x40/0x40 > [ 78.151615] [<ffffffff815a2bdf>] netlink_rcv_skb+0xaf/0xc0 > [ 78.158425] [<ffffffff8157f80c>] rtnetlink_rcv+0x2c/0x40 > [ 78.164997] [<ffffffff815a22d1>] netlink_unicast+0x101/0x1f0 > [ 78.171937] [<ffffffff815a27c1>] netlink_sendmsg+0x401/0x660 > [ 78.178867] [<ffffffff81553e78>] sock_sendmsg+0x38/0x50 > [ 78.185335] [<ffffffff815547d5>] ___sys_sendmsg+0x275/0x290 > [ 78.192176] [<ffffffff81262c56>] ? sysctl_head_finish+0x46/0x50 > [ 78.199411] [<ffffffff81262e08>] ? proc_sys_call_handler+0x88/0xe0 > [ 78.206946] [<ffffffff8131854c>] ? lockref_put_or_lock+0x4c/0x80 > [ 78.214296] [<ffffffff81555197>] __sys_sendmsg+0x57/0xa0 > [ 78.220878] [<ffffffff815551f2>] SyS_sendmsg+0x12/0x20 > [ 78.227283] [<ffffffff8168536e>] entry_SYSCALL_64_fastpath+0x12/0x71 > [ 78.235114] mlx4_en 0000:05:00.0: Failed assigning an EQ to > \xfffffff\xffffffb6Z6 > \xffffff88\xffffffff\xffffffff\xffffff84\xffffffa20\xffffff81\xffffffff\xffffffff\xffffffff\xffffffff > [ 78.243732] mlx4_en: mlx4_roce: Failed activating Rx CQ > [ 78.319027] mlx4_en: mlx4_roce: Failed starting port:2 > > The interface in question is unusable. > > -- > Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> > GPG KeyID: 0E572FDD > > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <CAJ3xEMj5By11L3qbSKxcEiMarB6CeyeERMnuK_vvH11VLLFypw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <CAJ3xEMj5By11L3qbSKxcEiMarB6CeyeERMnuK_vvH11VLLFypw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-08-30 22:38 ` Doug Ledford [not found] ` <55E385DB.2-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Doug Ledford @ 2015-08-30 22:38 UTC (permalink / raw) To: Or Gerlitz Cc: Or Gerlitz, linux-rdma, Amir Vadai, Matan Barak, Jack Morgenstein [-- Attachment #1.1: Type: text/plain, Size: 5560 bytes --] On 08/29/2015 09:13 PM, Or Gerlitz wrote: > On Fri, Aug 28, 2015 at 10:27 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >> I'm seeing this with rc8 on a dual port mlx4 adapter set to IB/Eth mode: > > mmm, both Amir and myself are just finishing vacations... so WB notes > are not always lovely as you want them to be, life >> >> [ 77.883513] IPv6: ADDRCONF(NETDEV_UP): mlx4_roce: link is not ready >> [ 77.892044] mlx4_en: mlx4_roce: frag:0 - size:1518 prefix:0 stride:1536 >> [ 77.903129] genirq: Flags mismatch irq 135. 00000000 >> (mlx4-65@0000:05:00.0) vs. 00000000 (mlx4-65@0000:05:00.0) > > is this strict regression from some known point in the past on this > system/config -- i.e 4.1 or 4.2-rc1?! Yes. When I was submitting the 4.2-rc changes this machine worked. This is one of my IB/Eth SRIOV machines. I tested with SRIOV disabled and it didn't effect things. > Can you please send the mlx4 driver output when you load it with debug > prints on? also do things work if you set the ports type to be ib/ib > or eth/eth? It should work as ib/ib given that in ib/eth mode the ib port works. I doubt eth/eth would work, but I'll try and see. OK, Eth/Eth mode fails too (at least on the second port, I can say on the first port for certain as I can't bring it up, it's still plugged into an IB switch). However, now in Eth/Eth mode, attempts to bring up the interface manually at the command line have hung, which it didn't do in IB/Eth mode. I'll try to ping things down further, but that's what I have so far. And as requested, the config is attached. > > send us your compressed .config > > Matan, any idea what goes wrong here? > > Or. > > > >> [ 77.914965] CPU: 0 PID: 1541 Comm: NetworkManager Not tainted >> 4.2.0-rc8 #58 >> [ 77.923292] Hardware name: Dell Inc. PowerEdge R820/04K5X5, BIOS >> 2.2.3 07/09/2014 >> [ 77.932205] 0000000000000000 00000000c16e3ce1 ffff8820365ab498 >> ffffffff8167e6ff >> [ 77.941072] 0000000000000000 ffff8820339e9a00 ffff8820365ab4f8 >> ffffffff810d2b6e >> [ 77.949938] 0000000000000246 ffff881032e67aa4 ffff881035e10ba0 >> 00000000c16e3ce1 >> [ 77.958812] Call Trace: >> [ 77.962109] [<ffffffff8167e6ff>] dump_stack+0x45/0x57 >> [ 77.968412] [<ffffffff810d2b6e>] __setup_irq+0x51e/0x590 >> [ 77.975018] [<ffffffffc03870a0>] ? mlx4_interrupt+0x80/0x80 [mlx4_core] >> [ 77.983072] [<ffffffff810d2d64>] request_threaded_irq+0xf4/0x1a0 >> [ 77.990468] [<ffffffffc0385d55>] mlx4_assign_eq+0x135/0x360 [mlx4_core] >> [ 77.998513] [<ffffffffc0537537>] mlx4_en_activate_cq+0x2a7/0x310 >> [mlx4_en] >> [ 78.006853] [<ffffffff8130a2c8>] ? alloc_cpumask_var_node+0x28/0x40 >> [ 78.014542] [<ffffffff8131e8b9>] ? find_next_bit+0x19/0x20 >> [ 78.021334] [<ffffffff8130a284>] ? cpumask_next_and+0x34/0x50 >> [ 78.028425] [<ffffffffc053ae6b>] mlx4_en_start_port+0x1bb/0xb60 >> [mlx4_en] >> [ 78.036689] [<ffffffffc037fe01>] ? mlx4_free_cmd_mailbox+0x31/0x40 >> [mlx4_core] >> [ 78.045435] [<ffffffffc053bb59>] mlx4_en_open+0x349/0x630 [mlx4_en] >> [ 78.053107] [<ffffffff815732f9>] __dev_open+0xc9/0x140 >> [ 78.059538] [<ffffffff81573621>] __dev_change_flags+0xa1/0x160 >> [ 78.066718] [<ffffffff81573709>] dev_change_flags+0x29/0x60 >> [ 78.073602] [<ffffffff81580dbe>] do_setlink+0x5be/0xa70 >> [ 78.080097] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] >> [ 78.087859] [<ffffffffc01b1456>] ? mga_dirty_update+0x1e6/0x2f0 >> [mgag200] >> [ 78.096112] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] >> [ 78.103873] [<ffffffff81582470>] rtnl_newlink+0x4f0/0x880 >> [ 78.110586] [<ffffffff81582073>] ? rtnl_newlink+0xf3/0x880 >> [ 78.117372] [<ffffffff81294238>] ? security_capable+0x48/0x60 >> [ 78.124452] [<ffffffff81081b1d>] ? ns_capable+0x2d/0x60 >> [ 78.130950] [<ffffffff8157f8c4>] rtnetlink_rcv_msg+0xa4/0x250 >> [ 78.138028] [<ffffffff812987c0>] ? sock_has_perm+0x70/0x90 >> [ 78.144824] [<ffffffff8157f820>] ? rtnetlink_rcv+0x40/0x40 >> [ 78.151615] [<ffffffff815a2bdf>] netlink_rcv_skb+0xaf/0xc0 >> [ 78.158425] [<ffffffff8157f80c>] rtnetlink_rcv+0x2c/0x40 >> [ 78.164997] [<ffffffff815a22d1>] netlink_unicast+0x101/0x1f0 >> [ 78.171937] [<ffffffff815a27c1>] netlink_sendmsg+0x401/0x660 >> [ 78.178867] [<ffffffff81553e78>] sock_sendmsg+0x38/0x50 >> [ 78.185335] [<ffffffff815547d5>] ___sys_sendmsg+0x275/0x290 >> [ 78.192176] [<ffffffff81262c56>] ? sysctl_head_finish+0x46/0x50 >> [ 78.199411] [<ffffffff81262e08>] ? proc_sys_call_handler+0x88/0xe0 >> [ 78.206946] [<ffffffff8131854c>] ? lockref_put_or_lock+0x4c/0x80 >> [ 78.214296] [<ffffffff81555197>] __sys_sendmsg+0x57/0xa0 >> [ 78.220878] [<ffffffff815551f2>] SyS_sendmsg+0x12/0x20 >> [ 78.227283] [<ffffffff8168536e>] entry_SYSCALL_64_fastpath+0x12/0x71 >> [ 78.235114] mlx4_en 0000:05:00.0: Failed assigning an EQ to >> \xfffffff\xffffffb6Z6 >> \xffffff88\xffffffff\xffffffff\xffffff84\xffffffa20\xffffff81\xffffffff\xffffffff\xffffffff\xffffffff >> [ 78.243732] mlx4_en: mlx4_roce: Failed activating Rx CQ >> [ 78.319027] mlx4_en: mlx4_roce: Failed starting port:2 >> >> The interface in question is unusable. >> >> -- >> Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> >> GPG KeyID: 0E572FDD >> >> -- Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> GPG KeyID: 0E572FDD [-- Attachment #1.2: config.gz --] [-- Type: application/gzip, Size: 35252 bytes --] [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 884 bytes --] ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <55E385DB.2-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <55E385DB.2-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> @ 2015-08-31 7:09 ` Matan Barak [not found] ` <55E3FDAB.10706-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Matan Barak @ 2015-08-31 7:09 UTC (permalink / raw) To: Doug Ledford, Or Gerlitz Cc: Or Gerlitz, linux-rdma, Amir Vadai, Jack Morgenstein On 8/31/2015 1:38 AM, Doug Ledford wrote: > On 08/29/2015 09:13 PM, Or Gerlitz wrote: >> On Fri, Aug 28, 2015 at 10:27 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >>> I'm seeing this with rc8 on a dual port mlx4 adapter set to IB/Eth mode: >> >> mmm, both Amir and myself are just finishing vacations... so WB notes >> are not always lovely as you want them to be, life >>> >>> [ 77.883513] IPv6: ADDRCONF(NETDEV_UP): mlx4_roce: link is not ready >>> [ 77.892044] mlx4_en: mlx4_roce: frag:0 - size:1518 prefix:0 stride:1536 >>> [ 77.903129] genirq: Flags mismatch irq 135. 00000000 >>> (mlx4-65@0000:05:00.0) vs. 00000000 (mlx4-65@0000:05:00.0) >> >> is this strict regression from some known point in the past on this >> system/config -- i.e 4.1 or 4.2-rc1?! > > Yes. When I was submitting the 4.2-rc changes this machine worked. > This is one of my IB/Eth SRIOV machines. I tested with SRIOV disabled > and it didn't effect things. > >> Can you please send the mlx4 driver output when you load it with debug >> prints on? also do things work if you set the ports type to be ib/ib >> or eth/eth? > > It should work as ib/ib given that in ib/eth mode the ib port works. I > doubt eth/eth would work, but I'll try and see. OK, Eth/Eth mode fails > too (at least on the second port, I can say on the first port for > certain as I can't bring it up, it's still plugged into an IB switch). > However, now in Eth/Eth mode, attempts to bring up the interface > manually at the command line have hung, which it didn't do in IB/Eth mode. > > I'll try to ping things down further, but that's what I have so far. > > And as requested, the config is attached. > >> >> send us your compressed .config >> >> Matan, any idea what goes wrong here? >> >> Or. >> >> >> >>> [ 77.914965] CPU: 0 PID: 1541 Comm: NetworkManager Not tainted >>> 4.2.0-rc8 #58 >>> [ 77.923292] Hardware name: Dell Inc. PowerEdge R820/04K5X5, BIOS >>> 2.2.3 07/09/2014 >>> [ 77.932205] 0000000000000000 00000000c16e3ce1 ffff8820365ab498 >>> ffffffff8167e6ff >>> [ 77.941072] 0000000000000000 ffff8820339e9a00 ffff8820365ab4f8 >>> ffffffff810d2b6e >>> [ 77.949938] 0000000000000246 ffff881032e67aa4 ffff881035e10ba0 >>> 00000000c16e3ce1 >>> [ 77.958812] Call Trace: >>> [ 77.962109] [<ffffffff8167e6ff>] dump_stack+0x45/0x57 >>> [ 77.968412] [<ffffffff810d2b6e>] __setup_irq+0x51e/0x590 >>> [ 77.975018] [<ffffffffc03870a0>] ? mlx4_interrupt+0x80/0x80 [mlx4_core] >>> [ 77.983072] [<ffffffff810d2d64>] request_threaded_irq+0xf4/0x1a0 >>> [ 77.990468] [<ffffffffc0385d55>] mlx4_assign_eq+0x135/0x360 [mlx4_core] >>> [ 77.998513] [<ffffffffc0537537>] mlx4_en_activate_cq+0x2a7/0x310 >>> [mlx4_en] >>> [ 78.006853] [<ffffffff8130a2c8>] ? alloc_cpumask_var_node+0x28/0x40 >>> [ 78.014542] [<ffffffff8131e8b9>] ? find_next_bit+0x19/0x20 >>> [ 78.021334] [<ffffffff8130a284>] ? cpumask_next_and+0x34/0x50 >>> [ 78.028425] [<ffffffffc053ae6b>] mlx4_en_start_port+0x1bb/0xb60 >>> [mlx4_en] >>> [ 78.036689] [<ffffffffc037fe01>] ? mlx4_free_cmd_mailbox+0x31/0x40 >>> [mlx4_core] >>> [ 78.045435] [<ffffffffc053bb59>] mlx4_en_open+0x349/0x630 [mlx4_en] >>> [ 78.053107] [<ffffffff815732f9>] __dev_open+0xc9/0x140 >>> [ 78.059538] [<ffffffff81573621>] __dev_change_flags+0xa1/0x160 >>> [ 78.066718] [<ffffffff81573709>] dev_change_flags+0x29/0x60 >>> [ 78.073602] [<ffffffff81580dbe>] do_setlink+0x5be/0xa70 >>> [ 78.080097] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] >>> [ 78.087859] [<ffffffffc01b1456>] ? mga_dirty_update+0x1e6/0x2f0 >>> [mgag200] >>> [ 78.096112] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 [mgag200] >>> [ 78.103873] [<ffffffff81582470>] rtnl_newlink+0x4f0/0x880 >>> [ 78.110586] [<ffffffff81582073>] ? rtnl_newlink+0xf3/0x880 >>> [ 78.117372] [<ffffffff81294238>] ? security_capable+0x48/0x60 >>> [ 78.124452] [<ffffffff81081b1d>] ? ns_capable+0x2d/0x60 >>> [ 78.130950] [<ffffffff8157f8c4>] rtnetlink_rcv_msg+0xa4/0x250 >>> [ 78.138028] [<ffffffff812987c0>] ? sock_has_perm+0x70/0x90 >>> [ 78.144824] [<ffffffff8157f820>] ? rtnetlink_rcv+0x40/0x40 >>> [ 78.151615] [<ffffffff815a2bdf>] netlink_rcv_skb+0xaf/0xc0 >>> [ 78.158425] [<ffffffff8157f80c>] rtnetlink_rcv+0x2c/0x40 >>> [ 78.164997] [<ffffffff815a22d1>] netlink_unicast+0x101/0x1f0 >>> [ 78.171937] [<ffffffff815a27c1>] netlink_sendmsg+0x401/0x660 >>> [ 78.178867] [<ffffffff81553e78>] sock_sendmsg+0x38/0x50 >>> [ 78.185335] [<ffffffff815547d5>] ___sys_sendmsg+0x275/0x290 >>> [ 78.192176] [<ffffffff81262c56>] ? sysctl_head_finish+0x46/0x50 >>> [ 78.199411] [<ffffffff81262e08>] ? proc_sys_call_handler+0x88/0xe0 >>> [ 78.206946] [<ffffffff8131854c>] ? lockref_put_or_lock+0x4c/0x80 >>> [ 78.214296] [<ffffffff81555197>] __sys_sendmsg+0x57/0xa0 >>> [ 78.220878] [<ffffffff815551f2>] SyS_sendmsg+0x12/0x20 >>> [ 78.227283] [<ffffffff8168536e>] entry_SYSCALL_64_fastpath+0x12/0x71 >>> [ 78.235114] mlx4_en 0000:05:00.0: Failed assigning an EQ to >>> \xfffffff\xffffffb6Z6 >>> \xffffff88\xffffffff\xffffffff\xffffff84\xffffffa20\xffffff81\xffffffff\xffffffff\xffffffff\xffffffff >>> [ 78.243732] mlx4_en: mlx4_roce: Failed activating Rx CQ >>> [ 78.319027] mlx4_en: mlx4_roce: Failed starting port:2 >>> >>> The interface in question is unusable. >>> >>> -- >>> Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> >>> GPG KeyID: 0E572FDD >>> >>> > > Actually, it looks like the dump stack we've got before [1] was fixed. This happens when the mlx4 driver is used in setups where number of cores >= 32. Doug, is that the case? [1] http://www.spinics.net/lists/netdev/msg341171.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <55E3FDAB.10706-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <55E3FDAB.10706-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org> @ 2015-08-31 13:02 ` Doug Ledford [not found] ` <55E45058.1070105-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Doug Ledford @ 2015-08-31 13:02 UTC (permalink / raw) To: Matan Barak, Or Gerlitz Cc: Or Gerlitz, linux-rdma, Amir Vadai, Jack Morgenstein [-- Attachment #1: Type: text/plain, Size: 6237 bytes --] On 08/31/2015 03:09 AM, Matan Barak wrote: > > > On 8/31/2015 1:38 AM, Doug Ledford wrote: >> On 08/29/2015 09:13 PM, Or Gerlitz wrote: >>> On Fri, Aug 28, 2015 at 10:27 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> >>> wrote: >>>> I'm seeing this with rc8 on a dual port mlx4 adapter set to IB/Eth >>>> mode: >>> >>> mmm, both Amir and myself are just finishing vacations... so WB notes >>> are not always lovely as you want them to be, life >>>> >>>> [ 77.883513] IPv6: ADDRCONF(NETDEV_UP): mlx4_roce: link is not ready >>>> [ 77.892044] mlx4_en: mlx4_roce: frag:0 - size:1518 prefix:0 >>>> stride:1536 >>>> [ 77.903129] genirq: Flags mismatch irq 135. 00000000 >>>> (mlx4-65@0000:05:00.0) vs. 00000000 (mlx4-65@0000:05:00.0) >>> >>> is this strict regression from some known point in the past on this >>> system/config -- i.e 4.1 or 4.2-rc1?! >> >> Yes. When I was submitting the 4.2-rc changes this machine worked. >> This is one of my IB/Eth SRIOV machines. I tested with SRIOV disabled >> and it didn't effect things. >> >>> Can you please send the mlx4 driver output when you load it with debug >>> prints on? also do things work if you set the ports type to be ib/ib >>> or eth/eth? >> >> It should work as ib/ib given that in ib/eth mode the ib port works. I >> doubt eth/eth would work, but I'll try and see. OK, Eth/Eth mode fails >> too (at least on the second port, I can say on the first port for >> certain as I can't bring it up, it's still plugged into an IB switch). >> However, now in Eth/Eth mode, attempts to bring up the interface >> manually at the command line have hung, which it didn't do in IB/Eth >> mode. >> >> I'll try to ping things down further, but that's what I have so far. >> >> And as requested, the config is attached. >> >>> >>> send us your compressed .config >>> >>> Matan, any idea what goes wrong here? >>> >>> Or. >>> >>> >>> >>>> [ 77.914965] CPU: 0 PID: 1541 Comm: NetworkManager Not tainted >>>> 4.2.0-rc8 #58 >>>> [ 77.923292] Hardware name: Dell Inc. PowerEdge R820/04K5X5, BIOS >>>> 2.2.3 07/09/2014 >>>> [ 77.932205] 0000000000000000 00000000c16e3ce1 ffff8820365ab498 >>>> ffffffff8167e6ff >>>> [ 77.941072] 0000000000000000 ffff8820339e9a00 ffff8820365ab4f8 >>>> ffffffff810d2b6e >>>> [ 77.949938] 0000000000000246 ffff881032e67aa4 ffff881035e10ba0 >>>> 00000000c16e3ce1 >>>> [ 77.958812] Call Trace: >>>> [ 77.962109] [<ffffffff8167e6ff>] dump_stack+0x45/0x57 >>>> [ 77.968412] [<ffffffff810d2b6e>] __setup_irq+0x51e/0x590 >>>> [ 77.975018] [<ffffffffc03870a0>] ? mlx4_interrupt+0x80/0x80 >>>> [mlx4_core] >>>> [ 77.983072] [<ffffffff810d2d64>] request_threaded_irq+0xf4/0x1a0 >>>> [ 77.990468] [<ffffffffc0385d55>] mlx4_assign_eq+0x135/0x360 >>>> [mlx4_core] >>>> [ 77.998513] [<ffffffffc0537537>] mlx4_en_activate_cq+0x2a7/0x310 >>>> [mlx4_en] >>>> [ 78.006853] [<ffffffff8130a2c8>] ? alloc_cpumask_var_node+0x28/0x40 >>>> [ 78.014542] [<ffffffff8131e8b9>] ? find_next_bit+0x19/0x20 >>>> [ 78.021334] [<ffffffff8130a284>] ? cpumask_next_and+0x34/0x50 >>>> [ 78.028425] [<ffffffffc053ae6b>] mlx4_en_start_port+0x1bb/0xb60 >>>> [mlx4_en] >>>> [ 78.036689] [<ffffffffc037fe01>] ? mlx4_free_cmd_mailbox+0x31/0x40 >>>> [mlx4_core] >>>> [ 78.045435] [<ffffffffc053bb59>] mlx4_en_open+0x349/0x630 [mlx4_en] >>>> [ 78.053107] [<ffffffff815732f9>] __dev_open+0xc9/0x140 >>>> [ 78.059538] [<ffffffff81573621>] __dev_change_flags+0xa1/0x160 >>>> [ 78.066718] [<ffffffff81573709>] dev_change_flags+0x29/0x60 >>>> [ 78.073602] [<ffffffff81580dbe>] do_setlink+0x5be/0xa70 >>>> [ 78.080097] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 >>>> [mgag200] >>>> [ 78.087859] [<ffffffffc01b1456>] ? mga_dirty_update+0x1e6/0x2f0 >>>> [mgag200] >>>> [ 78.096112] [<ffffffffc01b158f>] ? mga_imageblit+0x2f/0x40 >>>> [mgag200] >>>> [ 78.103873] [<ffffffff81582470>] rtnl_newlink+0x4f0/0x880 >>>> [ 78.110586] [<ffffffff81582073>] ? rtnl_newlink+0xf3/0x880 >>>> [ 78.117372] [<ffffffff81294238>] ? security_capable+0x48/0x60 >>>> [ 78.124452] [<ffffffff81081b1d>] ? ns_capable+0x2d/0x60 >>>> [ 78.130950] [<ffffffff8157f8c4>] rtnetlink_rcv_msg+0xa4/0x250 >>>> [ 78.138028] [<ffffffff812987c0>] ? sock_has_perm+0x70/0x90 >>>> [ 78.144824] [<ffffffff8157f820>] ? rtnetlink_rcv+0x40/0x40 >>>> [ 78.151615] [<ffffffff815a2bdf>] netlink_rcv_skb+0xaf/0xc0 >>>> [ 78.158425] [<ffffffff8157f80c>] rtnetlink_rcv+0x2c/0x40 >>>> [ 78.164997] [<ffffffff815a22d1>] netlink_unicast+0x101/0x1f0 >>>> [ 78.171937] [<ffffffff815a27c1>] netlink_sendmsg+0x401/0x660 >>>> [ 78.178867] [<ffffffff81553e78>] sock_sendmsg+0x38/0x50 >>>> [ 78.185335] [<ffffffff815547d5>] ___sys_sendmsg+0x275/0x290 >>>> [ 78.192176] [<ffffffff81262c56>] ? sysctl_head_finish+0x46/0x50 >>>> [ 78.199411] [<ffffffff81262e08>] ? proc_sys_call_handler+0x88/0xe0 >>>> [ 78.206946] [<ffffffff8131854c>] ? lockref_put_or_lock+0x4c/0x80 >>>> [ 78.214296] [<ffffffff81555197>] __sys_sendmsg+0x57/0xa0 >>>> [ 78.220878] [<ffffffff815551f2>] SyS_sendmsg+0x12/0x20 >>>> [ 78.227283] [<ffffffff8168536e>] >>>> entry_SYSCALL_64_fastpath+0x12/0x71 >>>> [ 78.235114] mlx4_en 0000:05:00.0: Failed assigning an EQ to >>>> \xfffffff\xffffffb6Z6 >>>> \xffffff88\xffffffff\xffffffff\xffffff84\xffffffa20\xffffff81\xffffffff\xffffffff\xffffffff\xffffffff >>>> >>>> [ 78.243732] mlx4_en: mlx4_roce: Failed activating Rx CQ >>>> [ 78.319027] mlx4_en: mlx4_roce: Failed starting port:2 >>>> >>>> The interface in question is unusable. >>>> >>>> -- >>>> Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> >>>> GPG KeyID: 0E572FDD >>>> >>>> >> >> > > Actually, it looks like the dump stack we've got before [1] was fixed. > This happens when the mlx4 driver is used in setups where number of > cores >= 32. > Doug, is that the case? Indeed, 48 cores on this machine. > [1] http://www.spinics.net/lists/netdev/msg341171.html -- Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> GPG KeyID: 0E572FDD [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 884 bytes --] ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <55E45058.1070105-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <55E45058.1070105-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> @ 2015-08-31 20:21 ` Or Gerlitz [not found] ` <CAJ3xEMjp+3Y0y2d-K-zi9SnBSshN_C5x5KLY3oCpD_XriDCsWw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Or Gerlitz @ 2015-08-31 20:21 UTC (permalink / raw) To: Doug Ledford Cc: Matan Barak, Or Gerlitz, linux-rdma, Amir Vadai, Jack Morgenstein On Mon, Aug 31, 2015 at 4:02 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: > On 08/31/2015 03:09 AM, Matan Barak wrote: >> Actually, it looks like the dump stack we've got before [1] was fixed. >> This happens when the mlx4 driver is used in setups where number of >> cores >= 32. >> Doug, is that the case? > Indeed, 48 cores on this machine. so do we have bingo here? the patch is in the net-next tree (and we can't put it in 4.2 only through -stable since 4.2 is released by now), does it solves the problem? Or. >> [1] http://www.spinics.net/lists/netdev/msg341171.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <CAJ3xEMjp+3Y0y2d-K-zi9SnBSshN_C5x5KLY3oCpD_XriDCsWw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <CAJ3xEMjp+3Y0y2d-K-zi9SnBSshN_C5x5KLY3oCpD_XriDCsWw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-08-31 22:13 ` Doug Ledford [not found] ` <55E4D1A1.3060608-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Doug Ledford @ 2015-08-31 22:13 UTC (permalink / raw) To: Or Gerlitz Cc: Matan Barak, Or Gerlitz, linux-rdma, Amir Vadai, Jack Morgenstein [-- Attachment #1: Type: text/plain, Size: 824 bytes --] On 08/31/2015 04:21 PM, Or Gerlitz wrote: > On Mon, Aug 31, 2015 at 4:02 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >> On 08/31/2015 03:09 AM, Matan Barak wrote: > >>> Actually, it looks like the dump stack we've got before [1] was fixed. >>> This happens when the mlx4 driver is used in setups where number of >>> cores >= 32. > >>> Doug, is that the case? > >> Indeed, 48 cores on this machine. > > so do we have bingo here? the patch is in the net-next tree (and we > can't put it in 4.2 only through -stable since 4.2 is released by > now), does it solves the problem? Yes, it solved the problem. I pulled the patch into my testing branch to confirm. -- Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> GPG KeyID: 0E572FDD [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 884 bytes --] ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <55E4D1A1.3060608-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <55E4D1A1.3060608-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> @ 2015-09-01 6:40 ` Or Gerlitz [not found] ` <CAJ3xEMh-EXsoWsMhSf2ho_U_tz5tCUz2iWK+YZ3d76j5B7HJxg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Or Gerlitz @ 2015-09-01 6:40 UTC (permalink / raw) To: Doug Ledford, Matan Barak; +Cc: linux-rdma, Amir Vadai, Jack Morgenstein On Tue, Sep 1, 2015 at 1:13 AM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: > On 08/31/2015 04:21 PM, Or Gerlitz wrote: >> On Mon, Aug 31, 2015 at 4:02 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >>> On 08/31/2015 03:09 AM, Matan Barak wrote: >> >>>> Actually, it looks like the dump stack we've got before [1] was fixed. >>>> This happens when the mlx4 driver is used in setups where number of >>>> cores >= 32. >> >>>> Doug, is that the case? >> >>> Indeed, 48 cores on this machine. >> >> so do we have bingo here? the patch is in the net-next tree (and we >> can't put it in 4.2 only through -stable since 4.2 is released by >> now), does it solves the problem? > > Yes, it solved the problem. I pulled the patch into my testing branch > to confirm. Good. Something is still strange w.r.t your environment... you said that when you were submitting the 4.2-rc changes this machine worked, however the problematic code in mlx4_enable_msi_x existed there by the time you made that testing, Matan - agree? Or. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <CAJ3xEMh-EXsoWsMhSf2ho_U_tz5tCUz2iWK+YZ3d76j5B7HJxg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <CAJ3xEMh-EXsoWsMhSf2ho_U_tz5tCUz2iWK+YZ3d76j5B7HJxg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-09-01 8:42 ` Matan Barak [not found] ` <CAAKD3BCbZpzG3g+H3xkuBM+Y9G14DZ=tJXeaGe5_ccQpiqWmCQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Matan Barak @ 2015-09-01 8:42 UTC (permalink / raw) To: Or Gerlitz Cc: Doug Ledford, Matan Barak, linux-rdma, Amir Vadai, Jack Morgenstein On Tue, Sep 1, 2015 at 9:40 AM, Or Gerlitz <gerlitz.or-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: > On Tue, Sep 1, 2015 at 1:13 AM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >> On 08/31/2015 04:21 PM, Or Gerlitz wrote: >>> On Mon, Aug 31, 2015 at 4:02 PM, Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> wrote: >>>> On 08/31/2015 03:09 AM, Matan Barak wrote: >>> >>>>> Actually, it looks like the dump stack we've got before [1] was fixed. >>>>> This happens when the mlx4 driver is used in setups where number of >>>>> cores >= 32. >>> >>>>> Doug, is that the case? >>> >>>> Indeed, 48 cores on this machine. >>> >>> so do we have bingo here? the patch is in the net-next tree (and we >>> can't put it in 4.2 only through -stable since 4.2 is released by >>> now), does it solves the problem? >> >> Yes, it solved the problem. I pulled the patch into my testing branch >> to confirm. > > Good. Something is still strange w.r.t your environment... you said that > when you were submitting the 4.2-rc changes this machine worked, however > the problematic code in mlx4_enable_msi_x existed there by the time you > made that testing, Matan - agree? > > Or. If I recall, this code was sent through linux-net mailing list. So it's possible the different branches weren't rebased, isn't it? > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <CAAKD3BCbZpzG3g+H3xkuBM+Y9G14DZ=tJXeaGe5_ccQpiqWmCQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <CAAKD3BCbZpzG3g+H3xkuBM+Y9G14DZ=tJXeaGe5_ccQpiqWmCQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-09-01 9:50 ` Or Gerlitz [not found] ` <CAJ3xEMg1qoCrkMoymnbe_ww50Cg7yjOf4y3fTSb7ngoqbCZ1Hg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 11+ messages in thread From: Or Gerlitz @ 2015-09-01 9:50 UTC (permalink / raw) To: Matan Barak, Doug Ledford Cc: Matan Barak, linux-rdma, Amir Vadai, Jack Morgenstein On Tue, Sep 1, 2015 at 11:42 AM, Matan Barak <matanb-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org> wrote: > If I recall, this code was sent through linux-net mailing list. So > it's possible the different branches weren't rebased, isn't it? but the code was merged for 4.2-rc1 -- so Doug, this means that when you did the 4.2-rc work you haven't rebased to 4.2-rc1 -- if this is indeed the case, such practice is problematic, agree? Or. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 11+ messages in thread
[parent not found: <CAJ3xEMg1qoCrkMoymnbe_ww50Cg7yjOf4y3fTSb7ngoqbCZ1Hg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: mlx4 problems with 4.2-rc8 [not found] ` <CAJ3xEMg1qoCrkMoymnbe_ww50Cg7yjOf4y3fTSb7ngoqbCZ1Hg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-09-01 13:54 ` Doug Ledford 0 siblings, 0 replies; 11+ messages in thread From: Doug Ledford @ 2015-09-01 13:54 UTC (permalink / raw) To: Or Gerlitz, Matan Barak Cc: Matan Barak, linux-rdma, Amir Vadai, Jack Morgenstein [-- Attachment #1: Type: text/plain, Size: 1197 bytes --] On 09/01/2015 05:50 AM, Or Gerlitz wrote: > On Tue, Sep 1, 2015 at 11:42 AM, Matan Barak <matanb-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org> wrote: > >> If I recall, this code was sent through linux-net mailing list. So >> it's possible the different branches weren't rebased, isn't it? > > but the code was merged for 4.2-rc1 -- so Doug, this means that when > you did the 4.2-rc work you haven't rebased to 4.2-rc1 -- if this is > indeed the case, > such practice is problematic, agree? That depends. Early on in the rc series, I don't rebase for testing. Later on in the rc series, I do. However, that's all moot because it isn't why I didn't see this in the 4.2 rc series. I only have one machine with > 32 CPUs, and it's intended to be a highly reliable/stable machine. I don't always use it for testing new kernels. In this particular case, I needed to move it from a RHEL kernel to an upstream kernel to isolate a different issue, and hence I ran across this problem. My more common test machines would have never seen the issue I reported. -- Doug Ledford <dledford-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> GPG KeyID: 0E572FDD [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 884 bytes --] ^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2015-09-01 13:54 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-08-29 5:27 mlx4 problems with 4.2-rc8 Doug Ledford
[not found] ` <55E142DC.8060205-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2015-08-30 1:13 ` Or Gerlitz
[not found] ` <CAJ3xEMj5By11L3qbSKxcEiMarB6CeyeERMnuK_vvH11VLLFypw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-08-30 22:38 ` Doug Ledford
[not found] ` <55E385DB.2-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2015-08-31 7:09 ` Matan Barak
[not found] ` <55E3FDAB.10706-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2015-08-31 13:02 ` Doug Ledford
[not found] ` <55E45058.1070105-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2015-08-31 20:21 ` Or Gerlitz
[not found] ` <CAJ3xEMjp+3Y0y2d-K-zi9SnBSshN_C5x5KLY3oCpD_XriDCsWw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-08-31 22:13 ` Doug Ledford
[not found] ` <55E4D1A1.3060608-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2015-09-01 6:40 ` Or Gerlitz
[not found] ` <CAJ3xEMh-EXsoWsMhSf2ho_U_tz5tCUz2iWK+YZ3d76j5B7HJxg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-09-01 8:42 ` Matan Barak
[not found] ` <CAAKD3BCbZpzG3g+H3xkuBM+Y9G14DZ=tJXeaGe5_ccQpiqWmCQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-09-01 9:50 ` Or Gerlitz
[not found] ` <CAJ3xEMg1qoCrkMoymnbe_ww50Cg7yjOf4y3fTSb7ngoqbCZ1Hg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-09-01 13:54 ` Doug Ledford
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).