public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
* Regression: Connect-X5 doesn't connect with NVME-of
@ 2018-02-01 17:56 Logan Gunthorpe
       [not found] ` <66a5332c-01ee-7a39-8224-189fa52a7298-OTvnGxWRz7hWk0Htik3J/w@public.gmane.org>
  0 siblings, 1 reply; 8+ messages in thread
From: Logan Gunthorpe @ 2018-02-01 17:56 UTC (permalink / raw)
  To: linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
  Cc: Max Gurtovoy, Stephen Bates, Saeed Mahameed

Hello,

We've experienced a regression with using nvme-of and two Connect-X5s. 
With v4.15 and v4.14.16 we see the following dmesgs when trying to 
connect to the target:

> [   43.732539] nvme nvme2: creating 16 I/O queues.
> [   44.072427] nvmet: adding queue 1 to ctrl 1.
> [   44.072553] nvmet: adding queue 2 to ctrl 1.
> [   44.072597] nvme nvme2: Connect command failed, error wo/DNR bit: -16402
> [   44.072609] nvme nvme2: failed to connect queue: 3 ret=-18
> [   44.075421] nvmet_rdma: freeing queue 2
> [   44.075792] nvmet_rdma: freeing queue 1
> [   44.264293] nvmet_rdma: freeing queue 3
> *snip*

(on v4.15 there is additional error panics likely do to some other 
nvme-of error handling bugs)

And nvme connect returns:

> Failed to write to /dev/nvme-fabrics: Invalid cross-device link

The two adapters are the same with the latest available firmware:

> 	transport:			InfiniBand (0)
> 	fw_ver:				16.21.2010
> 	vendor_id:			0x02c9
> 	vendor_part_id:			4119
> 	hw_ver:				0x0
> 	board_id:			MT_0000000010

We bisected to find the commit that broke our setup is:

05e0cc84e00c net/mlx5: Fix get vector affinity helper function

Thanks,

Logan
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2018-02-05 15:59 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-02-01 17:56 Regression: Connect-X5 doesn't connect with NVME-of Logan Gunthorpe
     [not found] ` <66a5332c-01ee-7a39-8224-189fa52a7298-OTvnGxWRz7hWk0Htik3J/w@public.gmane.org>
2018-02-03  4:53   ` Saeed Mahameed
     [not found]     ` <e6cdfbe7-762c-c70c-be5f-397bdb08ee80-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2018-02-03 22:46       ` Max Gurtovoy
2018-02-04  9:57       ` Sagi Grimberg
     [not found]         ` <0d629a68-a1fa-7297-e371-5abbc2dd5fe7-NQWnxTmZq1alnMjI0IkVqw@public.gmane.org>
2018-02-05 11:23           ` Max Gurtovoy
     [not found]             ` <dbda15f0-f678-9fab-dffe-5e8d2ae24ae1-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2018-02-05 14:18               ` Sagi Grimberg
     [not found]                 ` <7941ee6c-bf13-093c-e5c2-9ed93889405d-NQWnxTmZq1alnMjI0IkVqw@public.gmane.org>
2018-02-05 15:44                   ` Laurence Oberman
     [not found]                     ` <1517845472.11655.3.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2018-02-05 15:59                       ` Max Gurtovoy

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox