public inbox for linux-rdma@vger.kernel.org
 help / color / mirror / Atom feed
* 4.13 ib_mthca NULL pointer dereference with OpenSM
@ 2017-10-26 17:17 Chris Blake
       [not found] ` <CALpBJjoMLCqzVe5yKp4wCX-X-sH+=zv1QZfMvOWh2Ukh3c2LFg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 26+ messages in thread
From: Chris Blake @ 2017-10-26 17:17 UTC (permalink / raw)
  To: linux-rdma-u79uwXL29TY76Z2rM5mHXA

Hello linux-rmda,

I recently upgraded one of my boxes to 4.13, and have started
experiencing issues with ib_mthca. To start, my setup is Infiniband
direct between 2 servers using older Mellanox Technologies MT25208
cards for ipoib as well as NFS over RDMA. After upgrading, the
following has been experienced:

1. On my NAS host running OpenSM, as soon as it starts I get a NULL
pointer dereference which makes infiniband unusable. [0] This only
occurs on kernel 4.13 or newer.

2. On my compute host not running OpenSM, connectivity works for a bit
but shortly after dmesg is full of the following message:
infiniband mthca0: ib_post_send_mad error
This occurs when my compute host is on kernel 4.13 or newer.

I went ahead and tested some mainline kernel versions on both of my
nodes, and here are my findings:
4.13.8 = NULL pointer dereference on NAS, IPoIB not working
4.12.14 = Works as expected
4.14.0-rc5 = NULL pointer dereference on NAS, IPoIB not working

I have tried to see if I could find the patch responsible for this,
but sadly I have not had much luck.

As for my systems, the following modules are loaded:
ib_uverbs
ib_umad
rdma_ucm
ib_mthca
ib_ipoib

Let me know if there is anything I can test to help diagnose what is
causing this issue.

Regards,
Chris Blake

[0]: https://gist.github.com/riptidewave93/48595b8bc3bca669251db7d8a8e8a803
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2017-11-13 17:47 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-10-26 17:17 4.13 ib_mthca NULL pointer dereference with OpenSM Chris Blake
     [not found] ` <CALpBJjoMLCqzVe5yKp4wCX-X-sH+=zv1QZfMvOWh2Ukh3c2LFg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-29 19:11   ` Leon Romanovsky
     [not found]     ` <20171029191114.GO16127-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-10-29 19:35       ` Chris Blake
     [not found]         ` <CALpBJjoJpgR1QB-gzRJLOjGtjK4T2H4EbG1-R+ndodvznWkgwA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-30  7:19           ` Leon Romanovsky
     [not found]             ` <20171030071956.GU16127-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-10-30 18:39               ` Chris Blake
     [not found]                 ` <CALpBJjqWrXu3b12EG2-AWKwKs9v2q5pTf11rXSy04C-5dGta6A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-30 19:00                   ` Leon Romanovsky
2017-10-30 23:01                   ` Jason Gunthorpe
     [not found]                     ` <20171030230156.GA4081-uk2M96/98Pc@public.gmane.org>
2017-10-31  3:16                       ` Parav Pandit
     [not found]                         ` <VI1PR0502MB300860F7C004ABFB01079F53D15E0-o1MPJYiShExKsLr+rGaxW8DSnupUy6xnnBOFsp37pqbUKgpGm//BTAC/G2K4zDHf@public.gmane.org>
2017-10-31  4:24                           ` Jason Gunthorpe
     [not found]                             ` <20171031042435.GB7961-uk2M96/98Pc@public.gmane.org>
2017-10-31  5:11                               ` Leon Romanovsky
2017-10-31 12:49                               ` Hal Rosenstock
     [not found]                                 ` <e2a2aa84-71ce-36f0-91af-0854b24d4a6a-LDSdmyG8hGV8YrgS2mwiifqBs+8SCbDb@public.gmane.org>
2017-10-31 15:01                                   ` Daniel Jurgens
     [not found]                                     ` <60de3bab-f294-dd11-bcae-d179115f7c31-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2017-10-31 15:09                                       ` Jason Gunthorpe
     [not found]                                         ` <20171031150901.GA9852-uk2M96/98Pc@public.gmane.org>
2017-10-31 15:12                                           ` Daniel Jurgens
2017-10-31 15:15                                           ` Leon Romanovsky
     [not found]                                             ` <20171031151521.GK16127-U/DQcQFIOTAAJjI8aNfphQ@public.gmane.org>
2017-10-31 15:20                                               ` Chris Blake
2017-10-31 15:20                                               ` Daniel Jurgens
     [not found]                                                 ` <5ab2d58d-8af8-c075-86f0-7010b8316cba-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2017-10-31 17:49                                                   ` Chris Blake
     [not found]                                                     ` <CALpBJjorS-AK3Ua9UuUnWv33rYqBoeAXF9vsFALTff5Qv3hNSg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-31 17:58                                                       ` Parav Pandit
2017-10-31 18:22                                                       ` Daniel Jurgens
     [not found]                                                         ` <fa1abfd0-e9fc-e842-8a21-c214e9dc8e9e-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2017-10-31 18:29                                                           ` Chris Blake
     [not found]                                                             ` <CALpBJjpPzeBMu1QGS8KOmv33A8CTJ2YjFhEhqaPBUpcVy+i3xg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-31 18:36                                                               ` Chris Blake
2017-10-31 19:23                                                           ` Jason Gunthorpe
     [not found]                                                             ` <20171031192340.GA18578-uk2M96/98Pc@public.gmane.org>
2017-10-31 19:44                                                               ` Daniel Jurgens
     [not found]                                                                 ` <799ce928-2e7c-7f60-951a-3a88544b7510-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
2017-11-12 23:38                                                                   ` Chris Blake
     [not found]                                                                     ` <CALpBJjogf5ONOmMTr9Z8RQ_78aON=_zSSJ+K-_VQyRRXO6vkAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-11-13 17:47                                                                       ` Daniel Jurgens

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox