All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jason Gunthorpe <jgunthorpe-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
To: Joshua McBeth <joshua.mcbeth-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Cc: Leon Romanovsky <leonro-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>,
	linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: SR-IOV with mlx4 on ConnectX-2 fails with DMAR errors
Date: Wed, 14 Dec 2016 15:38:23 -0700	[thread overview]
Message-ID: <20161214223823.GA9986@obsidianresearch.com> (raw)
In-Reply-To: <CAN27Ff4P_11FQ-VuiPcL4S67uVh4xSJpqyZ0Q4AbPfd64fmO9g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>

On Wed, Dec 14, 2016 at 10:06:13AM -0500, Joshua McBeth wrote:

> > Does this make your 4.8 kernel work? If yes, then I suspect mlx4 has
> > broken IB_DEVICE_LOCAL_DMA_LKEY with SRIOV.. Leon? mlx5 has this
> > broken, doesn't it?

> With 4.8.1 and the below applied to the SR-IOV host and guest kernels,
> SR-IOV functions in both the SR-IOV host and guests and there are no
> DMAR errors emitted.

So strange.

Looking at your original report you see these errors:

[  107.137484] DMAR: [DMA Read] Request device [05:06.1] fault addr

But I don't see where 05:06.01 is a PCI device. That seems like a big
problem.

Based on that this looks like a Mellanox bug where
IB_DEVICE_LOCAL_DMA_LKEY is causing the wrong PCI BDF to be provided
as the requestor. Mellanox will have to help you futher, you are
running the latest firmware, right?

> The NFS/RDMA client in the guest does not work on the SR-IOV virtual
> function with the NFS/RDMA server of the host on the SR-IOV physical
> function, but this may be something else I need to troubleshoot
> further, as both IPoIB and synthetic RDMA traffic passes between the
> guest, host, and remote node just fine.  The remote node's NFS/RDMA
> client is additionally able to function with the host's NFS/RDMA
> server on the SR-IOV physical function.

Try removing IB_DEVICE_LOCAL_DMA_LKEY from the mlx4 driver entirely..

> > It would also be very helpful to try and determine what memory the NIC is
> > trying to read.. If it is the ipoib packet or some mlx4 internal
> > thing.

> How can I determine this?

Print out the dma address of the skb when the SEND is submitted in
ipoib and see if it is similar to the DMAR region..

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

      parent reply	other threads:[~2016-12-14 22:38 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-12-12 16:04 SR-IOV with mlx4 on ConnectX-2 fails with DMAR errors Joshua McBeth
     [not found] ` <CAN27Ff4RYh3y_45PUxxXhGuDvrrrjm8qe38fj5JPq7oV2QmdYA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2016-12-13  3:57   ` Joshua McBeth
     [not found]     ` <CAN27Ff74Ov8bgsZP1QJJOwvWc5_KOW_U=PsP+=Z87AJz+VSAOw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2016-12-13 16:54       ` Jason Gunthorpe
     [not found]         ` <20161213165441.GB11099-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2016-12-13 18:36           ` Joshua McBeth
     [not found]             ` <CAN27Ff7hEr4u5nyELjwRXb3W_t0TNefuuT4AzdjAoWWGRHqTFw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2016-12-13 19:01               ` Jason Gunthorpe
     [not found]                 ` <20161213190102.GA15119-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org>
2016-12-14 15:06                   ` Joshua McBeth
     [not found]                     ` <CAN27Ff4P_11FQ-VuiPcL4S67uVh4xSJpqyZ0Q4AbPfd64fmO9g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2016-12-14 22:38                       ` Jason Gunthorpe [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20161214223823.GA9986@obsidianresearch.com \
    --to=jgunthorpe-epgobjl8dl3ta4ec/59zmfatqe2ktcn/@public.gmane.org \
    --cc=joshua.mcbeth-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    --cc=leonro-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
    --cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.