From: Leon Romanovsky <leon@kernel.org>
To: Chuck Lever <chuck.lever@oracle.com>
Cc: Timo Rothenpieler <timo@rothenpieler.org>,
Linux NFS Mailing List <linux-nfs@vger.kernel.org>,
linux-rdma <linux-rdma@vger.kernel.org>
Subject: Re: NFS over RDMA issues on Linux 5.4
Date: Tue, 4 Aug 2020 12:36:35 +0300 [thread overview]
Message-ID: <20200804093635.GA4432@unreal> (raw)
In-Reply-To: <CE6C02CE-3EEB-4834-B499-376BC6020A17@oracle.com>
On Mon, Aug 03, 2020 at 12:24:21PM -0400, Chuck Lever wrote:
> Hi Timo-
>
> > On Aug 3, 2020, at 11:05 AM, Timo Rothenpieler <timo@rothenpieler.org> wrote:
> >
> > Hello,
> >
> > I have just deployed a new system with Mellanox ConnectX-4 VPI EDR IB cards and wanted to setup NFS over RDMA on it.
> >
> > However, while mounting the FS over RDMA works fine, actually using it results in the following messages absolutely hammering dmesg on both client and server:
> >
> >> https://gist.github.com/BtbN/9582e597b6581f552fa15982b0285b80#file-server-log
> >
> > The spam only stops once I forcibly reboot the client. The filesystem gets nowhere during all this. The retrans counter in nfsstat just keeps going up, nothing actually gets done.
> >
> > This is on Linux 5.4.54, using nfs-utils 2.4.3.
> > The mlx5 driver had enhanced-mode disabled in order to enable IPoIB connected mode with an MTU of 65520.
> >
> > Normal NFS 4.2 over tcp works perfectly fine on this setup, it's only when I mount via rdma that things go wrong.
> >
> > Is this an issue on my end, or did I run into a bug somewhere here?
> > Any pointers, patches and solutions to test are welcome.
>
> I haven't seen that failure mode here, so best I can recommend is
> keep investigating. I've copied linux-rdma in case they have any
> advice.
The mentioning of IPoIB is a slightly confusing in the context of NFS-over-RDMA.
Are you running NFS over IPoIB?
From brief look on CQE error syndrome (local length error), the client sends wrong WQE.
Thanks
>
> --
> Chuck Lever
>
>
>
next prev parent reply other threads:[~2020-08-04 9:36 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <8a1087d3-9add-dfe1-da0c-edab74fcca51@rothenpieler.org>
2020-08-03 16:24 ` NFS over RDMA issues on Linux 5.4 Chuck Lever
2020-08-04 9:36 ` Leon Romanovsky [this message]
2020-08-04 10:52 ` Timo Rothenpieler
2020-08-04 12:25 ` Leon Romanovsky
2020-08-04 12:49 ` Chuck Lever
2020-08-04 13:08 ` Timo Rothenpieler
2020-08-04 13:12 ` Chuck Lever
2020-08-04 13:19 ` Timo Rothenpieler
2020-08-04 13:24 ` Chuck Lever
2020-08-04 13:40 ` Timo Rothenpieler
2020-08-04 13:46 ` Leon Romanovsky
2020-08-04 13:53 ` Chuck Lever
2020-08-04 15:34 ` Chuck Lever
2020-08-04 15:39 ` Timo Rothenpieler
2020-08-04 15:46 ` Chuck Lever
2020-08-04 15:50 ` Timo Rothenpieler
2020-08-04 16:07 ` Timo Rothenpieler
2020-08-04 15:55 ` Leon Romanovsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200804093635.GA4432@unreal \
--to=leon@kernel.org \
--cc=chuck.lever@oracle.com \
--cc=linux-nfs@vger.kernel.org \
--cc=linux-rdma@vger.kernel.org \
--cc=timo@rothenpieler.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox