From: Jason Gunthorpe <jgg@ziepe.ca>
To: Kevan Rehm <kevanrehm@gmail.com>
Cc: Mark Zhang <markzhang@nvidia.com>,
Leon Romanovsky <leon@kernel.org>,
"linux-rdma@vger.kernel.org" <linux-rdma@vger.kernel.org>,
Yishai Hadas <yishaih@nvidia.com>,
kevan.rehm@hpe.com, chien.tin.tung@intel.com
Subject: Re: Segfault in mlx5 driver on infiniband after application fork
Date: Mon, 12 Feb 2024 12:45:33 -0400 [thread overview]
Message-ID: <20240212164533.GG765010@ziepe.ca> (raw)
In-Reply-To: <0BC5DF9E-53A2-4224-8EDE-87B4F2407D56@gmail.com>
On Mon, Feb 12, 2024 at 11:37:39AM -0500, Kevan Rehm wrote:
>
>
> > On Feb 12, 2024, at 11:12 AM, Jason Gunthorpe <jgg@ziepe.ca> wrote:
> >
> > On Mon, Feb 12, 2024 at 11:04:36AM -0500, Kevan Rehm wrote:
> >
> >> Those routines call ibv_dontfork_range on the page after it’s been
> >> allocated via posix_memalign(). _add_page() then adds the new page
> >> to the mlx5_context field dbr_available_pages.
> >
> > Oh, if this is your trouble then upgrade your kernel. This part is
> > fixed on kernels that have working fork support.
>
> That’s the bit that confuses me; all this is happening in user
> space, what is different in the kernel that would prevent this
> problem from occurring in user space? Any guess as to how much
> newer a kernel must be?
Newer kernels are detected and disable the DONT_FORK calls in verbs.
rdma-core support is present since:
commit 67b00c3835a3480a035a9e1bcf5695f5c0e8568e
Author: Gal Pressman <galpress@amazon.com>
Date: Sun Apr 4 17:24:54 2021 +0300
verbs: Report when ibv_fork_init() is not needed
Identify kernels which do not require ibv_fork_init() to be called and
report it through the ibv_is_fork_initialized() verb.
The feature detection is done through a new read-only attribute in the
get sys netlink command. If the attribute is not reported, assume old
kernel without COF support. If the attribute is reported, use the
returned value.
This allows ibv_is_fork_initialized() to return the previously unused
IBV_FORK_UNNEEDED value, which takes precedence over the
DISABLED/ENABLED values. Meaning that if the kernel does not require a
call to ibv_fork_init(), IBV_FORK_UNNEEDED will be returned regardless
of whether ibv_fork_init() was called or not.
Signed-off-by: Gal Pressman <galpress@amazon.com>
The kernel support was in v5.13-rc1~78^2~1
And backported in a few cases.
Jason
next prev parent reply other threads:[~2024-02-12 16:45 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-02-11 19:24 Segfault in mlx5 driver on infiniband after application fork Kevan Rehm
2024-02-12 13:33 ` Jason Gunthorpe
2024-02-12 14:37 ` Kevan Rehm
2024-02-12 14:40 ` Jason Gunthorpe
2024-02-12 16:04 ` Kevan Rehm
2024-02-12 16:12 ` Jason Gunthorpe
2024-02-12 16:37 ` Kevan Rehm
2024-02-12 16:45 ` Jason Gunthorpe [this message]
2024-02-16 19:56 ` Kevan Rehm
-- strict thread matches above, loose matches on Subject: below --
2024-02-21 12:51 Kevan Rehm
2024-02-13 16:45 Kevan Rehm
2024-02-07 19:17 Rehm, Kevan
2024-02-08 8:52 ` Leon Romanovsky
2024-02-08 9:05 ` Mark Zhang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20240212164533.GG765010@ziepe.ca \
--to=jgg@ziepe.ca \
--cc=chien.tin.tung@intel.com \
--cc=kevan.rehm@hpe.com \
--cc=kevanrehm@gmail.com \
--cc=leon@kernel.org \
--cc=linux-rdma@vger.kernel.org \
--cc=markzhang@nvidia.com \
--cc=yishaih@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox