From: David Hildenbrand <david@redhat.com>
To: Nadav Amit <nadav.amit@gmail.com>
Cc: Li Wang <liwang@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
linux-kselftest@vger.kernel.org,
"open list:MEMORY MANAGEMENT" <linux-mm@kvack.org>,
Peter Xu <peterx@redhat.com>,
Aruna Ramakrishna <aruna.ramakrishna@oracle.com>,
Bagas Sanjaya <bagasdotme@gmail.com>,
Catalin Marinas <catalin.marinas@arm.com>,
Dave Hansen <dave.hansen@linux.intel.com>,
Joey Gouly <joey.gouly@arm.com>,
Johannes Weiner <hannes@cmpxchg.org>,
Keith Lucas <keith.lucas@oracle.com>,
Ryan Roberts <ryan.roberts@arm.com>,
Shuah Khan <shuah@kernel.org>
Subject: Re: [PATCH v2] selftests/mm: Fix UFFDIO_API usage with proper two-step feature negotiation
Date: Tue, 24 Jun 2025 13:39:09 +0200 [thread overview]
Message-ID: <239f75e4-1868-4ac9-882f-664a8863b781@redhat.com> (raw)
In-Reply-To: <611F9598-A1A4-47B6-B37E-09BF7B4D17D0@gmail.com>
On 24.06.25 13:29, Nadav Amit wrote:
>
>
>> On 24 Jun 2025, at 11:22, David Hildenbrand <david@redhat.com> wrote:
>>
>> On 24.06.25 10:07, David Hildenbrand wrote:
>>>>
>>> Is that actually required?
>>> The man page explicitly documents:
>>> " EINVAL A previous UFFDIO_API call already enabled one or more
>>> features for this userfaultfd. Calling UFF‐
>>> DIO_API twice, the first time with no features set, is
>>> explicitly allowed as per the two-step feature
>>> detection handshake.
>>> "
>>> So if that doesn't work, something might be broken.
>>
>> CCing Nadav and Peter:
>>
>> Could it be that
>>
>> commit 22e5fe2a2a279d9a6fcbdfb4dffe73821bef1c90
>> Author: Nadav Amit <nadav.amit@gmail.com>
>> Date: Thu Sep 2 14:58:59 2021 -0700
>>
>> userfaultfd: prevent concurrent API initialization
>> userfaultfd assumes that the enabled features are set once and never
>> changed after UFFDIO_API ioctl succeeded.
>> However, currently, UFFDIO_API can be called concurrently from two
>> different threads, succeed on both threads and leave userfaultfd's
>> features in non-deterministic state. Theoretically, other uffd operations
>> (ioctl's and page-faults) can be dispatched while adversely affected by
>> such changes of features.
>> Moreover, the writes to ctx->state and ctx->features are not ordered,
>> which can - theoretically, again - let userfaultfd_ioctl() think that
>> userfaultfd API completed, while the features are still not initialized.
>> To avoid races, it is arguably best to get rid of ctx->state. Since there
>> are only 2 states, record the API initialization in ctx->features as the
>> uppermost bit and remove ctx->state.
>>
>> Accidentally broke the documented two-step handshake in the man page where we
>> can avoid closing + reopening the fd?
>
> I agree the code is not correct (and my patch didn’t address this issue),
> but I don’t see it broke it either.
>
> Unless I’m missing something the code before my patch, when
> uffdio_api.features == 0, also set ctx->state to UFFD_STATE_RUNNING, which
> meant another invocation would see (ctx->state != UFFD_STATE_WAIT_API) and
> fail.
You might be right, I only checked the cmpxchg, assuming it was working
before that.
... but staring at the history of the "ctx->state =
UFFD_STATE_RUNNING;", I am not sure if it ever behaved that way.
Do maybe, the man page is simply wrong (although I wonder why that case
was described that detailed)
>
>>
>> Without testing, the following might fix it if I am right:
>>
>> diff --git a/fs/userfaultfd.c b/fs/userfaultfd.c
>> index 22f4bf956ba1c..f03e7c980e1c5 100644
>> --- a/fs/userfaultfd.c
>> +++ b/fs/userfaultfd.c
>> @@ -1944,9 +1944,9 @@ static int userfaultfd_move(struct userfaultfd_ctx *ctx,
>> static int userfaultfd_api(struct userfaultfd_ctx *ctx,
>> unsigned long arg)
>> {
>> + unsigned int new_features, old_features = 0;
>> struct uffdio_api uffdio_api;
>> void __user *buf = (void __user *)arg;
>> - unsigned int ctx_features;
>> int ret;
>> __u64 features;
>> @@ -1990,9 +1990,12 @@ static int userfaultfd_api(struct userfaultfd_ctx *ctx,
>> goto out;
>> /* only enable the requested features for this uffd context */
>> - ctx_features = uffd_ctx_features(features);
>> + new_features = uffd_ctx_features(features);
>> + /* allow two-step handshake */
>> + if (userfaultfd_is_initialized(ctx))
>> + old_features = UFFD_FEATURE_INITIALIZED;
>> ret = -EINVAL;
>> - if (cmpxchg(&ctx->features, 0, ctx_features) != 0)
>> + if (cmpxchg(&ctx->features, old_features, new_features) != old_features)
>> goto err_out;
>> ret = 0;
>
> I am not sure it is right since you would return EINVAL in this case.
> It also looks a bit overly complicated - are you concerned about a race?
Yes.
> My whole concern about race was that somebody would exploit it to
> overcome non-cooperative UFFD (IIRC).
>
> So perhaps just add a check for the case features if 0 and be done with
> it? Something like adding:
>
> ret = 0;
> if (ctx->features == 0 && features == 0)
> goto err_out; /* no error but copying of uffdio_api required */
Probably would also work. But let's find out first if we even want to
fix this, given that it never seemed to have behaved that way from a
quick glimpse.
--
Cheers,
David / dhildenb
next prev parent reply other threads:[~2025-06-24 11:39 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-22 8:10 [PATCH] mm/selftests: improve UFFD-WP feature detection in KSM test Li Wang
2025-06-23 8:33 ` David Hildenbrand
2025-06-24 3:43 ` Li Wang
2025-06-24 4:24 ` [PATCH v2] selftests/mm: Fix UFFDIO_API usage with proper two-step feature negotiation Li Wang
2025-06-24 8:07 ` David Hildenbrand
2025-06-24 8:22 ` David Hildenbrand
2025-06-24 11:29 ` Nadav Amit
2025-06-24 11:39 ` David Hildenbrand [this message]
2025-06-24 11:48 ` David Hildenbrand
2025-06-24 15:03 ` Peter Xu
2025-06-24 15:17 ` David Hildenbrand
2025-06-24 15:17 ` David Hildenbrand
2025-06-25 0:34 ` Li Wang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=239f75e4-1868-4ac9-882f-664a8863b781@redhat.com \
--to=david@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=aruna.ramakrishna@oracle.com \
--cc=bagasdotme@gmail.com \
--cc=catalin.marinas@arm.com \
--cc=dave.hansen@linux.intel.com \
--cc=hannes@cmpxchg.org \
--cc=joey.gouly@arm.com \
--cc=keith.lucas@oracle.com \
--cc=linux-kselftest@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=liwang@redhat.com \
--cc=nadav.amit@gmail.com \
--cc=peterx@redhat.com \
--cc=ryan.roberts@arm.com \
--cc=shuah@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).