From: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
To: Peter Zijlstra <peterz@infradead.org>, Andy Lutomirski <luto@kernel.org>
Cc: x86 <x86@kernel.org>, linux-kernel <linux-kernel@vger.kernel.org>,
Nicholas Piggin <npiggin@gmail.com>,
Arnd Bergmann <arnd@arndb.de>, Anton Blanchard <anton@ozlabs.org>
Subject: Re: [PATCH 3/3] membarrier: Propagate SYNC_CORE and RSEQ actions more carefully
Date: Tue, 1 Dec 2020 09:28:37 -0500 (EST) [thread overview]
Message-ID: <1044280457.69297.1606832917168.JavaMail.zimbra@efficios.com> (raw)
In-Reply-To: <20201201101637.GU2414@hirez.programming.kicks-ass.net>
----- On Dec 1, 2020, at 5:16 AM, Peter Zijlstra peterz@infradead.org wrote:
> On Mon, Nov 30, 2020 at 09:50:35AM -0800, Andy Lutomirski wrote:
>> membarrier() carefully propagates SYNC_CORE and RSEQ actions to all
>> other CPUs, but there are two issues.
>>
>> - membarrier() does not sync_core() or rseq_preempt() the calling
>> CPU. Aside from the logic being mind-bending, this also means
>> that it may not be safe to modify user code through an alias,
>> call membarrier(), and then jump to a different executable alias
>> of the same code.
>
> I always understood this to be on purpose. The calling CPU can fix up
> itself just fine. The pain point is fixing up the other CPUs, and that's
> where membarrier() helps.
Indeed, as documented in the man page:
MEMBARRIER_CMD_PRIVATE_EXPEDITED_SYNC_CORE (since Linux 4.16)
In addition to providing the memory ordering guarantees de‐
scribed in MEMBARRIER_CMD_PRIVATE_EXPEDITED, upon return from
system call the calling thread has a guarantee that all its run‐
ning thread siblings have executed a core serializing instruc‐
tion. This guarantee is provided only for threads in the same
process as the calling thread.
membarrier sync core guarantees a core serializing instruction on the siblings,
not on the caller thread. This has been done on purpose given that the caller
thread can always issue its core serializing instruction from user-space on
its own.
>
> That said, I don't mind including self, these aren't fast calls by any
> means.
I don't mind including self either, but this would require documentation
updates, including man pages, to state that starting from kernel Y this
is the guaranteed behavior. It's then tricky for user-space to query what
the behavior is unless we introduce a new membarrier command for it. So this
could introduce issues if software written for the newer kernels runs on older
kernels.
>
>> - membarrier() does not explicitly sync_core() remote CPUs either;
>> instead, it relies on the assumption that an IPI will result in a
>> core sync. On x86, I think this may be true in practice, but
>> it's not architecturally reliable. In particular, the SDM and
>> APM do not appear to guarantee that interrupt delivery is
>> serializing.
>
> Right, I don't think we rely on that, we do rely on interrupt delivery
> providing order though -- as per the previous email.
>
>> On a preemptible kernel, IPI return can schedule,
>> thereby switching to another task in the same mm that was
>> sleeping in a syscall. The new task could then SYSRET back to
>> usermode without ever executing IRET.
>
> This; I think we all overlooked this scenario.
Indeed, this is an issue which needs to be fixed.
>
>> This patch simplifies the code to treat the calling CPU just like
>> all other CPUs, and explicitly sync_core() on all target CPUs. This
>> eliminates the need for the smp_mb() at the end of the function
>> except in the special case of a targeted remote membarrier(). This
>> patch updates that code and the comments accordingly.
I am not confident that removing the smp_mb at the end of membarrier is
an appropriate change, nor that it simplifies the model.
This changes things from a model where we have a barrier at the beginning
and end of the membarrier system call, which nicely orders things happening
before/after the system call with respect to anything that is observed within
the system call (including the scheduler activity updating the runqueue's
current task), to a model where the memory barrier for the current thread
will be conditionally executed after we have sent the IPIs, and unconditionally
when issuing smp_call_function* on self.
About the documentation of the membarrier scenario, I think it is redundant
with a documentation patch I already have sitting in -tip (scenario A):
https://git.kernel.org/tip/25595eb6aaa9fbb31330f1e0b400642694bc6574
Thanks,
Mathieu
--
Mathieu Desnoyers
EfficiOS Inc.
http://www.efficios.com
next prev parent reply other threads:[~2020-12-01 14:29 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-11-30 17:50 [PATCH 0/3] membarrier fixes Andy Lutomirski
2020-11-30 17:50 ` [PATCH 1/3] x86/membarrier: Get rid of a dubious optimization Andy Lutomirski
2020-12-01 14:39 ` Mathieu Desnoyers
2020-12-01 17:47 ` Andy Lutomirski
2020-11-30 17:50 ` [PATCH 2/3] membarrier: Add an actual barrier before rseq_preempt() Andy Lutomirski
2020-12-01 10:06 ` Peter Zijlstra
2020-12-01 14:31 ` Mathieu Desnoyers
2020-12-01 17:55 ` Andy Lutomirski
2020-11-30 17:50 ` [PATCH 3/3] membarrier: Propagate SYNC_CORE and RSEQ actions more carefully Andy Lutomirski
2020-12-01 10:16 ` Peter Zijlstra
2020-12-01 14:28 ` Mathieu Desnoyers [this message]
2020-12-01 18:12 ` Andy Lutomirski
2020-12-01 18:29 ` Mathieu Desnoyers
2020-12-01 18:48 ` Andy Lutomirski
2020-12-01 20:51 ` Mathieu Desnoyers
2020-12-01 18:09 ` Andy Lutomirski
2020-12-01 18:53 ` Peter Zijlstra
2020-12-01 18:55 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1044280457.69297.1606832917168.JavaMail.zimbra@efficios.com \
--to=mathieu.desnoyers@efficios.com \
--cc=anton@ozlabs.org \
--cc=arnd@arndb.de \
--cc=linux-kernel@vger.kernel.org \
--cc=luto@kernel.org \
--cc=npiggin@gmail.com \
--cc=peterz@infradead.org \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.