From: Jakub Kicinski <kuba@kernel.org>
To: Kirill Tkhai <tkhai@ya.ru>, "Paul E. McKenney" <paulmck@kernel.org>
Cc: Linux Kernel Network Developers <netdev@vger.kernel.org>,
davem@davemloft.net, edumazet@google.com, pabeni@redhat.com,
kuniyu@amazon.com, gorcunov@gmail.com
Subject: Re: [PATCH net-next] unix: Guarantee sk_state relevance in case of it was assigned by a task on other cpu
Date: Wed, 25 Jan 2023 22:10:53 -0800 [thread overview]
Message-ID: <20230125221053.301c0341@kernel.org> (raw)
In-Reply-To: <6953ec3b-6c48-954e-f3db-63450a5ab886@ya.ru>
On Thu, 26 Jan 2023 00:09:08 +0300 Kirill Tkhai wrote:
> 1)There are a many combinations with third task involved:
>
> [CPU0:Task0] [CPU1:Task1] [CPU2:Task2]
> listen(sk)
> kernel:
> sk_diag_fill(sk)
> rep->udiag_state = TCP_LISTEN
> return_from_syscall
> userspace:
> mutex_lock()
> shared_mem_var = rep->udiag_state
> mutex_unlock()
>
> userspace:
> mutex_lock()
> if (shared_mem_var == TCP_LISTEN)
> accept(sk); /* -> fail, since sk_state is not visible */
> mutex_unlock()
>
> In this situation Task2 definitely knows Task0's listen() has succeed, but there is no a possibility
> to guarantee its accept() won't fail. Despite there are appropriate barriers in mutex_lock() and mutex_unlock(),
> there is no a possibility to add a barrier on CPU1 to make Task0's store visible on CPU2.
Me trying to prove that memory ordering is transitive would be 100%
speculation. Let's ask Paul instead - is the above valid? Or the fact
that CPU1 observes state from CPU0 and is strongly ordered with CPU2
implies that CPU2 will also observe CPU0's state?
> 2)My understanding is chronologically later accept() mustn't miss sk_state.
> Otherwise, kernel says that ordering between internal syscalls data
> is userspace duty, which is wrong. Userspace knows nothing about internal
> kernel data.
> 3)Such possible situations in log file also look strange:
Dunno those points are a bit subjective. Squeezing perf out of
distributed systems requires sacrifices 🤷️
next prev parent reply other threads:[~2023-01-26 6:11 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-01-22 22:21 [PATCH net-next] unix: Guarantee sk_state relevance in case of it was assigned by a task on other cpu Kirill Tkhai
2023-01-24 17:57 ` Kuniyuki Iwashima
2023-01-24 21:05 ` Kirill Tkhai
2023-01-24 22:32 ` Cyrill Gorcunov
2023-01-25 1:35 ` Jakub Kicinski
2023-01-25 21:09 ` Kirill Tkhai
2023-01-26 6:10 ` Jakub Kicinski [this message]
2023-01-26 20:25 ` Paul E. McKenney
2023-01-26 21:33 ` Jakub Kicinski
2023-01-26 21:47 ` Paul E. McKenney
2023-02-02 19:42 ` Kirill Tkhai
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230125221053.301c0341@kernel.org \
--to=kuba@kernel.org \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=gorcunov@gmail.com \
--cc=kuniyu@amazon.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=paulmck@kernel.org \
--cc=tkhai@ya.ru \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).