From: "Jiayuan Chen" <jiayuan.chen@linux.dev>
To: "John Fastabend" <john.fastabend@gmail.com>,
"Cong Wang" <xiyou.wangcong@gmail.com>
Cc: bpf@vger.kernel.org, "Jakub Sitnicki" <jakub@cloudflare.com>,
"David S. Miller" <davem@davemloft.net>,
"Eric Dumazet" <edumazet@google.com>,
"Jakub Kicinski" <kuba@kernel.org>,
"Paolo Abeni" <pabeni@redhat.com>,
"Simon Horman" <horms@kernel.org>,
"Cong Wang" <cong.wang@bytedance.com>,
"Alexei Starovoitov" <ast@kernel.org>,
netdev@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH bpf-next v1] bpf, sockmap: Fix concurrency issues between memory charge and uncharge
Date: Tue, 20 May 2025 15:10:34 +0000 [thread overview]
Message-ID: <e568d71aa0c1f7397c755ce6f0a71540931ebc3e@linux.dev> (raw)
In-Reply-To: <20250519200003.46elezpkkfx5grl4@gmail.com>
May 20, 2025 at 04:00, "John Fastabend" <john.fastabend@gmail.com> wrote:
>
> On 2025-05-18 11:48:31, Cong Wang wrote:
>
[...]
> >
> > Solution:
> >
> > 1. Add locking to the kfree_sk_msg() process, which is only called in the
> >
> > user process context.
> >
> > 2. Integrate the charge process into sk_psock_create_ingress_msg() in the
> >
> > backlog process and add locking.
> >
> > 3. Reuse the existing psock->ingress_lock.
> >
> >
> >
> > Reusing the psock->ingress_lock looks weird to me, as it is intended for
> >
> > locking ingress queue, at least at the time it was introduced.
> >
> >
> >
> > And technically speaking, it is the sock lock which is supposed to serialize
> >
> > socket charging.
> >
> >
> >
> > So is there any better solution here?
> >
>
> Agree I would be more apt to add the sock_lock back to the backlog then
>
> to punish fast path this way.
>
> Holding the ref cnt on the psock stops blocks the sk_psock_destroy() in
>
> backlog now so is this still an issue?
>
> Thanks,
>
> John
>
Thanks to Cong and John for their feedback.
For TCP, lock_sock(sk) works as expected. However, since we now support
multiple socket types (UNIX, UDP), the locking mechanism must be adapted
accordingly.
For UNIX sockets, we must use u->iolock instead of lock_sock(sk) in the
backlog path. This is because we already acquire lock(u->iolock) in both:
'''
unix_bpf_recvmsg() (BPF handler)
unix_stream_read_generic() (native handler)
'''
For UDP, the native handler __skb_recv_udp() locks udp_sk(sk)->reader_queue->lock,
but no locking is implemented in udp_bpf_recvmsg(). This implies that ingress_lock
effectively serves the same purpose as udp_sk(sk)->reader_queue->lock to prevent
concurrent user-space access.
Conclusion:
To avoid using ingress_lock, we need to implement a per-socket locking strategy into psock:
Default implementation: lock_sock(sk)
UNIX sockets: Use lock(u->iolock) in backlog path.
UDP sockets: Explicitly use reader_queue->lock both in udp_bpf_recvmsg() and backlog path.
As of now, I don’t have any better ideas.
prev parent reply other threads:[~2025-05-20 15:10 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-05-08 6:24 [PATCH bpf-next v1] bpf, sockmap: Fix concurrency issues between memory charge and uncharge Jiayuan Chen
2025-05-18 18:48 ` Cong Wang
2025-05-19 20:00 ` John Fastabend
2025-05-20 15:10 ` Jiayuan Chen [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=e568d71aa0c1f7397c755ce6f0a71540931ebc3e@linux.dev \
--to=jiayuan.chen@linux.dev \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=cong.wang@bytedance.com \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=horms@kernel.org \
--cc=jakub@cloudflare.com \
--cc=john.fastabend@gmail.com \
--cc=kuba@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=xiyou.wangcong@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.