From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.8 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH, MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id E56B9C352A2 for ; Fri, 7 Feb 2020 10:56:44 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id BCE3D20838 for ; Fri, 7 Feb 2020 10:56:44 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=cloudflare.com header.i=@cloudflare.com header.b="RcYhavdZ" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727068AbgBGK4n (ORCPT ); Fri, 7 Feb 2020 05:56:43 -0500 Received: from mail-wm1-f67.google.com ([209.85.128.67]:36962 "EHLO mail-wm1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726857AbgBGK4n (ORCPT ); Fri, 7 Feb 2020 05:56:43 -0500 Received: by mail-wm1-f67.google.com with SMTP id f129so2223599wmf.2 for ; Fri, 07 Feb 2020 02:56:40 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudflare.com; s=google; h=references:user-agent:from:to:cc:subject:in-reply-to:date :message-id:mime-version; bh=Uixk1JfPRfZiskd4b5V0w8uIZZOxCYEiV7QUtBMbUWw=; b=RcYhavdZsLpqgP20VvYdLOC0mNiSX7lTL2sqhQ59iQIXWipEC6zKbuCeHoP4GY0t0B EZIx3Ipyt0LTHJziQIZ1vSHojZsnKA2EhadixIE+ES0+DXsiu4vG0jm9Bte5yyIaHdZk XYZfwMvMAtFU9q03VwDKV4aSHkIyO25r0r+mo= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:references:user-agent:from:to:cc:subject :in-reply-to:date:message-id:mime-version; bh=Uixk1JfPRfZiskd4b5V0w8uIZZOxCYEiV7QUtBMbUWw=; b=XzL/pbkjJVDCB+zMooH5W+4H+pHWu089RMtaN0Fo9s1MeqcFbCkBOB+gZg03+TVd5L Vnnp+J4fNkEPMKnCnPzZxlIktnQ0EcjrIBjs94hPA1PofaLbxSI31lm6JGII/Z/wI4UP ZBJwMaWbhN57hnHuA5cveM4FDk3RvkhrCQoX7zcYm29Ms6B5yUVuTfLZIPknDeCHQ2SK uXK7Mbgupaiz3fRdwSUCezmdzAGHBlW5SbjQh+B8kFekIKAVdRQBKsbLGvdMAHO0hvbp l9kinUeWa75g7mmrhEwD9ficiW/MHoLqhKO+poYvzobz8OFOJvj/IV32MmNM+OixSXHb qjEw== X-Gm-Message-State: APjAAAVJYH8XUYn8WNZhHGfKOL1yxNyOfaq67i+JdG5Wx0VOB/chP+Gn AwvJnlonTBage3Ps1Zxa8ijK8w== X-Google-Smtp-Source: APXvYqwW8UfEVnYheCnZ/OzASRGSTOcK7iOubLtK+9VH5Wdat0CZTnVAZLLgZZludphRfIz5vL23ow== X-Received: by 2002:a7b:cb91:: with SMTP id m17mr3612169wmi.146.1581072999762; Fri, 07 Feb 2020 02:56:39 -0800 (PST) Received: from cloudflare.com ([176.221.114.230]) by smtp.gmail.com with ESMTPSA id a62sm2953727wmh.33.2020.02.07.02.56.38 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 07 Feb 2020 02:56:39 -0800 (PST) References: <20200207103713.28175-1-lmb@cloudflare.com> User-agent: mu4e 1.1.0; emacs 26.3 From: Jakub Sitnicki To: Lorenz Bauer Cc: John Fastabend , Daniel Borkmann , "David S. Miller" , Jakub Kicinski , Alexei Starovoitov , Martin KaFai Lau , Song Liu , Yonghong Song , Andrii Nakryiko , kernel-team@cloudflare.com, netdev@vger.kernel.org, bpf@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH bpf] bpf: sockmap: check update requirements after locking In-reply-to: <20200207103713.28175-1-lmb@cloudflare.com> Date: Fri, 07 Feb 2020 11:56:38 +0100 Message-ID: <87y2temzrt.fsf@cloudflare.com> MIME-Version: 1.0 Content-Type: text/plain Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org On Fri, Feb 07, 2020 at 11:37 AM CET, Lorenz Bauer wrote: > It's currently possible to insert sockets in unexpected states into > a sockmap, due to a TOCTTOU when updating the map from a syscall. > sock_map_update_elem checks that sk->sk_state == TCP_ESTABLISHED, > locks the socket and then calls sock_map_update_common. At this > point, the socket may have transitioned into another state, and > the earlier assumptions don't hold anymore. Crucially, it's > conceivable (though very unlikely) that a socket has become unhashed. > This breaks the sockmap's assumption that it will get a callback > via sk->sk_prot->unhash. > > Fix this by checking the (fixed) sk_type and sk_protocol without the > lock, followed by a locked check of sk_state. > > Unfortunately it's not possible to push the check down into > sock_(map|hash)_update_common, since BPF_SOCK_OPS_PASSIVE_ESTABLISHED_CB > run before the socket has transitioned from TCP_SYN_RECV into > TCP_ESTABLISHED. > > Signed-off-by: Lorenz Bauer > Fixes: 604326b41a6f ("bpf, sockmap: convert to generic sk_msg interface") > --- > net/core/sock_map.c | 16 ++++++++++------ > 1 file changed, 10 insertions(+), 6 deletions(-) > > diff --git a/net/core/sock_map.c b/net/core/sock_map.c > index 8998e356f423..36a2433e183f 100644 > --- a/net/core/sock_map.c > +++ b/net/core/sock_map.c > @@ -416,14 +416,16 @@ static int sock_map_update_elem(struct bpf_map *map, void *key, > ret = -EINVAL; > goto out; > } > - if (!sock_map_sk_is_suitable(sk) || > - sk->sk_state != TCP_ESTABLISHED) { > + if (!sock_map_sk_is_suitable(sk)) { > ret = -EOPNOTSUPP; > goto out; > } > > sock_map_sk_acquire(sk); > - ret = sock_map_update_common(map, idx, sk, flags); > + if (sk->sk_state != TCP_ESTABLISHED) > + ret = -EOPNOTSUPP; > + else > + ret = sock_map_update_common(map, idx, sk, flags); > sock_map_sk_release(sk); > out: > fput(sock->file); > @@ -739,14 +741,16 @@ static int sock_hash_update_elem(struct bpf_map *map, void *key, > ret = -EINVAL; > goto out; > } > - if (!sock_map_sk_is_suitable(sk) || > - sk->sk_state != TCP_ESTABLISHED) { > + if (!sock_map_sk_is_suitable(sk)) { > ret = -EOPNOTSUPP; > goto out; > } > > sock_map_sk_acquire(sk); > - ret = sock_hash_update_common(map, key, sk, flags); > + if (sk->sk_state != TCP_ESTABLISHED) > + ret = -EOPNOTSUPP; > + else > + ret = sock_hash_update_common(map, key, sk, flags); > sock_map_sk_release(sk); > out: > fput(sock->file); > -- > 2.20.1 Thanks for fixing this, Lorenz. I'll adapt socket state checks on update in "Extend SOCKMAP to store listening sockets" series accordingly. Reviewed-by: Jakub Sitnicki