From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <bpf-owner@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
Received: from vger.kernel.org (vger.kernel.org [23.128.96.18])
	by smtp.lore.kernel.org (Postfix) with ESMTP id E0949C433EF
	for <bpf@archiver.kernel.org>; Tue, 22 Mar 2022 07:34:36 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S237602AbiCVHf5 (ORCPT <rfc822;bpf@archiver.kernel.org>);
        Tue, 22 Mar 2022 03:35:57 -0400
Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48898 "EHLO
        lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S237605AbiCVHfk (ORCPT <rfc822;bpf@vger.kernel.org>);
        Tue, 22 Mar 2022 03:35:40 -0400
Received: from mail-pj1-x1042.google.com (mail-pj1-x1042.google.com [IPv6:2607:f8b0:4864:20::1042])
        by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C566B27B00
        for <bpf@vger.kernel.org>; Tue, 22 Mar 2022 00:34:04 -0700 (PDT)
Received: by mail-pj1-x1042.google.com with SMTP id bx5so14948011pjb.3
        for <bpf@vger.kernel.org>; Tue, 22 Mar 2022 00:34:04 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20210112;
        h=date:from:to:cc:subject:message-id:references:mime-version
         :content-disposition:in-reply-to;
        bh=aPcF/jB/6DVYUhyCr7j+T3pwNZsN+SKsKI+lV+HI4dE=;
        b=JV3Wuq9yQ6mV6Vd7YiVj97Y5nCeTYV4XRdoKkm38yNz4vd1E9wlintPDGC3EftfOlZ
         UcbqGoHwe8k4n4GR87W/jHEloxCdChTvkzQd+ddUAde4wSLtuVaQO6Atf+ZR6pujNqjK
         j8IYFVLV9hO4jINqwQE7iGWEQGHMsVox8idhX6pV/jfQzekZJ8Or0LHVB7kz9BnSkmz1
         Xog2R+WD1Ra6rg7fecO+BWozVQVJ5zcnLnMwon+PZ4iJLP49SNzKdpSb61oxPo6WgHy4
         hBZZi7aekBWqH+1+9Lm57dUHeaDL6fAWlhESVkaD1T7FP6ipMKFGdcfTqCMHihovN03v
         m3Xg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112;
        h=x-gm-message-state:date:from:to:cc:subject:message-id:references
         :mime-version:content-disposition:in-reply-to;
        bh=aPcF/jB/6DVYUhyCr7j+T3pwNZsN+SKsKI+lV+HI4dE=;
        b=lM1LmFg3UgmV01MpC+BLb70SnEMPQjQ7pD8cvKVKLshIVH1YK1G3N1fFZTUkTC4liv
         oWyAy1h/28Qb7kqD3ES7VpQUw1lYHjQzhdlxqDgCSft+s4xlDN/HA2PGizPjG3KrOh8c
         KsEIpRQ7ovSjsF2fgf+fj9Mu+Swbv1cdxsBLNTuwysquhcRlPtXcrafMT9smC/3/8SuT
         8hnls+RT9siRO8A2xrI8Fb2uue8Ye2Mi1ZTWi8rF1JVmHEpgPFccIQJw/yY42iNeocya
         p2fVmys7frR5sc3bGceltJYjvsJnKxV+X/StbTI0ytp27heMDUrsC48N9cjflVt8RZ2o
         tw9w==
X-Gm-Message-State: AOAM532HS4PSWRS1aVajSorIvNTkwvXHuiuq6iM4PkHuXUfEn4apZTlj
        kvtluAdNoCS4YonF5NrN0A0=
X-Google-Smtp-Source: ABdhPJxb3tIsrjHx5MzdwYW6T3mv/juZoWKJT/kDJ2SwAeDUXfB32xo74F7MOUkJjAU3Ga0HLAqU7A==
X-Received: by 2002:a17:902:6944:b0:153:9866:7fea with SMTP id k4-20020a170902694400b0015398667feamr16886090plt.6.1647934444223;
        Tue, 22 Mar 2022 00:34:04 -0700 (PDT)
Received: from localhost ([14.139.187.71])
        by smtp.gmail.com with ESMTPSA id u22-20020a056a00125600b004fa3aec7f89sm17425196pfi.75.2022.03.22.00.34.03
        (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
        Tue, 22 Mar 2022 00:34:03 -0700 (PDT)
Date:   Tue, 22 Mar 2022 13:04:01 +0530
From:   Kumar Kartikeya Dwivedi <memxor@gmail.com>
To:     Joanne Koong <joannelkoong@gmail.com>
Cc:     bpf <bpf@vger.kernel.org>, Alexei Starovoitov <ast@kernel.org>,
        Andrii Nakryiko <andrii@kernel.org>,
        Daniel Borkmann <daniel@iogearbox.net>,
        Toke =?utf-8?Q?H=C3=B8iland-J=C3=B8rgensen?= <toke@redhat.com>,
        Jesper Dangaard Brouer <brouer@redhat.com>
Subject: Re: [PATCH bpf-next v3 04/13] bpf: Indicate argument that will be
 released in bpf_func_proto
Message-ID: <20220322073401.4z3k2vvc5facomfs@apollo>
References: <20220320155510.671497-1-memxor@gmail.com>
 <20220320155510.671497-5-memxor@gmail.com>
 <CAJnrk1amRmvAWSqDzeC=sOVL7_iTqbC8z0B7oXXSni9xqsK_Ww@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <CAJnrk1amRmvAWSqDzeC=sOVL7_iTqbC8z0B7oXXSni9xqsK_Ww@mail.gmail.com>
Precedence: bulk
List-ID: <bpf.vger.kernel.org>
X-Mailing-List: bpf@vger.kernel.org

On Tue, Mar 22, 2022 at 07:17:40AM IST, Joanne Koong wrote:
> On Sun, Mar 20, 2022 at 6:34 PM Kumar Kartikeya Dwivedi
> <memxor@gmail.com> wrote:
> >
> > Add a few fields for each arg (argN_release) that when set to true,
> > tells verifier that for a release function, that argument's register
> > will be the one for which meta.ref_obj_id will be set, and which will
> > then be released using release_reference. To capture the regno,
> > introduce a release_regno field in bpf_call_arg_meta.
> >
> > This would be required in the next patch, where we may either pass NULL
> > or a refcounted pointer as an argument to the release function
> > bpf_kptr_xchg. Just releasing only when meta.ref_obj_id is set is not
> > enough, as there is a case where the type of argument needed matches,
> > but the ref_obj_id is set to 0. Hence, we must enforce that whenever
> > meta.ref_obj_id is zero, the register that is to be released can only
> > be NULL for a release function.
> >
> > Signed-off-by: Kumar Kartikeya Dwivedi <memxor@gmail.com>
> > ---
> >  include/linux/bpf.h   | 10 ++++++++++
> >  kernel/bpf/ringbuf.c  |  2 ++
> >  kernel/bpf/verifier.c | 39 +++++++++++++++++++++++++++++++++------
> >  net/core/filter.c     |  1 +
> >  4 files changed, 46 insertions(+), 6 deletions(-)
> >
> > diff --git a/include/linux/bpf.h b/include/linux/bpf.h
> > index f35920d279dd..48ddde854d67 100644
> > --- a/include/linux/bpf.h
> > +++ b/include/linux/bpf.h
> > @@ -487,6 +487,16 @@ struct bpf_func_proto {
> >                 };
> >                 u32 *arg_btf_id[5];
> >         };
> > +       union {
> > +               struct {
> > +                       bool arg1_release;
> > +                       bool arg2_release;
> > +                       bool arg3_release;
> > +                       bool arg4_release;
> > +                       bool arg5_release;
> > +               };
> > +               bool arg_release[5];
> > +       };
>
> Instead of having the new fields "argx_release" for each arg, what are
> your thoughts on using PTR_RELEASE as an "enum bpf_type_flag" to the
> existing "argx_type" field? For example, instead of
>
>      .arg1_type      = ARG_PTR_TO_ALLOC_MEM,
>      .arg1_release   = true,
>
> could we do something like
>
>      .arg1_type      = ARG_PTR_TO_ALLOC_MEM | PTR_RELEASE
>
> In the verifier, we could determine whether an argument register
> releases a reference by checking whether this PTR_RELEASE flag is set.
>
> Would this be a little cleaner? Curious to hear your thoughts.
>

I don't dislike it, it's just a little more work to make sure we don't have it
set for arg_type in places where it isn't expected, so it would need some
inspection of existing code. It's certainly a bit better than having five bools.

I guess I'll try it out and see.

>
> >         int *ret_btf_id; /* return value btf_id */
> >         bool (*allowed)(const struct bpf_prog *prog);
> >  };
> > diff --git a/kernel/bpf/ringbuf.c b/kernel/bpf/ringbuf.c
> > index 710ba9de12ce..f40ce718630e 100644
> > --- a/kernel/bpf/ringbuf.c
> > +++ b/kernel/bpf/ringbuf.c
> > @@ -405,6 +405,7 @@ const struct bpf_func_proto bpf_ringbuf_submit_proto = {
> >         .func           = bpf_ringbuf_submit,
> >         .ret_type       = RET_VOID,
> >         .arg1_type      = ARG_PTR_TO_ALLOC_MEM,
> > +       .arg1_release   = true,
> >         .arg2_type      = ARG_ANYTHING,
> >  };
> >
> > @@ -418,6 +419,7 @@ const struct bpf_func_proto bpf_ringbuf_discard_proto = {
> >         .func           = bpf_ringbuf_discard,
> >         .ret_type       = RET_VOID,
> >         .arg1_type      = ARG_PTR_TO_ALLOC_MEM,
> > +       .arg1_release   = true,
> >         .arg2_type      = ARG_ANYTHING,
> >  };
> >
> > diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
> > index 744b7362e52e..b8cd34607215 100644
> > --- a/kernel/bpf/verifier.c
> > +++ b/kernel/bpf/verifier.c
> > @@ -245,6 +245,7 @@ struct bpf_call_arg_meta {
> >         struct bpf_map *map_ptr;
> >         bool raw_mode;
> >         bool pkt_access;
> > +       u8 release_regno;
> >         int regno;
> >         int access_size;
> >         int mem_size;
> > @@ -6101,12 +6102,31 @@ static bool check_btf_id_ok(const struct bpf_func_proto *fn)
> >         return true;
> >  }
> >
> > -static int check_func_proto(const struct bpf_func_proto *fn, int func_id)
> > +static bool check_release_regno(const struct bpf_func_proto *fn, int func_id,
> > +                               struct bpf_call_arg_meta *meta)
> > +{
> > +       int i;
> > +
> > +       for (i = 0; i < ARRAY_SIZE(fn->arg_release); i++) {
> > +               if (fn->arg_release[i]) {
> > +                       if (!is_release_function(func_id))
> > +                               return false;
> > +                       if (meta->release_regno)
> > +                               return false;
> > +                       meta->release_regno = i + 1;
> > +               }
> > +       }
> > +       return !is_release_function(func_id) || meta->release_regno;
> > +}
> > +
> > +static int check_func_proto(const struct bpf_func_proto *fn, int func_id,
> > +                           struct bpf_call_arg_meta *meta)
> >  {
> >         return check_raw_mode_ok(fn) &&
> >                check_arg_pair_ok(fn) &&
> >                check_btf_id_ok(fn) &&
> > -              check_refcount_ok(fn, func_id) ? 0 : -EINVAL;
> > +              check_refcount_ok(fn, func_id) &&
> > +              check_release_regno(fn, func_id, meta) ? 0 : -EINVAL;
> >  }
> >
> >  /* Packet data might have moved, any old PTR_TO_PACKET[_META,_END]
> > @@ -6785,7 +6805,7 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> >         memset(&meta, 0, sizeof(meta));
> >         meta.pkt_access = fn->pkt_access;
> >
> > -       err = check_func_proto(fn, func_id);
> > +       err = check_func_proto(fn, func_id, &meta);
> >         if (err) {
> >                 verbose(env, "kernel subsystem misconfigured func %s#%d\n",
> >                         func_id_name(func_id), func_id);
> > @@ -6818,8 +6838,17 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> >                         return err;
> >         }
> >
> > +       regs = cur_regs(env);
> > +
> >         if (is_release_function(func_id)) {
> > -               err = release_reference(env, meta.ref_obj_id);
> > +               err = -EINVAL;
> > +               if (meta.ref_obj_id)
> > +                       err = release_reference(env, meta.ref_obj_id);
> > +               /* meta.ref_obj_id can only be 0 if register that is meant to be
> > +                * released is NULL, which must be > R0.
> > +                */
> > +               else if (meta.release_regno && register_is_null(&regs[meta.release_regno]))
> > +                       err = 0;
>
> If I'm understanding this correctly, in this patch we will call
> check_release_regno on every function to determine if any / which of
> the argument registers release a reference. Given that in the majority
> of cases the function will not be a release function, what are your
> thoughts on moving that check to be within the scope of this if
> function? So if it is a release function, and meta.ref_obj_id is not
> set, then we do the checking for which argument register is a release
> register and whether that register is null. Curious to hear your
> thoughts.
>

The suggestion looks nice, as it saves a lot of work, but my preference was to
error when the bpf_func_proto fields are incorrect (more than one arg has
argN_release == true). In this case we can still detect such a case, but it is
behind 'if (is_release_function(...))', so it wouldn't catch incorrect
bpf_func_proto of non-release functions.

So whether to do it your way would depend on whether it is considered valuable
(or defensive programming) to detect badly set up bpf_func_proto or not (we
already do it for some other cases, so it's nothing new), particularly for this
case.

>
> >                 if (err) {
> >                         verbose(env, "func %s#%d reference has not been acquired before\n",
> >                                 func_id_name(func_id), func_id);
> > @@ -6827,8 +6856,6 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> >                 }
> >         }
> >
> > -       regs = cur_regs(env);
> > -
> >         switch (func_id) {
> >         case BPF_FUNC_tail_call:
> >                 err = check_reference_leak(env);
> > diff --git a/net/core/filter.c b/net/core/filter.c
> > index 03655f2074ae..17eff4731b06 100644
> > --- a/net/core/filter.c
> > +++ b/net/core/filter.c
> > @@ -6622,6 +6622,7 @@ static const struct bpf_func_proto bpf_sk_release_proto = {
> >         .gpl_only       = false,
> >         .ret_type       = RET_INTEGER,
> >         .arg1_type      = ARG_PTR_TO_BTF_ID_SOCK_COMMON,
> > +       .arg1_release   = true,
> >  };
> >
> >  BPF_CALL_5(bpf_xdp_sk_lookup_udp, struct xdp_buff *, ctx,
> > --
> > 2.35.1
> >

--
Kartikeya