public inbox for bpf@vger.kernel.org
 help / color / mirror / Atom feed
From: Kui-Feng Lee <sinquersw@gmail.com>
To: Andrii Nakryiko <andrii.nakryiko@gmail.com>,
	Jordan Rome <linux@jordanrome.com>
Cc: bpf@vger.kernel.org, Alexei Starovoitov <ast@kernel.org>,
	Daniel Borkmann <daniel@iogearbox.net>,
	Andrii Nakryiko <andrii@kernel.org>,
	Martin KaFai Lau <martin.lau@kernel.org>,
	Kernel Team <kernel-team@fb.com>
Subject: Re: [bpf-next v5 1/2] bpf: Add bpf_copy_from_user_str kfunc
Date: Fri, 16 Aug 2024 00:23:14 -0700	[thread overview]
Message-ID: <6ddc8fda-3fcd-4e5f-8a0c-475323b08de9@gmail.com> (raw)
In-Reply-To: <CAEf4Bzb+W2PyvUuHixc+mTTt73zTCYBBpBwtoYmTtv++rxd4+g@mail.gmail.com>



On 8/15/24 15:38, Andrii Nakryiko wrote:
> On Thu, Aug 15, 2024 at 4:28 AM Jordan Rome <linux@jordanrome.com> wrote:
>>
>> This adds a kfunc wrapper around strncpy_from_user,
>> which can be called from sleepable BPF programs.
>>
>> This matches the non-sleepable 'bpf_probe_read_user_str'
>> helper except it includes an additional 'flags'
>> param, which allows consumers to clear the entire
>> destination buffer on success.
>>
>> Signed-off-by: Jordan Rome <linux@jordanrome.com>
>> ---
>>   include/uapi/linux/bpf.h       |  8 +++++++
>>   kernel/bpf/helpers.c           | 41 ++++++++++++++++++++++++++++++++++
>>   tools/include/uapi/linux/bpf.h |  8 +++++++
>>   3 files changed, 57 insertions(+)
>>
>> diff --git a/include/uapi/linux/bpf.h b/include/uapi/linux/bpf.h
>> index e05b39e39c3f..e207175981be 100644
>> --- a/include/uapi/linux/bpf.h
>> +++ b/include/uapi/linux/bpf.h
>> @@ -7513,4 +7513,12 @@ struct bpf_iter_num {
>>          __u64 __opaque[1];
>>   } __attribute__((aligned(8)));
>>
>> +/*
>> + * Flags to control bpf_copy_from_user_str() behaviour.
>> + *     - BPF_ZERO_BUFFER: Memset 0 the tail of the destination buffer on success
>> + */
>> +enum {
>> +       BPF_ZERO_BUFFER = (1ULL << 0)
> 
> We call all flags BPF_F_<something>, so let's stay consistent.
> 
> And just for a bit of bikeshedding, "zero buffer" isn't immediately
> clear and it would be nice to have a clearer verb in there. I don't
> have a perfect name, but something like BPF_F_PAD_ZEROS or something
> with "pad" maybe?
> 
> Also, should we keep behavior a bit more consistent and say that on
> failure this flag will also ensure that buffer is cleared?
> 
>> +};
>> +
>>   #endif /* _UAPI__LINUX_BPF_H__ */
>> diff --git a/kernel/bpf/helpers.c b/kernel/bpf/helpers.c
>> index d02ae323996b..fe4348679d38 100644
>> --- a/kernel/bpf/helpers.c
>> +++ b/kernel/bpf/helpers.c
>> @@ -2939,6 +2939,46 @@ __bpf_kfunc void bpf_iter_bits_destroy(struct bpf_iter_bits *it)
>>          bpf_mem_free(&bpf_global_ma, kit->bits);
>>   }
>>
>> +/**
>> + * bpf_copy_from_user_str() - Copy a string from an unsafe user address
>> + * @dst:             Destination address, in kernel space.  This buffer must be at
>> + *                   least @dst__szk bytes long.
>> + * @dst__szk:        Maximum number of bytes to copy, including the trailing NUL.
>> + * @unsafe_ptr__ign: Source address, in user space.
>> + * @flags:           The only supported flag is BPF_ZERO_BUFFER
>> + *
>> + * Copies a NUL-terminated string from userspace to BPF space. If user string is
>> + * too long this will still ensure zero termination in the dst buffer unless
>> + * buffer size is 0.
>> + *
>> + * If BPF_ZERO_BUFFER flag is set, memset the tail of @dst to 0 on success.
>> + */
>> +__bpf_kfunc int bpf_copy_from_user_str(void *dst, u32 dst__szk, const void __user *unsafe_ptr__ign, u64 flags)
>> +{
>> +       int ret;
>> +       int count;
>> +
> 
> validate that flags doesn't have any unknown flags
> 
> if (unlikely(flags & ~BPF_F_ZERO_BUFFER))
>      return -EINVAL;
> 
>> +       if (unlikely(!dst__szk))
>> +               return 0;
>> +
>> +       count = dst__szk - 1;
>> +       if (unlikely(!count)) {
>> +               ((char *)dst)[0] = '\0';
>> +               return 1;
>> +       }
> 
> Do we need to special-case this unlikely scenario? Especially that
> it's unlikely, why write code for it and pay a tiny price for an extra
> check?
> 
>> +
>> +       ret = strncpy_from_user(dst, unsafe_ptr__ign, count);
>> +       if (ret >= 0) {
>> +               if (flags & BPF_ZERO_BUFFER)
>> +                       memset((char *)dst + ret, 0, dst__szk - ret);
>> +               else
>> +                       ((char *)dst)[ret] = '\0';
>> +               ret++;
> 
> so if string is truncated, ret == count, no? And dst[ret] will go
> beyond the buffer?

Since count = dst__szk - 1, it is not going beyond the buffer.

> 
> we need more tests to validate all those various conditions
> 
> 
> I'd also rewrite this a bit, so it's more linear:
> 
> 
> ret = strncpy(...);
> if (ret < 0)
>      return ret;
> 
> ((char *)dst)[count - 1] = '\0';
> 
> if (flags & BPF_F_ZERO_BUF)
>        memset(...);
> 
> return ret < count ? ret + 1 : count;
> 
> 
> or something along those lines
> 
> 
> pw-bot: cr
> 
> 
>> +       }
>> +
>> +       return ret;
>> +}
>> +
>>   __bpf_kfunc_end_defs();
>>
>>   BTF_KFUNCS_START(generic_btf_ids)
>> @@ -3024,6 +3064,7 @@ BTF_ID_FLAGS(func, bpf_preempt_enable)
>>   BTF_ID_FLAGS(func, bpf_iter_bits_new, KF_ITER_NEW)
>>   BTF_ID_FLAGS(func, bpf_iter_bits_next, KF_ITER_NEXT | KF_RET_NULL)
>>   BTF_ID_FLAGS(func, bpf_iter_bits_destroy, KF_ITER_DESTROY)
>> +BTF_ID_FLAGS(func, bpf_copy_from_user_str, KF_SLEEPABLE)
>>   BTF_KFUNCS_END(common_btf_ids)
>>
>>   static const struct btf_kfunc_id_set common_kfunc_set = {
>> diff --git a/tools/include/uapi/linux/bpf.h b/tools/include/uapi/linux/bpf.h
>> index e05b39e39c3f..15c2c3431e0f 100644
>> --- a/tools/include/uapi/linux/bpf.h
>> +++ b/tools/include/uapi/linux/bpf.h
>> @@ -7513,4 +7513,12 @@ struct bpf_iter_num {
>>          __u64 __opaque[1];
>>   } __attribute__((aligned(8)));
>>
>> +/*
>> + * Flags to control bpf_copy_from_user_str() behaviour.
>> + *     - BPF_ZERO_BUFFER: Memset 0 the entire destination buffer on success
>> + */
>> +enum {
>> +       BPF_ZERO_BUFFER = (1ULL << 0)
>> +};
>> +
>>   #endif /* _UAPI__LINUX_BPF_H__ */
>> --
>> 2.43.5
>>

  reply	other threads:[~2024-08-16  7:23 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-08-15 11:27 [bpf-next v5 1/2] bpf: Add bpf_copy_from_user_str kfunc Jordan Rome
2024-08-15 11:27 ` [bpf-next v5 2/2] bpf: Add tests for " Jordan Rome
2024-08-15 22:41   ` Andrii Nakryiko
2024-08-15 22:38 ` [bpf-next v5 1/2] bpf: Add " Andrii Nakryiko
2024-08-16  7:23   ` Kui-Feng Lee [this message]
2024-08-19 16:25     ` Andrii Nakryiko

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6ddc8fda-3fcd-4e5f-8a0c-475323b08de9@gmail.com \
    --to=sinquersw@gmail.com \
    --cc=andrii.nakryiko@gmail.com \
    --cc=andrii@kernel.org \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=daniel@iogearbox.net \
    --cc=kernel-team@fb.com \
    --cc=linux@jordanrome.com \
    --cc=martin.lau@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox