From: Kui-Feng Lee <sinquersw@gmail.com>
To: Martin KaFai Lau <martin.lau@linux.dev>, thinker.li@gmail.com
Cc: kuifeng@meta.com, bpf@vger.kernel.org, ast@kernel.org,
song@kernel.org, kernel-team@meta.com, andrii@kernel.org,
drosen@google.com
Subject: Re: [PATCH bpf-next v5 3/9] bpf: hold module for bpf_struct_ops_map.
Date: Fri, 20 Oct 2023 15:28:26 -0700 [thread overview]
Message-ID: <2e07e20b-cb1a-409e-a0fc-4b5ee55e0cbe@gmail.com> (raw)
In-Reply-To: <047bbde0-eb9c-7785-349a-d241c1623fab@linux.dev>
On 10/20/23 14:37, Martin KaFai Lau wrote:
> On 10/19/23 10:07 PM, Kui-Feng Lee wrote:
>>
>>
>> On 10/19/23 09:29, Kui-Feng Lee wrote:
>>>
>>>
>>> On 10/18/23 17:36, Martin KaFai Lau wrote:
>>>> On 10/17/23 9:23 AM, thinker.li@gmail.com wrote:
>>>
>>>>
>>>>> }
>>>>> void bpf_struct_ops_init(struct btf *btf, struct bpf_verifier_log
>>>>> *log)
>>>>> @@ -215,7 +218,7 @@ void bpf_struct_ops_init(struct btf *btf,
>>>>> struct bpf_verifier_log *log)
>>>>> for (i = 0; i < ARRAY_SIZE(bpf_struct_ops); i++) {
>>>>> st_ops = bpf_struct_ops[i];
>>>>> - bpf_struct_ops_init_one(st_ops, btf, log);
>>>>> + bpf_struct_ops_init_one(st_ops, btf, NULL, log);
>>>>> }
>>>>> }
>>>>> @@ -630,6 +633,7 @@ static void __bpf_struct_ops_map_free(struct
>>>>> bpf_map *map)
>>>>> bpf_jit_uncharge_modmem(PAGE_SIZE);
>>>>> }
>>>>> bpf_map_area_free(st_map->uvalue);
>>>>> + module_put(st_map->st_ops->owner);
>>>>> bpf_map_area_free(st_map);
>>>>> }
>>>>> @@ -676,9 +680,18 @@ static struct bpf_map
>>>>> *bpf_struct_ops_map_alloc(union bpf_attr *attr)
>>>>> if (!st_ops)
>>>>> return ERR_PTR(-ENOTSUPP);
>>>>> + /* If st_ops->owner is NULL, it means the struct_ops is
>>>>> + * statically defined in the kernel. We don't need to
>>>>> + * take a refcount on it.
>>>>> + */
>>>>> + if (st_ops->owner && !btf_try_get_module(st_ops->btf))
>
> While replying and looking at it again, I don't think the
> btf_try_get_module(st_ops->btf) is safe. The module's owned st_ops
> itself could have been gone with the module. The same goes with the
> "st_ops->owner" test, so btf_is_module(btf) should be used instead.
I have change it locally. Here, it calls btf_try_get_module() after
calling btf_struct_ops_find_value(). The new code will call
btf_try_get_module() against the btf from attr->value_type_btf_obj_fd
before btf_struct_ops_find_value(). Just like I mentioned earlier to
ensure the callers of btf_struct_ops_find_value() and
btf_struct_ops_find() hold a refcount to the module.
>
> I am risking to act like a broken clock to repeat this question, does it
> really need to store btf back into the st_ops which may accidentally get
> into the above btf_try_get_module(st_ops->btf) usage?
>
>>>>
>>>> This just came to my mind. Is the module refcnt needed during map
>>>> alloc/free or it could be done during the reg/unreg instead?
>>>
>>>
>>> Sure, I can move it to reg/unreg.
>>
>> Just found that we relies type information in st_ops to update element
>> and clean up maps.
>> We can not move get/put modules to reg/unreg except keeping a
>> redundant copy in
>> st_map or somewhere. It make the code much more complicated by
>> introducing get/put module here and there.
>>
>> I prefer to keep as it is now. WDYT?
>
> Yeah, sure. I was asking after seeing a longer wait time for the module
> to go away in patch 11 selftest and requires an explicit waiting for the
> tasks_trace period. Releasing the module refcnt earlier will help.
>
> Regardless of the module refcnt hold/free location, I think storing the
> type* and value* in the module's owned st_ops does not look correct now.
> It was fine and convenient to piggy back them into bpf_struct_ops when
> everything was built-in the kernel and no lifetime concern. It makes
> sense now to separate them out from the module's owned st_ops. Something
> like:
>
> struct btf_struct_ops_desc {
> struct bpf_struct_ops *ops;
> const struct btf_type *type;
> const struct btf_type *value_type;
> u32 type_id;
> u32 value_id;
> };
>
> struct btf_struct_ops_tab {
> u32 cnt;
> u32 capacity;
> struct btf_struct_ops_desc *st_ops_desc[];
> };
>
> wdyt?
So, st_map should hold a pointer to a bpf_struct_ops_desc instead of
st_ops, right? It would work!
next prev parent reply other threads:[~2023-10-20 22:28 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-10-17 16:22 [PATCH bpf-next v5 0/9] Registrating struct_ops types from modules thinker.li
2023-10-17 16:22 ` [PATCH bpf-next v5 1/9] bpf: refactory struct_ops type initialization to a function thinker.li
2023-10-17 16:22 ` [PATCH bpf-next v5 2/9] bpf: add struct_ops_tab to btf thinker.li
2023-10-19 0:00 ` Martin KaFai Lau
2023-10-19 0:33 ` Kui-Feng Lee
2023-10-19 2:28 ` Martin KaFai Lau
2023-10-19 16:15 ` Kui-Feng Lee
2023-10-17 16:23 ` [PATCH bpf-next v5 3/9] bpf: hold module for bpf_struct_ops_map thinker.li
2023-10-19 0:36 ` Martin KaFai Lau
2023-10-19 16:29 ` Kui-Feng Lee
2023-10-20 5:07 ` Kui-Feng Lee
2023-10-20 21:37 ` Martin KaFai Lau
2023-10-20 22:28 ` Kui-Feng Lee [this message]
2023-10-17 16:23 ` [PATCH bpf-next v5 4/9] bpf: validate value_type thinker.li
2023-10-17 16:23 ` [PATCH bpf-next v5 5/9] bpf: pass attached BTF to the bpf_struct_ops subsystem thinker.li
2023-10-17 16:23 ` [PATCH bpf-next v5 6/9] bpf, net: switch to dynamic registration thinker.li
2023-10-19 1:49 ` Martin KaFai Lau
2023-10-20 15:12 ` Kui-Feng Lee
2023-10-20 17:53 ` Kui-Feng Lee
2023-10-17 16:23 ` [PATCH bpf-next v5 7/9] libbpf: Find correct module BTFs for struct_ops maps and progs thinker.li
2023-10-17 21:49 ` Andrii Nakryiko
2023-10-18 2:25 ` Kui-Feng Lee
2023-10-19 2:43 ` Martin KaFai Lau
2023-10-19 16:31 ` Kui-Feng Lee
2023-10-17 16:23 ` [PATCH bpf-next v5 8/9] bpf: export btf_ctx_access to modules thinker.li
2023-10-17 16:23 ` [PATCH bpf-next v5 9/9] selftests/bpf: test case for register_bpf_struct_ops() thinker.li
2023-10-17 18:03 ` kernel test robot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2e07e20b-cb1a-409e-a0fc-4b5ee55e0cbe@gmail.com \
--to=sinquersw@gmail.com \
--cc=andrii@kernel.org \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=drosen@google.com \
--cc=kernel-team@meta.com \
--cc=kuifeng@meta.com \
--cc=martin.lau@linux.dev \
--cc=song@kernel.org \
--cc=thinker.li@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.