From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 82F7AECAAA1 for ; Sun, 23 Oct 2022 18:05:29 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230261AbiJWSF0 (ORCPT ); Sun, 23 Oct 2022 14:05:26 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33744 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230007AbiJWSF0 (ORCPT ); Sun, 23 Oct 2022 14:05:26 -0400 Received: from mx0a-00082601.pphosted.com (mx0b-00082601.pphosted.com [67.231.153.30]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3A3451C104 for ; Sun, 23 Oct 2022 11:05:19 -0700 (PDT) Received: from pps.filterd (m0089730.ppops.net [127.0.0.1]) by m0089730.ppops.net (8.17.1.5/8.17.1.5) with ESMTP id 29NFI5T8023220 for ; Sun, 23 Oct 2022 11:05:19 -0700 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fb.com; h=from : to : cc : subject : date : message-id : mime-version : content-transfer-encoding : content-type; s=facebook; bh=M+jbRnZiTqBLsXA0J5ETfs1ZNztiCdI1qS1wcWp2fG4=; b=mCT3rRazXmYhh8FrFF7mo/PLTQDtSk6++smA0ijxtgMtNN3dzS2JkIVOwsEEvXbQ+CzZ 8G6FFEvfW630EQOvaPntGSJqZsJJuNDHJYtpEQP6Z0iwMhI8dO+Q1sgZAlyaUMQfMrBr qlxEyT0GnJKgYqT30WX7dQFfVTs6FjLBjUE= Received: from maileast.thefacebook.com ([163.114.130.16]) by m0089730.ppops.net (PPS) with ESMTPS id 3kcc1bb290-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NOT) for ; Sun, 23 Oct 2022 11:05:18 -0700 Received: from twshared13931.24.frc3.facebook.com (2620:10d:c0a8:1b::d) by mail.thefacebook.com (2620:10d:c0a8:83::6) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Sun, 23 Oct 2022 11:05:18 -0700 Received: by devbig309.ftw3.facebook.com (Postfix, from userid 128203) id 2E769111596D6; Sun, 23 Oct 2022 11:05:14 -0700 (PDT) From: Yonghong Song To: CC: Alexei Starovoitov , Andrii Nakryiko , Daniel Borkmann , , KP Singh , Martin KaFai Lau , Tejun Heo Subject: [PATCH bpf-next v4 0/7] bpf: Implement cgroup local storage available to non-cgroup-attached bpf progs Date: Sun, 23 Oct 2022 11:05:14 -0700 Message-ID: <20221023180514.2857498-1-yhs@fb.com> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-FB-Internal: Safe Content-Type: text/plain X-Proofpoint-GUID: GlTq_GVkpFMaH7624A1_AwvSPRBKWQik X-Proofpoint-ORIG-GUID: GlTq_GVkpFMaH7624A1_AwvSPRBKWQik X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.205,Aquarius:18.0.895,Hydra:6.0.545,FMLib:17.11.122.1 definitions=2022-10-21_04,2022-10-21_01,2022-06-22_01 Precedence: bulk List-ID: X-Mailing-List: bpf@vger.kernel.org There already exists a local storage implementation for cgroup-attached bpf programs. See map type BPF_MAP_TYPE_CGROUP_STORAGE and helper bpf_get_local_storage(). But there are use cases such that non-cgroup attached bpf progs wants to access cgroup local storage data. For example= , tc egress prog has access to sk and cgroup. It is possible to use sk local storage to emulate cgroup local storage by storing data in socke= t. But this is a waste as it could be lots of sockets belonging to a particu= lar cgroup. Alternatively, a separate map can be created with cgroup id as th= e key. But this will introduce additional overhead to manipulate the new map. A cgroup local storage, similar to existing sk/inode/task storage, should help for this use case. This patch implemented new cgroup local storage available to non-cgroup-attached bpf programs. In the patch series, Patches 1 and 2 are preparation patches. Patch 3 implemented new cgroup local storage kernel support. Patches 4 and 5 implemented libbpf and bpftool support. Patch 6 added two tests to validate kernel/libbpf implementations. Patch 7 added documentation for new BPF_MAP_TYPE_CGRP_STORAGE map type and comparison of the old and new cgroup local storage maps. Changelogs: v3 -> v4: . fix a config guarding problem in kernel/cgroup/cgroup.c when cgrp_storage is deleted (CONFIG_CGROUP_BPF =3D> CONFIG_BPF_SYSCALL)= .=20 . rename selftest from cgroup_local_storage.c to cgrp_local_storage.c so the name can better align with map name. . fix a few misspellings. v2 -> v3: . fix a config caused kernel test complaint. . better description/comments in uapi bpf.h and bpf_cgrp_storage.c. . factor code for better resue for map_alloc/map_free. . improved explanation in map documentation. v1 -> v2: . change map name from BPF_MAP_TYPE_CGROUP_LOCAL_STORAGE to BPF_MAP_TYPE_CGRP_STORAGE. . removed support of sleepable programs. . changed the place of freeing cgrp local storage from put_css_set_lo= cked() to css_free_rwork_fn(). . added map documentation. Yonghong Song (7): bpf: Make struct cgroup btf id global bpf: Refactor inode/task/sk storage map_{alloc,free}() for reuse bpf: Implement cgroup storage available to non-cgroup-attached bpf progs libbpf: Support new cgroup local storage bpftool: Support new cgroup local storage selftests/bpf: Add selftests for new cgroup local storage docs/bpf: Add documentation for new cgroup local storage Documentation/bpf/map_cgrp_storage.rst | 109 +++++++ include/linux/bpf.h | 3 + include/linux/bpf_local_storage.h | 11 +- include/linux/bpf_types.h | 1 + include/linux/btf_ids.h | 1 + include/linux/cgroup-defs.h | 4 + include/uapi/linux/bpf.h | 50 +++- kernel/bpf/Makefile | 2 +- kernel/bpf/bpf_cgrp_storage.c | 268 ++++++++++++++++++ kernel/bpf/bpf_inode_storage.c | 15 +- kernel/bpf/bpf_local_storage.c | 39 ++- kernel/bpf/bpf_task_storage.c | 15 +- kernel/bpf/cgroup_iter.c | 2 +- kernel/bpf/helpers.c | 6 + kernel/bpf/syscall.c | 3 +- kernel/bpf/verifier.c | 13 +- kernel/cgroup/cgroup.c | 4 + kernel/trace/bpf_trace.c | 4 + net/core/bpf_sk_storage.c | 15 +- scripts/bpf_doc.py | 2 + .../bpf/bpftool/Documentation/bpftool-map.rst | 2 +- tools/bpf/bpftool/map.c | 2 +- tools/include/uapi/linux/bpf.h | 50 +++- tools/lib/bpf/libbpf.c | 1 + tools/lib/bpf/libbpf_probes.c | 1 + .../bpf/prog_tests/cgrp_local_storage.c | 92 ++++++ .../selftests/bpf/progs/cgrp_local_storage.c | 88 ++++++ .../selftests/bpf/progs/cgrp_ls_recursion.c | 70 +++++ 28 files changed, 813 insertions(+), 60 deletions(-) create mode 100644 Documentation/bpf/map_cgrp_storage.rst create mode 100644 kernel/bpf/bpf_cgrp_storage.c create mode 100644 tools/testing/selftests/bpf/prog_tests/cgrp_local_sto= rage.c create mode 100644 tools/testing/selftests/bpf/progs/cgrp_local_storage.= c create mode 100644 tools/testing/selftests/bpf/progs/cgrp_ls_recursion.c --=20 2.30.2