From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id E18B7C4332F for ; Thu, 20 Oct 2022 22:13:06 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229509AbiJTWNE (ORCPT ); Thu, 20 Oct 2022 18:13:04 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37954 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229484AbiJTWND (ORCPT ); Thu, 20 Oct 2022 18:13:03 -0400 Received: from mx0b-00082601.pphosted.com (mx0b-00082601.pphosted.com [67.231.153.30]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C0D301A5B07 for ; Thu, 20 Oct 2022 15:13:02 -0700 (PDT) Received: from pps.filterd (m0148460.ppops.net [127.0.0.1]) by mx0a-00082601.pphosted.com (8.17.1.5/8.17.1.5) with ESMTP id 29KL6mUP004244 for ; Thu, 20 Oct 2022 15:13:02 -0700 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fb.com; h=from : to : cc : subject : date : message-id : mime-version : content-transfer-encoding : content-type; s=facebook; bh=T1Goh3/P0LTiouSZuidJa3/j4/w2/ZqYmIeTFomb0Rs=; b=GOpscfG9OqVrxJmsnZ27UaMb4vJa3AVkvSriK2D6kvNQ1YSM2mUBfj5ZY61TM+yIsRa7 /d98E1PiChBoMZZbC+jR8u32idUx15QgCLu9zv6OsKEuZMiiBdZXJHvYhvMsIDCvM+cU MqDHp1qa5pd4UM4t6cHzSiymrOFOv1TNvls= Received: from maileast.thefacebook.com ([163.114.130.16]) by mx0a-00082601.pphosted.com (PPS) with ESMTPS id 3kbdy5rm7e-14 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NOT) for ; Thu, 20 Oct 2022 15:13:01 -0700 Received: from twshared25017.14.frc2.facebook.com (2620:10d:c0a8:1b::d) by mail.thefacebook.com (2620:10d:c0a8:83::7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Thu, 20 Oct 2022 15:13:00 -0700 Received: by devbig309.ftw3.facebook.com (Postfix, from userid 128203) id EC46E10F4EC1D; Thu, 20 Oct 2022 15:12:55 -0700 (PDT) From: Yonghong Song To: CC: Alexei Starovoitov , Andrii Nakryiko , Daniel Borkmann , , KP Singh , Martin KaFai Lau , Tejun Heo Subject: [PATCH bpf-next v2 0/6] bpf: Implement cgroup local storage available to non-cgroup-attached bpf progs Date: Thu, 20 Oct 2022 15:12:55 -0700 Message-ID: <20221020221255.3553649-1-yhs@fb.com> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-FB-Internal: Safe Content-Type: text/plain X-Proofpoint-ORIG-GUID: ijxMNZshrht72M3RCcmD2CN903cnHx5i X-Proofpoint-GUID: ijxMNZshrht72M3RCcmD2CN903cnHx5i X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.205,Aquarius:18.0.895,Hydra:6.0.545,FMLib:17.11.122.1 definitions=2022-10-20_11,2022-10-20_01,2022-06-22_01 Precedence: bulk List-ID: X-Mailing-List: bpf@vger.kernel.org There already exists a local storage implementation for cgroup-attached bpf programs. See map type BPF_MAP_TYPE_CGROUP_STORAGE and helper bpf_get_local_storage(). But there are use cases such that non-cgroup attached bpf progs wants to access cgroup local storage data. For example= , tc egress prog has access to sk and cgroup. It is possible to use sk local storage to emulate cgroup local storage by storing data in socke= t. But this is a waste as it could be lots of sockets belonging to a particu= lar cgroup. Alternatively, a separate map can be created with cgroup id as th= e key. But this will introduce additional overhead to manipulate the new map. A cgroup local storage, similar to existing sk/inode/task storage, should help for this use case. This patch implemented new cgroup local storage available to non-cgroup-attached bpf programs. In the patch series, Patch 1 is a preparation patch. Patch 2 implemented new cgroup local storage kernel support. Patches 3 and 4 implemented libbpf and bpftool support. Patch 5 added two tests to validate kernel/libbpf implementations. Patch 6 added documentation for new BPF_MAP_TYPE_CGRP_STORAGE map type and comparison of the old and new cgroup local storage maps. Changelogs: v1 -> v2: . change map name from BPF_MAP_TYPE_CGROUP_LOCAL_STORAGE to BPF_MAP_TYPE_CGRP_STORAGE. . removed support of sleepable programs. . changed the place of freeing cgrp local storage from put_css_set_lo= cked() to css_free_rwork_fn(). . added map documentation. Yonghong Song (6): bpf: Make struct cgroup btf id global bpf: Implement cgroup storage available to non-cgroup-attached bpf progs libbpf: Support new cgroup local storage bpftool: Support new cgroup local storage selftests/bpf: Add selftests for cgroup local storage docs/bpf: Add documentation for map type BPF_MAP_TYPE_CGRP_STROAGE Documentation/bpf/map_cgrp_storage.rst | 104 +++++++ include/linux/bpf.h | 3 + include/linux/bpf_types.h | 1 + include/linux/btf_ids.h | 1 + include/linux/cgroup-defs.h | 4 + include/uapi/linux/bpf.h | 48 ++- kernel/bpf/Makefile | 2 +- kernel/bpf/bpf_cgrp_storage.c | 276 ++++++++++++++++++ kernel/bpf/cgroup_iter.c | 2 +- kernel/bpf/helpers.c | 6 + kernel/bpf/syscall.c | 3 +- kernel/bpf/verifier.c | 13 +- kernel/cgroup/cgroup.c | 4 + kernel/trace/bpf_trace.c | 4 + scripts/bpf_doc.py | 2 + .../bpf/bpftool/Documentation/bpftool-map.rst | 2 +- tools/bpf/bpftool/map.c | 2 +- tools/include/uapi/linux/bpf.h | 48 ++- tools/lib/bpf/libbpf.c | 1 + tools/lib/bpf/libbpf_probes.c | 1 + .../bpf/prog_tests/cgroup_local_storage.c | 92 ++++++ .../bpf/progs/cgroup_local_storage.c | 88 ++++++ .../selftests/bpf/progs/cgroup_ls_recursion.c | 70 +++++ 23 files changed, 769 insertions(+), 8 deletions(-) create mode 100644 Documentation/bpf/map_cgrp_storage.rst create mode 100644 kernel/bpf/bpf_cgrp_storage.c create mode 100644 tools/testing/selftests/bpf/prog_tests/cgroup_local_s= torage.c create mode 100644 tools/testing/selftests/bpf/progs/cgroup_local_storag= e.c create mode 100644 tools/testing/selftests/bpf/progs/cgroup_ls_recursion= .c --=20 2.30.2