From: Tom Hromatka <tom.hromatka@oracle.com>
To: Kees Cook <keescook@chromium.org>
Cc: Network Development <netdev@vger.kernel.org>,
Sargun Dhillon <sargun@sargun.me>, Will Drewry <wad@chromium.org>,
Daniel Borkmann <daniel@iogearbox.net>,
Linux Containers <containers@lists.linux-foundation.org>,
Alexei Starovoitov <ast@kernel.org>,
Andy Lutomirski <luto@amacapital.net>
Subject: Re: [PATCH net-next 0/3] eBPF Seccomp filters
Date: Tue, 13 Feb 2018 13:38:53 -0700 [thread overview]
Message-ID: <b885663b-eaa2-f212-7b9b-2496943a30e1@oracle.com> (raw)
In-Reply-To: <CAGXu5jJZgrgLrhkZO33RNdOds8zwnnOZh+rqwguxJM+zm=EJ7g@mail.gmail.com>
On 02/13/2018 01:35 PM, Kees Cook wrote:
> On Tue, Feb 13, 2018 at 12:33 PM, Tom Hromatka <tom.hromatka@oracle.com> wrote:
>> On Tue, Feb 13, 2018 at 7:42 AM, Sargun Dhillon <sargun@sargun.me> wrote:
>>> This patchset enables seccomp filters to be written in eBPF. Although,
>>> this patchset doesn't introduce much of the functionality enabled by
>>> eBPF, it lays the ground work for it.
>>>
>>> It also introduces the capability to dump eBPF filters via the PTRACE
>>> API in order to make it so that CHECKPOINT_RESTORE will be satisifed.
>>> In the attached samples, there's an example of this. One can then use
>>> BPF_OBJ_GET_INFO_BY_FD in order to get the actual code of the program,
>>> and use that at reload time.
>>>
>>> The primary reason for not adding maps support in this patchset is
>>> to avoid introducing new complexities around PR_SET_NO_NEW_PRIVS.
>>> If we have a map that the BPF program can read, it can potentially
>>> "change" privileges after running. It seems like doing writes only
>>> is safe, because it can be pure, and side effect free, and therefore
>>> not negatively effect PR_SET_NO_NEW_PRIVS. Nonetheless, if we come
>>> to an agreement, this can be in a follow-up patchset.
>>
>>
>> Coincidentally I also sent an RFC for adding eBPF hash maps to the seccomp
>> userspace mailing list just last week:
>> https://groups.google.com/forum/#!topic/libseccomp/pX6QkVF0F74
>>
>> The kernel changes I proposed are in this email:
>> https://groups.google.com/d/msg/libseccomp/pX6QkVF0F74/ZUJlwI5qAwAJ
>>
>> In that email thread, Kees requested that I try out a binary tree in cBPF
>> and evaluate its performance. I just got a rough prototype working, and
>> while not as fast as an eBPF hash map, the cBPF binary tree was a
>> significant
>> improvement over the linear list of ifs that are currently generated. Also,
>> it only required changing a single function within the libseccomp libary
>> itself.
>>
>> https://github.com/drakenclimber/libseccomp/commit/87b36369f17385f5a7a4d95101185577fbf6203b
>>
>> Here are the results I am currently seeing using an in-house customer's
>> seccomp filter and a simplistic test program that runs getppid() thousands
>> of times.
>>
>> Test Case minimum TSC ticks to make syscall
>> ----------------------------------------------------------------
>> seccomp disabled 620
>> getppid() at the front of 306-syscall seccomp filter 722
>> getppid() in middle of 306-syscall seccomp filter 1392
>> getppid() at the end of the 306-syscall filter 2452
>> seccomp using a 306-syscall-sized EBPF hash map 800
>> cBPF filter using a binary tree 922
> I still think that's a crazy filter. :) It should be inverted to just
> check the 26 syscalls and a final "greater than" test. I would expect
> it to be faster still. :)
>
> -Kees
I completely agree it's a crazy filter, but it seems to be a
common "mistake" our users are making. It would be nice to
help them out if we can.
Tom
next prev parent reply other threads:[~2018-02-13 20:39 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-02-13 20:33 [PATCH net-next 0/3] eBPF Seccomp filters Tom Hromatka
2018-02-13 20:35 ` Kees Cook
2018-02-13 20:38 ` Tom Hromatka [this message]
-- strict thread matches above, loose matches on Subject: below --
2018-02-13 15:42 Sargun Dhillon
2018-02-13 15:47 ` Kees Cook
2018-02-13 16:29 ` Sargun Dhillon
2018-02-13 17:02 ` Jessie Frazelle
[not found] ` <CAEk6tEw3ty0kBH+06TYt4=Ywt-4_cHBa9f8p3ajMghtjRkHmMg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-02-13 17:07 ` Brian Goff
2018-02-13 17:31 ` Sargun Dhillon
2018-02-13 20:16 ` Kees Cook
[not found] ` <CAGXu5jKv3QFVKLhok1JWiPamE0b4CqLTO-hx8sP0KWED921=6w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-02-13 20:50 ` Tycho Andersen
2018-02-13 21:08 ` Paul Moore
2018-02-14 17:25 ` Andy Lutomirski
2018-02-14 17:32 ` Tycho Andersen
2018-02-15 4:30 ` Alexei Starovoitov
[not found] ` <20180215043027.zssmhvfdn7iz3rlz-+o4/htvd0TCa6kscz5V53/3mLCh9rsb+VpNB7YpNyf8@public.gmane.org>
2018-02-15 8:35 ` Lorenzo Colitti via Containers
2018-02-15 16:05 ` Andy Lutomirski
2018-02-16 18:39 ` Sargun Dhillon
2018-02-14 0:47 ` Mickaël Salaün
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b885663b-eaa2-f212-7b9b-2496943a30e1@oracle.com \
--to=tom.hromatka@oracle.com \
--cc=ast@kernel.org \
--cc=containers@lists.linux-foundation.org \
--cc=daniel@iogearbox.net \
--cc=keescook@chromium.org \
--cc=luto@amacapital.net \
--cc=netdev@vger.kernel.org \
--cc=sargun@sargun.me \
--cc=wad@chromium.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).