From: "Michal Koutný" <mkoutny-IBi9RG/b67k@public.gmane.org>
To: Christian Brauner <brauner-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
Cc: Tejun Heo <tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
Roman Gushchin <guro-b10kYP2dOMg@public.gmane.org>,
Shakeel Butt <shakeelb-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>,
Zefan Li <lizefan.x-EC8Uxl6Npydl57MIdRCFDg@public.gmane.org>,
Johannes Weiner <hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org>,
cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
Christian Brauner
<christian.brauner-GeWIH/nMZzLQT0dZR+AlfA@public.gmane.org>
Subject: Re: [RFC PATCH] cgroup: add cgroup.signal
Date: Mon, 26 Apr 2021 16:42:45 +0200 [thread overview]
Message-ID: <YIbRZeWIl8i6soSN@blackbook> (raw)
In-Reply-To: <20210423171351.3614430-1-brauner-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
[-- Attachment #1: Type: text/plain, Size: 2954 bytes --]
Hello Christian,
I have some questions to understand the motivation here.
On Fri, Apr 23, 2021 at 07:13:51PM +0200, Christian Brauner <brauner-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org> wrote:
> - Signals are specified by writing the signal number into cgroup.signal.
> An alternative would be to allow writing the signal name but I don't
> think that's worth it. Callers like systemd can easily do a snprintf()
> with the signal's define/enum.
> - Since signaling is a one-time event and we're holding cgroup_mutex()
> as we do for freezer we don't need to worry about tasks joining the
> cgroup while we're signaling the cgroup. Freezer needed to care about
> this because a task could join or leave frozen/non-frozen cgroups.
> Since we only support SIGKILL currently and SIGKILL works for frozen
> tasks there's also not significant interaction with frozen cgroups.
> - Since signaling leads to an event and not a state change the
> cgroup.signal file is write-only.
Have you considered accepting a cgroup fd to pidfd_send_signal and
realize this operation through this syscall? (Just asking as it may
prevent some of these consequences whereas bring other unclarities.)
> - Since we currently only support SIGKILL we don't need to generate a
> separate notification and can rely on the unpopulated notification
> meachnism. If we support more signals we can introduce a separate
> notification in cgroup.events.
What kind of notification do you have in mind here?
> - Freezer doesn't care about tasks in different pid namespaces, i.e. if
> you have two tasks in different pid namespaces the cgroup would still
> be frozen.
> The cgroup.signal mechanism should consequently behave the same way,
> i.e. signal all processes and ignore in which pid namespace they
> exist. This would obviously mean that if you e.g. had a task from an
> ancestor pid namespace join a delegated cgroup of a container in a
> child pid namespace the container can kill that task. But I think this
> is fine and actually the semantics we want since the cgroup has been
> delegated.
What do you mean by a delegated cgroup in this context?
> - We're holding the read-side of tasklist lock while we're signaling
> tasks. That seems fine since kill(-1, SIGKILL) holds the read-side
> of tasklist lock walking all processes and is a way for unprivileged
> users to trigger tasklist lock being held for a long time. In contrast
> it would require a delegated cgroup with lots of processes and a deep
> hierarchy to allow for something similar with this interface.
I'd better not proliferate tasklist_lock users if it's avoidable (such
as freezer does).
> Fwiw, in addition to the system manager and container use-cases I think
> this has the potential to be picked up by the "kill" tool. In the future
> I'd hope we can do: kill -9 --cgroup /sys/fs/cgroup/delegated
(OT: FTR, there's `systemctl kill` already ;-))
Michal
[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2021-04-26 14:42 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-04-23 17:13 [RFC PATCH] cgroup: add cgroup.signal Christian Brauner
[not found] ` <20210423171351.3614430-1-brauner-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2021-04-23 19:01 ` Roman Gushchin
[not found] ` <YIMZkjzNFypjZao9-cx5fftMpWqeCjSd+JxjunQ2O0Ztt9esIQQ4Iyu8u01E@public.gmane.org>
2021-04-26 14:42 ` Michal Koutný
2021-04-26 15:15 ` Christian Brauner
2021-04-26 19:02 ` Michal Koutný
2021-04-26 14:42 ` Michal Koutný [this message]
2021-04-26 15:29 ` Christian Brauner
2021-04-26 16:08 ` Shakeel Butt
[not found] ` <CALvZod5=eLQMdVXxuhj9ia=PkoRvT5oBxeqZAVtQpSukZ=tCxA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2021-04-26 16:24 ` Christian Brauner
2021-04-26 19:03 ` Michal Koutný
2021-04-27 9:36 ` Christian Brauner
2021-04-27 14:29 ` Tejun Heo
[not found] ` <YIgfrP5J3aXHfM1i-NiLfg/pYEd1N0TnZuCh8vA@public.gmane.org>
2021-04-28 14:37 ` Christian Brauner
2021-04-28 16:04 ` Tejun Heo
[not found] ` <YImHjGGuIt0ebL0G-NiLfg/pYEd1N0TnZuCh8vA@public.gmane.org>
2021-04-28 18:12 ` Roman Gushchin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YIbRZeWIl8i6soSN@blackbook \
--to=mkoutny-ibi9rg/b67k@public.gmane.org \
--cc=brauner-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
--cc=cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=christian.brauner-GeWIH/nMZzLQT0dZR+AlfA@public.gmane.org \
--cc=guro-b10kYP2dOMg@public.gmane.org \
--cc=hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org \
--cc=lizefan.x-EC8Uxl6Npydl57MIdRCFDg@public.gmane.org \
--cc=shakeelb-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org \
--cc=tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox