From: Sargun Dhillon <sargun@sargun.me>
To: Christian Brauner <brauner@kernel.org>
Cc: Aleksa Sarai <cyphar@cyphar.com>,
linux-fsdevel@vger.kernel.org, linux-api@vger.kernel.org,
Christoph Hellwig <hch@lst.de>
Subject: Re: [PATCH 2/3] fs: Allow user to lock mount attributes with mount_setattr
Date: Fri, 25 Aug 2023 11:02:39 -0600 [thread overview]
Message-ID: <CAMp4zn9FwArMzibK6XY15b9_ouHo03DfUzOC199n2bNHkMqtGA@mail.gmail.com> (raw)
In-Reply-To: <20230816-covern-kronzeuge-6d60381f598c@brauner>
On Wed, Aug 16, 2023 at 4:36 AM Christian Brauner <brauner@kernel.org> wrote:
>
> On Wed, Aug 16, 2023 at 06:56:25PM +1000, Aleksa Sarai wrote:
> > On 2023-08-16, Christian Brauner <brauner@kernel.org> wrote:
> > > On Tue, Aug 15, 2023 at 06:46:33AM -0700, Sargun Dhillon wrote:
> > > > On Tue, Aug 15, 2023 at 2:30 AM Christian Brauner <brauner@kernel.org> wrote:
> > > > >
> > > > > On Thu, Aug 10, 2023 at 02:00:43AM -0700, Sargun Dhillon wrote:
> > > > > > We support locking certain mount attributes in the kernel. This API
> > > > > > isn't directly exposed to users. Right now, users can lock mount
> > > > > > attributes by going through the process of creating a new user
> > > > > > namespaces, and when the mounts are copied to the "lower privilege"
> > > > > > domain, they're locked. The mount can be reopened, and passed around
> > > > > > as a "locked mount".
> > > > >
> > > > > Not sure if that's what you're getting at but you can actually fully
> > > > > create these locked mounts already:
> > > > >
> > > > > P1 P2
> > > > > # init userns + init mountns # init userns + init mountns
> > > > > sudo mount --bind /foo /bar
> > > > > sudo mount --bind -o ro,nosuid,nodev,noexec /bar
> > > > >
> > > > > # unprivileged userns + unprivileged mountns
> > > > > unshare --mount --user --map-root
> > > > >
> > > > > mount --bind -oremount
> > > > >
> > > > > fd = open_tree(/bar, OPEN_TREE_CLONE)
> > > > >
> > > > > send(fd_send, P2);
> > > > >
> > > > > recv(&fd_recv, P1)
> > > > >
> > > > > move_mount(fd_recv, /locked-mnt);
> > > > >
> > > > > and now you have a fully locked mount on the host for P2. Did you mean that?
> > > > >
> > > >
> > > > Yep. Doing this within a program without clone / fork is awkward. Forking and
> > > > unsharing in random C++ programs doesn't always go super well, so in my
> > > > mind it'd be nice to have an API to do this directly.
> > > >
> > > > In addition, having the superblock continue to be owned by the userns that
> > > > its mounted in is nice because then they can toggle the other mount attributes
> > > > (nodev, nosuid, noexec are the ones we care about).
> > > >
> > > > > >
> > > > > > Locked mounts are useful, for example, in container execution without
> > > > > > user namespaces, where you may want to expose some host data as read
> > > > > > only without allowing the container to remount the mount as mutable.
> > > > > >
> > > > > > The API currently requires that the given privilege is taken away
> > > > > > while or before locking the flag in the less privileged position.
> > > > > > This could be relaxed in the future, where the user is allowed to
> > > > > > remount the mount as read only, but once they do, they cannot make
> > > > > > it read only again.
> > > > >
> > > > > s/read only/read write/
> > > > >
> > > > > >
> > > > > > Right now, this allows for all flags that are lockable via the
> > > > > > userns unshare trick to be locked, other than the atime related
> > > > > > ones. This is because the semantics of what the "less privileged"
> > > > > > position is around the atime flags is unclear.
> > > > >
> > > > > I think that atime stuff doesn't really make sense to expose to
> > > > > userspace. That seems a bit pointless imho.
> > > > >
> > > > > >
> > > > > > Signed-off-by: Sargun Dhillon <sargun@sargun.me>
> > > > > > ---
> > > > > > fs/namespace.c | 40 +++++++++++++++++++++++++++++++++++---
> > > > > > include/uapi/linux/mount.h | 2 ++
> > > > > > 2 files changed, 39 insertions(+), 3 deletions(-)
> > > > > >
> > > > > > diff --git a/fs/namespace.c b/fs/namespace.c
> > > > > > index 54847db5b819..5396e544ac84 100644
> > > > > > --- a/fs/namespace.c
> > > > > > +++ b/fs/namespace.c
> > > > > > @@ -78,6 +78,7 @@ static LIST_HEAD(ex_mountpoints); /* protected by namespace_sem */
> > > > > > struct mount_kattr {
> > > > > > unsigned int attr_set;
> > > > > > unsigned int attr_clr;
> > > > > > + unsigned int attr_lock;
> > > > >
> > > > > So when I originally noted down this crazy idea
> > > > > https://github.com/uapi-group/kernel-features
> > > > > I didn't envision a new struct member but rather a flag that could be
> > > > > raised in attr_set like MOUNT_ATTR_LOCK that would indicate for the
> > > > > other flags in attr_set to become locked.
> > > > >
> > > > > So if we could avoid growing the struct pointlessly I'd prefer that. Is
> > > > > there a reason that wouldn't work?
> > > > No reason. The semantics were just a little more awkward, IMHO.
> > > > Specifically:
> > > > * This attr could never be cleared, only set, which didn't seem to follow
> > > > the attr_set / attr_clr semantics
> > > > * If we ever introduced a mount_getattr call, you'd want to expose
> > > > each of the locked bits independently, I'd think, and exposing
> > > > that through one flag wouldn't give you the same fidelity.
> > >
> > > Hm, right. So it's either new flags or a new member. @Aleksa?
> >
> > I like ->attr_lock more tbh, especially since they cannot be cleared.
> > They are implemented as mount flags internally, but conceptually locking
> > flags is a separate thing to setting them.
>
> Ok, it'd be neat if could do the sanity review of this api then as you
> know the eventual users probably best.
Aleksa,
What do you think? The biggest miss / frustration with this API is that
there is no way to introspect which flags are locked from userspace,
but given the absence of a mount_getattr syscall (currently), I think
that we can add that later.
next prev parent reply other threads:[~2023-08-25 17:04 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-08-10 9:00 [PATCH 1/3] selftests/mount_setattr: Add a test to test locking mount attrs Sargun Dhillon
2023-08-10 9:00 ` [PATCH 2/3] fs: Allow user to lock mount attributes with mount_setattr Sargun Dhillon
2023-08-14 4:40 ` Aleksa Sarai
2023-08-14 6:18 ` Aleksa Sarai
2023-08-14 18:15 ` Sargun Dhillon
2023-08-14 23:58 ` Aleksa Sarai
2023-08-15 9:30 ` Christian Brauner
2023-08-15 13:46 ` Sargun Dhillon
2023-08-16 7:32 ` Christian Brauner
2023-08-16 8:56 ` Aleksa Sarai
2023-08-16 10:36 ` Christian Brauner
2023-08-25 17:02 ` Sargun Dhillon [this message]
2023-08-10 9:00 ` [PATCH 3/3] selftests/mount_setattr: Add tests for mount locking API Sargun Dhillon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAMp4zn9FwArMzibK6XY15b9_ouHo03DfUzOC199n2bNHkMqtGA@mail.gmail.com \
--to=sargun@sargun.me \
--cc=brauner@kernel.org \
--cc=cyphar@cyphar.com \
--cc=hch@lst.de \
--cc=linux-api@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).