From: Vivek Goyal <vgoyal@redhat.com>
To: Luis Henriques <lhenriques@suse.de>
Cc: Miklos Szeredi <miklos@szeredi.hu>,
linux-fsdevel@vger.kernel.org, virtio-fs@redhat.com,
linux-kernel@vger.kernel.org,
"Dr. David Alan Gilbert" <dgilbert@redhat.com>
Subject: Re: [RFC PATCH] fuse: Clear SGID bit when setting mode in setacl
Date: Tue, 2 Mar 2021 11:25:54 -0500 [thread overview]
Message-ID: <20210302162554.GE220334@redhat.com> (raw)
In-Reply-To: <20210302160033.GD220334@redhat.com>
On Tue, Mar 02, 2021 at 11:00:33AM -0500, Vivek Goyal wrote:
> On Mon, Mar 01, 2021 at 06:20:30PM +0000, Luis Henriques wrote:
> > On Mon, Mar 01, 2021 at 11:33:24AM -0500, Vivek Goyal wrote:
> > > On Fri, Feb 26, 2021 at 06:33:57PM +0000, Luis Henriques wrote:
> > > > Setting file permissions with POSIX ACLs (setxattr) isn't clearing the
> > > > setgid bit. This seems to be CVE-2016-7097, detected by running fstest
> > > > generic/375 in virtiofs. Unfortunately, when the fix for this CVE landed
> > > > in the kernel with commit 073931017b49 ("posix_acl: Clear SGID bit when
> > > > setting file permissions"), FUSE didn't had ACLs support yet.
> > >
> > > Hi Luis,
> > >
> > > Interesting. I did not know that "chmod" can lead to clearing of SGID
> > > as well. Recently we implemented FUSE_HANDLE_KILLPRIV_V2 flag which
> > > means that file server is responsible for clearing of SUID/SGID/caps
> > > as per following rules.
> > >
> > > - caps are always cleared on chown/write/truncate
> > > - suid is always cleared on chown, while for truncate/write it is cleared
> > > only if caller does not have CAP_FSETID.
> > > - sgid is always cleared on chown, while for truncate/write it is cleared
> > > only if caller does not have CAP_FSETID as well as file has group execute
> > > permission.
> > >
> > > And we don't have anything about "chmod" in this list. Well, I will test
> > > this and come back to this little later.
> > >
> > > I see following comment in fuse_set_acl().
> > >
> > > /*
> > > * Fuse userspace is responsible for updating access
> > > * permissions in the inode, if needed. fuse_setxattr
> > > * invalidates the inode attributes, which will force
> > > * them to be refreshed the next time they are used,
> > > * and it also updates i_ctime.
> > > */
> > >
> > > So looks like that original code has been written with intent that
> > > file server is responsible for updating inode permissions. I am
> > > assuming this will include clearing of S_ISGID if needed.
> > >
> > > But question is, does file server has enough information to be able
> > > to handle proper clearing of S_ISGID info. IIUC, file server will need
> > > two pieces of information atleast.
> > >
> > > - gid of the caller.
> > > - Whether caller has CAP_FSETID or not.
> > >
> > > I think we have first piece of information but not the second one. May
> > > be we need to send this in fuse_setxattr_in->flags. And file server
> > > can drop CAP_FSETID while doing setxattr().
> > >
> > > What about "gid" info. We don't change to caller's uid/gid while doing
> > > setxattr(). So host might not clear S_ISGID or clear it when it should
> > > not. I am wondering that can we switch to caller's uid/gid in setxattr(),
> > > atleast while setting acls.
> >
> > Thank for looking into this. To be honest, initially I thought that the
> > fix should be done in the server too, but when I looked into the code I
> > couldn't find an easy way to get that done (without modifying the data
> > being passed from the kernel in setxattr).
> >
> > So, what I've done was to look at what other filesystems were doing in the
> > ACL code, and that's where I found out about this CVE. The CVE fix for
> > the other filesystems looked easy enough to be included in FUSE too.
>
> Hi Luis,
>
> I still feel that it should probably be fixed in virtiofsd, given fuse client
> is expecting file server to take care of any change of mode (file
> permission bits).
Havid said that, there is one disadvantage of relying on server to
do this. Now idmapped mount patches have been merged. If virtiofs
were to ever support idmapped mounts, this will become an issue.
Server does not know about idmapped mounts, and it does not have
information on how to shift inode gid to determine if SGID should
be cleared or not.
So if we were to keep possible future support of idmapped mounts in mind,
then solving it in client makes more sense. (/me is afraid that there
might be other dependencies like this elsewhere).
Miklos, WDYT.
Thanks
Vivek
>
> I wrote a proof of concept patch and this should fix this. But it
> drop CAP_FSETID always. So I will need to modify kernel to pass
> this information to file server and that should properly fix
> generic/375.
>
> Please have a look. This applies on top of fuse acl support V4 patches
> I had posted. I have pushed all the patches on a temporary git branch
> as well.
>
> https://github.com/rhvgoyal/qemu/commits/acl-sgid
>
> Vivek
>
>
> Subject: virtiofsd: Switch creds, drop FSETID for system.posix_acl_access xattr
>
> When posix access acls are set on a file, it can lead to adjusting file
> permissions (mode) as well. If caller does not have CAP_FSETID and it
> also does not have membership of owner group, this will lead to clearing
> SGID bit in mode.
>
> Current fuse code is written in such a way that it expects file server
> to take care of chaning file mode (permission), if there is a need.
> Right now, host kernel does not clear SGID bit because virtiofsd is
> running as root and has CAP_FSETID. For host kernel to clear SGID,
> virtiofsd need to switch to gid of caller in guest and also drop
> CAP_FSETID (if caller did not have it to begin with).
>
> This is a proof of concept patch which switches to caller's uid/gid
> and alwasys drops CAP_FSETID in lo_setxattr(system.posix_acl_access).
> This should fix the xfstest generic/375 test case.
>
> This patch is not complete yet. Kernel should pass information when
> to drop CAP_FSETID and when not to. I will look into modifying
> kernel to pass this information to file server.
>
> Reported-by: Luis Henriques <lhenriques@suse.de>
> Yet-to-be-signed-off-by: Vivek Goyal <vgoyal@redhat.com>
> ---
> tools/virtiofsd/passthrough_ll.c | 28 +++++++++++++++++++++++++++-
> 1 file changed, 27 insertions(+), 1 deletion(-)
>
> Index: rhvgoyal-qemu/tools/virtiofsd/passthrough_ll.c
> ===================================================================
> --- rhvgoyal-qemu.orig/tools/virtiofsd/passthrough_ll.c 2021-03-02 08:06:20.539820330 -0500
> +++ rhvgoyal-qemu/tools/virtiofsd/passthrough_ll.c 2021-03-02 10:46:40.901334665 -0500
> @@ -172,7 +172,7 @@ struct lo_data {
> int user_killpriv_v2, killpriv_v2;
> /* If set, virtiofsd is responsible for setting umask during creation */
> bool change_umask;
> - int user_posix_acl;
> + int user_posix_acl, posix_acl;
> };
>
> static const struct fuse_opt lo_opts[] = {
> @@ -677,6 +677,7 @@ static void lo_init(void *userdata, stru
> fuse_log(FUSE_LOG_DEBUG, "lo_init: enabling posix acl\n");
> conn->want |= FUSE_CAP_POSIX_ACL | FUSE_CAP_DONT_MASK;
> lo->change_umask = true;
> + lo->posix_acl = true;
> } else {
> /* User either did not specify anything or wants it disabled */
> fuse_log(FUSE_LOG_DEBUG, "lo_init: disabling posix_acl\n");
> @@ -2981,12 +2982,37 @@ static void lo_setxattr(fuse_req_t req,
>
> sprintf(procname, "%i", inode->fd);
> if (S_ISREG(inode->filetype) || S_ISDIR(inode->filetype)) {
> + bool switched_creds = false;
> + struct lo_cred old = {};
> +
> fd = openat(lo->proc_self_fd, procname, O_RDONLY);
> if (fd < 0) {
> saverr = errno;
> goto out;
> }
> +
> + if (lo->posix_acl && !strcmp(name, "system.posix_acl_access")) {
> + ret = lo_change_cred(req, &old, false);
> + if (ret) {
> + saverr = ret;
> + goto out;
> + }
> + ret = drop_effective_cap("FSETID", NULL);
> + if (ret != 0) {
> + lo_restore_cred(&old, false);
> + saverr = ret;
> + goto out;
> + }
> + switched_creds = true;
> + }
> +
> ret = fsetxattr(fd, name, value, size, flags);
> +
> + if (switched_creds) {
> + if (gain_effective_cap("FSETID"))
> + fuse_log(FUSE_LOG_ERR, "Failed to gain CAP_FSETID\n");
> + lo_restore_cred(&old, false);
> + }
> } else {
> /* fchdir should not fail here */
> assert(fchdir(lo->proc_self_fd) == 0);
next prev parent reply other threads:[~2021-03-03 5:38 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-02-26 18:33 [RFC PATCH] fuse: Clear SGID bit when setting mode in setacl Luis Henriques
2021-03-01 16:33 ` Vivek Goyal
2021-03-01 18:20 ` Luis Henriques
2021-03-02 16:00 ` Vivek Goyal
2021-03-02 16:25 ` Vivek Goyal [this message]
2021-03-03 15:36 ` Miklos Szeredi
2021-03-02 14:22 ` Vivek Goyal
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20210302162554.GE220334@redhat.com \
--to=vgoyal@redhat.com \
--cc=dgilbert@redhat.com \
--cc=lhenriques@suse.de \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=miklos@szeredi.hu \
--cc=virtio-fs@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).