Linux NFS development
 help / color / mirror / Atom feed
From: Chuck Lever <chuck.lever@oracle.com>
To: Peter Staubach <staubach@redhat.com>
Cc: Jeff Layton <jlayton@redhat.com>,
	steved@redhat.com, libtirpc-devel@lists.sourceforge.net,
	linux-nfs@vger.kernel.org
Subject: Re: [PATCH] libtirpc: handle large numbers of supplemental groups gracefully (try #2)
Date: Tue, 16 Feb 2010 15:10:02 -0500	[thread overview]
Message-ID: <4B7AFB9A.1010208@oracle.com> (raw)
In-Reply-To: <4B7AF6CD.3080808@redhat.com>

On 02/16/2010 02:49 PM, Peter Staubach wrote:
> Jeff Layton wrote:
>> On Tue, 16 Feb 2010 14:24:09 -0500
>> Peter Staubach<staubach@redhat.com>  wrote:
>>
>>> Jeff Layton wrote:
>>>> This is the second attempt at this patch. The main changes are that this
>>>> one doesn't set a floor value for the size of the group list. There are
>>>> also a few minor cleanups and comments added.
>>>>
>>>> If authunix_create_default() is called by a user with more than 16
>>>> supplimental groups, it'll abort(), which causes the program to crash
>>>> and coredump.
>>>>
>>>> Fix it to handle this situation gracefully. Get the number of groups
>>>> that the user has first, and then allocate a big enough buffer to hold
>>>> them. Then, just don't let the lower function use more than the NGRPS
>>>> groups.
>>>>
>>>> Also fix up the error handling in this function so that it just returns
>>>> a NULL pointer on error and logs a message via warnx() instead of
>>>> calling abort().
>>>>
>>>> Reported-by: Peter Engel<peter.engel-y6kNeMnOB+c@public.gmane.org>
>>>> Signed-off-by: Jeff Layton<jlayton@redhat.com>
>>>> ---
>>>>   src/auth_unix.c |   62 +++++++++++++++++++++++++++++++++++++++++++++++++-----
>>>>   1 files changed, 56 insertions(+), 6 deletions(-)
>>>>
>>>> diff --git a/src/auth_unix.c b/src/auth_unix.c
>>>> index 71ca15d..a295e71 100644
>>>> --- a/src/auth_unix.c
>>>> +++ b/src/auth_unix.c
>>>> @@ -49,6 +49,7 @@
>>>>   #include<stdlib.h>
>>>>   #include<unistd.h>
>>>>   #include<string.h>
>>>> +#include<errno.h>
>>>>
>>>>   #include<rpc/types.h>
>>>>   #include<rpc/xdr.h>
>>>> @@ -175,20 +176,69 @@ AUTH *
>>>>   authunix_create_default()
>>>>   {
>>>>   	int len;
>>>> +	size_t bufsize = 0;
>>>>   	char machname[MAXHOSTNAMELEN + 1];
>>>>   	uid_t uid;
>>>>   	gid_t gid;
>>>> -	gid_t gids[NGRPS];
>>>> +	gid_t *gids = NULL;
>>>> +	AUTH *auth;
>>>> +
>>>> +	if (gethostname(machname, sizeof machname) == -1) {
>>>> +		warnx("%s: gethostname() failed: %s", __func__,
>>>> +			strerror(errno));
>>>> +		return NULL;
>>>> +	}
>>>>
>>>> -	if (gethostname(machname, sizeof machname) == -1)
>>>> -		abort();
>>>>   	machname[sizeof(machname) - 1] = 0;
>>>>   	uid = geteuid();
>>>>   	gid = getegid();
>>>> -	if ((len = getgroups(NGRPS, gids))<  0)
>>>> -		abort();
>>>> +
>>>> +retry:
>>>> +	len = getgroups(0, NULL);
>>>> +	if (len<  0) {
>>>> +		warnx("%s: failed to get number of groups: %s", __func__,
>>>> +			strerror(errno));
>>>> +		return NULL;
>>>> +	}
>>>> +
>>>> +	if (len == 0)
>>>> +		goto no_groups;
>>>> +
>>>> +	bufsize = len * sizeof(gid_t);
>>>> +	gids = mem_alloc(bufsize);
>>>> +	if (gids == NULL) {
>>>> +		warnx("%s: memory allocation failed: %s", __func__,
>>>> +			strerror(ENOMEM));
>>>> +		return NULL;
>>>> +	}
>>>> +
>>>> +	len = getgroups(len, gids);
>>>> +	if (len<  0) {
>>>> +		mem_free(gids, bufsize);
>>>> +		/*
>>>> +		 * glibc equivalent routines mention that it's possible for
>>>> +		 * the number of groups to change between two getgroups calls.
>>>> +		 * If that happens, retry the whole thing again.
>>>> +		 */
>>>> +		if (len == -EINVAL) {
>>>> +			gids = NULL;
>>>> +			bufsize = 0;
>>>> +			goto retry;
>>>> +		}
>>>> +		warnx("%s: failed to get group list: %s", __func__,
>>>> +			strerror(errno));
>>>> +		return NULL;
>>>> +	}
>>>> +
>>>> +	/* AUTH_UNIX has a hard limit of NGRPS supplemental groups */
>>>> +	if (len>  NGRPS)
>>>> +		len = NGRPS;
>>>> +
>>>> +no_groups:
>>>>   	/* XXX: interface problem; those should all have been unsigned */
>>>> -	return (authunix_create(machname, uid, gid, len, gids));
>>>> +	auth = authunix_create(machname, uid, gid, len, gids);
>>>> +	mem_free(gids, bufsize);
>>>> +	return auth;
>>>>   }
>>>>
>>>>   /*
>>> This change to restrict the groups used to the first NGRPS
>>> groups is one that we have always avoided.  It can be quite
>>> confusing to the user, to have an operation fail, but then
>>> to look and notice that the correct group is listed.
>>>
>>> Having the library abort seems odd and wrong, but this will
>>> also change semantics.  Is there really a problem here,
>>> after all of these years, that must be addressed?
>>>
>>
>> Yes, we had a bug report against nfs-utils where someone was seeing
>> umount.nfs core dump because root was in>NGRPS groups.
>>
>> https://bugzilla.redhat.com/show_bug.cgi?id=565507
>>
>> ...as Chuck points out, this is basically what glibc has done for
>> years. While it may be confusing for users, I think it's more
>> preferable than having the programs fail outright.
>>
>> What sort of behavior would you propose in its place?
>>
>
> Well, I do agree that a library routine should not call abort().
> That's just pure and simple, wrong.  That bug must be fixed.
>
> However, making a policy decision for the user also seems
> wrong.  Just because glibc did something does not make it right.

I'm not going to argue with that :-)

> It does tend to make it a standard, so the semantic change
> argument becomes weaker, but this change was made to a different
> RPC library, so the application must have/be changed to use it,
> so all bets are off again.

Not precisely true here.  authunix_create_default() is, I believe, also 
part of the legacy RPC interface (TS-RPC).  Thus any application that 
uses only the legacy RPC calls and is relinked with libtirpc can use 
this interface, and may expect it to behave as before.

> My inclination would be to return NULL and see who complains.

Yeah, unfortunately many of authunix_create_default's callers don't 
check the return value.

In the umount.nfs and showmount cases, what would be the appropriate 
recovery action for the application, if authunix_create_default() 
returns NULL?

-- 
chuck[dot]lever[at]oracle[dot]com

  reply	other threads:[~2010-02-16 20:12 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-16 19:16 [PATCH] libtirpc: handle large numbers of supplemental groups gracefully (try #2) Jeff Layton
2010-02-16 19:22 ` Chuck Lever
2010-02-16 19:24 ` Peter Staubach
2010-02-16 19:25   ` Chuck Lever
2010-02-16 19:33   ` Jeff Layton
     [not found]     ` <20100216143307.45a7ed42-9yPaYZwiELC+kQycOl6kW4xkIHaj4LzF@public.gmane.org>
2010-02-16 19:49       ` Peter Staubach
2010-02-16 20:10         ` Chuck Lever [this message]
2010-02-16 20:14         ` Jeff Layton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B7AFB9A.1010208@oracle.com \
    --to=chuck.lever@oracle.com \
    --cc=jlayton@redhat.com \
    --cc=libtirpc-devel@lists.sourceforge.net \
    --cc=linux-nfs@vger.kernel.org \
    --cc=staubach@redhat.com \
    --cc=steved@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox