From: Wido den Hollander <wido@42on.com>
To: Sage Weil <sage@inktank.com>
Cc: Greg Farnum <greg@inktank.com>, ceph-devel@vger.kernel.org
Subject: Re: CephFS First product release discussion
Date: Tue, 05 Mar 2013 19:36:32 +0100 [thread overview]
Message-ID: <51363B30.7080006@42on.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1303051027180.26446@cobra.newdream.net>
On 03/05/2013 07:28 PM, Sage Weil wrote:
> On Tue, 5 Mar 2013, Greg Farnum wrote:
>> On Tuesday, March 5, 2013 at 10:08 AM, Wido den Hollander wrote:
>>> On 03/05/2013 06:03 PM, Greg Farnum wrote:
>>>> This is a companion discussion to the blog post at http://ceph.com/dev-notes/cephfs-mds-status-discussion/ ? go read that!
>>>>
>>>> The short and slightly alternate version: I spent most of about two weeks working on bugs related to snapshots in the MDS, and we started realizing that we could probably do our first supported release of CephFS and the related infrastructure much sooner if we didn't need to support all of the whizbang features. (This isn't to say that the base feature set is stable now, but it's much closer than when you turn on some of the other things.) I'd like to get feedback from you in the community on what minimum supported feature set would prompt or allow you to start using CephFS in real environments ? not what you'd *like* to see, but what you *need* to see. This will allow us at Inktank to prioritize more effectively and hopefully get out a supported release much more quickly! :)
>>>>
>>>> The current proposed feature set is basically what's left over after we've trimmed off everything we can think to split off, but if any of the proposed included features are also particularly important or don't matter, be sure to mention them (NFS export in particular ? it works right now but isn't in great shape due to NFS filehandle caching).
>>>
>>> Great news! Although RBD and RADOS itself are already great, a lot of
>>> applications would still require a shared filesystem.
>>>
>>> Think about a (Cloud|Open)Stack environment with thousands of instances
>>> running but also need some form of shared filesystem.
>>>
>>> One thing I'm missing though is user-quotas, have they been discussed at
>>> all and what would the work to implement those involve?
>>>
>>> I know it would require a lot more tracking per file so it's not that
>>> easy and would certainly not make it into a first release, but are they
>>> on the roadmap at all?
>>
>> Not at present. I think there are some tickets related to this in the
>> tracker as feature requests, but CephFS needs more groundwork about
>> multi-tenancy in general before we can do reasonable planning around a
>> robust user quota feature. (Near-real-time hacks are possible now based
>> around the rstats infrastructure and I believe somebody has built them,
>> though I've never seen them myself.)
>
> Wido, by 'user quota' do you mean something that is uid-based, or would
> enforcement on subtree/directory quotas be sufficient for your use cases?
> I've been holding out hope that uid-based usage accounting is a thing of
> the past and that subtrees are sufficient for real users... in which case
> adding enfocement to the existing rstats infrastructure is a very
> manageable task.
>
I mean actual uid-based quotas. That still plays nice with shared
environments like Samba or so where you have all homedirectories on a
shared filesystems and you set per user quotas. Samba reads out those
quotas and propagates them to the (Windows) client.
I know this was a problem with ZFS as well. They also said they could do
"per filesystem quotas" so that would be sufficient, but for example NFS
doesn't export filesystems mounted in a export, so if you have a bunch
of homedirectories on the filesystem and you want to account the usage
of each user it's getting kind of hard.
This could be solved if the clients directly mounted CephFS though.
I'm talking about setups where you have 100k users in a LDAP and they
all have their data in a single filesystem and you want to track the
usage of each user, that's not an easy task without uid-based quotas.
Running 'du' on each directory would be much faster with Ceph since it
accounts tracks the subdirectories and shows their total size with an
'ls -al'.
Environments with 100k users also tend to be very dynamic with adding
and removing users all the time, so creating separate filesystems for
them would be very time consuming.
Now, I'm not talking about enforcing soft or hard quotas, I'm just
talking about knowing how much space uid X and Y consume on the filesystem.
Wido
> sage
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
Wido den Hollander
42on B.V.
Phone: +31 (0)20 700 9902
Skype: contact42on
next prev parent reply other threads:[~2013-03-05 18:36 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <sfid-H20130305-170326-+024.05-1@marduk.tchpc.tcd.ie>
2013-03-05 17:03 ` CephFS First product release discussion Greg Farnum
2013-03-05 18:08 ` Wido den Hollander
2013-03-05 18:17 ` Greg Farnum
2013-03-05 18:28 ` Sage Weil
2013-03-05 18:36 ` Wido den Hollander [this message]
2013-03-05 18:48 ` Jim Schutt
2013-03-05 19:33 ` Sage Weil
2013-03-06 17:24 ` Wido den Hollander
2013-03-06 19:07 ` Jim Schutt
2013-03-06 19:13 ` CephFS Space Accounting and Quotas (was: CephFS First product release discussion) Greg Farnum
2013-03-06 19:58 ` CephFS Space Accounting and Quotas Jim Schutt
2013-03-06 20:21 ` Greg Farnum
2013-03-06 21:28 ` Jim Schutt
2013-03-06 21:39 ` Greg Farnum
2013-03-06 23:14 ` Jim Schutt
2013-03-07 0:18 ` Greg Farnum
2013-03-07 15:15 ` Jim Schutt
2013-03-08 22:45 ` Jim Schutt
2013-03-09 2:05 ` Greg Farnum
2013-03-11 14:47 ` Jim Schutt
2013-03-11 15:48 ` Greg Farnum
2013-03-11 16:48 ` Jim Schutt
2013-03-11 16:57 ` Greg Farnum
2013-03-11 20:40 ` Jim Schutt
2013-03-12 22:34 ` Jim Schutt
[not found] ` <513FAE0F.2010608@sandia.gov>
[not found] ` <BE627BF4B6E74BD49037D07821FC1DB9@inktank.com>
[not found] ` <5143AA84.50409@sandia.gov>
2013-03-15 23:17 ` Greg Farnum
2013-03-18 14:19 ` Jim Schutt
2013-03-06 21:42 ` Sage Weil
2013-03-06 5:01 ` [ceph-users] CephFS First product release discussion Neil Levine
[not found] ` <CANygib-U_MQi1TMmQuT_Q9MVwPfT+PzJwN=+BMcBK69WuRfu3w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-03-07 13:11 ` Félix Ortega Hortigüela
[not found] ` <E0B1337A572647BA9FCC0CE8CA946F42-4GqslpFJ+cxBDgjK7y7TUQ@public.gmane.org>
2013-03-07 11:54 ` Jimmy Tang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51363B30.7080006@42on.com \
--to=wido@42on.com \
--cc=ceph-devel@vger.kernel.org \
--cc=greg@inktank.com \
--cc=sage@inktank.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.