From: Wido den Hollander <wido@42on.com>
To: Nulik Nol <nuliknol@gmail.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: app design recommendations
Date: Wed, 21 Aug 2013 10:49:20 +0200 [thread overview]
Message-ID: <52147F10.3030509@42on.com> (raw)
In-Reply-To: <CAHO6xe-FSmZdFTqDq56U9_QQ_hoPMnPqbFLjT1CLGETz-hCs7A@mail.gmail.com>
On 08/20/2013 10:06 PM, Nulik Nol wrote:
> Hi,
> I am creating an email system which will handle whole company's email,
> mostly internal mail. There will be thousands of companies and
> hundreds of users per company. So I am planning to use one pool per
> company to store email messages. Can Ceph manage thousands or maybe
> hundred thousands of pools ? Could there be any slowdown at production
> with such design after some growth?
>
No, Ceph dan not handle thousands or even hundreds of pools. For each
pool a certain (configurable) amount of placement groups is created.
Each PG has to be tracked through the cluster and will consume CPU en
Memory on your OSDs and even Monitors.
With a large amount of pools you will start consuming a huge amount of
memory and CPU power.
To mitigate this RADOS namespaces have developed:
http://wiki.ceph.com/01Planning/02Blueprints/Dumpling/rados_namespaces
That will allow you to separate data in a pool without having to create
multiple pools. Namespaces are cheap, so use them.
> Every email will be stored as an individual ceph object (emails will
> average 512 bytes and rarely have attachments) , is it ok to store
> them as a ceph objects or will it be less efficient than storing
> multiple emails in a ceph object,? What is the optimal ceph object
> size to store individually, so it would be preferable to do this
> instead of writing through omap with leveldb? (kind of "ceph object vs
> omap" benchmark question)
>
You can store billions of objects in a pool, that is not a problem.
Their size doesn't matter either.
Larger objects might work better in recovery situations since you save
yourself the opening and closing of a vast amount of small files on the
OSD's filesystem. I have never benchmarked it though.
> Also I will be putting mini-chat sessions between users in a ceph
> object, each time a user sends a message to another user, I will
> append the text to the ceph object, so my question is, will Ceph
> rewrite the whole object into a new physical location on disk when I
> do an append? Or will it just rewrite the block that was modified?
>
As long as you use rados_append it will append to the object and the
file on the OSD and it won't rewrite the complete file on the OSD's
filesystem.
> And last questions: Which is faster, storing small key/value pairs in
> omap or in xattrs ? Will storing key/value pairs in xattrs result in
> space waste by allocating a block for zero-sized object on the OSD? (I
> won't write any data to the object, just use xattrs)
>
I'm not sure about the answer on this one, but you are aware of the fact
that you can't search for objects? You can't ask RADOS to give you all
objects where attr X has value Y.
Wido
> Will appreciate very much your comments.
>
> Best Regards
> Nulik
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
--
Wido den Hollander
42on B.V.
Phone: +31 (0)20 700 9902
Skype: contact42on
next prev parent reply other threads:[~2013-08-21 8:49 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-08-20 20:06 app design recommendations Nulik Nol
2013-08-21 8:49 ` Wido den Hollander [this message]
2013-08-22 8:45 ` Nulik Nol
[not found] ` <CA+4uBUa9c=n632p7XdTV_XM=3AVF6WCNoX_XCzpJOVT4YpURjQ@mail.gmail.com>
2013-08-21 15:25 ` Fwd: " Samuel Just
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=52147F10.3030509@42on.com \
--to=wido@42on.com \
--cc=ceph-devel@vger.kernel.org \
--cc=nuliknol@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.