From: Jevon Qiao <qiaojianfeng@unitedstack.com>
To: Marek Dohojda <mdohojda@altitudedigital.com>,
Samuel Just <sjust@redhat.com>
Cc: 乔建峰 <scaleqiao@gmail.com>,
"ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>,
ceph-users <ceph-users@ceph.com>,
cbt@ceph.com
Subject: Re: [ceph-users] Is it safe to increase pg number in a production environment
Date: Wed, 5 Aug 2015 09:43:46 +0800 [thread overview]
Message-ID: <55C16A52.4040403@unitedstack.com> (raw)
In-Reply-To: <B655F3AF-5BCE-4285-BD86-79E4DC666261@altitudedigital.com>
Thank you and Samuel for the prompt response.
On 5/8/15 00:52, Marek Dohojda wrote:
> I have done this not that long ago. My original PG estimates were wrong and I had to increase them.
>
> After increasing the PG numbers the Ceph rebalanced, and that took a while. To be honest in my case the slowdown wasn’t really visible, but it took a while.
How many OSDs do you have in your cluster? How much did you adjust the
PG numbers?
> My strong suggestion to you would be to do it in a long IO time, and be prepared that this willl take quite a long time to accomplish. Do it slowly and do not increase multiple pools at once.
Both you and Samuel said to do it slowly, do you mean to adjust the pg
numbers step by step rather than doing it in one step? Also, would you
please explain 'a long IO time' in details.
Thanks,
Jevon
> It isn’t recommended practice but doable.
>
>
>> On Aug 4, 2015, at 10:46 AM, Samuel Just <sjust@redhat.com> wrote:
>>
>> It will cause a large amount of data movement. Each new pg after the
>> split will relocate. It might be ok if you do it slowly. Experiment
>> on a test cluster.
>> -Sam
>>
>> On Mon, Aug 3, 2015 at 12:57 AM, 乔建峰 <scaleqiao@gmail.com> wrote:
>>> Hi Cephers,
>>>
>>> This is a greeting from Jevon. Currently, I'm experiencing an issue which
>>> suffers me a lot, so I'm writing to ask for your comments/help/suggestions.
>>> More details are provided bellow.
>>>
>>> Issue:
>>> I set up a cluster having 24 OSDs and created one pool with 1024 placement
>>> groups on it for a small startup company. The number 1024 was calculated per
>>> the equation 'OSDs * 100'/pool size. The cluster have been running quite
>>> well for a long time. But recently, our monitoring system always complains
>>> that some disks' usage exceed 85%. I log into the system and find out that
>>> some disks' usage are really very high, but some are not(less than 60%).
>>> Each time when the issue happens, I have to manually re-balance the
>>> distribution. This is a short-term solution, I'm not willing to do it all
>>> the time.
>>>
>>> Two long-term solutions come in my mind,
>>> 1) Ask the customers to expand their clusters by adding more OSDs. But I
>>> think they will ask me to explain the reason of the imbalance data
>>> distribution. We've already done some analysis on the environment, we
>>> learned that the most imbalance part in the CRUSH is the mapping between
>>> object and pg. The biggest pg has 613 objects, while the smallest pg only
>>> has 226 objects.
>>>
>>> 2) Increase the number of placement groups. It can be of great help for
>>> statistically uniform data distribution, but it can also incur significant
>>> data movement as PGs are effective being split. I just cannot do it in our
>>> customers' environment before we 100% understand the consequence. So anyone
>>> did this under a production environment? How much does this operation affect
>>> the performance of Clients?
>>>
>>> Any comments/help/suggestions will be highly appreciated.
>>>
>>> --
>>> Best Regards
>>> Jevon
>>>
>>> _______________________________________________
>>> ceph-users mailing list
>>> ceph-users@lists.ceph.com
>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>
>> _______________________________________________
>> ceph-users mailing list
>> ceph-users@lists.ceph.com
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.htmlml
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2015-08-05 1:44 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-03 7:57 Is it safe to increase pg number in a production environment 乔建峰
2015-08-04 16:46 ` [ceph-users] " Samuel Just
2015-08-04 16:51 ` Stefan Priebe
2015-08-04 19:16 ` Ketor D
[not found] ` <CAM9_UU8Mxycvk91NSrFSMQ5=jDxaXcajzB7CTGDZ2sJJ0YW7-w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-08-04 19:48 ` Stefan Priebe
2015-08-11 15:31 ` [ceph-users] " Dan van der Ster
2015-08-11 16:02 ` Jan Schermer
2015-08-05 1:50 ` Jevon Qiao
2015-08-04 16:52 ` Marek Dohojda
2015-08-04 17:23 ` Jan Schermer
2015-08-05 3:45 ` Jevon Qiao
2015-08-05 11:36 ` Jan Schermer
2015-08-07 1:39 ` Jevon Qiao
2015-08-05 1:43 ` Jevon Qiao [this message]
[not found] ` <55C16A52.4040403-OsJI6HhKm/eMe3Hu20U6GA@public.gmane.org>
2015-08-05 16:04 ` Marek Dohojda
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55C16A52.4040403@unitedstack.com \
--to=qiaojianfeng@unitedstack.com \
--cc=cbt@ceph.com \
--cc=ceph-devel@vger.kernel.org \
--cc=ceph-users@ceph.com \
--cc=mdohojda@altitudedigital.com \
--cc=scaleqiao@gmail.com \
--cc=sjust@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.