All of lore.kernel.org
 help / color / mirror / Atom feed
* Is it safe to increase pg number in a production environment
@ 2015-08-03  7:57 乔建峰
  2015-08-04 16:46 ` [ceph-users] " Samuel Just
  0 siblings, 1 reply; 15+ messages in thread
From: 乔建峰 @ 2015-08-03  7:57 UTC (permalink / raw)
  To: ceph-devel-u79uwXL29TY76Z2rM5mHXA, ceph-users-Qp0mS5GaXlQ,
	cbt-Qp0mS5GaXlQ


[-- Attachment #1.1: Type: text/plain, Size: 1708 bytes --]

Hi Cephers,

This is a greeting from Jevon. Currently, I'm experiencing an issue which
suffers me a lot, so I'm writing to ask for your comments/help/suggestions.
More details are provided bellow.

Issue:
I set up a cluster having 24 OSDs and created one pool with 1024 placement
groups on it for a small startup company. The number 1024 was calculated
per the equation 'OSDs * 100'/pool size. The cluster have been running
quite well for a long time. But recently, our monitoring system always
complains that some disks' usage exceed 85%. I log into the system and find
out that some disks' usage are really very high, but some are not(less than
60%). Each time when the issue happens, I have to manually re-balance the
distribution. This is a short-term solution, I'm not willing to do it all
the time.

Two long-term solutions come in my mind,
1) Ask the customers to expand their clusters by adding more OSDs. But I
think they will ask me to explain the reason of the imbalance data
distribution. We've already done some analysis on the environment, we
learned that the most imbalance part in the CRUSH is the mapping between
object and pg. The biggest pg has 613 objects, while the smallest pg only
has 226 objects.

2) Increase the number of placement groups. It can be of great help for
statistically uniform data distribution, but it can also incur significant
data movement as PGs are effective being split. I just cannot do it in our
customers' environment before we 100% understand the consequence. So anyone
did this under a production environment? How much does this operation
affect the performance of Clients?

Any comments/help/suggestions will be highly appreciated.

-- 
Best Regards
Jevon

[-- Attachment #1.2: Type: text/html, Size: 1999 bytes --]

[-- Attachment #2: Type: text/plain, Size: 178 bytes --]

_______________________________________________
ceph-users mailing list
ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2015-08-11 16:02 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-08-03  7:57 Is it safe to increase pg number in a production environment 乔建峰
2015-08-04 16:46 ` [ceph-users] " Samuel Just
2015-08-04 16:51   ` Stefan Priebe
2015-08-04 19:16     ` Ketor D
     [not found]       ` <CAM9_UU8Mxycvk91NSrFSMQ5=jDxaXcajzB7CTGDZ2sJJ0YW7-w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-08-04 19:48         ` Stefan Priebe
2015-08-11 15:31           ` [ceph-users] " Dan van der Ster
2015-08-11 16:02             ` Jan Schermer
2015-08-05  1:50     ` Jevon Qiao
2015-08-04 16:52   ` Marek Dohojda
2015-08-04 17:23     ` Jan Schermer
2015-08-05  3:45       ` Jevon Qiao
2015-08-05 11:36         ` Jan Schermer
2015-08-07  1:39           ` Jevon Qiao
2015-08-05  1:43     ` Jevon Qiao
     [not found]       ` <55C16A52.4040403-OsJI6HhKm/eMe3Hu20U6GA@public.gmane.org>
2015-08-05 16:04         ` Marek Dohojda

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.