All of lore.kernel.org
 help / color / mirror / Atom feed
* use object size of 32k rather than 4M
@ 2015-12-23 12:00 hzwulibin
       [not found] ` <567A8CD6.10707-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  0 siblings, 1 reply; 4+ messages in thread
From: hzwulibin @ 2015-12-23 12:00 UTC (permalink / raw)
  To: ceph-devel, ceph-users; +Cc: Sage Weil, Haomai Wang

Hi, cephers, Sage and Haomai

Recently we stuck of the performance down problem when recoverying. The scene is simple:
1. run fio with rand write(bs=4k)
2. stop one osd; sleep 10; start the osd
3. the IOPS drop from 6K to about 200

We now know the SSD which that osd on is the bottleneck when recovery. After read the code, we find the IO of that 
SSD come from two ways:
1. normal recovery IO
2. user IO but in the missing list, need to recovery the 4M object first.

So our first step is limit the recovery IO to slow down the stress of that SSD. That helps in some scene, but not this one.


We have 36 OSD with 3 replicas, so when one osd down, about 1/12 objects will be in degraded state.
When we run fio with 4k randwrite, about 1/12 io will stuck and need to recovery the 4M object first.
That really enlarge the stress the that SSD.

In order to reduce the enlarge impact, we want to change the default size of the object from 4M to 32k.

We know that will increase the number of the objects of one OSD and make remove process become longer.

Hmm, here i want to ask your guys is there any other potential problems will 32k size have? If no obvious problem, will could dive into
it and do more test on it.

Many thanks!
 				
--------------
hzwulibin
2015-12-23

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2015-12-23 13:57 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-12-23 12:00 use object size of 32k rather than 4M hzwulibin
     [not found] ` <567A8CD6.10707-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2015-12-23 12:57   ` Van Leeuwen, Robert
2015-12-23 13:14     ` [ceph-users] " hzwulibin
2015-12-23 13:57       ` Van Leeuwen, Robert

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.