From: "hzwulibin" <hzwulibin@gmail.com>
To: "Van Leeuwen, Robert" <rovanleeuwen@ebay.com>,
ceph-devel <ceph-devel@vger.kernel.org>,
ceph-users <ceph-users@ceph.com>
Subject: Re: [ceph-users] use object size of 32k rather than 4M
Date: Wed, 23 Dec 2015 21:14:03 +0800 [thread overview]
Message-ID: <567A9E17.1020906@gmail.com> (raw)
In-Reply-To: <23844535-D8F2-438E-9C4B-3BE523D0157A@ebay.com>
Hi, Robert
Thanks for your quick reply. Yeah, the number of file really will be the potential problem. But if just the memory problem, we could use more memory in our OSD
servers.
Also, i tested it on XFS use mdtest, here is the result:
$ sudo ~/wulb/bin/mdtest -I 10000 -z 1 -b 1024 -R -F
--------------------------------------------------------------------------
[[10342,1],0]: A high-performance Open MPI point-to-point messaging module
was unable to find any relevant network interfaces:
Module: OpenFabrics (openib)
Host: 10-180-0-34
Another transport will be used instead, although this may result in
lower performance.
--------------------------------------------------------------------------
-- started at 12/23/2015 18:59:16 --
mdtest-1.8.3 was launched with 1 total task(s) on 1 nodes
Command line used: /home/ceph/wulb/bin/mdtest -I 10000 -z 1 -b 1024 -R -F
Path: /home/ceph
FS: 824.5 GiB Used FS: 4.8% Inodes: 52.4 Mi Used Inodes: 0.6%
random seed: 1450868356
1 tasks, 10250000 files
SUMMARY: (of 1 iterations)
Operation Max Min Mean Std Dev
--------- --- --- ---- -------
File creation : 44660.505 44660.505 44660.505 0.000
File stat : 693747.783 693747.783 693747.783 0.000
File read : 365319.444 365319.444 365319.444 0.000
File removal : 62064.560 62064.560 62064.560 0.000
Tree creation : 69680.729 69680.729 69680.729 0.000
Tree removal : 352.905 352.905 352.905 0.000
From what i tested, the speed of File stat and File read are not slow down much. So, could i say the speed of OP like
lookup a file will not decrease much, just increase the number of the files?
------------------
hzwulibin
2015-12-23
-------------------------------------------------------------
发件人:"Van Leeuwen, Robert" <rovanleeuwen@ebay.com>
发送日期:2015-12-23 20:57
收件人:hzwulibin,ceph-devel,ceph-users
抄送:
主题:Re: [ceph-users] use object size of 32k rather than 4M
>In order to reduce the enlarge impact, we want to change the default size of the object from 4M to 32k.
>
>We know that will increase the number of the objects of one OSD and make remove process become longer.
>
>Hmm, here i want to ask your guys is there any other potential problems will 32k size have? If no obvious problem, will could dive into
>it and do more test on it.
I assume the objects on the OSDs filesystem will become 32k when you do this.
So if you have 1TB of data on one OSD you will have 31 million files == 31 million inodes
This is excluding the directory structure which also might be significant.
If you have 10 OSDs on a server you will easily hit 310 million inodes.
You will need a LOT of memory to make sure the inodes are cached but even then looking up the inode might add significant latency.
My guess is it will be fast in the beginning but it will grind to an hold when the cluster gets fuller due to inodes no longer being in memory.
Also this does not take in any other bottlenecks you might hit in ceph which other users can probably answer better.
Cheers,
Robert van Leeuwen
next prev parent reply other threads:[~2015-12-23 13:14 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-12-23 12:00 use object size of 32k rather than 4M hzwulibin
[not found] ` <567A8CD6.10707-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2015-12-23 12:57 ` Van Leeuwen, Robert
2015-12-23 13:14 ` hzwulibin [this message]
2015-12-23 13:57 ` [ceph-users] " Van Leeuwen, Robert
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=567A9E17.1020906@gmail.com \
--to=hzwulibin@gmail.com \
--cc=ceph-devel@vger.kernel.org \
--cc=ceph-users@ceph.com \
--cc=rovanleeuwen@ebay.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.