All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mark Nelson <mnelson@redhat.com>
To: Sage Weil <sweil@redhat.com>, "Chen, Xiaoxi" <xiaoxi.chen@intel.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: Newstore get_omap_iterator
Date: Wed, 15 Apr 2015 21:59:54 -0500	[thread overview]
Message-ID: <552F25AA.7040907@redhat.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1504130819170.4469@cobra.newdream.net>

[-- Attachment #1: Type: text/plain, Size: 1371 bytes --]



On 04/13/2015 10:27 AM, Sage Weil wrote:
> [adding ceph-devel]
>
> On Mon, 13 Apr 2015, Chen, Xiaoxi wrote:
>> Hi,
>>
>>        Actually I have done the tuning survey on RocksDB when I was
>> updating the RocksDB to newer version and exposed the tuning in
>> ceph.conf.
>>
>>        What we need to ensure is the WAL never hit the disk. The rocksdb
>
> We'll always have to pay that 1x write to the log; we just want to make
> sure it doesn't turn into 2x.  I take it you're assuming the log is on an
> SSD (not disk)?
>
>> write ahead log is already introduce 1X write, if the data flushed to
>> SST in level 0, that will be 2X, not to mention any further compaction.
>>
>>        The tuning that makes the differences are :
>> 	write_buffer_size
>> 	max_write_buffer_number
>> 	min_write_buffer_number_to_merge
>>
>>        Say if we have
>> 	write_buffer_size =512M
>> 	max_write_buffer_number = 6
>> 	min_write_buffer_number_to_merge =2

Attached are tests for a single PCIE ssd with filestore, newstore + 
fsync + default tunables, newstore+fsync + Xiaoxi's tunables, and also a 
test using xiaoxi's tunables with fdatasync.

Basically Xioaxi's tunables help, and fdatasync helps a little more 
(mostly at small IO sizes), but still not enough to get us to beat 
filestore, though newstore *does* do consistently better than filestore 
with 4MB writes now.

Mark


[-- Attachment #2: newstore_xiaoxi_fdatasync.pdf --]
[-- Type: application/pdf, Size: 42928 bytes --]

      reply	other threads:[~2015-04-16  2:59 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <6F3FA899187F0043BA1827A69DA2F7CC021C90B0@shsmsx102.ccr.corp.intel.com>
     [not found] ` <alpine.DEB.2.00.1504081012130.4469@cobra.newdream.net>
     [not found]   ` <6F3FA899187F0043BA1827A69DA2F7CC021CADF4@shsmsx102.ccr.corp.intel.com>
     [not found]     ` <alpine.DEB.2.00.1504121312490.4469@cobra.newdream.net>
     [not found]       ` <6F3FA899187F0043BA1827A69DA2F7CC021CB9CE@shsmsx102.ccr.corp.intel.com>
2015-04-13 15:27         ` Newstore get_omap_iterator Sage Weil
2015-04-16  2:59           ` Mark Nelson [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=552F25AA.7040907@redhat.com \
    --to=mnelson@redhat.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=sweil@redhat.com \
    --cc=xiaoxi.chen@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.