All of lore.kernel.org
 help / color / mirror / Atom feed
From: Stefan Priebe <s.priebe@profihost.ag>
To: Gregory Farnum <greg@inktank.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: Designing a cluster guide
Date: Sat, 19 May 2012 10:37:01 +0200	[thread overview]
Message-ID: <4FB75BAD.3080709@profihost.ag> (raw)
In-Reply-To: <CAPYLRzgTWLcbZHHpVs0j=WZokXfh8CGLcTUF6nkV1CCYiSs1DQ@mail.gmail.com>

Hi Greg,

Am 17.05.2012 23:27, schrieb Gregory Farnum:
>> It mentions for example "Fast CPU" for the mds system. What does fast
>> mean? Just the speed of one core? Or is ceph designed to use multi core?
>> Is multi core or more speed important?
> Right now, it's primarily the speed of a single core. The MDS is
> highly threaded but doing most things requires grabbing a big lock.
> How fast is a qualitative rather than quantitative assessment at this
> point, though.
So would you recommand a fast (more ghz) Core i3 instead of a single 
xeon for this system? (price per ghz is better).

> It depends on what your nodes look like, and what sort of cluster
> you're running. The monitors are pretty lightweight, but they will add
> *some* load. More important is their disk access patterns — they have
> to do a lot of syncs. So if they're sharing a machine with some other
> daemon you want them to have an independent disk and to be running a
> new kernel&glibc so that they can use syncfs rather than sync. (The
> only distribution I know for sure does this is Ubuntu 12.04.)
Which kernel and which glibc version supports this? I have searched 
google but haven't found an exact version. We're using debian lenny 
squeeze with a custom kernel.

>> Regarding the OSDs is it fine to use an SSD Raid 1 for the journal and
>> perhaps 22x SATA Disks in a Raid 10 for the FS or is this quite absurd
>> and you should go for 22x SSD Disks in a Raid 6?
> You'll need to do your own failure calculations on this one, I'm
> afraid. Just take note that you'll presumably be limited to the speed
> of your journaling device here.
Yeah that's why i wanted to use a Raid 1 of SSDs for the journaling. Or 
is this still too slow? Another idea was to use only a ramdisk for the 
journal and backup the files while shutting down to disk and restore 
them after boot.

> Given that Ceph is going to be doing its own replication, though, I
> wouldn't want to add in another whole layer of replication with raid10
> — do you really want to multiply your storage requirements by another
> factor of two?
OK correct bad idea.

>> Is it more useful the use a Raid 6 HW Controller or the btrfs raid?
> I would use the hardware controller over btrfs raid for now; it allows
> more flexibility in eg switching to xfs. :)
OK but overall you would recommand running one osd per disk right? So 
instead of using a Raid 6 with for example 10 disks you would run 6 osds 
on this machine?

>> Use single socket Xeon for the OSDs or Dual Socket?
> Dual socket servers will be overkill given the setup you're
> describing. Our WAG rule of thumb is 1GHz of modern CPU per OSD
> daemon. You might consider it if you decided you wanted to do an OSD
> per disk instead (that's a more common configuration, but it requires
> more CPU and RAM per disk and we don't know yet which is the better
> choice).
Is there also a rule of thumb for the memory?

My biggest problem with ceph right now is the awful slow speed while 
doing random reads and writes.

Sequential read and writes are at 200Mb/s (that's pretty good for bonded 
dual Gbit/s). But random reads and write are only at 0,8 - 1,5 Mb/s 
which is def. too slow.

Stefan
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2012-05-19  8:37 UTC|newest]

Thread overview: 53+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-05-10 12:09 slow performance even when using SSDs Stefan Priebe - Profihost AG
2012-05-10 13:09 ` Stefan Priebe - Profihost AG
2012-05-10 18:24   ` Calvin Morrow
2012-05-10 13:23 ` Designing a cluster guide Stefan Priebe - Profihost AG
2012-05-17 21:27   ` Gregory Farnum
2012-05-19  8:37     ` Stefan Priebe [this message]
2012-05-19 16:15       ` Alexandre DERUMIER
2012-05-20  7:56         ` Stefan Priebe
2012-05-20  8:13           ` Alexandre DERUMIER
2012-05-20  8:19           ` Christian Brunner
2012-05-20  8:27             ` Stefan Priebe
2012-05-20  8:31               ` Christian Brunner
2012-05-21  8:22                 ` Stefan Priebe - Profihost AG
2012-05-21 15:03                   ` Christian Brunner
2012-05-20  8:56             ` Tim O'Donovan
2012-05-20  9:24               ` Stefan Priebe
2012-05-20  9:46                 ` Tim O'Donovan
2012-05-20  9:49                   ` Stefan Priebe
2012-05-21 14:59               ` Christian Brunner
2012-05-21 15:05                 ` Stefan Priebe - Profihost AG
2012-05-21 15:12                   ` Tomasz Paszkowski
     [not found]                     ` <CANT588uxL7jrf1BfowUeer_AnDTfGjzkWVFhS4aNMaMSst_jyA@mail.gmail.com>
2012-05-21 15:36                       ` Tomasz Paszkowski
2012-05-21 18:15                         ` Damien Churchill
2012-05-21 20:11                     ` Stefan Priebe
2012-05-21 20:13                       ` Tomasz Paszkowski
2012-05-21 20:14                         ` Stefan Priebe
2012-05-21 20:19                           ` Tomasz Paszkowski
2012-05-21 15:07         ` Tomasz Paszkowski
2012-05-21 21:22           ` Sławomir Skowron
2012-05-21 23:52             ` Quenten Grasso
2012-05-22  0:30               ` Gregory Farnum
2012-05-22  0:42                 ` Quenten Grasso
2012-05-22  0:46                   ` Quenten Grasso
2012-05-22  5:51                     ` Sławomir Skowron
2012-05-29  7:25                       ` Quenten Grasso
2012-05-29 16:50                         ` Tommi Virtanen
2012-05-22  9:04                 ` Jerker Nyberg
2012-05-23  5:31                   ` Gregory Farnum
2012-05-23 19:47                     ` Jerker Nyberg
2012-05-23 21:47                       ` Gregory Farnum
2012-05-24  8:33                         ` Jerker Nyberg
2012-05-22  6:30             ` Stefan Priebe - Profihost AG
2012-05-22  6:59               ` Sławomir Skowron
2012-05-21 18:13       ` Gregory Farnum
2012-05-22  6:20         ` Stefan Priebe - Profihost AG
2012-06-29 18:07     ` Gregory Farnum
2012-06-29 18:42       ` Brian Edmonds
2012-06-29 18:50         ` Gregory Farnum
2012-06-29 20:59           ` Brian Edmonds
2012-06-29 21:11             ` Gregory Farnum
2012-06-29 21:18               ` Brian Edmonds
2012-06-29 21:30                 ` Gregory Farnum
2012-06-29 21:33                 ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4FB75BAD.3080709@profihost.ag \
    --to=s.priebe@profihost.ag \
    --cc=ceph-devel@vger.kernel.org \
    --cc=greg@inktank.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.