All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mark Nelson <mark.nelson@inktank.com>
To: Gandalf Corvotempesta <gandalf.corvotempesta@gmail.com>
Cc: Stefan Kleijkers <stefan@kleijkers.nl>,
	Tren Blackburn <tren@eotnetworks.com>,
	ceph-devel@vger.kernel.org
Subject: Re: Ceph journal
Date: Thu, 01 Nov 2012 16:27:26 -0500	[thread overview]
Message-ID: <5092E93E.7060302@inktank.com> (raw)
In-Reply-To: <CAJH6TXhwOxtT8XLWYMYiDQ=uFUtkv6CziR7vAFqSovZ_n1JORg@mail.gmail.com>

On 11/01/2012 04:18 PM, Gandalf Corvotempesta wrote:
> 2012/10/31 Stefan Kleijkers <stefan@kleijkers.nl>:
>> As far as I know, this is correct. You get a ACK (on the write) back after
>> it landed on ALL three journals (or/and osds in case of BTRFS in parallel
>> mode). So If you lose one node, you still have it in two more nodes and they
>> will commit it to disk. After recovering the missing node/osd it will get
>> the data from one of the other nodes. So you won't lose any data.
>
> In this case I can suppose that ceph writing speed is relative to the
> journal's writing speed and never to ODS disks.
>

Eventually you will need to write all of that data out to disk and 
writes to the journal will have to stop to allow the underlying disk to 
catch up.  In cases like that you will often see performance going along 
seemingly speedily and then all of a sudden see long pauses and possibly 
chaotic performance characteristics.

> Let's assume a journal size of 150GB, capable to write at 200MB/s in a
> 2gbit/s network (lacp between two gigabit ports), no replica between
> OSDs and very very slow SATA disk (5400 RPM, for example, much slower
> than jurnal). Just a single osd.
> Ceph will write at 200MB/s, and in background it will flush journals
> to disk, right?

It will do that for a while, based on how you've tweaked the flush 
intervals and various journal settings to determine how much data ceph 
will allow to hang out in the journal while still accepting new requests.

>
> I can assume that journal is a buffer and RBD will write only to it.
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>


  reply	other threads:[~2012-11-01 21:27 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-10-31 21:18 Ceph journal Gandalf Corvotempesta
2012-10-31 21:24 ` Tren Blackburn
2012-10-31 21:32   ` Stefan Kleijkers
2012-10-31 21:54   ` Sage Weil
2012-10-31 21:58   ` Gandalf Corvotempesta
2012-10-31 22:04     ` Stefan Kleijkers
2012-10-31 22:07       ` Gandalf Corvotempesta
2012-10-31 22:55         ` Sébastien Han
2012-11-01 21:18       ` Gandalf Corvotempesta
2012-11-01 21:27         ` Mark Nelson [this message]
2012-11-01 21:33           ` Gandalf Corvotempesta
2012-11-03 17:29             ` Gregory Farnum
2012-11-04 11:48               ` Gandalf Corvotempesta
2012-11-05 13:06               ` Jean-Daniel BUSSY

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5092E93E.7060302@inktank.com \
    --to=mark.nelson@inktank.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=gandalf.corvotempesta@gmail.com \
    --cc=stefan@kleijkers.nl \
    --cc=tren@eotnetworks.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.