All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josh Durgin <josh.durgin@inktank.com>
To: Tren Blackburn <tren@eotnetworks.com>
Cc: Sage Weil <sage@inktank.com>, ceph-devel@vger.kernel.org
Subject: Re: Why does mkcephfs take approximately 30 seconds per osd on ceph 0.51?
Date: Tue, 18 Sep 2012 16:32:51 -0700	[thread overview]
Message-ID: <505904A3.70804@inktank.com> (raw)
In-Reply-To: <CANLRBowJs_qx7UcAUJLd41nwF95XSc-Z5ibTt-pedT161qxebg@mail.gmail.com>

On 09/18/2012 02:23 PM, Tren Blackburn wrote:
> On Tue, Sep 18, 2012 at 2:11 PM, Tren Blackburn <tren@eotnetworks.com> wrote:
>> On Tue, Sep 18, 2012 at 1:58 PM, Sage Weil <sage@inktank.com> wrote:
>>> On Tue, 18 Sep 2012, Tren Blackburn wrote:
>>>> Hi List;
>>>>
>>>> I've been working with ceph 0.51 lately, and have noticed this for a
>>>> while now, but it hasn't been a big enough issue for me to report.
>>>> However today I'm turning up a 192 OSD cluster, and 30 seconds per OSD
>>>> adds up pretty quick. For some reason it's taking 30 seconds between
>>>> checking the OSD for a pre-existing store:
>>>>
>>>> 2012-09-18 13:53:28.400590 7fe895d25780 -1 filestore(/var/ceph/disk11)
>>>> could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such
>>>> file or directory
>>>>
>>>> And then actually creating the new store:
>>>>
>>>> 2012-09-18 13:53:57.371396 7fe895d25780 -1 created object store
>>>> /var/ceph/disk11 journal /dev/mapper/vg-journal.disk11 for osd.34 fsid
>>>> bca82801-04d7-402e-917f-8023a4b161a8
>>>> 2012-09-18 13:53:57.371449 7fe895d25780 -1 auth: error reading file:
>>>> /var/ceph/disk11/keyring: can't open /var/ceph/disk11/keyring: (2) No
>>>> such file or directory
>>>> 2012-09-18 13:53:57.371527 7fe895d25780 -1 created new key in keyring
>>>> /var/ceph/disk11/keyring
>>>>
>>>> I can provide many examples as I'm watching it slowly plod through
>>>> currently. Also, the horse power of the server makes no difference.
>>>> The servers in question here are dual E5-2600's with 96GB ram and
>>>> 12x2TB drives.
>>>>
>>>> What information can I provide to help debug this? Or is this an
>>>> already known issue?
>>>
>>> If you add -x to the #!/bin/bash line in mkcephfs we can see exactly which
>>> command(s) are going slow...
>>
>> It looks like it's the ceph-osd command taking the time:
>>
>> + /usr/bin/ceph-osd -c /tmp/mkfs.ceph.9006/conf --monmap
>> /tmp/mkfs.ceph.9006/monmap -i 000 --mkfs --mkkey
>>   HDIO_DRIVE_CMD(identify) failed: Invalid argument
>> 2012-09-18 14:08:11.265921 7fb41bd04780 -1 journal check: ondisk fsid
>> 00000000-0000-0000-0000-000000000000 doesn't match expected
>> aa1f859e-4d3c-4b02-87c5-0190bb51dd48, invalid (someone else's?)
>> journal
>>   HDIO_DRIVE_CMD(identify) failed: Invalid argument
>>   HDIO_DRIVE_CMD(identify) failed: Invalid argument
>>   HDIO_DRIVE_CMD(identify) failed: Invalid argument
>> 2012-09-18 14:08:11.440522 7fb41bd04780 -1 filestore(/var/ceph/disk01)
>> could not find 23c2fcde/osd_superblock/0//-1 in index: (2) No such
>> file or directory
>> 2012-09-18 14:08:40.424981 7fb41bd04780 -1 created object store
>> /var/ceph/disk01 journal /dev/mapper/vg-journal.disk01 for osd.0 fsid
>> ad1bcd15-081a-4d6b-abf1-2fcf79764c9b
>> 2012-09-18 14:08:40.425032 7fb41bd04780 -1 auth: error reading file:
>> /var/ceph/disk01/keyring: can't open /var/ceph/disk01/keyring: (2) No
>> such file or directory
>> 2012-09-18 14:08:40.425109 7fb41bd04780 -1 created new key in keyring
>> /var/ceph/disk01/keyring
>>
> I managed to attach a strace to the ceph-osd process. Please find it at:
>
> http://pastebin.com/Tdm2t3jT
>
> Hopefully it's useful. Please let me know what else I can do to help
> with this, as having to wait 1.5 hours for mkcephfs to complete is a
> little excessive... ;)
>
> t.

Could you do an strace -tt? That'll tell us where the slowness is.

Josh


  reply	other threads:[~2012-09-18 23:32 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-09-18 20:57 Why does mkcephfs take approximately 30 seconds per osd on ceph 0.51? Tren Blackburn
2012-09-18 20:58 ` Sage Weil
2012-09-18 21:11   ` Tren Blackburn
2012-09-18 21:23     ` Tren Blackburn
2012-09-18 23:32       ` Josh Durgin [this message]
2012-09-18 23:47         ` Tren Blackburn
2012-09-19 22:38           ` Josh Durgin
2012-09-20 20:26             ` Tren Blackburn
2012-09-21 17:59             ` Tren Blackburn

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=505904A3.70804@inktank.com \
    --to=josh.durgin@inktank.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=sage@inktank.com \
    --cc=tren@eotnetworks.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.