All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wido den Hollander <wido@widodh.nl>
To: Sage Weil <sage@newdream.net>
Cc: "Székelyi Szabolcs" <szekelyi@niif.hu>, ceph-devel@vger.kernel.org
Subject: Re: [WRN] map e### wrongly marked me down or wrong addr
Date: Mon, 27 Feb 2012 20:20:04 +0100	[thread overview]
Message-ID: <4F4BD764.6090302@widodh.nl> (raw)
In-Reply-To: <Pine.LNX.4.64.1202270901350.12283@cobra.newdream.net>



On 02/27/2012 06:03 PM, Sage Weil wrote:
> On Mon, 27 Feb 2012, Székelyi Szabolcs wrote:
>> Hello,
>>
>> whenever I restart osd.0 I see a pair of messages like
>>
>> 2012-02-27 17:26:00.132666 mon.0<osd_1_ip>:6789/0 106 : [INF] osd.0
>> <osd_0_ip>:6801/29931 failed (by osd.1<osd_1_ip>:6806/20125)
>> 2012-02-27 17:26:21.074926 osd.0<osd_0_ip>:6801/29931 1 : [WRN] map e370
>> wrongly marked me down or wrong addr
>>
>> a couple of times. The situation stabilizes in a normal state after about two
>> minutes.
>>
>> Should I worry about this? Maybe the first message is about the just killed
>> OSD, and the second comes from the new incarnation, and this is completely
>> normal? This is Ceph 0.41.
>
> It's not normal.  Wido was seeing something similar, I think.  I suspect
> the problem is that during startup ceph-osd just busy, but the heartbeat
> code is such that it's not supposed to miss them.

I haven't seen the wrongly marked me down messages, I'm just seeing that 
'pairs' of OSD's are marking the other down.

Still trying to figure that one out.

>
> Can you reproduce this with 'debug ms = 1'?
>
> sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2012-02-27 19:20 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-02-27 16:59 [WRN] map e### wrongly marked me down or wrong addr Székelyi Szabolcs
2012-02-27 17:03 ` Sage Weil
2012-02-27 19:20   ` Wido den Hollander [this message]
2012-02-28 15:31   ` Székelyi Szabolcs
2012-02-28 16:16     ` Gregory Farnum
2012-02-28 17:26       ` Székelyi Szabolcs
2012-02-27 23:53 ` Gregory Farnum

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4F4BD764.6090302@widodh.nl \
    --to=wido@widodh.nl \
    --cc=ceph-devel@vger.kernel.org \
    --cc=sage@newdream.net \
    --cc=szekelyi@niif.hu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.