All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jevon Qiao <scaleqiao@gmail.com>
To: Sage Weil <sage@newdream.net>, Jaze Lee <jazeltq@gmail.com>
Cc: ceph-devel@vger.kernel.org
Subject: Re: Client still connect failed leader after that mon down
Date: Fri, 18 Dec 2015 11:41:13 +0800	[thread overview]
Message-ID: <56738059.3070505@gmail.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1512170525540.10312@cobra.newdream.net>

On 17/12/15 21:27, Sage Weil wrote:
> On Thu, 17 Dec 2015, Jaze Lee wrote:
>> Hello cephers:
>>      In our test, there are three monitors. We find client run ceph
>> command will slow when the leader mon is down. Even after long time, a
>> client run ceph command will also slow in first time.
>> >From strace, we find that the client first to connect the leader, then
>> after 3s, it connect the second.
>> After some search we find that the quorum is not change, the leader is
>> still the down monitor.
>> Is that normal?  Or is there something i miss?
> It's normal.  Even when the quorum does change, the client doesn't
> know that.  It should be contacting a random mon on startup, though, so I
> would expect the 3s delay 1/3 of the time.
That's because client randomly picks up a mon from Monmap. But what we 
observed is that when a mon is down no change is made to monmap(neither 
the epoch nor the members). Is it the culprit for this phenomenon?

Thanks,
Jevon
> A long-standing low-priority feature request is to have the client contact
> 2 mons in parallel so that it can still connect quickly if one is down.
> It's requires some non-trivial work in mon/MonClient.{cc,h} though and I
> don't think anyone has looked at it seriously.
>
> sage
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


  reply	other threads:[~2015-12-18  3:41 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-12-17  9:56 Client still connect failed leader after that mon down Jaze Lee
2015-12-17 13:27 ` Sage Weil
2015-12-18  3:41   ` Jevon Qiao [this message]
2015-12-21  2:55     ` Zhi Zhang
     [not found]       ` <CAKZcxge7gaa7A5M=dg1Ri3WQ9UY7CP0RMenZrHUhGOjQVx7H1Q@mail.gmail.com>
     [not found]         ` <CAKZcxgf+Ghr9TVFAJhs3nQfY==uy6XrtJ-FuT6LCjBK3cGkc5w@mail.gmail.com>
2015-12-21  9:49           ` Fwd: " Zhi Zhang
2015-12-21 13:53             ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=56738059.3070505@gmail.com \
    --to=scaleqiao@gmail.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=jazeltq@gmail.com \
    --cc=sage@newdream.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.