Distributed Replicated Block Device (DRBD) development
 help / color / mirror / Atom feed
From: Lars Ellenberg <Lars.Ellenberg@linbit.com>
To: drbd-dev@lists.linbit.com
Subject: Re: [Drbd-dev] DRBD8: failed to complete sync due to receiving bitmap in unexpected state
Date: Tue, 12 Dec 2006 11:19:37 +0100	[thread overview]
Message-ID: <20061212101937.GD7967@soda.linbit> (raw)
In-Reply-To: <BD7042533C2F8943A6A4257A9E31C45439C877@EXNA.corp.stratus.com>

/ 2006-12-11 17:16:50 -0500
\ Montrose, Ernest:
> Hi all,
> Were are seeing a case where a Sync happened, data is marked consistent
> on both sides, target went to Connected
>   state, source DID NOT CHANGE FROM WFBitMapS state. The clock on the
> two systems seem to be not quite synchronized, but it seems that:
> 
> 1. The two nodes connected, realised they needed to resync and worked
> out that one node had the
>   good data.
> 2. Because other syncing was going on, the sync process was paused
> 3. Later on, sync resumed, good side connection went to WFBitmapS, bad
> side WFBitmapT
> 4. Sync happened, data was marked consistent on both sides, target went
> to Connected
>   state, source DID NOT CHANGE FROM WFBitMapS.
> 
> Now, the only oddity I see is on the target side where we see:
> 
> Dec 10 04:52:52 george kernel: drbd1: unexpected cstate (PausedSyncT) in
> receive_bitmap
> 
> This did NOT stop the resync, but I would suspect it meant that a
> critical message was never sent which left the source side in WFBitmapS.
> 
> Presumably there is a window where one side is out of the paused state
> before the other.
>  
> Simon Grham actually did a bit of analysis of this and think that the
> problem might be a race condition in drbd_receive.c:receive_bitmap().
> Any ideas, because I cannot reproduce this at reliably at this time.

Not yet...
is any state change Secondary->Primary involved,
or are the only (re)connecting?

-- 
: Lars Ellenberg                            Tel +43-1-8178292-55 :
: LINBIT Information Technologies GmbH      Fax +43-1-8178292-82 :
: Vivenotgasse 48, A-1120 Vienna/Europe    http://www.linbit.com :

      reply	other threads:[~2006-12-12 10:19 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-12-11 22:16 [Drbd-dev] DRBD8: failed to complete sync due to receiving bitmap in unexpected state Montrose, Ernest
2006-12-12 10:19 ` Lars Ellenberg [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20061212101937.GD7967@soda.linbit \
    --to=lars.ellenberg@linbit.com \
    --cc=drbd-dev@lists.linbit.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox