From: Lars Ellenberg <lars.ellenberg@linbit.com>
To: drbd-dev@lists.linbit.com
Subject: Re: [Drbd-dev] [bug] drbd 9: Receiver error
Date: Thu, 26 Feb 2015 21:12:01 +0100 [thread overview]
Message-ID: <20150226201201.GL19088@soda.linbit> (raw)
In-Reply-To: <54EB60E7.10304@suse.de>
On Mon, Feb 23, 2015 at 11:18:31AM -0600, Goldwyn Rodrigues wrote:
> >>[ 176.429715] block drbd0: tumbleweed1: bitmap overflow (e:8388342) while decoding bm RLE packet
> >>[ 176.429739] drbd r0 tumbleweed1: error receiving P_COMPRESSED_BITMAP, e: -5 l: 7!
> >
> >
> >If you can reproduce this (with RLE enabled),
> >can you please down drbd on both nodes,
> >then "dump-md"?
> >
> >I'm interested in how exactly your bitmaps look like,
> >so I could "unit test" the bitmap compression/decompression for it.
> >
>
> The problem occurs when the devices are of unequal sizes.
Oh.
Well, they should refuse to talk to each other in the first place.
Or have agreed to the minimum of all involved sizes before even trying
to exchange bitmap information.
However, you should not connect different size DRBD, anyways.
If that does not work yet, well, then don't do it ;-)
> I am
> unable to get the dump-md _after_ the error, because of the
> following:
>
> tumbleweed3:~ # drbdadm dump-md r0
> Found meta data is "unclean", please apply-al first
> Command 'drbdmeta 0 v09 /dev/sda internal dump-md' terminated with
> exit code 255
you can add "--force".
or, well, down, then "apply-al", as in "drbdadm apply-al",
resp. drbdmeta 0 v09 /dev/sda internal apply-al
(where "al" is "activity log").
> I am able to recreate the problem everytime. Here is the dump before
> starting the service:
>
> tumbleweed1:~ # drbdadm dump-md r0
> # DRBD meta data dump
> # 2015-02-23 10:31:50 -0600 [1424709110]
> # tumbleweed1> drbdmeta 0 v09 /dev/sda internal dump-md
> #
>
> version "v09";
>
> max-peers 1;
> # md_size_sect 2120
> # md_offset 34359734272
> # al_offset 34359701504
> # bm_offset 34358652928
>
> node-id -1;
> current-uuid 0x0000000000000004;
> flags 0x00000080;
> peer[0] {
> bitmap-index -1;
> bitmap-uuid 0x0000000000000000;
> bitmap-dagtag 0x0000000000000000;
> flags 0x00000000;
> }
> peer[1] {
> bitmap-index -1;
> bitmap-uuid 0x0000000000000000;
> bitmap-dagtag 0x0000000000000000;
> flags 0x00000000;
> }
> history-uuids {
> 0x0000000000000000; 0x0000000000000000; 0x0000000000000000;
> # al-extents 257;
> la-size-sect 0;
> bm-byte-per-bit 4096;
> device-uuid 0xA4C9657C56B6D358;
> la-peer-max-bio-size 0;
> al-stripes 1;
> al-stripe-size-4k 8;
> # bm-bytes 0;
> bitmap[0] {
> }
> # bits-set 0;
> tumbleweed3:~ # drbdadm dump-md r0
> # DRBD meta data dump
> # 2015-02-23 10:32:12 -0600 [1424709132]
> # tumbleweed3> drbdmeta 0 v09 /dev/sda internal dump-md
> #
>
> version "v09";
>
> max-peers 1;
> # md_size_sect 1992
> # md_offset 32212250624
> # al_offset 32212217856
> # bm_offset 32211234816
>
> node-id -1;
> current-uuid 0x0000000000000004;
> flags 0x00000080;
> peer[0] {
> bitmap-index -1;
> bitmap-uuid 0x0000000000000000;
> bitmap-dagtag 0x0000000000000000;
> flags 0x00000000;
> }
> peer[1] {
> bitmap-index -1;
> bitmap-uuid 0x0000000000000000;
> bitmap-dagtag 0x0000000000000000;
> flags 0x00000000;
> }
> history-uuids {
> 0x0000000000000000; 0x0000000000000000; 0x0000000000000000;
> 0x0000000000000000;
> # al-extents 257;
> la-size-sect 0;
> bm-byte-per-bit 4096;
> device-uuid 0xC789EF231ECD0071;
> la-peer-max-bio-size 0;
> al-stripes 1;
> al-stripe-size-4k 8;
> # bm-bytes 0;
> bitmap[0] {
> }
> # bits-set 0;
> This happens when I try to assign tumbleweed1 (bigger device) the
> primary using the command:
>
> # drbdadm -- --overwrite-data-of-peer primary r0
should be enough info for us to reproduce.
For now: just don't do that.
Use devices of the same size everywhere.
Thanks,
Lars
--
: Lars Ellenberg
: http://www.LINBIT.com | Your Way to High Availability
: DRBD, Linux-HA and Pacemaker support and consulting
DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
prev parent reply other threads:[~2015-02-26 20:12 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-02-20 5:40 [Drbd-dev] [bug] drbd 9: Receiver error Goldwyn Rodrigues
2015-02-20 7:41 ` Philipp Marek
2015-02-20 9:12 ` Goldwyn Rodrigues
2015-02-20 9:26 ` Philipp Marek
2015-02-20 14:18 ` Lars Ellenberg
2015-02-23 17:18 ` Goldwyn Rodrigues
2015-02-26 20:12 ` Lars Ellenberg [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150226201201.GL19088@soda.linbit \
--to=lars.ellenberg@linbit.com \
--cc=drbd-dev@lists.linbit.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.