From: Lars Ellenberg <Lars.Ellenberg@linbit.com>
To: drbd-dev@lists.linbit.com
Subject: Re: [Drbd-dev] DRBD-8: recent regression causing corruption and crashes
Date: Fri, 11 Aug 2006 20:45:59 +0200 [thread overview]
Message-ID: <20060811184559.GG7373@soda.linbit> (raw)
In-Reply-To: <342BAC0A5467384983B586A6B0B3767103624F3C@EXNA.corp.stratus.com>
/ 2006-08-11 12:01:23 -0400
\ Graham, Simon:
> Quick update:
>
How exactly do you "test"?
Kernel and hardware?
(sorry, if you posted that earlier, just point me to it)
I triggered a full sync (drbdadm invalidate),
and while that was running, access the Primary(SyncSource)
(cp -av /somethinghuge/ /mnt/drbd-mount-point/)
> > 1. I get errors during initial synchronization of a volume like this
> > that cause the resync to be aborted:
> >
> > drbd15: tl_verify: failed to find req e51a4da0, sector 0 in list
I don't see those here.
> DRBD, Cmd: WriteAck, BlkId: SYNCER Sector: 0, AckLen: 8000
I don't see these either.
> > 2. I get panics with the following signature:- these look like they
> are
> > happening when a local write
> > on the primary (which this node is) completes.
>
> The panic signature seems to change - for example, I just got one like
> this in the receiver thread:
>
> drbd15: ASSERT( drbd_req_get_sector(i) == sector ) in
> /sandbox/sgraham/sn/trunk/platform/drbd/8.0/drbd/drbd_main.c:313
> drbd15: tl_verify: found req e63d0240 but it has wrong sector (8 versus
> 0)
nor these.
> drbd15: in tl_clear_barrier:374: ap_pending_cnt = -1 < 0 !
this is bad...
What I do see here is: "ap_pending > 0" still too often, when I
disconnect during resync + write activity, effectively blocking the
Primary's io subsystem. seemingly we still got bugs in tl_clear :(
need to look into that further.
> Code: Bad EIP value.
> <0>Fatal exception: panic in 5 seconds
outch.
--
: Lars Ellenberg Tel +43-1-8178292-55 :
: LINBIT Information Technologies GmbH Fax +43-1-8178292-82 :
: Schoenbrunner Str. 244, A-1120 Vienna/Europe http://www.linbit.com :
next prev parent reply other threads:[~2006-08-11 18:45 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-08-11 16:01 [Drbd-dev] DRBD-8: recent regression causing corruption and crashes Graham, Simon
2006-08-11 18:45 ` Lars Ellenberg [this message]
-- strict thread matches above, loose matches on Subject: below --
2006-08-11 2:31 Graham, Simon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20060811184559.GG7373@soda.linbit \
--to=lars.ellenberg@linbit.com \
--cc=drbd-dev@lists.linbit.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.