Distributed Replicated Block Device (DRBD) development
 help / color / mirror / Atom feed
From: Lars Ellenberg <lars.ellenberg@linbit.com>
To: drbd-dev@lists.linbit.com
Subject: Re: [Drbd-dev] NULL pointer derefernce in 8.4.7-1
Date: Wed, 30 Mar 2016 14:00:50 +0200	[thread overview]
Message-ID: <20160330120050.GO15579@soda.linbit> (raw)
In-Reply-To: <alpine.LRH.2.11.1603300137590.2225@mail.ewheeler.net>

On Wed, Mar 30, 2016 at 02:19:07AM +0000, Eric Wheeler wrote:
> Hello all,
> 
> We are getting kernel crashes in linux 4.1.20 with the drbd-8.4.git tree 
> at commit 3a6a769340ef93b1ba2792c6461250790795db49 .  
> 
> I don't see anything in the newer commits that addresses this issue so 
> I'm posting---but I'll try the latest commit in master, too, just in case.
> 
> Please see the backtrace below.  I also included our global_common.conf 
> further down.  This is protocol A and the link is quite slow.  This NULL 
> ptr dereference appears to show up when the drbd kernel thread is blocked 
> for a long time.  It might happen at reconnect time because the BUG didn't 
> show up until 13 seconds after the P_BARRIER error.
> 
> The problem is pretty reproducable, so I can probably test patches.  
> Please let me know what I can do to help test.

DRBD logs of both peers leading up to the incident may be useful.

check if older kernel versions are ok?
as in 2.6.32, 3.10, ...
if older seems to be ok, figure out which version breaks.

maybe check if older DRBD is still ok (maybe this is a more recent regression?)

try to resolve addresses to source code lines.

> [ 2480.751713]  [<ffffffffa06b0f26>] drbd_send+0xe6/0x200 [drbd]
> [ 2480.753608]  [<ffffffffa06b2b81>] _drbd_no_send_page.isra.40+0x71/0xb0 [drbd]
> [ 2480.755463]  [<ffffffffa06b3178>] drbd_send_dblock+0x3e8/0x7a0 [drbd]
> [ 2480.757263]  [<ffffffffa06a5874>] ? complete_master_bio+0x94/0x170 [drbd]
> [ 2480.759073]  [<ffffffffa06935cf>] w_send_dblock+0xaf/0x1e0 [drbd]
> [ 2480.760844]  [<ffffffffa06949a9>] drbd_worker+0xf9/0x3a0 [drbd]
> [ 2480.762567]  [<ffffffffa06aed00>] ? drbd_destroy_connection+0x190/0x190 [drbd]
> [ 2480.764181]  [<ffffffffa06aed1d>] drbd_thread_setup+0x1d/0x110 [drbd]
> [ 2480.765777]  [<ffffffffa06aed00>] ? drbd_destroy_connection+0x190/0x190 [drbd]
> [ 2480.767337]  [<ffffffff810c0b08>] kthread+0xd8/0xf0
> [ 2480.768873]  [<ffffffff810c0a30>] ? kthread_create_on_node+0x1b0/0x1b0
> [ 2480.770409]  [<ffffffff816e94e2>] ret_from_fork+0x42/0x70
> [ 2480.771868]  [<ffffffff810c0a30>] ? kthread_create_on_node+0x1b0/0x1b0
> 
> 
> ===> /etc/drbd.d/global_common.conf <===
> common {
> 	startup {
> 		wfc-timeout 30;
> 		outdated-wfc-timeout 20;
> 		degr-wfc-timeout 30;
> 	}
> 	options {
> 		on-no-data-accessible suspend-io;
> 	}
> 	syncer {
> 		rate 500M;
> 	}
> 	disk {
> 		al-extents 3389;
> 		c-fill-target 10240;
> 		c-delay-target 100;
> 		c-plan-ahead 70;
> 		c-min-rate 1024;
> 		c-max-rate 400M;
> 		on-io-error pass_on;
> 		read-balancing when-congested-remote;
> 	}
> 	net {
> 		after-sb-0pri discard-zero-changes;
> 		after-sb-1pri call-pri-lost-after-sb;
> 		after-sb-2pri disconnect;
> 		allow-two-primaries no;
> 		protocol A;
> 		cram-hmac-alg sha1;
> 		verify-alg crc32c;
> 		csums-alg crc32c;
> 		max-buffers 8192;
> 		max-epoch-size 8192;
> 		tcp-cork yes;
> 		sndbuf-size 1M;
> 		rcvbuf-size 2M;
> 		unplug-watermark 128; 
> 		ko-count 3;
> 		timeout 90;
> 		
> 		ping-int 10;
> 		ping-timeout 30;
> 	}
> }

-- 
: Lars Ellenberg
: LINBIT | Keeping the Digital World Running
: DRBD -- Heartbeat -- Corosync -- Pacemaker
: R&D, Integration, Ops, Consulting, Support

DRBD® and LINBIT® are registered trademarks of LINBIT

  reply	other threads:[~2016-03-30 12:00 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-03-30  2:19 [Drbd-dev] NULL pointer derefernce in 8.4.7-1 Eric Wheeler
2016-03-30 12:00 ` Lars Ellenberg [this message]
2016-03-30 19:52   ` [Drbd-dev] NULL pointer derefernce in 8.4.7-1 during drbd_destroy_connection() Eric Wheeler

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160330120050.GO15579@soda.linbit \
    --to=lars.ellenberg@linbit.com \
    --cc=drbd-dev@lists.linbit.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox