linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] lock_page() doesn't lock if __wait_on_bit_lock returns -EINTR
@ 2015-12-12 16:23 Chris Mason
  2015-12-12 18:33 ` Linus Torvalds
  0 siblings, 1 reply; 11+ messages in thread
From: Chris Mason @ 2015-12-12 16:23 UTC (permalink / raw)
  To: Linus Torvalds, Peter Zijlstra, Dave Jones, LKML,
	Jon Christopherson

We have two reports of frequent crashes in btrfs where asserts in
clear_page_dirty_for_io() were triggering on missing page locks.

The crashes were much easier to trigger when processes were catching
ctrl-c's, and after much debugging it really looked like lock_page was a
noop.

This recent commit looks pretty suspect to me, and I confirmed that we
were exiting __wait_on_bit_lock() with -EINTR when it was called with
TASK_UNINTERRUPTIBLE

commit 68985633bccb6066bf1803e316fbc6c1f5b796d6
Author: Peter Zijlstra <peterz@infradead.org>
Date:   Tue Dec 1 14:04:04 2015 +0100

    sched/wait: Fix signal handling in bit wait helpers

The patch below is mostly untested, and probably not the right solution.
Dave's trinity run doesn't explode immediately anymore, and I wanted to
get this out for discussion.  A quick look on the list doesn't show
anyone else has tracked this down, sorry if it's a dup.

Reported-by: Dave Jones <dsj@fb.com>, 
Reported-by: Jon Christopherson <jon@jons.org>
Signed-off-by: Chris Mason <clm@fb.com>

diff --git a/kernel/sched/wait.c b/kernel/sched/wait.c
index f10bd87..12f69df 100644
--- a/kernel/sched/wait.c
+++ b/kernel/sched/wait.c
@@ -434,6 +434,8 @@ __wait_on_bit_lock(wait_queue_head_t *wq, struct wait_bit_queue *q,
 		ret = action(&q->key);
 		if (!ret)
 			continue;
+		if (ret == -EINTR && mode == TASK_UNINTERRUPTIBLE)
+			continue;
 		abort_exclusive_wait(wq, &q->wait, mode, &q->key);
 		return ret;
 	} while (test_and_set_bit(q->key.bit_nr, q->key.flags));

^ permalink raw reply related	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2015-12-15  0:00 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-12-12 16:23 [PATCH] lock_page() doesn't lock if __wait_on_bit_lock returns -EINTR Chris Mason
2015-12-12 18:33 ` Linus Torvalds
2015-12-12 19:41   ` Linus Torvalds
2015-12-13  0:07     ` Chris Mason
2015-12-14 18:33       ` Dave Jones
2015-12-14 20:01         ` Chris Mason
2015-12-14 23:59         ` Chris Mason
2015-12-13  9:50     ` Peter Zijlstra
2015-12-13 15:55       ` Chris Mason
2015-12-13 20:51       ` Linus Torvalds
2015-12-13 21:12         ` Peter Zijlstra

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).