public inbox for linux-bcache@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] bcache: btree.c: Fix GC thread exit in case of cache device failure and unregister
@ 2018-01-12 15:24 Pavel Vazharov
  2018-01-13  4:06 ` Coly Li
  0 siblings, 1 reply; 3+ messages in thread
From: Pavel Vazharov @ 2018-01-12 15:24 UTC (permalink / raw)
  To: mlyle, kent.overstreet; +Cc: linux-bcache, linux-kernel, Pavel Vazharov

There was a possibility for infinite do-while loop inside the GC thread
function in case of total failure of the caching device. I was able to
reproduce it 3 times simulating disappearing of the caching device via
'echo 1 > /sys/block/<dev>/device/delete'. In that case the btree_root
starts to return non zero and non -EAGAIN result, 'gc failed' message
start to fill the kernel log and the do-while becomes infinite loop
occupying single CPU core at 100%.
There is already a logic which unregisters the cache_set (or panics) in
case of io errors and thus we exit the loop here if the unregistering
procedure has already started.

Signed-off-by: Pavel Vazharov <freakpv@gmail.com>
---
 drivers/md/bcache/btree.c | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/drivers/md/bcache/btree.c b/drivers/md/bcache/btree.c
index 81e8dc3..a672081 100644
--- a/drivers/md/bcache/btree.c
+++ b/drivers/md/bcache/btree.c
@@ -1748,8 +1748,12 @@ static void bch_btree_gc(struct cache_set *c)
 		closure_sync(&writes);
 		cond_resched();
 
-		if (ret && ret != -EAGAIN)
-			pr_warn("gc failed!");
+		if (ret && ret != -EAGAIN) {
+			if (test_bit(CACHE_SET_UNREGISTERING, &c->flags))
+				break;
+			else
+				pr_warn("gc failed!");
+		}
 	} while (ret);
 
 	bch_btree_gc_finish(c);
-- 
2.7.4

^ permalink raw reply related	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-01-13  4:43 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-01-12 15:24 [PATCH] bcache: btree.c: Fix GC thread exit in case of cache device failure and unregister Pavel Vazharov
2018-01-13  4:06 ` Coly Li
2018-01-13  4:43   ` Pavel Vazharov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox