* [PATCH] bcache: fix a livelock in btree lock
@ 2014-10-18 12:51 Zheng Liu
0 siblings, 0 replies; 4+ messages in thread
From: Zheng Liu @ 2014-10-18 12:51 UTC (permalink / raw)
To: linux-bcache; +Cc: Zheng Liu
From: Zheng Liu <wenqing.lz@taobao.com>
This commit tries to fix a livelock in bcache. This livelock might
happen when we causes a huge number of cache misses simultaneously.
When we get a cache miss, bcache will execute the following path.
->cached_dev_make_request()
->cached_dev_read()
->cached_lookup()
->bch->btree_map_keys()
->btree_root() <------------------------
->bch_btree_map_keys_recurse() |
->cache_lookup_fn() |
->cached_dev_cache_miss() |
->bch_btree_insert_check_key() -|
[If btree->seq is not equal to seq + 1, we should return
EINTR and traverse btree again.]
In bch_btree_insert_check_key() function we first need to check upgrade
flag (op->lock == -1), and when this flag is true we need to release
read btree->lock and try to take write btree->lock. During taking and
releasing this write lock, btree->seq will be monotone increased in
order to prevent other threads modify this in cache miss (see btree.h:74).
But if there are some cache misses caused by some requested, we could
meet a livelock because btree->seq is always changed by others. Thus no
one can make progress.
This commit will try to take write btree->lock if it encounters a race
when we traverse btree. Although it sacrifice the scalability but we
can ensure that only one can modify the btree.
Signed-off-by: Zheng Liu <wenqing.lz@taobao.com>
---
drivers/md/bcache/btree.c | 4 +++-
1 files changed, 3 insertions(+), 1 deletions(-)
diff --git a/drivers/md/bcache/btree.c b/drivers/md/bcache/btree.c
index 00cde40..d14560a 100644
--- a/drivers/md/bcache/btree.c
+++ b/drivers/md/bcache/btree.c
@@ -2162,8 +2162,10 @@ int bch_btree_insert_check_key(struct btree *b, struct btree_op *op,
rw_lock(true, b, b->level);
if (b->key.ptr[0] != btree_ptr ||
- b->seq != seq + 1)
+ b->seq != seq + 1) {
+ op->lock = b->c->root->level + 1;
goto out;
+ }
}
SET_KEY_PTRS(check_key, 1);
--
1.7.1
^ permalink raw reply related [flat|nested] 4+ messages in thread
* [PATCH] bcache: fix a livelock in btree lock
@ 2015-02-03 11:21 Joshua Schmid
2015-02-04 3:00 ` Zhu Yanhai
0 siblings, 1 reply; 4+ messages in thread
From: Joshua Schmid @ 2015-02-03 11:21 UTC (permalink / raw)
To: linux-bcache; +Cc: Zheng Liu
From: Zheng Liu <wenqing.lz@taobao.com>
This commit tries to fix a livelock in bcache. This livelock might
happen when we causes a huge number of cache misses simultaneously.
When we get a cache miss, bcache will execute the following path.
->cached_dev_make_request()
->cached_dev_read()
->cached_lookup()
->bch->btree_map_keys()
->btree_root() <------------------------
->bch_btree_map_keys_recurse() |
->cache_lookup_fn() |
->cached_dev_cache_miss() |
->bch_btree_insert_check_key() -|
[If btree->seq is not equal to seq + 1, we should return
EINTR and traverse btree again.]
In bch_btree_insert_check_key() function we first need to check upgrade
flag (op->lock == -1), and when this flag is true we need to release
read btree->lock and try to take write btree->lock. During taking and
releasing this write lock, btree->seq will be monotone increased in
order to prevent other threads modify this in cache miss (see btree.h:74).
But if there are some cache misses caused by some requested, we could
meet a livelock because btree->seq is always changed by others. Thus no
one can make progress.
This commit will try to take write btree->lock if it encounters a race
when we traverse btree. Although it sacrifice the scalability but we
can ensure that only one can modify the btree.
Signed-off-by: Zheng Liu <wenqing.lz@taobao.com>
Tested-by: Joshua Schmid <jschmid@suse.com>
---
drivers/md/bcache/btree.c | 4 +++-
1 file changed, 3 insertions(+), 1 deletion(-)
diff --git a/drivers/md/bcache/btree.c b/drivers/md/bcache/btree.c
index 218f21a..f1c224f 100644
--- a/drivers/md/bcache/btree.c
+++ b/drivers/md/bcache/btree.c
@@ -2163,8 +2163,10 @@ int bch_btree_insert_check_key(struct btree *b, struct btree_op *op,
rw_lock(true, b, b->level);
if (b->key.ptr[0] != btree_ptr ||
- b->seq != seq + 1)
+ b->seq != seq + 1) {
+ op->lock = b->c->root->level + 1;
goto out;
+ }
}
SET_KEY_PTRS(check_key, 1);
--
2.1.2
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] bcache: fix a livelock in btree lock
2015-02-03 11:21 [PATCH] bcache: fix a livelock in btree lock Joshua Schmid
@ 2015-02-04 3:00 ` Zhu Yanhai
2015-02-25 12:11 ` Zheng Liu
0 siblings, 1 reply; 4+ messages in thread
From: Zhu Yanhai @ 2015-02-04 3:00 UTC (permalink / raw)
To: Joshua Schmid; +Cc: linux-bcache, Zheng Liu
Zheng,
It should be 'op->lock = b->level', not 'op->lock = b->c->root->level
+ 1', otherwise we will stop all concurrency writes unconditionally in
the second round. Isn't it?
-zyh
2015-02-03 19:21 GMT+08:00 Joshua Schmid <jschmid@suse.com>:
> From: Zheng Liu <wenqing.lz@taobao.com>
>
> This commit tries to fix a livelock in bcache. This livelock might
> happen when we causes a huge number of cache misses simultaneously.
>
> When we get a cache miss, bcache will execute the following path.
>
> ->cached_dev_make_request()
> ->cached_dev_read()
> ->cached_lookup()
> ->bch->btree_map_keys()
> ->btree_root() <------------------------
> ->bch_btree_map_keys_recurse() |
> ->cache_lookup_fn() |
> ->cached_dev_cache_miss() |
> ->bch_btree_insert_check_key() -|
> [If btree->seq is not equal to seq + 1, we should return
> EINTR and traverse btree again.]
>
> In bch_btree_insert_check_key() function we first need to check upgrade
> flag (op->lock == -1), and when this flag is true we need to release
> read btree->lock and try to take write btree->lock. During taking and
> releasing this write lock, btree->seq will be monotone increased in
> order to prevent other threads modify this in cache miss (see btree.h:74).
> But if there are some cache misses caused by some requested, we could
> meet a livelock because btree->seq is always changed by others. Thus no
> one can make progress.
>
> This commit will try to take write btree->lock if it encounters a race
> when we traverse btree. Although it sacrifice the scalability but we
> can ensure that only one can modify the btree.
>
> Signed-off-by: Zheng Liu <wenqing.lz@taobao.com>
> Tested-by: Joshua Schmid <jschmid@suse.com>
> ---
> drivers/md/bcache/btree.c | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/md/bcache/btree.c b/drivers/md/bcache/btree.c
> index 218f21a..f1c224f 100644
> --- a/drivers/md/bcache/btree.c
> +++ b/drivers/md/bcache/btree.c
> @@ -2163,8 +2163,10 @@ int bch_btree_insert_check_key(struct btree *b, struct btree_op *op,
> rw_lock(true, b, b->level);
>
> if (b->key.ptr[0] != btree_ptr ||
> - b->seq != seq + 1)
> + b->seq != seq + 1) {
> + op->lock = b->c->root->level + 1;
> goto out;
> + }
> }
>
> SET_KEY_PTRS(check_key, 1);
> --
> 2.1.2
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] bcache: fix a livelock in btree lock
2015-02-04 3:00 ` Zhu Yanhai
@ 2015-02-25 12:11 ` Zheng Liu
0 siblings, 0 replies; 4+ messages in thread
From: Zheng Liu @ 2015-02-25 12:11 UTC (permalink / raw)
To: Zhu Yanhai; +Cc: Joshua Schmid, linux-bcache, Zheng Liu
On Wed, Feb 04, 2015 at 11:00:40AM +0800, Zhu Yanhai wrote:
> Zheng,
>
> It should be 'op->lock = b->level', not 'op->lock = b->c->root->level
> + 1', otherwise we will stop all concurrency writes unconditionally in
> the second round. Isn't it?
You're right. I will fix this problem and re-send the patch that will
be rebased against the latest upstream kernel.
Thanks,
- Zheng
>
> -zyh
>
> 2015-02-03 19:21 GMT+08:00 Joshua Schmid <jschmid@suse.com>:
> > From: Zheng Liu <wenqing.lz@taobao.com>
> >
> > This commit tries to fix a livelock in bcache. This livelock might
> > happen when we causes a huge number of cache misses simultaneously.
> >
> > When we get a cache miss, bcache will execute the following path.
> >
> > ->cached_dev_make_request()
> > ->cached_dev_read()
> > ->cached_lookup()
> > ->bch->btree_map_keys()
> > ->btree_root() <------------------------
> > ->bch_btree_map_keys_recurse() |
> > ->cache_lookup_fn() |
> > ->cached_dev_cache_miss() |
> > ->bch_btree_insert_check_key() -|
> > [If btree->seq is not equal to seq + 1, we should return
> > EINTR and traverse btree again.]
> >
> > In bch_btree_insert_check_key() function we first need to check upgrade
> > flag (op->lock == -1), and when this flag is true we need to release
> > read btree->lock and try to take write btree->lock. During taking and
> > releasing this write lock, btree->seq will be monotone increased in
> > order to prevent other threads modify this in cache miss (see btree.h:74).
> > But if there are some cache misses caused by some requested, we could
> > meet a livelock because btree->seq is always changed by others. Thus no
> > one can make progress.
> >
> > This commit will try to take write btree->lock if it encounters a race
> > when we traverse btree. Although it sacrifice the scalability but we
> > can ensure that only one can modify the btree.
> >
> > Signed-off-by: Zheng Liu <wenqing.lz@taobao.com>
> > Tested-by: Joshua Schmid <jschmid@suse.com>
> > ---
> > drivers/md/bcache/btree.c | 4 +++-
> > 1 file changed, 3 insertions(+), 1 deletion(-)
> >
> > diff --git a/drivers/md/bcache/btree.c b/drivers/md/bcache/btree.c
> > index 218f21a..f1c224f 100644
> > --- a/drivers/md/bcache/btree.c
> > +++ b/drivers/md/bcache/btree.c
> > @@ -2163,8 +2163,10 @@ int bch_btree_insert_check_key(struct btree *b, struct btree_op *op,
> > rw_lock(true, b, b->level);
> >
> > if (b->key.ptr[0] != btree_ptr ||
> > - b->seq != seq + 1)
> > + b->seq != seq + 1) {
> > + op->lock = b->c->root->level + 1;
> > goto out;
> > + }
> > }
> >
> > SET_KEY_PTRS(check_key, 1);
> > --
> > 2.1.2
> >
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2015-02-25 11:58 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-02-03 11:21 [PATCH] bcache: fix a livelock in btree lock Joshua Schmid
2015-02-04 3:00 ` Zhu Yanhai
2015-02-25 12:11 ` Zheng Liu
-- strict thread matches above, loose matches on Subject: below --
2014-10-18 12:51 Zheng Liu
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox