* [RFC] [PATCH] Potential bug fix for blk_tag_queue
@ 2008-08-26 1:03 Abhijeet Joglekar
2008-08-26 1:03 ` [RFC][PATCH] blk-tag: Use atomic_t type for bqt->busy Abhijeet Joglekar
2008-08-26 2:46 ` [RFC] [PATCH] Potential bug fix for blk_tag_queue Matthew Wilcox
0 siblings, 2 replies; 6+ messages in thread
From: Abhijeet Joglekar @ 2008-08-26 1:03 UTC (permalink / raw)
To: jens.axboe; +Cc: linux-scsi, ajoglekar
The following patch fixes a potential bug in the blk_tag_queue structure.
"busy" is used to keep track of outstanding tags, is declared as int,
and updated inside queue lock. For host-wide shared tag map, this corrupts
the value of busy, which hits BUG_ON during __blk_free_tags.
Recommend converting busy to atomic_t and using atomic_macros to access it.
---
Abhijeet Joglekar (1):
blk-tag: Use atomic_t type for bqt->busy.
block/blk-tag.c | 8 ++++----
include/linux/blkdev.h | 2 +-
2 files changed, 5 insertions(+), 5 deletions(-)
--
Signature
^ permalink raw reply [flat|nested] 6+ messages in thread
* [RFC][PATCH] blk-tag: Use atomic_t type for bqt->busy.
2008-08-26 1:03 [RFC] [PATCH] Potential bug fix for blk_tag_queue Abhijeet Joglekar
@ 2008-08-26 1:03 ` Abhijeet Joglekar
2008-08-26 2:46 ` [RFC] [PATCH] Potential bug fix for blk_tag_queue Matthew Wilcox
1 sibling, 0 replies; 6+ messages in thread
From: Abhijeet Joglekar @ 2008-08-26 1:03 UTC (permalink / raw)
To: jens.axboe; +Cc: linux-scsi, ajoglekar
blk-tag: Use atomic_t type for bqt->busy.
bqt->busy is an integer that keeps count of number of outstanding tags. For
host-wide shared tag map, busy is accessed in a queue lock (not host lock)
which corrupts the busy value. Change busy to atomic_t and use atomic
macros to access it.
Signed-off-by: Abhijeet Joglekar <abjoglek@cisco.com>
---
block/blk-tag.c | 8 ++++----
include/linux/blkdev.h | 2 +-
2 files changed, 5 insertions(+), 5 deletions(-)
diff --git a/block/blk-tag.c b/block/blk-tag.c
index 32667be..8f37d08 100644
--- a/block/blk-tag.c
+++ b/block/blk-tag.c
@@ -38,7 +38,7 @@ static int __blk_free_tags(struct blk_queue_tag *bqt)
retval = atomic_dec_and_test(&bqt->refcnt);
if (retval) {
- BUG_ON(bqt->busy);
+ BUG_ON(atomic_read(&bqt->busy));
kfree(bqt->tag_index);
bqt->tag_index = NULL;
@@ -147,7 +147,7 @@ static struct blk_queue_tag *__blk_queue_init_tags(struct request_queue *q,
if (init_tag_map(q, tags, depth))
goto fail;
- tags->busy = 0;
+ atomic_set(&tags->busy, 0);
atomic_set(&tags->refcnt, 1);
return tags;
fail:
@@ -313,7 +313,7 @@ void blk_queue_end_tag(struct request_queue *q, struct request *rq)
* unlock memory barrier semantics.
*/
clear_bit_unlock(tag, bqt->tag_map);
- bqt->busy--;
+ atomic_dec(&bqt->busy);
}
EXPORT_SYMBOL(blk_queue_end_tag);
@@ -368,7 +368,7 @@ int blk_queue_start_tag(struct request_queue *q, struct request *rq)
bqt->tag_index[tag] = rq;
blkdev_dequeue_request(rq);
list_add(&rq->queuelist, &q->tag_busy_list);
- bqt->busy++;
+ atomic_inc(&bqt->busy);
return 0;
}
EXPORT_SYMBOL(blk_queue_start_tag);
diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
index 88d6808..106531e 100644
--- a/include/linux/blkdev.h
+++ b/include/linux/blkdev.h
@@ -274,7 +274,7 @@ enum blk_queue_state {
struct blk_queue_tag {
struct request **tag_index; /* map of busy tags */
unsigned long *tag_map; /* bit map of free/busy tags */
- int busy; /* current depth */
+ atomic_t busy; /* current depth */
int max_depth; /* what we will send to device */
int real_max_depth; /* what the array can hold */
atomic_t refcnt; /* map can be shared */
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [RFC] [PATCH] Potential bug fix for blk_tag_queue
2008-08-26 1:03 [RFC] [PATCH] Potential bug fix for blk_tag_queue Abhijeet Joglekar
2008-08-26 1:03 ` [RFC][PATCH] blk-tag: Use atomic_t type for bqt->busy Abhijeet Joglekar
@ 2008-08-26 2:46 ` Matthew Wilcox
2008-08-26 7:00 ` Jens Axboe
1 sibling, 1 reply; 6+ messages in thread
From: Matthew Wilcox @ 2008-08-26 2:46 UTC (permalink / raw)
To: Abhijeet Joglekar; +Cc: jens.axboe, linux-scsi
On Mon, Aug 25, 2008 at 06:03:01PM -0700, Abhijeet Joglekar wrote:
> The following patch fixes a potential bug in the blk_tag_queue structure.
> "busy" is used to keep track of outstanding tags, is declared as int,
> and updated inside queue lock. For host-wide shared tag map, this corrupts
> the value of busy, which hits BUG_ON during __blk_free_tags.
I believe you to be right.
> Recommend converting busy to atomic_t and using atomic_macros to access it.
ugh, more atomic ops. How about just getting rid of it? Patch
untested:
diff --git a/block/blk-tag.c b/block/blk-tag.c
index 32667be..ed5166f 100644
--- a/block/blk-tag.c
+++ b/block/blk-tag.c
@@ -38,7 +38,8 @@ static int __blk_free_tags(struct blk_queue_tag *bqt)
retval = atomic_dec_and_test(&bqt->refcnt);
if (retval) {
- BUG_ON(bqt->busy);
+ BUG_ON(find_first_bit(bqt->tag_map, bqt->max_depth) <
+ bqt->max_depth);
kfree(bqt->tag_index);
bqt->tag_index = NULL;
@@ -147,7 +148,6 @@ static struct blk_queue_tag *__blk_queue_init_tags(struct request_queue *q,
if (init_tag_map(q, tags, depth))
goto fail;
- tags->busy = 0;
atomic_set(&tags->refcnt, 1);
return tags;
fail:
@@ -313,7 +313,6 @@ void blk_queue_end_tag(struct request_queue *q, struct request *rq)
* unlock memory barrier semantics.
*/
clear_bit_unlock(tag, bqt->tag_map);
- bqt->busy--;
}
EXPORT_SYMBOL(blk_queue_end_tag);
@@ -368,7 +367,6 @@ int blk_queue_start_tag(struct request_queue *q, struct request *rq)
bqt->tag_index[tag] = rq;
blkdev_dequeue_request(rq);
list_add(&rq->queuelist, &q->tag_busy_list);
- bqt->busy++;
return 0;
}
EXPORT_SYMBOL(blk_queue_start_tag);
diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
index e61f22b..c66c664 100644
--- a/include/linux/blkdev.h
+++ b/include/linux/blkdev.h
@@ -274,7 +274,6 @@ enum blk_queue_state {
struct blk_queue_tag {
struct request **tag_index; /* map of busy tags */
unsigned long *tag_map; /* bit map of free/busy tags */
- int busy; /* current depth */
int max_depth; /* what we will send to device */
int real_max_depth; /* what the array can hold */
atomic_t refcnt; /* map can be shared */
--
Matthew Wilcox Intel Open Source Technology Centre
"Bill, look, we understand that you're interested in selling us this
operating system, but compare it to ours. We can't possibly take such
a retrograde step."
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [RFC] [PATCH] Potential bug fix for blk_tag_queue
2008-08-26 2:46 ` [RFC] [PATCH] Potential bug fix for blk_tag_queue Matthew Wilcox
@ 2008-08-26 7:00 ` Jens Axboe
2008-08-26 7:49 ` Abhijeet Joglekar
0 siblings, 1 reply; 6+ messages in thread
From: Jens Axboe @ 2008-08-26 7:00 UTC (permalink / raw)
To: Matthew Wilcox; +Cc: Abhijeet Joglekar, linux-scsi
On Mon, Aug 25 2008, Matthew Wilcox wrote:
> On Mon, Aug 25, 2008 at 06:03:01PM -0700, Abhijeet Joglekar wrote:
> > The following patch fixes a potential bug in the blk_tag_queue structure.
> > "busy" is used to keep track of outstanding tags, is declared as int,
> > and updated inside queue lock. For host-wide shared tag map, this corrupts
> > the value of busy, which hits BUG_ON during __blk_free_tags.
>
> I believe you to be right.
>
> > Recommend converting busy to atomic_t and using atomic_macros to access it.
>
> ugh, more atomic ops. How about just getting rid of it? Patch
> untested:
>
> diff --git a/block/blk-tag.c b/block/blk-tag.c
> index 32667be..ed5166f 100644
> --- a/block/blk-tag.c
> +++ b/block/blk-tag.c
> @@ -38,7 +38,8 @@ static int __blk_free_tags(struct blk_queue_tag *bqt)
>
> retval = atomic_dec_and_test(&bqt->refcnt);
> if (retval) {
> - BUG_ON(bqt->busy);
> + BUG_ON(find_first_bit(bqt->tag_map, bqt->max_depth) <
> + bqt->max_depth);
>
> kfree(bqt->tag_index);
> bqt->tag_index = NULL;
> @@ -147,7 +148,6 @@ static struct blk_queue_tag *__blk_queue_init_tags(struct request_queue *q,
> if (init_tag_map(q, tags, depth))
> goto fail;
>
> - tags->busy = 0;
> atomic_set(&tags->refcnt, 1);
> return tags;
> fail:
> @@ -313,7 +313,6 @@ void blk_queue_end_tag(struct request_queue *q, struct request *rq)
> * unlock memory barrier semantics.
> */
> clear_bit_unlock(tag, bqt->tag_map);
> - bqt->busy--;
> }
> EXPORT_SYMBOL(blk_queue_end_tag);
>
> @@ -368,7 +367,6 @@ int blk_queue_start_tag(struct request_queue *q, struct request *rq)
> bqt->tag_index[tag] = rq;
> blkdev_dequeue_request(rq);
> list_add(&rq->queuelist, &q->tag_busy_list);
> - bqt->busy++;
> return 0;
> }
> EXPORT_SYMBOL(blk_queue_start_tag);
> diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
> index e61f22b..c66c664 100644
> --- a/include/linux/blkdev.h
> +++ b/include/linux/blkdev.h
> @@ -274,7 +274,6 @@ enum blk_queue_state {
> struct blk_queue_tag {
> struct request **tag_index; /* map of busy tags */
> unsigned long *tag_map; /* bit map of free/busy tags */
> - int busy; /* current depth */
> int max_depth; /* what we will send to device */
> int real_max_depth; /* what the array can hold */
> atomic_t refcnt; /* map can be shared */
Thanks, we can easily kill it. We should remove blk_queue_tag_depth()
and blk_queue_tag_queue() as well, they were the 'exported' way of
getting this information. I added this for the IDE TCQ bits back in
olden days, but that stuff is no longer there. So it's probably been
unused since then.
I've applied your patch.
--
Jens Axboe
^ permalink raw reply [flat|nested] 6+ messages in thread
* RE: [RFC] [PATCH] Potential bug fix for blk_tag_queue
2008-08-26 7:00 ` Jens Axboe
@ 2008-08-26 7:49 ` Abhijeet Joglekar
2008-08-26 8:58 ` Jens Axboe
0 siblings, 1 reply; 6+ messages in thread
From: Abhijeet Joglekar @ 2008-08-26 7:49 UTC (permalink / raw)
To: Jens Axboe, Matthew Wilcox; +Cc: linux-scsi
Ok, removing that works too.
Shouldn't the BUG_ON be checking upto bqt->real_max_depth instead of
bqt->depth?
BUG_ON(find_first_bit(bqt->tag_map, bqt->real_max_depth) <
bqt->real_max_depth);
In cases where the tag map gets resized to a value less than the
originally allocated tag map, blk_queue_resize_tags seems to be setting
bqt->max_depth to the new_depth.
There could still be outstanding tags max_depth <= t <= real_max_depth
which might not get freed, which the BUG_ON will not capture if it
checks against max_depth.
abhijeet
-----Original Message-----
From: Jens Axboe [mailto:jens.axboe@oracle.com]
Sent: Tuesday, August 26, 2008 12:01 AM
To: Matthew Wilcox
Cc: Abhijeet Joglekar; linux-scsi@vger.kernel.org
Subject: Re: [RFC] [PATCH] Potential bug fix for blk_tag_queue
On Mon, Aug 25 2008, Matthew Wilcox wrote:
> On Mon, Aug 25, 2008 at 06:03:01PM -0700, Abhijeet Joglekar wrote:
> > The following patch fixes a potential bug in the blk_tag_queue
structure.
> > "busy" is used to keep track of outstanding tags, is declared as
int,
> > and updated inside queue lock. For host-wide shared tag map, this
corrupts
> > the value of busy, which hits BUG_ON during __blk_free_tags.
>
> I believe you to be right.
>
> > Recommend converting busy to atomic_t and using atomic_macros to
access it.
>
> ugh, more atomic ops. How about just getting rid of it? Patch
> untested:
>
> diff --git a/block/blk-tag.c b/block/blk-tag.c
> index 32667be..ed5166f 100644
> --- a/block/blk-tag.c
> +++ b/block/blk-tag.c
> @@ -38,7 +38,8 @@ static int __blk_free_tags(struct blk_queue_tag
*bqt)
>
> retval = atomic_dec_and_test(&bqt->refcnt);
> if (retval) {
> - BUG_ON(bqt->busy);
> + BUG_ON(find_first_bit(bqt->tag_map, bqt->max_depth) <
> + bqt->max_depth);
>
> kfree(bqt->tag_index);
> bqt->tag_index = NULL;
> @@ -147,7 +148,6 @@ static struct blk_queue_tag
*__blk_queue_init_tags(struct request_queue *q,
> if (init_tag_map(q, tags, depth))
> goto fail;
>
> - tags->busy = 0;
> atomic_set(&tags->refcnt, 1);
> return tags;
> fail:
> @@ -313,7 +313,6 @@ void blk_queue_end_tag(struct request_queue *q,
struct request *rq)
> * unlock memory barrier semantics.
> */
> clear_bit_unlock(tag, bqt->tag_map);
> - bqt->busy--;
> }
> EXPORT_SYMBOL(blk_queue_end_tag);
>
> @@ -368,7 +367,6 @@ int blk_queue_start_tag(struct request_queue *q,
struct request *rq)
> bqt->tag_index[tag] = rq;
> blkdev_dequeue_request(rq);
> list_add(&rq->queuelist, &q->tag_busy_list);
> - bqt->busy++;
> return 0;
> }
> EXPORT_SYMBOL(blk_queue_start_tag);
> diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
> index e61f22b..c66c664 100644
> --- a/include/linux/blkdev.h
> +++ b/include/linux/blkdev.h
> @@ -274,7 +274,6 @@ enum blk_queue_state {
> struct blk_queue_tag {
> struct request **tag_index; /* map of busy tags */
> unsigned long *tag_map; /* bit map of free/busy tags */
> - int busy; /* current depth */
> int max_depth; /* what we will send to device
*/
> int real_max_depth; /* what the array can hold */
> atomic_t refcnt; /* map can be shared */
Thanks, we can easily kill it. We should remove blk_queue_tag_depth()
and blk_queue_tag_queue() as well, they were the 'exported' way of
getting this information. I added this for the IDE TCQ bits back in
olden days, but that stuff is no longer there. So it's probably been
unused since then.
I've applied your patch.
--
Jens Axboe
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [RFC] [PATCH] Potential bug fix for blk_tag_queue
2008-08-26 7:49 ` Abhijeet Joglekar
@ 2008-08-26 8:58 ` Jens Axboe
0 siblings, 0 replies; 6+ messages in thread
From: Jens Axboe @ 2008-08-26 8:58 UTC (permalink / raw)
To: Abhijeet Joglekar; +Cc: Matthew Wilcox, linux-scsi
(please don't top post!)
On Tue, Aug 26 2008, Abhijeet Joglekar wrote:
> Ok, removing that works too.
>
> Shouldn't the BUG_ON be checking upto bqt->real_max_depth instead of
> bqt->depth?
>
> BUG_ON(find_first_bit(bqt->tag_map, bqt->real_max_depth) <
> bqt->real_max_depth);
>
>
> In cases where the tag map gets resized to a value less than the
> originally allocated tag map, blk_queue_resize_tags seems to be setting
> bqt->max_depth to the new_depth.
>
> There could still be outstanding tags max_depth <= t <= real_max_depth
> which might not get freed, which the BUG_ON will not capture if it
> checks against max_depth.
max_depth is fine, it's the safer choice. For the real_max_depth to
potentially catch any extra offenders, you would have to do a resize to
a larger depth, get bunch of IO issued, then resize down and shut down
the queue. Using real_max_depth should work as well, but then you have
to audit the ending of tags > max/real_max_depth. It looks ok, though.
--
Jens Axboe
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2008-08-26 8:58 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-08-26 1:03 [RFC] [PATCH] Potential bug fix for blk_tag_queue Abhijeet Joglekar
2008-08-26 1:03 ` [RFC][PATCH] blk-tag: Use atomic_t type for bqt->busy Abhijeet Joglekar
2008-08-26 2:46 ` [RFC] [PATCH] Potential bug fix for blk_tag_queue Matthew Wilcox
2008-08-26 7:00 ` Jens Axboe
2008-08-26 7:49 ` Abhijeet Joglekar
2008-08-26 8:58 ` Jens Axboe
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox