linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout()
@ 2011-03-11  8:06 Lukas Czerner
  2011-03-11  9:29 ` Jens Axboe
  2011-03-11 14:23 ` Jeff Moyer
  0 siblings, 2 replies; 5+ messages in thread
From: Lukas Czerner @ 2011-03-11  8:06 UTC (permalink / raw)
  To: linux-fsdevel
  Cc: linux-kernel, stable, Lukas Czerner, Jens Axboe, Dmitry Monakhov,
	Jeff Moyer

BZ29402
https://bugzilla.kernel.org/show_bug.cgi?id=29402

We can hit serious mis-synchronization in bio completion path of
blkdev_issue_zeroout() leading to a panic.

The problem is that when we are going to wait_for_completion() in
blkdev_issue_zeroout() we check if the bb.done equals issued (number of
submitted bios). If it does, we can skip the wait_for_completition()
and just out of the function since there is nothing to wait for.
However, there is a ordering problem because bio_batch_end_io() is
calling atomic_inc(&bb->done) before complete(), hence it might seem to
blkdev_issue_zeroout() that all bios has been completed and exit. At
this point when bio_batch_end_io() is going to call complete(bb->wait),
bb and wait does not longer exist since it was allocated on stack in
blkdev_issue_zeroout() ==> panic!

(thread 1)                      (thread 2)
bio_batch_end_io()              blkdev_issue_zeroout()
  if(bb) {                      ...
    if (bb->end_io)             ...
      bb->end_io(bio, err);     ...
    atomic_inc(&bb->done);      ...
    ...                         while (issued != atomic_read(&bb.done))
    ...                         (let issued == bb.done)
    ...                         (do the rest of the function)
    ...                         return ret;
    complete(bb->wait);
    ^^^^^^^^
    panic

We can fix this easily by simplifying bio_batch and completion counting.

Also remove bio_end_io_t *end_io since it is not used.

Signed-off-by: Lukas Czerner <lczerner@redhat.com>
Reported-by: Eric Whitney <eric.whitney@hp.com>
Tested-by: Eric Whitney <eric.whitney@hp.com>
CC: Jens Axboe <axboe@kernel.dk>
CC: Dmitry Monakhov <dmonakhov@openvz.org>
CC: Jeff Moyer <jmoyer@redhat.com>
---
 block/blk-lib.c |   19 +++++++------------
 1 files changed, 7 insertions(+), 12 deletions(-)

diff --git a/block/blk-lib.c b/block/blk-lib.c
index eec78be..bd3e8df 100644
--- a/block/blk-lib.c
+++ b/block/blk-lib.c
@@ -109,7 +109,6 @@ struct bio_batch
 	atomic_t 		done;
 	unsigned long 		flags;
 	struct completion 	*wait;
-	bio_end_io_t		*end_io;
 };
 
 static void bio_batch_end_io(struct bio *bio, int err)
@@ -122,12 +121,9 @@ static void bio_batch_end_io(struct bio *bio, int err)
 		else
 			clear_bit(BIO_UPTODATE, &bb->flags);
 	}
-	if (bb) {
-		if (bb->end_io)
-			bb->end_io(bio, err);
-		atomic_inc(&bb->done);
-		complete(bb->wait);
-	}
+	if (bb)
+		if (atomic_dec_and_test(&bb->done))
+			complete(bb->wait);
 	bio_put(bio);
 }
 
@@ -150,13 +146,12 @@ int blkdev_issue_zeroout(struct block_device *bdev, sector_t sector,
 	int ret;
 	struct bio *bio;
 	struct bio_batch bb;
-	unsigned int sz, issued = 0;
+	unsigned int sz;
 	DECLARE_COMPLETION_ONSTACK(wait);
 
-	atomic_set(&bb.done, 0);
+	atomic_set(&bb.done, 1);
 	bb.flags = 1 << BIO_UPTODATE;
 	bb.wait = &wait;
-	bb.end_io = NULL;
 
 submit:
 	ret = 0;
@@ -185,12 +180,12 @@ submit:
 				break;
 		}
 		ret = 0;
-		issued++;
+		atomic_inc(&bb.done);
 		submit_bio(WRITE, bio);
 	}
 
 	/* Wait for bios in-flight */
-	while (issued != atomic_read(&bb.done))
+	if (!atomic_dec_and_test(&bb.done))
 		wait_for_completion(&wait);
 
 	if (!test_bit(BIO_UPTODATE, &bb.flags))
-- 
1.7.4

^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout()
  2011-03-11  8:06 [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout() Lukas Czerner
@ 2011-03-11  9:29 ` Jens Axboe
  2011-03-11 14:23 ` Jeff Moyer
  1 sibling, 0 replies; 5+ messages in thread
From: Jens Axboe @ 2011-03-11  9:29 UTC (permalink / raw)
  To: Lukas Czerner
  Cc: linux-fsdevel, linux-kernel, stable, Dmitry Monakhov, Jeff Moyer

On 2011-03-11 09:06, Lukas Czerner wrote:
> BZ29402
> https://bugzilla.kernel.org/show_bug.cgi?id=29402
> 
> We can hit serious mis-synchronization in bio completion path of
> blkdev_issue_zeroout() leading to a panic.
> 
> The problem is that when we are going to wait_for_completion() in
> blkdev_issue_zeroout() we check if the bb.done equals issued (number of
> submitted bios). If it does, we can skip the wait_for_completition()
> and just out of the function since there is nothing to wait for.
> However, there is a ordering problem because bio_batch_end_io() is
> calling atomic_inc(&bb->done) before complete(), hence it might seem to
> blkdev_issue_zeroout() that all bios has been completed and exit. At
> this point when bio_batch_end_io() is going to call complete(bb->wait),
> bb and wait does not longer exist since it was allocated on stack in
> blkdev_issue_zeroout() ==> panic!
> 
> (thread 1)                      (thread 2)
> bio_batch_end_io()              blkdev_issue_zeroout()
>   if(bb) {                      ...
>     if (bb->end_io)             ...
>       bb->end_io(bio, err);     ...
>     atomic_inc(&bb->done);      ...
>     ...                         while (issued != atomic_read(&bb.done))
>     ...                         (let issued == bb.done)
>     ...                         (do the rest of the function)
>     ...                         return ret;
>     complete(bb->wait);
>     ^^^^^^^^
>     panic
> 
> We can fix this easily by simplifying bio_batch and completion counting.
> 
> Also remove bio_end_io_t *end_io since it is not used.

Thanks Lukas, I'll expedite this for 2.6.38!

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout()
  2011-03-11  8:06 [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout() Lukas Czerner
  2011-03-11  9:29 ` Jens Axboe
@ 2011-03-11 14:23 ` Jeff Moyer
  2011-03-11 14:31   ` Lukas Czerner
  1 sibling, 1 reply; 5+ messages in thread
From: Jeff Moyer @ 2011-03-11 14:23 UTC (permalink / raw)
  To: Lukas Czerner
  Cc: linux-fsdevel, linux-kernel, stable, Jens Axboe, Dmitry Monakhov

Lukas Czerner <lczerner@redhat.com> writes:

> BZ29402
> https://bugzilla.kernel.org/show_bug.cgi?id=29402
>
> We can hit serious mis-synchronization in bio completion path of
> blkdev_issue_zeroout() leading to a panic.
>
> The problem is that when we are going to wait_for_completion() in
> blkdev_issue_zeroout() we check if the bb.done equals issued (number of
> submitted bios). If it does, we can skip the wait_for_completition()
> and just out of the function since there is nothing to wait for.
> However, there is a ordering problem because bio_batch_end_io() is
> calling atomic_inc(&bb->done) before complete(), hence it might seem to
> blkdev_issue_zeroout() that all bios has been completed and exit. At
> this point when bio_batch_end_io() is going to call complete(bb->wait),
> bb and wait does not longer exist since it was allocated on stack in
> blkdev_issue_zeroout() ==> panic!
>
> (thread 1)                      (thread 2)
> bio_batch_end_io()              blkdev_issue_zeroout()
>   if(bb) {                      ...
>     if (bb->end_io)             ...
>       bb->end_io(bio, err);     ...
>     atomic_inc(&bb->done);      ...
>     ...                         while (issued != atomic_read(&bb.done))
>     ...                         (let issued == bb.done)
>     ...                         (do the rest of the function)
>     ...                         return ret;
>     complete(bb->wait);
>     ^^^^^^^^
>     panic
>
> We can fix this easily by simplifying bio_batch and completion counting.
>
> Also remove bio_end_io_t *end_io since it is not used.
>
> Signed-off-by: Lukas Czerner <lczerner@redhat.com>
> Reported-by: Eric Whitney <eric.whitney@hp.com>
> Tested-by: Eric Whitney <eric.whitney@hp.com>
> CC: Jens Axboe <axboe@kernel.dk>
> CC: Dmitry Monakhov <dmonakhov@openvz.org>
> CC: Jeff Moyer <jmoyer@redhat.com>
> ---
>  block/blk-lib.c |   19 +++++++------------
>  1 files changed, 7 insertions(+), 12 deletions(-)
>
> diff --git a/block/blk-lib.c b/block/blk-lib.c
> index eec78be..bd3e8df 100644
> --- a/block/blk-lib.c
> +++ b/block/blk-lib.c
> @@ -109,7 +109,6 @@ struct bio_batch
>  	atomic_t 		done;
>  	unsigned long 		flags;
>  	struct completion 	*wait;
> -	bio_end_io_t		*end_io;
>  };
>  
>  static void bio_batch_end_io(struct bio *bio, int err)
> @@ -122,12 +121,9 @@ static void bio_batch_end_io(struct bio *bio, int err)
>  		else
>  			clear_bit(BIO_UPTODATE, &bb->flags);
>  	}
> -	if (bb) {
> -		if (bb->end_io)
> -			bb->end_io(bio, err);
> -		atomic_inc(&bb->done);
> -		complete(bb->wait);
> -	}
> +	if (bb)
> +		if (atomic_dec_and_test(&bb->done))
> +			complete(bb->wait);

I think bb will always be set here, no real need to check.

Anyway, I though I already added my:

Reviewed-by: Jeff Moyer <jmoyer@redhat.com>

to this.  No?

Cheers,
Jeff

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout()
  2011-03-11 14:23 ` Jeff Moyer
@ 2011-03-11 14:31   ` Lukas Czerner
  2011-03-11 14:36     ` Jens Axboe
  0 siblings, 1 reply; 5+ messages in thread
From: Lukas Czerner @ 2011-03-11 14:31 UTC (permalink / raw)
  To: Jeff Moyer
  Cc: Lukas Czerner, linux-fsdevel, linux-kernel, stable, Jens Axboe,
	Dmitry Monakhov

On Fri, 11 Mar 2011, Jeff Moyer wrote:

> Lukas Czerner <lczerner@redhat.com> writes:
> 
> > BZ29402
> > https://bugzilla.kernel.org/show_bug.cgi?id=29402
> >
> > We can hit serious mis-synchronization in bio completion path of
> > blkdev_issue_zeroout() leading to a panic.
> >
> > The problem is that when we are going to wait_for_completion() in
> > blkdev_issue_zeroout() we check if the bb.done equals issued (number of
> > submitted bios). If it does, we can skip the wait_for_completition()
> > and just out of the function since there is nothing to wait for.
> > However, there is a ordering problem because bio_batch_end_io() is
> > calling atomic_inc(&bb->done) before complete(), hence it might seem to
> > blkdev_issue_zeroout() that all bios has been completed and exit. At
> > this point when bio_batch_end_io() is going to call complete(bb->wait),
> > bb and wait does not longer exist since it was allocated on stack in
> > blkdev_issue_zeroout() ==> panic!
> >
> > (thread 1)                      (thread 2)
> > bio_batch_end_io()              blkdev_issue_zeroout()
> >   if(bb) {                      ...
> >     if (bb->end_io)             ...
> >       bb->end_io(bio, err);     ...
> >     atomic_inc(&bb->done);      ...
> >     ...                         while (issued != atomic_read(&bb.done))
> >     ...                         (let issued == bb.done)
> >     ...                         (do the rest of the function)
> >     ...                         return ret;
> >     complete(bb->wait);
> >     ^^^^^^^^
> >     panic
> >
> > We can fix this easily by simplifying bio_batch and completion counting.
> >
> > Also remove bio_end_io_t *end_io since it is not used.
> >
> > Signed-off-by: Lukas Czerner <lczerner@redhat.com>
> > Reported-by: Eric Whitney <eric.whitney@hp.com>
> > Tested-by: Eric Whitney <eric.whitney@hp.com>
> > CC: Jens Axboe <axboe@kernel.dk>
> > CC: Dmitry Monakhov <dmonakhov@openvz.org>
> > CC: Jeff Moyer <jmoyer@redhat.com>
> > ---
> >  block/blk-lib.c |   19 +++++++------------
> >  1 files changed, 7 insertions(+), 12 deletions(-)
> >
> > diff --git a/block/blk-lib.c b/block/blk-lib.c
> > index eec78be..bd3e8df 100644
> > --- a/block/blk-lib.c
> > +++ b/block/blk-lib.c
> > @@ -109,7 +109,6 @@ struct bio_batch
> >  	atomic_t 		done;
> >  	unsigned long 		flags;
> >  	struct completion 	*wait;
> > -	bio_end_io_t		*end_io;
> >  };
> >  
> >  static void bio_batch_end_io(struct bio *bio, int err)
> > @@ -122,12 +121,9 @@ static void bio_batch_end_io(struct bio *bio, int err)
> >  		else
> >  			clear_bit(BIO_UPTODATE, &bb->flags);
> >  	}
> > -	if (bb) {
> > -		if (bb->end_io)
> > -			bb->end_io(bio, err);
> > -		atomic_inc(&bb->done);
> > -		complete(bb->wait);
> > -	}
> > +	if (bb)
> > +		if (atomic_dec_and_test(&bb->done))
> > +			complete(bb->wait);
> 
> I think bb will always be set here, no real need to check.
> 
> Anyway, I though I already added my:
> 
> Reviewed-by: Jeff Moyer <jmoyer@redhat.com>
> 
> to this.  No?
> 
> Cheers,
> Jeff

Yes, you did and I forgot to add it into the patch. Sorry about that.

Thanks!
-Lukas

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH v2] block: fix mis-synchronisation in  blkdev_issue_zeroout()
  2011-03-11 14:31   ` Lukas Czerner
@ 2011-03-11 14:36     ` Jens Axboe
  0 siblings, 0 replies; 5+ messages in thread
From: Jens Axboe @ 2011-03-11 14:36 UTC (permalink / raw)
  To: Lukas Czerner
  Cc: Jeff Moyer, linux-fsdevel, linux-kernel, stable, Dmitry Monakhov

On 2011-03-11 15:31, Lukas Czerner wrote:
> On Fri, 11 Mar 2011, Jeff Moyer wrote:
> 
>> Lukas Czerner <lczerner@redhat.com> writes:
>>
>>> BZ29402
>>> https://bugzilla.kernel.org/show_bug.cgi?id=29402
>>>
>>> We can hit serious mis-synchronization in bio completion path of
>>> blkdev_issue_zeroout() leading to a panic.
>>>
>>> The problem is that when we are going to wait_for_completion() in
>>> blkdev_issue_zeroout() we check if the bb.done equals issued (number of
>>> submitted bios). If it does, we can skip the wait_for_completition()
>>> and just out of the function since there is nothing to wait for.
>>> However, there is a ordering problem because bio_batch_end_io() is
>>> calling atomic_inc(&bb->done) before complete(), hence it might seem to
>>> blkdev_issue_zeroout() that all bios has been completed and exit. At
>>> this point when bio_batch_end_io() is going to call complete(bb->wait),
>>> bb and wait does not longer exist since it was allocated on stack in
>>> blkdev_issue_zeroout() ==> panic!
>>>
>>> (thread 1)                      (thread 2)
>>> bio_batch_end_io()              blkdev_issue_zeroout()
>>>   if(bb) {                      ...
>>>     if (bb->end_io)             ...
>>>       bb->end_io(bio, err);     ...
>>>     atomic_inc(&bb->done);      ...
>>>     ...                         while (issued != atomic_read(&bb.done))
>>>     ...                         (let issued == bb.done)
>>>     ...                         (do the rest of the function)
>>>     ...                         return ret;
>>>     complete(bb->wait);
>>>     ^^^^^^^^
>>>     panic
>>>
>>> We can fix this easily by simplifying bio_batch and completion counting.
>>>
>>> Also remove bio_end_io_t *end_io since it is not used.
>>>
>>> Signed-off-by: Lukas Czerner <lczerner@redhat.com>
>>> Reported-by: Eric Whitney <eric.whitney@hp.com>
>>> Tested-by: Eric Whitney <eric.whitney@hp.com>
>>> CC: Jens Axboe <axboe@kernel.dk>
>>> CC: Dmitry Monakhov <dmonakhov@openvz.org>
>>> CC: Jeff Moyer <jmoyer@redhat.com>
>>> ---
>>>  block/blk-lib.c |   19 +++++++------------
>>>  1 files changed, 7 insertions(+), 12 deletions(-)
>>>
>>> diff --git a/block/blk-lib.c b/block/blk-lib.c
>>> index eec78be..bd3e8df 100644
>>> --- a/block/blk-lib.c
>>> +++ b/block/blk-lib.c
>>> @@ -109,7 +109,6 @@ struct bio_batch
>>>  	atomic_t 		done;
>>>  	unsigned long 		flags;
>>>  	struct completion 	*wait;
>>> -	bio_end_io_t		*end_io;
>>>  };
>>>  
>>>  static void bio_batch_end_io(struct bio *bio, int err)
>>> @@ -122,12 +121,9 @@ static void bio_batch_end_io(struct bio *bio, int err)
>>>  		else
>>>  			clear_bit(BIO_UPTODATE, &bb->flags);
>>>  	}
>>> -	if (bb) {
>>> -		if (bb->end_io)
>>> -			bb->end_io(bio, err);
>>> -		atomic_inc(&bb->done);
>>> -		complete(bb->wait);
>>> -	}
>>> +	if (bb)
>>> +		if (atomic_dec_and_test(&bb->done))
>>> +			complete(bb->wait);
>>
>> I think bb will always be set here, no real need to check.
>>
>> Anyway, I though I already added my:
>>
>> Reviewed-by: Jeff Moyer <jmoyer@redhat.com>
>>
>> to this.  No?
>>
>> Cheers,
>> Jeff
> 
> Yes, you did and I forgot to add it into the patch. Sorry about that.

No worries, I added it now.

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2011-03-11 14:36 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-11  8:06 [PATCH v2] block: fix mis-synchronisation in blkdev_issue_zeroout() Lukas Czerner
2011-03-11  9:29 ` Jens Axboe
2011-03-11 14:23 ` Jeff Moyer
2011-03-11 14:31   ` Lukas Czerner
2011-03-11 14:36     ` Jens Axboe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).