linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
@ 2011-05-05 12:10 Jan Kara
  2011-05-05 13:49 ` Eric Sandeen
  2011-05-08 23:14 ` Ted Ts'o
  0 siblings, 2 replies; 5+ messages in thread
From: Jan Kara @ 2011-05-05 12:10 UTC (permalink / raw)
  To: tytso; +Cc: Tao Ma, linux-ext4, Jan Kara

In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
from shadow state. The waking code in journal_commit_transaction() has
a bug because it does not issue a memory barrier after the buffer is moved
from the shadow state and before wake_up_bit() is called. Thus a waitqueue
check can happen before the buffer is actually moved from the shadow state
and waiting process may never be woken. Fix the problem by issuing proper
barrier.

Reported-by: Tao Ma <boyu.mt@taobao.com>
Signed-off-by: Jan Kara <jack@suse.cz>
---
 fs/jbd2/commit.c |    9 +++++++--
 1 files changed, 7 insertions(+), 2 deletions(-)

 Analogous JBD fix has been queued in my tree...

diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
index 2e5d370..3a958c7 100644
--- a/fs/jbd2/commit.c
+++ b/fs/jbd2/commit.c
@@ -768,8 +768,13 @@ wait_for_iobuf:
                    required. */
 		JBUFFER_TRACE(jh, "file as BJ_Forget");
 		jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
-		/* Wake up any transactions which were waiting for this
-		   IO to complete */
+		/*
+		 * Wake up any transactions which were waiting for this IO to
+		 * complete. The barrier must be here so that changes by
+		 * jbd2_journal_file_buffer() take effect before wake_up_bit()
+		 * does the waitqueue check.
+		 */
+		smp_mb();
 		wake_up_bit(&bh->b_state, BH_Unshadow);
 		JBUFFER_TRACE(jh, "brelse shadowed buffer");
 		__brelse(bh);
-- 
1.7.1


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
  2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
@ 2011-05-05 13:49 ` Eric Sandeen
  2011-05-05 14:11   ` Jan Kara
  2011-05-08 23:14 ` Ted Ts'o
  1 sibling, 1 reply; 5+ messages in thread
From: Eric Sandeen @ 2011-05-05 13:49 UTC (permalink / raw)
  To: Jan Kara; +Cc: tytso, Tao Ma, linux-ext4

On 5/5/11 7:10 AM, Jan Kara wrote:
> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> from shadow state. The waking code in journal_commit_transaction() has
> a bug because it does not issue a memory barrier after the buffer is moved
> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> check can happen before the buffer is actually moved from the shadow state
> and waiting process may never be woken. Fix the problem by issuing proper
> barrier.

needed for jbd/commit.c as well, I guess?

-Eric

> Reported-by: Tao Ma <boyu.mt@taobao.com>
> Signed-off-by: Jan Kara <jack@suse.cz>
> ---
>  fs/jbd2/commit.c |    9 +++++++--
>  1 files changed, 7 insertions(+), 2 deletions(-)
> 
>  Analogous JBD fix has been queued in my tree...
> 
> diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
> index 2e5d370..3a958c7 100644
> --- a/fs/jbd2/commit.c
> +++ b/fs/jbd2/commit.c
> @@ -768,8 +768,13 @@ wait_for_iobuf:
>                     required. */
>  		JBUFFER_TRACE(jh, "file as BJ_Forget");
>  		jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
> -		/* Wake up any transactions which were waiting for this
> -		   IO to complete */
> +		/*
> +		 * Wake up any transactions which were waiting for this IO to
> +		 * complete. The barrier must be here so that changes by
> +		 * jbd2_journal_file_buffer() take effect before wake_up_bit()
> +		 * does the waitqueue check.
> +		 */
> +		smp_mb();
>  		wake_up_bit(&bh->b_state, BH_Unshadow);
>  		JBUFFER_TRACE(jh, "brelse shadowed buffer");
>  		__brelse(bh);


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
  2011-05-05 13:49 ` Eric Sandeen
@ 2011-05-05 14:11   ` Jan Kara
  2011-05-05 14:28     ` Eric Sandeen
  0 siblings, 1 reply; 5+ messages in thread
From: Jan Kara @ 2011-05-05 14:11 UTC (permalink / raw)
  To: Eric Sandeen; +Cc: Jan Kara, tytso, Tao Ma, linux-ext4

On Thu 05-05-11 08:49:14, Eric Sandeen wrote:
> On 5/5/11 7:10 AM, Jan Kara wrote:
> > In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> > from shadow state. The waking code in journal_commit_transaction() has
> > a bug because it does not issue a memory barrier after the buffer is moved
> > from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> > check can happen before the buffer is actually moved from the shadow state
> > and waiting process may never be woken. Fix the problem by issuing proper
> > barrier.
> 
> needed for jbd/commit.c as well, I guess?
  Yes, I was already queued in my tree. I just sent it to the list as well.

								Honza
> > diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
> > index 2e5d370..3a958c7 100644
> > --- a/fs/jbd2/commit.c
> > +++ b/fs/jbd2/commit.c
> > @@ -768,8 +768,13 @@ wait_for_iobuf:
> >                     required. */
> >  		JBUFFER_TRACE(jh, "file as BJ_Forget");
> >  		jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
> > -		/* Wake up any transactions which were waiting for this
> > -		   IO to complete */
> > +		/*
> > +		 * Wake up any transactions which were waiting for this IO to
> > +		 * complete. The barrier must be here so that changes by
> > +		 * jbd2_journal_file_buffer() take effect before wake_up_bit()
> > +		 * does the waitqueue check.
> > +		 */
> > +		smp_mb();
> >  		wake_up_bit(&bh->b_state, BH_Unshadow);
> >  		JBUFFER_TRACE(jh, "brelse shadowed buffer");
> >  		__brelse(bh);
> 
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
  2011-05-05 14:11   ` Jan Kara
@ 2011-05-05 14:28     ` Eric Sandeen
  0 siblings, 0 replies; 5+ messages in thread
From: Eric Sandeen @ 2011-05-05 14:28 UTC (permalink / raw)
  To: Jan Kara; +Cc: tytso, Tao Ma, linux-ext4

On 5/5/11 9:11 AM, Jan Kara wrote:
> On Thu 05-05-11 08:49:14, Eric Sandeen wrote:
>> On 5/5/11 7:10 AM, Jan Kara wrote:
>>> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
>>> from shadow state. The waking code in journal_commit_transaction() has
>>> a bug because it does not issue a memory barrier after the buffer is moved
>>> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
>>> check can happen before the buffer is actually moved from the shadow state
>>> and waiting process may never be woken. Fix the problem by issuing proper
>>> barrier.
>>
>> needed for jbd/commit.c as well, I guess?
>   Yes, I was already queued in my tree. I just sent it to the list as well.

sorry, sometimes "we" forget :)

Thanks!
-Eric

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
  2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
  2011-05-05 13:49 ` Eric Sandeen
@ 2011-05-08 23:14 ` Ted Ts'o
  1 sibling, 0 replies; 5+ messages in thread
From: Ted Ts'o @ 2011-05-08 23:14 UTC (permalink / raw)
  To: Jan Kara; +Cc: Tao Ma, linux-ext4

On Thu, May 05, 2011 at 02:10:39PM +0200, Jan Kara wrote:
> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> from shadow state. The waking code in journal_commit_transaction() has
> a bug because it does not issue a memory barrier after the buffer is moved
> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> check can happen before the buffer is actually moved from the shadow state
> and waiting process may never be woken. Fix the problem by issuing proper
> barrier.
> 
> Reported-by: Tao Ma <boyu.mt@taobao.com>
> Signed-off-by: Jan Kara <jack@suse.cz>

Thanks, I've added this to the ext4 tree.  (Currently in the dev
branch, pending testing.)

					- Ted

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2011-05-08 23:14 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
2011-05-05 13:49 ` Eric Sandeen
2011-05-05 14:11   ` Jan Kara
2011-05-05 14:28     ` Eric Sandeen
2011-05-08 23:14 ` Ted Ts'o

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).