* [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
@ 2011-05-05 12:10 Jan Kara
2011-05-05 13:49 ` Eric Sandeen
2011-05-08 23:14 ` Ted Ts'o
0 siblings, 2 replies; 5+ messages in thread
From: Jan Kara @ 2011-05-05 12:10 UTC (permalink / raw)
To: tytso; +Cc: Tao Ma, linux-ext4, Jan Kara
In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
from shadow state. The waking code in journal_commit_transaction() has
a bug because it does not issue a memory barrier after the buffer is moved
from the shadow state and before wake_up_bit() is called. Thus a waitqueue
check can happen before the buffer is actually moved from the shadow state
and waiting process may never be woken. Fix the problem by issuing proper
barrier.
Reported-by: Tao Ma <boyu.mt@taobao.com>
Signed-off-by: Jan Kara <jack@suse.cz>
---
fs/jbd2/commit.c | 9 +++++++--
1 files changed, 7 insertions(+), 2 deletions(-)
Analogous JBD fix has been queued in my tree...
diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
index 2e5d370..3a958c7 100644
--- a/fs/jbd2/commit.c
+++ b/fs/jbd2/commit.c
@@ -768,8 +768,13 @@ wait_for_iobuf:
required. */
JBUFFER_TRACE(jh, "file as BJ_Forget");
jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
- /* Wake up any transactions which were waiting for this
- IO to complete */
+ /*
+ * Wake up any transactions which were waiting for this IO to
+ * complete. The barrier must be here so that changes by
+ * jbd2_journal_file_buffer() take effect before wake_up_bit()
+ * does the waitqueue check.
+ */
+ smp_mb();
wake_up_bit(&bh->b_state, BH_Unshadow);
JBUFFER_TRACE(jh, "brelse shadowed buffer");
__brelse(bh);
--
1.7.1
^ permalink raw reply related [flat|nested] 5+ messages in thread* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
@ 2011-05-05 13:49 ` Eric Sandeen
2011-05-05 14:11 ` Jan Kara
2011-05-08 23:14 ` Ted Ts'o
1 sibling, 1 reply; 5+ messages in thread
From: Eric Sandeen @ 2011-05-05 13:49 UTC (permalink / raw)
To: Jan Kara; +Cc: tytso, Tao Ma, linux-ext4
On 5/5/11 7:10 AM, Jan Kara wrote:
> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> from shadow state. The waking code in journal_commit_transaction() has
> a bug because it does not issue a memory barrier after the buffer is moved
> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> check can happen before the buffer is actually moved from the shadow state
> and waiting process may never be woken. Fix the problem by issuing proper
> barrier.
needed for jbd/commit.c as well, I guess?
-Eric
> Reported-by: Tao Ma <boyu.mt@taobao.com>
> Signed-off-by: Jan Kara <jack@suse.cz>
> ---
> fs/jbd2/commit.c | 9 +++++++--
> 1 files changed, 7 insertions(+), 2 deletions(-)
>
> Analogous JBD fix has been queued in my tree...
>
> diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
> index 2e5d370..3a958c7 100644
> --- a/fs/jbd2/commit.c
> +++ b/fs/jbd2/commit.c
> @@ -768,8 +768,13 @@ wait_for_iobuf:
> required. */
> JBUFFER_TRACE(jh, "file as BJ_Forget");
> jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
> - /* Wake up any transactions which were waiting for this
> - IO to complete */
> + /*
> + * Wake up any transactions which were waiting for this IO to
> + * complete. The barrier must be here so that changes by
> + * jbd2_journal_file_buffer() take effect before wake_up_bit()
> + * does the waitqueue check.
> + */
> + smp_mb();
> wake_up_bit(&bh->b_state, BH_Unshadow);
> JBUFFER_TRACE(jh, "brelse shadowed buffer");
> __brelse(bh);
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
2011-05-05 13:49 ` Eric Sandeen
@ 2011-05-05 14:11 ` Jan Kara
2011-05-05 14:28 ` Eric Sandeen
0 siblings, 1 reply; 5+ messages in thread
From: Jan Kara @ 2011-05-05 14:11 UTC (permalink / raw)
To: Eric Sandeen; +Cc: Jan Kara, tytso, Tao Ma, linux-ext4
On Thu 05-05-11 08:49:14, Eric Sandeen wrote:
> On 5/5/11 7:10 AM, Jan Kara wrote:
> > In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> > from shadow state. The waking code in journal_commit_transaction() has
> > a bug because it does not issue a memory barrier after the buffer is moved
> > from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> > check can happen before the buffer is actually moved from the shadow state
> > and waiting process may never be woken. Fix the problem by issuing proper
> > barrier.
>
> needed for jbd/commit.c as well, I guess?
Yes, I was already queued in my tree. I just sent it to the list as well.
Honza
> > diff --git a/fs/jbd2/commit.c b/fs/jbd2/commit.c
> > index 2e5d370..3a958c7 100644
> > --- a/fs/jbd2/commit.c
> > +++ b/fs/jbd2/commit.c
> > @@ -768,8 +768,13 @@ wait_for_iobuf:
> > required. */
> > JBUFFER_TRACE(jh, "file as BJ_Forget");
> > jbd2_journal_file_buffer(jh, commit_transaction, BJ_Forget);
> > - /* Wake up any transactions which were waiting for this
> > - IO to complete */
> > + /*
> > + * Wake up any transactions which were waiting for this IO to
> > + * complete. The barrier must be here so that changes by
> > + * jbd2_journal_file_buffer() take effect before wake_up_bit()
> > + * does the waitqueue check.
> > + */
> > + smp_mb();
> > wake_up_bit(&bh->b_state, BH_Unshadow);
> > JBUFFER_TRACE(jh, "brelse shadowed buffer");
> > __brelse(bh);
>
--
Jan Kara <jack@suse.cz>
SUSE Labs, CR
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
2011-05-05 14:11 ` Jan Kara
@ 2011-05-05 14:28 ` Eric Sandeen
0 siblings, 0 replies; 5+ messages in thread
From: Eric Sandeen @ 2011-05-05 14:28 UTC (permalink / raw)
To: Jan Kara; +Cc: tytso, Tao Ma, linux-ext4
On 5/5/11 9:11 AM, Jan Kara wrote:
> On Thu 05-05-11 08:49:14, Eric Sandeen wrote:
>> On 5/5/11 7:10 AM, Jan Kara wrote:
>>> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
>>> from shadow state. The waking code in journal_commit_transaction() has
>>> a bug because it does not issue a memory barrier after the buffer is moved
>>> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
>>> check can happen before the buffer is actually moved from the shadow state
>>> and waiting process may never be woken. Fix the problem by issuing proper
>>> barrier.
>>
>> needed for jbd/commit.c as well, I guess?
> Yes, I was already queued in my tree. I just sent it to the list as well.
sorry, sometimes "we" forget :)
Thanks!
-Eric
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] jbd2: Fix forever sleeping process in do_get_write_access()
2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
2011-05-05 13:49 ` Eric Sandeen
@ 2011-05-08 23:14 ` Ted Ts'o
1 sibling, 0 replies; 5+ messages in thread
From: Ted Ts'o @ 2011-05-08 23:14 UTC (permalink / raw)
To: Jan Kara; +Cc: Tao Ma, linux-ext4
On Thu, May 05, 2011 at 02:10:39PM +0200, Jan Kara wrote:
> In do_get_write_access() we wait on BH_Unshadow bit for buffer to get
> from shadow state. The waking code in journal_commit_transaction() has
> a bug because it does not issue a memory barrier after the buffer is moved
> from the shadow state and before wake_up_bit() is called. Thus a waitqueue
> check can happen before the buffer is actually moved from the shadow state
> and waiting process may never be woken. Fix the problem by issuing proper
> barrier.
>
> Reported-by: Tao Ma <boyu.mt@taobao.com>
> Signed-off-by: Jan Kara <jack@suse.cz>
Thanks, I've added this to the ext4 tree. (Currently in the dev
branch, pending testing.)
- Ted
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2011-05-08 23:14 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-05-05 12:10 [PATCH] jbd2: Fix forever sleeping process in do_get_write_access() Jan Kara
2011-05-05 13:49 ` Eric Sandeen
2011-05-05 14:11 ` Jan Kara
2011-05-05 14:28 ` Eric Sandeen
2011-05-08 23:14 ` Ted Ts'o
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).