[PATCH] Fix reiserfs latencies caused by data=ordered

linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [PATCH] Fix reiserfs latencies caused by data=ordered
@ 2006-08-04 14:07 Chris Mason
  2006-08-04 23:01 ` Andrew Morton
  0 siblings, 1 reply; 3+ messages in thread
From: Chris Mason @ 2006-08-04 14:07 UTC (permalink / raw)
  To: akpm, reiserfs-dev, linux-fsdevel

From: Chris Maosn <mason@suse.com>
Subject: Fix reiserfs latencies caused by data=ordered

ReiserFS does periodic cleanup of old transactions in order to limit the
length of time a journal replay may take after a crash.  Sometimes, writing
metadata from an old (already committed) transaction may require committing
a newer transaction, which also requires writing all data=ordered buffers.
This can cause very long stalls on journal_begin.

This patch makes sure new transactions will not need to be committed before
trying a periodic reclaim of an old transaction.  It is low risk because
if a bad decision is made, it just means a slightly longer journal
replay after a crash.

Signed-off-by: Chris Mason <mason@suse.com>

--- a/fs/reiserfs/journal.c	Fri Apr 14 13:10:47 2006 -0400
+++ b/fs/reiserfs/journal.c	Sun May 07 22:07:38 2006 -0400
@@ -1189,6 +1189,21 @@ static struct reiserfs_journal_list *fin
 	return NULL;
 }
 
+static int newer_jl_done(struct reiserfs_journal_cnode *cn)
+{
+	struct super_block *sb = cn->sb;
+	b_blocknr_t blocknr = cn->blocknr;
+
+	cn = cn->hprev;
+	while (cn) {
+		if (cn->sb == sb && cn->blocknr == blocknr && cn->jlist &&
+		    atomic_read(&cn->jlist->j_commit_left) != 0)
+				    return 0;
+		cn = cn->hprev;
+	}
+	return 1;
+}
+
 static void remove_journal_hash(struct super_block *,
 				struct reiserfs_journal_cnode **,
 				struct reiserfs_journal_list *, unsigned long,
@@ -1605,6 +1620,31 @@ static int flush_journal_list(struct sup
 		up(&journal->j_flush_sem);
 	put_fs_excl();
 	return err;
+}
+
+static int test_transaction(struct super_block *s,
+                            struct reiserfs_journal_list *jl)
+{
+	struct reiserfs_journal_cnode *cn;
+
+	if (jl->j_len == 0 || atomic_read(&jl->j_nonzerolen) == 0)
+		return 1;
+
+	cn = jl->j_realblock;
+	while (cn) {
+		/* if the blocknr == 0, this has been cleared from the hash,
+		 ** skip it
+		 */
+		if (cn->blocknr == 0) {
+			goto next;
+		}
+		if (cn->bh && !newer_jl_done(cn))
+			return 0;
+	      next:
+		cn = cn->next;
+		cond_resched();
+	}
+	return 0;
 }
 
 static int write_one_transaction(struct super_block *s,
@@ -3436,16 +3476,6 @@ static void flush_async_commits(void *p)
 		flush_commit_list(p_s_sb, jl, 1);
 	}
 	unlock_kernel();
-	/*
-	 * this is a little racey, but there's no harm in missing
-	 * the filemap_fdata_write
-	 */
-	if (!atomic_read(&journal->j_async_throttle)
-	    && !reiserfs_is_journal_aborted(journal)) {
-		atomic_inc(&journal->j_async_throttle);
-		filemap_fdatawrite(p_s_sb->s_bdev->bd_inode->i_mapping);
-		atomic_dec(&journal->j_async_throttle);
-	}
 }
 
 /*
@@ -3847,7 +3877,9 @@ static void flush_old_journal_lists(stru
 		entry = journal->j_journal_list.next;
 		jl = JOURNAL_LIST_ENTRY(entry);
 		/* this check should always be run, to send old lists to disk */
-		if (jl->j_timestamp < (now - (JOURNAL_MAX_TRANS_AGE * 4))) {
+		if (jl->j_timestamp < (now - (JOURNAL_MAX_TRANS_AGE * 4)) &&
+		    atomic_read(&jl->j_commit_left) == 0 &&
+		    test_transaction(s, jl)) {
 			flush_used_journal_lists(s, jl);
 		} else {
 			break;
-- 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] Fix reiserfs latencies caused by data=ordered
  2006-08-04 14:07 [PATCH] Fix reiserfs latencies caused by data=ordered Chris Mason
@ 2006-08-04 23:01 ` Andrew Morton
  2006-08-07 12:20   ` Chris Mason
  0 siblings, 1 reply; 3+ messages in thread
From: Andrew Morton @ 2006-08-04 23:01 UTC (permalink / raw)
  To: Chris Mason; +Cc: reiserfs-dev, linux-fsdevel

On Fri, 4 Aug 2006 10:07:08 -0400
Chris Mason <mason@suse.com> wrote:

> ReiserFS does periodic cleanup of old transactions in order to limit the
> length of time a journal replay may take after a crash.  Sometimes, writing
> metadata from an old (already committed) transaction may require committing
> a newer transaction, which also requires writing all data=ordered buffers.
> This can cause very long stalls on journal_begin.
> 
> This patch makes sure new transactions will not need to be committed before
> trying a periodic reclaim of an old transaction.  It is low risk because
> if a bad decision is made, it just means a slightly longer journal
> replay after a crash.

So I'm thinking that these:

i_mutex-does-not-need-to-be-locked-in-reiserfs_delete_inode.patch
fix-reiserfs-lock-inversion-of-bkl-vs-inode-semaphore.patch (akpm modified)
reiserfs_write_full_page-should-not-get_block-past-eof.patch

are 2.6.18 material.  What are your thoughts on that?

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] Fix reiserfs latencies caused by data=ordered
  2006-08-04 23:01 ` Andrew Morton
@ 2006-08-07 12:20   ` Chris Mason
  0 siblings, 0 replies; 3+ messages in thread
From: Chris Mason @ 2006-08-07 12:20 UTC (permalink / raw)
  To: Andrew Morton; +Cc: reiserfs-dev, linux-fsdevel

On Fri, Aug 04, 2006 at 04:01:22PM -0700, Andrew Morton wrote:
> On Fri, 4 Aug 2006 10:07:08 -0400
> Chris Mason <mason@suse.com> wrote:
> 
> > ReiserFS does periodic cleanup of old transactions in order to limit the
> > length of time a journal replay may take after a crash.  Sometimes, writing
> > metadata from an old (already committed) transaction may require committing
> > a newer transaction, which also requires writing all data=ordered buffers.
> > This can cause very long stalls on journal_begin.
> > 
> > This patch makes sure new transactions will not need to be committed before
> > trying a periodic reclaim of an old transaction.  It is low risk because
> > if a bad decision is made, it just means a slightly longer journal
> > replay after a crash.
> 
> So I'm thinking that these:
> 
> i_mutex-does-not-need-to-be-locked-in-reiserfs_delete_inode.patch
> fix-reiserfs-lock-inversion-of-bkl-vs-inode-semaphore.patch (akpm modified)
> reiserfs_write_full_page-should-not-get_block-past-eof.patch
> 
> are 2.6.18 material.  What are your thoughts on that?

Ack from me.

-chris


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2006-08-07 12:20 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-08-04 14:07 [PATCH] Fix reiserfs latencies caused by data=ordered Chris Mason
2006-08-04 23:01 ` Andrew Morton
2006-08-07 12:20   ` Chris Mason

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).