linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Andrew Morton <akpm@linux-foundation.org>
To: Theodore Tso <tytso@mit.edu>
Cc: linux-mm@kvack.org, linux-ext4@vger.kernel.org,
	Arjan van de Ven <arjan@infradead.org>,
	Jens Axboe <jens.axboe@oracle.com>
Subject: Re: [PATCH, RFC] Use WRITE_SYNC in __block_write_full_page() if WBC_SYNC_ALL
Date: Sun, 4 Jan 2009 15:19:27 -0800	[thread overview]
Message-ID: <20090104151927.1f1603c6.akpm@linux-foundation.org> (raw)
In-Reply-To: <20090104224351.GF22958@mit.edu>

On Sun, 4 Jan 2009 17:43:51 -0500 Theodore Tso <tytso@mit.edu> wrote:

> On Sun, Jan 04, 2009 at 02:23:03PM -0800, Andrew Morton wrote:
> > > Following up with an e-mail thread started by Arjan two months ago,
> > > (subject: [PATCH] Give kjournald a IOPRIO_CLASS_RT io priority), I have
> > > a patch, just sent to linux-ext4@vger.kernel.org, which fixes the jbd2
> > > layer to submit journal writes via submit_bh() with WRITE_SYNC.
> > > Hopefully this might be enough of a priority boost so we don't have to
> > > force a higher I/O priority level via a buffer_head flag.  However,
> > > while looking through the code paths, in ordered data mode, we end up
> > > flushing data pages via the page writeback paths on a per-inode basis,
> > > and I noticed that even though we are passing in
> > > wbc.sync_mode=WBC_SYNC_ALL, __block_write_full_page() is using
> > > submit_bh(WRITE, bh) instead of submit_bh(WRITE_SYNC).
> > 
> > But this is all the wrong way to fix the problem, isn't it?
> > 
> > The problem is that at one particular point, the current transaction
> > blocks callers behind the committing transaction's IO completion.
> > 
> > Did anyone look at fixing that?  ISTR concluding that a data copy and
> > shadow-bh arrangement might be needed.
> 
> I haven't had time to really drill down into the jbd code yet, and
> yes, eventually we probably want to do this.

We do.

>  Still, if we are
> submitting I/O which we are going to end up waiting on, we really
> should submit it with WRITE_SYNC, and this patch should optimize
> writes in other situations; for example, if we fsync() a file, we will
> also end up calling block_write_full_page(), and so supplying the
> WRITE_SYNC hint to the block layer would be a Good Thing.

Is it?  WRITE_SYNC means "unplug the queue after this bh/BIO".  By setting
it against every bh, don't we risk the generation of more BIOs and
the loss of merging opportunities?

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2009-01-04 23:20 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <E1LJatq-00061O-0e@closure.thunk.org>
     [not found] ` <20090104142303.98762f81.akpm@linux-foundation.org>
2009-01-04 22:43   ` [PATCH, RFC] Use WRITE_SYNC in __block_write_full_page() if WBC_SYNC_ALL Theodore Tso
2009-01-04 23:19     ` Andrew Morton [this message]
2009-01-05  0:21       ` Theodore Tso
2009-01-05  8:02       ` Jens Axboe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090104151927.1f1603c6.akpm@linux-foundation.org \
    --to=akpm@linux-foundation.org \
    --cc=arjan@infradead.org \
    --cc=jens.axboe@oracle.com \
    --cc=linux-ext4@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=tytso@mit.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).