public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Jan Kara <jack@suse.cz>
To: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Cc: Jan Kara <jack@suse.cz>, Fengguang Wu <fengguang.wu@intel.com>,
	viro@zeniv.linux.org.uk, hch@lst.de,
	linux-kernel@vger.kernel.org
Subject: Re: [PATCH] Fix queueing work if !bdi_cap_writeback_dirty()
Date: Mon, 17 Sep 2012 10:48:53 +0200	[thread overview]
Message-ID: <20120917084853.GA9150@quack.suse.cz> (raw)
In-Reply-To: <87wqzt7drb.fsf@devron.myhome.or.jp>

On Mon 17-09-12 08:24:08, OGAWA Hirofumi wrote:
> Jan Kara <jack@suse.cz> writes:
> 
> >> I'm not sure what you meant though. What is the difference with ignoring
> >> WBC_SYNC_NONE?
> >   When you completely ignore WB_SYNC_NONE writeback, you'll soon drive the
> > machine close to dirty limits and processes dirtying pages will get
> > throttled. Because flusher threads won't be able to write pages - they
> > do WB_SYNC_NONE writeback when we have too many dirty pages - processes
> > will be throttled until somebody calls sync(1) or someone writes the data
> > for some other reason... So I suspect things won't really work as you
> > expect.
> 
> I think you know how to solve it though. You can add the periodic flush
> in own task. And you can check bdi->dirty_exceeded in any handlers.
  Sure, you can have your private thread. That is possible but you will
have to duplicate flusher logic and you will still get odd behavior e.g.
when your filesystem is on one partition and another filesystem is on a
different partition of the same disk.

> Well, ok. The alternative plan but more bigger change is to add the
> handler to writeback task path. This would be better way, and core
> should be able to request to flush with usual way (I guess this is what
> you are concerning).  And I believe some FS can implement the simpler
> and more efficient writeback path.
> 
> But this would look like what reiserfs4 was submitted in past (before
> bdi was introduced), and unfortunately never accepted though.
> 
> Since situation was changed, will we accept it?
> 
> OK, why my FS requires it? Because basic strategy try to keep the
> consistency of user view, not only internal metadata consistency.
> I.e. it works like to flush the snapshot of user view.
> 
> So, flushing metadata/data by arbitrary order like current writeback
> task does is unacceptable (of course, except request by user). And
> writeback task will never know the correct order of FS.
  OK, thanks for explanation. Now I understand what you are trying to do.
Would it be enough if you could track dirty inodes inside your filesystem
and provide some callback for flusher so that you can queue these inodes in
the IO queue?

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

  reply	other threads:[~2012-09-17  8:49 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-09-11 18:28 [PATCH] Fix queueing work if !bdi_cap_writeback_dirty() OGAWA Hirofumi
2012-09-12  2:42 ` Fengguang Wu
2012-09-12  8:00   ` OGAWA Hirofumi
2012-09-13  0:33     ` Fengguang Wu
2012-09-13  5:41       ` OGAWA Hirofumi
2012-09-13  6:03         ` Fengguang Wu
2012-09-13  6:31           ` OGAWA Hirofumi
2012-09-13  6:39 ` Fengguang Wu
2012-09-13  7:53   ` OGAWA Hirofumi
2012-09-14 11:13     ` OGAWA Hirofumi
2012-09-14 11:18       ` Fengguang Wu
2012-09-14 11:14     ` Fengguang Wu
2012-09-14 12:12       ` OGAWA Hirofumi
2012-09-14 12:53         ` Fengguang Wu
2012-09-14 13:07           ` OGAWA Hirofumi
2012-09-14 13:33             ` Fengguang Wu
2012-09-14 13:49               ` OGAWA Hirofumi
2012-09-14 13:19         ` Jan Kara
2012-09-14 13:44           ` OGAWA Hirofumi
2012-09-14 14:45             ` Jan Kara
2012-09-14 15:10               ` OGAWA Hirofumi
2012-09-16 21:49                 ` Jan Kara
2012-09-16 23:24                   ` OGAWA Hirofumi
2012-09-17  8:48                     ` Jan Kara [this message]
2012-09-17  9:39                       ` OGAWA Hirofumi
2012-09-17  9:56                         ` Jan Kara
2012-09-17 10:37                           ` OGAWA Hirofumi
2012-09-17 15:54                             ` Jan Kara
2012-09-17 16:55                               ` OGAWA Hirofumi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120917084853.GA9150@quack.suse.cz \
    --to=jack@suse.cz \
    --cc=fengguang.wu@intel.com \
    --cc=hch@lst.de \
    --cc=hirofumi@mail.parknet.co.jp \
    --cc=linux-kernel@vger.kernel.org \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox