From: Jens Axboe <jens.axboe@oracle.com>
To: Artem Bityutskiy <dedekind1@gmail.com>
Cc: linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org,
chris.mason@oracle.com, david@fromorbit.com, hch@infradead.org,
akpm@linux-foundation.org, jack@suse.cz,
yanmin_zhang@linux.intel.com, richard@rsk.demon.co.uk,
damien.wyart@free.fr, fweisbec@gmail.com, Alan.Brunelle@hp.com
Subject: Re: [PATCH 02/10] writeback: switch to per-bdi threads for flushing data
Date: Mon, 6 Jul 2009 15:13:13 +0200 [thread overview]
Message-ID: <20090706131313.GR23611@kernel.dk> (raw)
In-Reply-To: <4A51F443.8070402@gmail.com>
On Mon, Jul 06 2009, Artem Bityutskiy wrote:
> Jens Axboe wrote:
>> +/*
>> + * kupdated() used to do this. We cannot do it from the bdi_forker_task()
>> + * or we risk deadlocking on ->s_umount. The longer term solution would be
>> + * to implement sync_supers_bdi() or similar and simply do it from the
>> + * bdi writeback tasks individually.
>> + */
>> +static int bdi_sync_supers(void *unused)
>> +{
>> + set_user_nice(current, 0);
>> +
>> + while (!kthread_should_stop()) {
>> + set_current_state(TASK_INTERRUPTIBLE);
>> + schedule();
>> +
>> + /*
>> + * Do this periodically, like kupdated() did before.
>> + */
>> + sync_supers();
>> + }
>> +
>> + return 0;
>
> ATM we have one timer for both data and super-block synchronization.
> With per-bdi write-back we have:
>
> 1. one timer for super blocks
> 2. many per-bdi timers for data (schedule_timeout() is essentially
> using timers).
That is correct. Note that these exit when they have been idle for a
while, for embedded and such you could make it more aggressive by
exiting quicker. The sync_supers should be directly fixable by your
sb_dirty() stuff.
So I don't think it's a huge change from what we currently have.
> This is not nice, because each timer is an additional source of
> power-savings killers. I mean, it is more power management (PM)
> friendly to have less timers and disturb CPU less, make CPU wake
> up from retention less frequently.
>
> I do not challange the per-bdi idea at all, but is it possible to
> think about a more PM-friendly desing and have one source of
> periodic write-back, not many. I mean, could there be one timer
> which periodically syncs supers and wakes up the BDI write-back
> tasks?
You could replace the schedule_timeout() by a schedule(), and instead
have a single timer running that would scan the bdi_list and issue the
kupdated() timed writeback that is the reason it uses schedule_timeout()
now. Explicitly issued work will manually wake up the per-bdi thread(s).
That single timer could easily handle waking up bdi_sync_supers() as
well.
--
Jens Axboe
next prev parent reply other threads:[~2009-07-06 13:13 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-06-25 10:41 [PATCH 0/10] Per-bdi writeback flusher threads v12 Jens Axboe
2009-06-25 10:41 ` [PATCH 01/10] writeback: move dirty inodes from super_block to backing_dev_info Jens Axboe
2009-06-25 10:41 ` [PATCH 02/10] writeback: switch to per-bdi threads for flushing data Jens Axboe
2009-07-06 12:55 ` Artem Bityutskiy
2009-07-06 13:13 ` Jens Axboe [this message]
2009-07-06 13:26 ` Artem Bityutskiy
2009-07-07 15:27 ` Artem Bityutskiy
2009-07-08 0:57 ` Zhang, Yanmin
2009-07-08 4:47 ` Artem Bityutskiy
2009-06-25 10:41 ` [PATCH 03/10] writeback: get rid of pdflush completely Jens Axboe
2009-06-25 10:41 ` [PATCH 04/10] writeback: separate the flushing state/task from the bdi Jens Axboe
2009-06-25 10:41 ` [PATCH 05/10] writeback: support > 1 flusher thread per bdi Jens Axboe
2009-07-06 12:18 ` Artem Bityutskiy
2009-07-06 12:22 ` Jens Axboe
2009-07-06 13:37 ` Artem Bityutskiy
2009-07-06 13:49 ` Jamie Lokier
2009-07-06 14:11 ` Artem Bityutskiy
2009-07-06 15:43 ` Jens Axboe
2009-06-25 10:41 ` [PATCH 06/10] writeback: allow sleepy exit of default writeback task Jens Axboe
2009-06-25 10:42 ` [PATCH 07/10] writeback: add some debug inode list counters to bdi stats Jens Axboe
2009-06-25 10:42 ` [PATCH 08/10] writeback: add name to backing_dev_info Jens Axboe
2009-06-25 10:42 ` [PATCH 09/10] writeback: check for registered bdi in flusher add and inode dirty Jens Axboe
2009-06-25 10:42 ` [PATCH 10/10] writeback: use spin_trylock() in bdi_writeback_all() for WB_SYNC_NONE Jens Axboe
2009-06-29 8:43 ` [PATCH 0/10] Per-bdi writeback flusher threads v12 Zhang, Yanmin
-- strict thread matches above, loose matches on Subject: below --
2009-08-31 12:14 [PATCH 0/10] Per-bdi writeback flusher threads v14 Jens Axboe
2009-08-31 12:14 ` [PATCH 02/10] writeback: switch to per-bdi threads for flushing data Jens Axboe
2009-08-31 12:58 ` Christoph Hellwig
2009-08-31 17:29 ` Jens Axboe
2009-06-17 11:07 [PATCH 0/10] Per-bdi writeback flusher threads v11 Jens Axboe
2009-06-17 11:07 ` [PATCH 02/10] writeback: switch to per-bdi threads for flushing data Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090706131313.GR23611@kernel.dk \
--to=jens.axboe@oracle.com \
--cc=Alan.Brunelle@hp.com \
--cc=akpm@linux-foundation.org \
--cc=chris.mason@oracle.com \
--cc=damien.wyart@free.fr \
--cc=david@fromorbit.com \
--cc=dedekind1@gmail.com \
--cc=fweisbec@gmail.com \
--cc=hch@infradead.org \
--cc=jack@suse.cz \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=richard@rsk.demon.co.uk \
--cc=yanmin_zhang@linux.intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).