From mboxrd@z Thu Jan  1 00:00:00 1970
From: Jens Axboe <axboe@kernel.dk>
Subject: Re: [PATCH, RFC] simplify writeback thread creation
Date: Thu, 08 Jul 2010 16:59:33 +0200
Message-ID: <4C35E7D5.8020809@kernel.dk>
References: <20100707225242.GA28802@lst.de>  <4C35C292.9020507@kernel.dk> <1278598877.20321.34.camel@localhost>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 7bit
Cc: Christoph Hellwig <hch@lst.de>, linux-fsdevel@vger.kernel.org
To: dedekind1@gmail.com
Return-path: <linux-fsdevel-owner@vger.kernel.org>
Received: from 0122700014.0.fullrate.dk ([95.166.99.235]:46688 "EHLO kernel.dk"
	rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP
	id S1754031Ab0GHO7f (ORCPT <rfc822;linux-fsdevel@vger.kernel.org>);
	Thu, 8 Jul 2010 10:59:35 -0400
In-Reply-To: <1278598877.20321.34.camel@localhost>
Sender: linux-fsdevel-owner@vger.kernel.org
List-ID: <linux-fsdevel.vger.kernel.org>

On 2010-07-08 16:21, Artem Bityutskiy wrote:
> On Thu, 2010-07-08 at 14:20 +0200, Jens Axboe wrote:
>> On 2010-07-08 00:52, Christoph Hellwig wrote:
>>> Currently the per-bdi writeback thread is only created when there is
>>> dirty any dirty data on the BDI, and it lazy exists when it's been
>>> unused for some time.
>>>
>>> This leads to some very complex code, and the need to keep a forker
>>> thread around.
>>>
>>> This patch removes all this code and simply creates the thread as part
>>> of the bdi registration.  The downside is that we use up ressoures
>>> for possible unused devices, although that overhead is rather low,
>>> with 8k kernel stack size on x86 and few other, even smaller ressources.
>>>
>>> If the overhead is still considered too much I can look into starting
>>> the thread explicitly instead of as part of the bdi registration, but
>>> that will require a bit of code complexity, too.
>>
>> I'm pretty sure this will come back to bite us in the ass... If we are
>> going to change the lazy create/exit setup, I would greatly prefer
>> doing it at fs mount time (or something to that effect).
> 
> How about not starting any thread at all at the bdi registration time,
> and start a bdi thread only when something for this bdi becomes dirty
> (__mark_inode_dirty()) or a bdi work is queued (bdi_queue_work())? If we
> do this, then the tasks can also die by the 5min timeout, and will be
> forked again when dirt/bdi works arrives?
> 
> I guess it is a bit challenging to start a task in __mark_inode_dirty(),
> whis is supposed to be fast and non-sleeping, but we can just submit a
> work which will start the task.

That work would have to reside on the stack, and __mark_inode_dirty()
block on the thread startup. We can't always do that.

-- 
Jens Axboe