From mboxrd@z Thu Jan 1 00:00:00 1970 From: Peter Zijlstra Subject: Re: [PATCH 8/8] vm: Add an tuning knob for vm.max_writeback_mb Date: Thu, 24 Sep 2009 17:38:16 +0200 Message-ID: <1253806696.18939.40.camel@laptop> References: <1252425983.7746.120.camel@twins> <20090908162936.GA2975@think> <1252428983.7746.140.camel@twins> <20090908172842.GC2975@think> <1252431974.7746.151.camel@twins> <1252432501.7746.156.camel@twins> <1252434746.7035.7.camel@laptop> <20090909142315.GA7949@duck.suse.cz> <1252597750.7205.82.camel@laptop> <20090914111721.GA24075@duck.suse.cz> <20090924083342.GA15918@localhost> Mime-Version: 1.0 Content-Type: text/plain Content-Transfer-Encoding: 7bit Cc: Jan Kara , Chris Mason , Artem Bityutskiy , Jens Axboe , "linux-kernel@vger.kernel.org" , "linux-fsdevel@vger.kernel.org" , "david@fromorbit.com" , "hch@infradead.org" , "akpm@linux-foundation.org" , Theodore Ts'o To: Wu Fengguang Return-path: In-Reply-To: <20090924083342.GA15918@localhost> Sender: linux-kernel-owner@vger.kernel.org List-Id: linux-fsdevel.vger.kernel.org On Thu, 2009-09-24 at 16:33 +0800, Wu Fengguang wrote: > Yeah, FIFO queuing should be good enough. > > I'd like to propose one more data structure for evaluation :) > > - bdi->throttle_lock > - bdi->throttle_list pages to sync for each waiting task, taken from sync_writeback_pages() > - bdi->throttle_pages (counted down) pages to sync for the head task, shall be atomic_t > > In balance_dirty_pages(), it would do > > nr_to_sync = sync_writeback_pages() > if (list_empty(bdi->throttle_list)) # I'm the only task > bdi->throttle_pages = nr_to_sync > append nr_to_sync to bdi->throttle_list > kick off background writeback > wait > remove itself from bdi->throttle_list and wait list > set bdi->throttle_pages for new head task (or LONG_MAX) > > In __bdi_writeout_inc(), it would do > > if (--bdi->throttle_pages <= 0) > check and wake up head task > > In wb_writeback(), it would do > > if (args->for_background && exiting) > wake up all throttled tasks > > To prevent wake up too many tasks at the same time, it can relax the > background threshold a bit, so that __bdi_writeout_inc() become the > only wake up point in normal cases. > > if (args->for_background && !list_empty(bdi->throttle_list) && > over background_thresh - background_thresh / 32) > keep write pages; Right, something like that ought to work well, or at least sounds like worth a try ;-)