From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail191.messagelabs.com (mail191.messagelabs.com [216.82.242.19]) by kanga.kvack.org (Postfix) with ESMTP id D9A456B0085 for ; Tue, 23 Nov 2010 02:47:50 -0500 (EST) Date: Mon, 22 Nov 2010 23:42:57 -0800 From: Andrew Morton Subject: Re: [RFC 1/2] deactive invalidated pages Message-Id: <20101122234257.f14bad44.akpm@linux-foundation.org> In-Reply-To: References: <20101122143817.E242.A69D9226@jp.fujitsu.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org To: Minchan Kim Cc: KOSAKI Motohiro , linux-mm , LKML , Peter Zijlstra , Rik van Riel , Johannes Weiner , Nick Piggin List-ID: On Tue, 23 Nov 2010 16:40:03 +0900 Minchan Kim wrote: > Hi KOSAKI, > > 2010/11/23 KOSAKI Motohiro : > >> By Other approach, app developer uses POSIX_FADV_DONTNEED. > >> But it has a problem. If kernel meets page is writing > >> during invalidate_mapping_pages, it can't work. > >> It is very hard for application programmer to use it. > >> Because they always have to sync data before calling > >> fadivse(..POSIX_FADV_DONTNEED) to make sure the pages could > >> be discardable. At last, they can't use deferred write of kernel > >> so that they could see performance loss. > >> (http://insights.oetiker.ch/linux/fadvise.html) > > > > If rsync use the above url patch, we don't need your patch. > > fdatasync() + POSIX_FADV_DONTNEED should work fine. > > It works well. But it needs always fdatasync before calling fadvise. > For small file, it hurt performance since we can't use the deferred write. fdatasync() is (much) better than nothing, but a userspace application which is carefully managing its IO scheduling should use sync_file_range(SYNC_FILE_RANGE_WRITE) to push data at the disk and should then run fadvise(DONTNEED) against the same data a few seconds later, after the IO has completed. That way, the application won't block against the write I/O at all, unless of course someone else is thrashing the disk as well, etc. If the app is doing a lot of file I/O (eg, rsync) then this shouldn't be too hard to arrange. Although the payback will be pretty small unless the IO-intensive process is also compute-intensive at times. And such applications are a) fairly rare and b) poorly designed: shouldn't be doing heavy IO and heavy compute in the same thread! -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: email@kvack.org