Re: Control page reclaim granularity

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Minchan Kim <minchan@kernel.org>
To: Minchan Kim <minchan@kernel.org>, linux-mm <linux-mm@kvack.org>,
	linux-kernel <linux-kernel@vger.kernel.org>,
	Konstantin Khlebnikov <khlebnikov@openvz.org>,
	riel@redhat.com, kosaki.motohiro@jp.fujitsu.com
Subject: Re: Control page reclaim granularity
Date: Mon, 12 Mar 2012 09:28:06 +0900	[thread overview]
Message-ID: <20120312002806.GA2436@barrios> (raw)
In-Reply-To: <20120308165403.GA10005@gmail.com>

On Fri, Mar 09, 2012 at 12:54:03AM +0800, Zheng Liu wrote:
> Hi Minchan,
> 
> Sorry, I forgot to say that I don't subscribe linux-mm and linux-kernel
> mailing list.  So please Cc me.
> 
> IMHO, maybe we should re-think about how does user use mmap(2).  I
> describe the cases I known in our product system.  They can be
> categorized into two cases.  One is mmaped all data files into memory
> and sometime it uses write(2) to append some data, and another uses
> mmap(2)/munmap(2) and read(2)/write(2) to manipulate the files.  In the
> second case,  the application wants to keep mmaped page into memory and
> let file pages to be reclaimed firstly.  So, IMO, when application uses
> mmap(2) to manipulate files, it is possible to imply that it wants keep
> these mmaped pages into memory and do not be reclaimed.  At least these
> pages do not be reclaimed early than file pages.  I think that maybe we
> can recover that routine and provide a sysctl parameter to let the user
> to set this ratio between mmaped pages and file pages.

I am not convinced why we should handle mapped page specially.
Sometimem, someone may use mmap by reducing buffer copy compared to read system call.
So I think we can't make sure mmaped pages are always win.

My suggestion is that it would be better to declare by user explicitly.
I think we can implement it by madvise and fadvise's WILLNEED option.
Current implementation is just readahead if there isn't a page in memory but I think
we can promote from inactive to active if there is already a page in
memory.

It's more clear and it couldn't be affected by kernel page reclaim algorithm change
like this.

> 
> Regards,
> Zheng

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)

From: Minchan Kim <minchan@kernel.org>
To: Minchan Kim <minchan@kernel.org>, linux-mm <linux-mm@kvack.org>,
	linux-kernel <linux-kernel@vger.kernel.org>,
	Konstantin Khlebnikov <khlebnikov@openvz.org>,
	riel@redhat.com, kosaki.motohiro@jp.fujitsu.com
Subject: Re: Control page reclaim granularity
Date: Mon, 12 Mar 2012 09:28:06 +0900	[thread overview]
Message-ID: <20120312002806.GA2436@barrios> (raw)
In-Reply-To: <20120308165403.GA10005@gmail.com>

On Fri, Mar 09, 2012 at 12:54:03AM +0800, Zheng Liu wrote:
> Hi Minchan,
> 
> Sorry, I forgot to say that I don't subscribe linux-mm and linux-kernel
> mailing list.  So please Cc me.
> 
> IMHO, maybe we should re-think about how does user use mmap(2).  I
> describe the cases I known in our product system.  They can be
> categorized into two cases.  One is mmaped all data files into memory
> and sometime it uses write(2) to append some data, and another uses
> mmap(2)/munmap(2) and read(2)/write(2) to manipulate the files.  In the
> second case,  the application wants to keep mmaped page into memory and
> let file pages to be reclaimed firstly.  So, IMO, when application uses
> mmap(2) to manipulate files, it is possible to imply that it wants keep
> these mmaped pages into memory and do not be reclaimed.  At least these
> pages do not be reclaimed early than file pages.  I think that maybe we
> can recover that routine and provide a sysctl parameter to let the user
> to set this ratio between mmaped pages and file pages.

I am not convinced why we should handle mapped page specially.
Sometimem, someone may use mmap by reducing buffer copy compared to read system call.
So I think we can't make sure mmaped pages are always win.

My suggestion is that it would be better to declare by user explicitly.
I think we can implement it by madvise and fadvise's WILLNEED option.
Current implementation is just readahead if there isn't a page in memory but I think
we can promote from inactive to active if there is already a page in
memory.

It's more clear and it couldn't be affected by kernel page reclaim algorithm change
like this.

> 
> Regards,
> Zheng

next prev parent reply	other threads:[~2012-03-12  0:28 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-03-08  7:34 Control page reclaim granularity Zheng Liu
2012-03-08  7:34 ` Zheng Liu
2012-03-08  8:39 ` Greg Thelen
2012-03-08  8:39   ` Greg Thelen
2012-03-08 16:13   ` Zheng Liu
2012-03-08 16:13     ` Zheng Liu
2012-03-08 16:32     ` Zhu Yanhai
2012-03-14  7:19     ` Greg Thelen
2012-03-14  7:19       ` Greg Thelen
2012-03-08  9:35 ` Minchan Kim
2012-03-08  9:35   ` Minchan Kim
2012-03-08 16:54   ` Zheng Liu
2012-03-08 16:54     ` Zheng Liu
2012-03-12  0:28     ` Minchan Kim [this message]
2012-03-12  0:28       ` Minchan Kim
2012-03-12  2:06       ` Fwd: " Zheng Liu
2012-03-12  2:06         ` Zheng Liu
2012-03-12  5:19         ` Minchan Kim
2012-03-12  5:19           ` Minchan Kim
2012-03-12  6:20           ` Konstantin Khlebnikov
2012-03-12  6:20             ` Konstantin Khlebnikov
2012-03-12  8:14             ` Zheng Liu
2012-03-12  8:14               ` Zheng Liu
2012-03-12 13:42               ` Minchan Kim
2012-03-12 13:42                 ` Minchan Kim
2012-03-12 14:18                 ` Konstantin Khlebnikov
2012-03-13  2:48                   ` Minchan Kim
2012-03-13  2:48                     ` Minchan Kim
2012-03-13  4:37                     ` Konstantin Khlebnikov
2012-03-13  4:37                       ` Konstantin Khlebnikov
2012-03-13  5:00                       ` Konstantin Khlebnikov
2012-03-13  5:00                         ` Konstantin Khlebnikov
2012-03-13  6:30                     ` Zheng Liu
2012-03-13  6:30                       ` Zheng Liu
2012-03-13  6:48                       ` Zheng Liu
2012-03-13  6:48                         ` Zheng Liu
2012-03-13  7:21                         ` Konstantin Khlebnikov
2012-03-13  7:21                           ` Konstantin Khlebnikov
2012-03-13  7:43                           ` Kautuk Consul
2012-03-13  7:43                             ` Kautuk Consul
2012-03-13  7:47                             ` Kautuk Consul
2012-03-13  7:47                               ` Kautuk Consul
2012-03-13  8:05                               ` Zheng Liu
2012-03-13  8:05                                 ` Zheng Liu
2012-03-13  8:04                                 ` Kautuk Consul
2012-03-13  8:04                                   ` Kautuk Consul
2012-03-13  8:08                                   ` Kautuk Consul
2012-03-13  8:08                                     ` Kautuk Consul
2012-03-13  8:28                                     ` Zheng Liu
2012-03-13  8:28                                       ` Zheng Liu
2012-03-13  8:36                                       ` Kautuk Consul
2012-03-13  8:36                                         ` Kautuk Consul
2012-03-13  9:03                                         ` Kautuk Consul
2012-03-13  9:03                                           ` Kautuk Consul
2012-03-12 15:15                 ` Zheng Liu
2012-03-12 15:15                   ` Zheng Liu
2012-03-13  2:51                   ` Minchan Kim
2012-03-13  2:51                     ` Minchan Kim
2012-03-12 14:55   ` Rik van Riel
2012-03-12 14:55     ` Rik van Riel
2012-03-13  2:57     ` Minchan Kim
2012-03-13  2:57       ` Minchan Kim
2012-03-13 14:57       ` Rik van Riel
2012-03-13 14:57         ` Rik van Riel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120312002806.GA2436@barrios \
    --to=minchan@kernel.org \
    --cc=khlebnikov@openvz.org \
    --cc=kosaki.motohiro@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=riel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.