All of lore.kernel.org
 help / color / mirror / Atom feed
From: Wanpeng Li <liwanp@linux.vnet.ibm.com>
To: Lin Feng <linfeng@cn.fujitsu.com>
Cc: akpm@linux-foundation.org, mgorman@suse.de, bcrl@kvack.org,
	viro@zeniv.linux.org.uk, khlebnikov@openvz.org,
	walken@google.com, kamezawa.hiroyu@jp.fujitsu.com,
	minchan@kernel.org, riel@redhat.com, rientjes@google.com,
	isimatu.yasuaki@jp.fujitsu.com, wency@cn.fujitsu.com,
	laijs@cn.fujitsu.com, jiang.liu@huawei.com, zab@redhat.com,
	jmoyer@redhat.com, linux-mm@kvack.org, linux-aio@kvack.org,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH V2 1/2] mm: hotplug: implement non-movable version of get_user_pages() called get_user_pages_non_movable()
Date: Wed, 20 Feb 2013 19:37:57 +0800	[thread overview]
Message-ID: <20130220113757.GA10124@hacker.(null)> (raw)
In-Reply-To: <1360056113-14294-2-git-send-email-linfeng@cn.fujitsu.com>

On Tue, Feb 05, 2013 at 05:21:52PM +0800, Lin Feng wrote:
>get_user_pages() always tries to allocate pages from movable zone, which is not
> reliable to memory hotremove framework in some case.
>
>This patch introduces a new library function called get_user_pages_non_movable()
> to pin pages only from zone non-movable in memory.
>It's a wrapper of get_user_pages() but it makes sure that all pages come from
>non-movable zone via additional page migration.
>
>Cc: Andrew Morton <akpm@linux-foundation.org>
>Cc: Mel Gorman <mgorman@suse.de>
>Cc: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
>Cc: Yasuaki Ishimatsu <isimatu.yasuaki@jp.fujitsu.com>
>Cc: Jeff Moyer <jmoyer@redhat.com>
>Cc: Minchan Kim <minchan@kernel.org>
>Cc: Zach Brown <zab@redhat.com>
>Reviewed-by: Tang Chen <tangchen@cn.fujitsu.com>
>Reviewed-by: Gu Zheng <guz.fnst@cn.fujitsu.com>
>Signed-off-by: Lin Feng <linfeng@cn.fujitsu.com>
>---
> include/linux/mm.h     |    3 ++
> include/linux/mmzone.h |    4 ++
> mm/memory.c            |   83 ++++++++++++++++++++++++++++++++++++++++++++++++
> mm/page_isolation.c    |    5 +++
> 4 files changed, 95 insertions(+), 0 deletions(-)
>
>diff --git a/include/linux/mm.h b/include/linux/mm.h
>index 12f5a09..3ff9eba 100644
>--- a/include/linux/mm.h
>+++ b/include/linux/mm.h
>@@ -1049,6 +1049,9 @@ int get_user_pages(struct task_struct *tsk, struct mm_struct *mm,
> 			struct page **pages, struct vm_area_struct **vmas);
> int get_user_pages_fast(unsigned long start, int nr_pages, int write,
> 			struct page **pages);
>+int get_user_pages_non_movable(struct task_struct *tsk, struct mm_struct *mm,
>+		unsigned long start, int nr_pages, int write, int force,
>+		struct page **pages, struct vm_area_struct **vmas);
> struct kvec;
> int get_kernel_pages(const struct kvec *iov, int nr_pages, int write,
> 			struct page **pages);
>diff --git a/include/linux/mmzone.h b/include/linux/mmzone.h
>index e25ab6f..1506351 100644
>--- a/include/linux/mmzone.h
>+++ b/include/linux/mmzone.h
>@@ -841,6 +841,10 @@ static inline int is_normal_idx(enum zone_type idx)
> 	return (idx == ZONE_NORMAL);
> }
>
>+static inline int zone_is_movable(struct zone *zone)
>+{
>+	return zone_idx(zone) == ZONE_MOVABLE;
>+}
> /**
>  * is_highmem - helper function to quickly check if a struct zone is a 
>  *              highmem zone or not.  This is an attempt to keep references
>diff --git a/mm/memory.c b/mm/memory.c
>index bb1369f..ede53cc 100644
>--- a/mm/memory.c
>+++ b/mm/memory.c
>@@ -58,6 +58,8 @@
> #include <linux/elf.h>
> #include <linux/gfp.h>
> #include <linux/migrate.h>
>+#include <linux/page-isolation.h>
>+#include <linux/mm_inline.h>
> #include <linux/string.h>
>
> #include <asm/io.h>
>@@ -1995,6 +1997,87 @@ int get_user_pages(struct task_struct *tsk, struct mm_struct *mm,
> }
> EXPORT_SYMBOL(get_user_pages);
>
>+#ifdef CONFIG_MEMORY_HOTREMOVE
>+/**
>+ * It's a wrapper of get_user_pages() but it makes sure that all pages come from
>+ * non-movable zone via additional page migration. It's designed for memory
>+ * hotremove framework.
>+ *
>+ * Currently get_user_pages() always tries to allocate pages from movable zone,
>+ * in some case users of get_user_pages() is easy to pin user pages for a long
>+ *  time(for now we found that pages pinned as aio ring pages is such case),
>+ * which is fatal for memory hotremove framework.
>+ *
>+ * This function first calls get_user_pages() to get the candidate pages, and
>+ * then check to ensure all pages are from non movable zone. Otherwise migrate

How about "Otherwise migrate candidate pages which have already been 
isolated to non movable zone."?

>+ * them to non movable zone, then retry. It will at most retry once.
>+ */
>+int get_user_pages_non_movable(struct task_struct *tsk, struct mm_struct *mm,
>+		unsigned long start, int nr_pages, int write, int force,
>+		struct page **pages, struct vm_area_struct **vmas)
>+{
>+	int ret, i, isolate_err, migrate_pre_flag;
>+	LIST_HEAD(pagelist);
>+
>+retry:
>+	ret = get_user_pages(tsk, mm, start, nr_pages, write, force, pages,
>+				vmas);
>+	if (ret <= 0)
>+		return ret;
>+
>+	isolate_err = 0;
>+	migrate_pre_flag = 0;
>+
>+	for (i = 0; i < ret; i++) {
>+		if (zone_is_movable(page_zone(pages[i]))) {
>+			if (!migrate_pre_flag) {
>+				if (migrate_prep())
>+					goto release_page;
>+				migrate_pre_flag = 1;
>+			}
>+
>+			if (!isolate_lru_page(pages[i])) {
>+				inc_zone_page_state(pages[i], NR_ISOLATED_ANON +
>+						 page_is_file_cache(pages[i]));
>+				list_add_tail(&pages[i]->lru, &pagelist);
>+			} else {
>+				isolate_err = 1;
>+				goto release_page;
>+			}
>+		}
>+	}
>+
>+	/* All pages are non movable, we are done :) */
>+	if (i == ret && list_empty(&pagelist))
>+		return ret;
>+
>+release_page:
>+	/* Undo the effects of former get_user_pages(), we won't pin anything */
>+	release_pages(pages, ret, 1);
>+
>+	if (migrate_pre_flag && !isolate_err) {
>+		ret = migrate_pages(&pagelist, alloc_migrate_target, 1,
>+					false, MIGRATE_SYNC, MR_SYSCALL);
>+		/* Steal pages from non-movable zone successfully? */
>+		if (!ret)
>+			goto retry;
>+	}
>+
>+	putback_lru_pages(&pagelist);
>+	/* Migration failed, we pin 0 page, tell caller the truth */
>+	return 0;
>+}
>+#else
>+inline int get_user_pages_non_movable(struct task_struct *tsk, struct mm_struct *mm,
>+		unsigned long start, int nr_pages, int write, int force,
>+		struct page **pages, struct vm_area_struct **vmas)
>+{
>+	return get_user_pages(tsk, mm, start, nr_pages, write, force, pages,
>+				vmas);
>+}
>+#endif
>+EXPORT_SYMBOL(get_user_pages_non_movable);
>+
> /**
>  * get_dump_page() - pin user page in memory while writing it to core dump
>  * @addr: user address
>diff --git a/mm/page_isolation.c b/mm/page_isolation.c
>index 383bdbb..1b7bd17 100644
>--- a/mm/page_isolation.c
>+++ b/mm/page_isolation.c
>@@ -247,6 +247,9 @@ int test_pages_isolated(unsigned long start_pfn, unsigned long end_pfn,
> 	return ret ? 0 : -EBUSY;
> }
>
>+/**
>+ * @private: 0 means page can be alloced from movable zone, otherwise forbidden
>+ */
> struct page *alloc_migrate_target(struct page *page, unsigned long private,
> 				  int **resultp)
> {
>@@ -254,6 +257,8 @@ struct page *alloc_migrate_target(struct page *page, unsigned long private,
>
> 	if (PageHighMem(page))
> 		gfp_mask |= __GFP_HIGHMEM;
>+	if (unlikely(private != 0))
>+		gfp_mask &= ~__GFP_MOVABLE;
>
> 	return alloc_page(gfp_mask);
> }
>-- 
>1.7.1
>
>--
>To unsubscribe, send a message with 'unsubscribe linux-mm' in
>the body to majordomo@kvack.org.  For more info on Linux MM,
>see: http://www.linux-mm.org/ .
>Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2013-02-20 11:38 UTC|newest]

Thread overview: 104+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-02-05  9:21 [PATCH V2 0/2] mm: hotplug: implement non-movable version of get_user_pages() to kill long-time pin pages Lin Feng
2013-02-05  9:21 ` Lin Feng
2013-02-05  9:21 ` Lin Feng
2013-02-05  9:21 ` [PATCH V2 1/2] mm: hotplug: implement non-movable version of get_user_pages() called get_user_pages_non_movable() Lin Feng
2013-02-05  9:21   ` Lin Feng
2013-02-05 12:01   ` Mel Gorman
2013-02-05 12:01     ` Mel Gorman
2013-02-05 12:01     ` Mel Gorman
2013-02-06  0:42     ` Minchan Kim
2013-02-06  0:42       ` Minchan Kim
2013-02-06  0:52       ` Benjamin LaHaise
2013-02-06  0:52         ` Benjamin LaHaise
2013-02-06  0:52         ` Benjamin LaHaise
2013-02-06  9:56       ` Mel Gorman
2013-02-06  9:56         ` Mel Gorman
2013-02-08  2:32         ` Minchan Kim
2013-02-08  2:32           ` Minchan Kim
2013-02-08  2:32           ` Minchan Kim
2013-05-13  9:11         ` Tang Chen
2013-05-13  9:11           ` Tang Chen
2013-05-13  9:11           ` Tang Chen
2013-05-13  9:19           ` Mel Gorman
2013-05-13  9:19             ` Mel Gorman
2013-05-13  9:19             ` Mel Gorman
2013-05-13 14:37             ` Benjamin LaHaise
2013-05-13 14:37               ` Benjamin LaHaise
2013-05-13 14:37               ` Benjamin LaHaise
2013-05-13 14:54               ` Jeff Moyer
2013-05-13 14:54                 ` Jeff Moyer
2013-05-13 14:54                 ` Jeff Moyer
2013-05-13 15:01                 ` Benjamin LaHaise
2013-05-13 15:01                   ` Benjamin LaHaise
2013-05-13 15:01                   ` Benjamin LaHaise
2013-05-14  1:24                   ` Tang Chen
2013-05-14  1:24                     ` Tang Chen
2013-05-14 13:58                     ` Benjamin LaHaise
2013-05-14 13:58                       ` Benjamin LaHaise
2013-05-14 13:58                       ` Benjamin LaHaise
2013-05-14 15:16                       ` chen tang
2013-05-15  2:09                       ` Tang Chen
2013-05-15  2:09                         ` Tang Chen
2013-05-15  7:21                         ` Tang Chen
2013-05-15  7:21                           ` Tang Chen
2013-05-15  7:21                           ` Tang Chen
2013-05-14  3:55             ` Tang Chen
2013-05-14  3:55               ` Tang Chen
2013-05-14  3:55               ` Tang Chen
2013-05-15 13:24               ` Mel Gorman
2013-05-15 13:24                 ` Mel Gorman
2013-05-16  5:54                 ` Tang Chen
2013-05-16  5:54                   ` Tang Chen
2013-05-17  0:23                   ` [WiP]: aio support for migrating pages (Re: [PATCH V2 1/2] mm: hotplug: implement non-movable version of get_user_pages() called get_user_pages_non_movable()) Benjamin LaHaise
2013-05-17  0:23                     ` Benjamin LaHaise
2013-05-17  0:23                     ` Benjamin LaHaise
2013-05-17  3:28                     ` Tang Chen
2013-05-17  3:28                       ` Tang Chen
2013-05-17  3:28                       ` Tang Chen
2013-05-17 14:37                       ` Benjamin LaHaise
2013-05-17 14:37                         ` Benjamin LaHaise
2013-05-17 14:37                         ` Benjamin LaHaise
2013-05-21  2:07                         ` Tang Chen
2013-05-21  2:07                           ` Tang Chen
2013-05-21  2:27                           ` Benjamin LaHaise
2013-05-21  2:27                             ` Benjamin LaHaise
2013-05-21  2:27                             ` Benjamin LaHaise
2013-06-11  9:42                             ` Tang Chen
2013-06-11  9:42                               ` Tang Chen
2013-06-11  9:42                               ` Tang Chen
2013-06-11 14:45                               ` Benjamin LaHaise
2013-06-11 14:45                                 ` Benjamin LaHaise
2013-06-28  9:24                                 ` Gu Zheng
2013-06-28  9:24                                   ` Gu Zheng
2013-07-01  7:23                                 ` Gu Zheng
2013-07-01  7:23                                   ` Gu Zheng
2013-07-02 18:00                                   ` Benjamin LaHaise
2013-07-02 18:00                                     ` Benjamin LaHaise
2013-07-02 18:00                                     ` Benjamin LaHaise
2013-07-03  1:53                                     ` Gu Zheng
2013-07-03  1:53                                       ` Gu Zheng
2013-07-03  1:53                                       ` Gu Zheng
2013-07-04  6:51                                     ` Gu Zheng
2013-07-04  6:51                                       ` Gu Zheng
2013-07-04  6:51                                       ` Gu Zheng
2013-07-04 11:41                                       ` Benjamin LaHaise
2013-07-04 11:41                                         ` Benjamin LaHaise
2013-07-04 11:41                                         ` Benjamin LaHaise
2013-07-05  3:21                                         ` Gu Zheng
2013-07-05  3:21                                           ` Gu Zheng
2013-07-05  3:21                                           ` Gu Zheng
2013-05-17 18:17                     ` Zach Brown
2013-05-17 18:17                       ` Zach Brown
2013-05-17 18:30                       ` Benjamin LaHaise
2013-05-17 18:30                         ` Benjamin LaHaise
2013-05-17 18:30                         ` Benjamin LaHaise
2013-02-20 11:37   ` Wanpeng Li [this message]
2013-02-20 12:39     ` [PATCH V2 1/2] mm: hotplug: implement non-movable version of get_user_pages() called get_user_pages_non_movable() Lin Feng
2013-02-20 12:39       ` Lin Feng
2013-02-20 12:39       ` Lin Feng
2013-02-20 11:37   ` Wanpeng Li
2013-02-20 11:37     ` Wanpeng Li
2013-02-20 11:37   ` Wanpeng Li
2013-02-20 11:37     ` Wanpeng Li
2013-02-05  9:21 ` [PATCH V2 2/2] fs/aio.c: use get_user_pages_non_movable() to pin ring pages when support memory hotremove Lin Feng
2013-02-05  9:21   ` Lin Feng

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='20130220113757.GA10124@hacker.(null)' \
    --to=liwanp@linux.vnet.ibm.com \
    --cc=akpm@linux-foundation.org \
    --cc=bcrl@kvack.org \
    --cc=isimatu.yasuaki@jp.fujitsu.com \
    --cc=jiang.liu@huawei.com \
    --cc=jmoyer@redhat.com \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=khlebnikov@openvz.org \
    --cc=laijs@cn.fujitsu.com \
    --cc=linfeng@cn.fujitsu.com \
    --cc=linux-aio@kvack.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mgorman@suse.de \
    --cc=minchan@kernel.org \
    --cc=riel@redhat.com \
    --cc=rientjes@google.com \
    --cc=viro@zeniv.linux.org.uk \
    --cc=walken@google.com \
    --cc=wency@cn.fujitsu.com \
    --cc=zab@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.