All of lore.kernel.org
 help / color / mirror / Atom feed
From: Konstantin Khlebnikov <khlebnikov@openvz.org>
To: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Hugh Dickins <hughd@google.com>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"linux-mm@kvack.org" <linux-mm@kvack.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Andi Kleen <andi@firstfloor.org>
Subject: Re: [PATCH v3 00/21] mm: lru_lock splitting
Date: Sat, 25 Feb 2012 09:31:01 +0400	[thread overview]
Message-ID: <4F487215.7000307@openvz.org> (raw)
In-Reply-To: <20120225111515.1275e04c.kamezawa.hiroyu@jp.fujitsu.com>

KAMEZAWA Hiroyuki wrote:
> On Thu, 23 Feb 2012 17:51:36 +0400
> Konstantin Khlebnikov<khlebnikov@openvz.org>  wrote:
>
>> v3 changes:
>> * inactive-ratio reworked again, now it always calculated from from scratch
>> * hierarchical pte reference bits filter in memory-cgroup reclaimer
>> * fixed two bugs in locking, found by Hugh Dickins
>> * locking functions slightly simplified
>> * new patch for isolated pages accounting
>> * new patch with lru interleaving
>>
>> This patchset is based on next-20120210
>>
>> git: https://github.com/koct9i/linux/commits/lruvec-v3
>>
>
> I wonder.... I just wonder...if we can split a lruvec in a zone into small
> pieces of lruvec and have splitted LRU-lock per them, do we need per-memcg-lrulock ?

What per-memcg-lrulock? I don't have it.
last patch splits lruvecs in memcg with the same factor.

>
> It seems per-memcg-lrulock can be much bigger lock than small-lruvec-lock.
> (depends on configuraton) and much more complicated..and have to take care
> of many things.. If unit of splitting can be specified by boot option,
> it seems admins can split a big memcg's per-memcg-lru lock into more small pieces.

lruvec count per memcg can be arbitrary and changeable if cgroup is empty.
This is not in this patch, but it's really easy.

>
> BTW, how to think of default size of splitting ? I wonder splitting lru into
> the number of cpus per a node can be a choice. Each cpu may have a chance to
> set prefered-pfn-range at page allocation with additional patches.

If we rework page to memcg linking and add direct lruvec-id into page->flags,
we will able to change lruvec before inserting page to lru.
Thus each cpu will always insert pages into its own lruvec in zone.
I have not thought about races yet, but this would be perfect solution.

>
> Thanks,
> -Kame
>
>
>> ---
>>
>> Konstantin Khlebnikov (21):
>>        memcg: unify inactive_ratio calculation
>>        memcg: make mm_match_cgroup() hirarchical
>>        memcg: fix page_referencies cgroup filter on global reclaim
>>        memcg: use vm_swappiness from target memory cgroup
>>        mm: rename lruvec->lists into lruvec->pages_lru
>>        mm: lruvec linking functions
>>        mm: add lruvec->pages_count
>>        mm: unify inactive_list_is_low()
>>        mm: add lruvec->reclaim_stat
>>        mm: kill struct mem_cgroup_zone
>>        mm: move page-to-lruvec translation upper
>>        mm: push lruvec into update_page_reclaim_stat()
>>        mm: push lruvecs from pagevec_lru_move_fn() to iterator
>>        mm: introduce lruvec locking primitives
>>        mm: handle lruvec relocks on lumpy reclaim
>>        mm: handle lruvec relocks in compaction
>>        mm: handle lruvec relock in memory controller
>>        mm: add to lruvec isolated pages counters
>>        memcg: check lru vectors emptiness in pre-destroy
>>        mm: split zone->lru_lock
>>        mm: zone lru vectors interleaving
>>
>>
>>   include/linux/huge_mm.h    |    3
>>   include/linux/memcontrol.h |   75 ------
>>   include/linux/mm.h         |   66 +++++
>>   include/linux/mm_inline.h  |   19 +-
>>   include/linux/mmzone.h     |   39 ++-
>>   include/linux/swap.h       |    6
>>   mm/Kconfig                 |   16 +
>>   mm/compaction.c            |   31 +--
>>   mm/huge_memory.c           |   14 +
>>   mm/internal.h              |  204 +++++++++++++++++
>>   mm/ksm.c                   |    2
>>   mm/memcontrol.c            |  343 +++++++++++-----------------
>>   mm/migrate.c               |    2
>>   mm/page_alloc.c            |   70 +-----
>>   mm/rmap.c                  |    2
>>   mm/swap.c                  |  217 ++++++++++--------
>>   mm/vmscan.c                |  534 ++++++++++++++++++++++++--------------------
>>   mm/vmstat.c                |    6
>>   18 files changed, 932 insertions(+), 717 deletions(-)
>>
>> --
>> Signature
>>
>

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)
From: Konstantin Khlebnikov <khlebnikov@openvz.org>
To: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Hugh Dickins <hughd@google.com>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"linux-mm@kvack.org" <linux-mm@kvack.org>,
	Johannes Weiner <hannes@cmpxchg.org>,
	Andrew Morton <akpm@linux-foundation.org>,
	Andi Kleen <andi@firstfloor.org>
Subject: Re: [PATCH v3 00/21] mm: lru_lock splitting
Date: Sat, 25 Feb 2012 09:31:01 +0400	[thread overview]
Message-ID: <4F487215.7000307@openvz.org> (raw)
In-Reply-To: <20120225111515.1275e04c.kamezawa.hiroyu@jp.fujitsu.com>

KAMEZAWA Hiroyuki wrote:
> On Thu, 23 Feb 2012 17:51:36 +0400
> Konstantin Khlebnikov<khlebnikov@openvz.org>  wrote:
>
>> v3 changes:
>> * inactive-ratio reworked again, now it always calculated from from scratch
>> * hierarchical pte reference bits filter in memory-cgroup reclaimer
>> * fixed two bugs in locking, found by Hugh Dickins
>> * locking functions slightly simplified
>> * new patch for isolated pages accounting
>> * new patch with lru interleaving
>>
>> This patchset is based on next-20120210
>>
>> git: https://github.com/koct9i/linux/commits/lruvec-v3
>>
>
> I wonder.... I just wonder...if we can split a lruvec in a zone into small
> pieces of lruvec and have splitted LRU-lock per them, do we need per-memcg-lrulock ?

What per-memcg-lrulock? I don't have it.
last patch splits lruvecs in memcg with the same factor.

>
> It seems per-memcg-lrulock can be much bigger lock than small-lruvec-lock.
> (depends on configuraton) and much more complicated..and have to take care
> of many things.. If unit of splitting can be specified by boot option,
> it seems admins can split a big memcg's per-memcg-lru lock into more small pieces.

lruvec count per memcg can be arbitrary and changeable if cgroup is empty.
This is not in this patch, but it's really easy.

>
> BTW, how to think of default size of splitting ? I wonder splitting lru into
> the number of cpus per a node can be a choice. Each cpu may have a chance to
> set prefered-pfn-range at page allocation with additional patches.

If we rework page to memcg linking and add direct lruvec-id into page->flags,
we will able to change lruvec before inserting page to lru.
Thus each cpu will always insert pages into its own lruvec in zone.
I have not thought about races yet, but this would be perfect solution.

>
> Thanks,
> -Kame
>
>
>> ---
>>
>> Konstantin Khlebnikov (21):
>>        memcg: unify inactive_ratio calculation
>>        memcg: make mm_match_cgroup() hirarchical
>>        memcg: fix page_referencies cgroup filter on global reclaim
>>        memcg: use vm_swappiness from target memory cgroup
>>        mm: rename lruvec->lists into lruvec->pages_lru
>>        mm: lruvec linking functions
>>        mm: add lruvec->pages_count
>>        mm: unify inactive_list_is_low()
>>        mm: add lruvec->reclaim_stat
>>        mm: kill struct mem_cgroup_zone
>>        mm: move page-to-lruvec translation upper
>>        mm: push lruvec into update_page_reclaim_stat()
>>        mm: push lruvecs from pagevec_lru_move_fn() to iterator
>>        mm: introduce lruvec locking primitives
>>        mm: handle lruvec relocks on lumpy reclaim
>>        mm: handle lruvec relocks in compaction
>>        mm: handle lruvec relock in memory controller
>>        mm: add to lruvec isolated pages counters
>>        memcg: check lru vectors emptiness in pre-destroy
>>        mm: split zone->lru_lock
>>        mm: zone lru vectors interleaving
>>
>>
>>   include/linux/huge_mm.h    |    3
>>   include/linux/memcontrol.h |   75 ------
>>   include/linux/mm.h         |   66 +++++
>>   include/linux/mm_inline.h  |   19 +-
>>   include/linux/mmzone.h     |   39 ++-
>>   include/linux/swap.h       |    6
>>   mm/Kconfig                 |   16 +
>>   mm/compaction.c            |   31 +--
>>   mm/huge_memory.c           |   14 +
>>   mm/internal.h              |  204 +++++++++++++++++
>>   mm/ksm.c                   |    2
>>   mm/memcontrol.c            |  343 +++++++++++-----------------
>>   mm/migrate.c               |    2
>>   mm/page_alloc.c            |   70 +-----
>>   mm/rmap.c                  |    2
>>   mm/swap.c                  |  217 ++++++++++--------
>>   mm/vmscan.c                |  534 ++++++++++++++++++++++++--------------------
>>   mm/vmstat.c                |    6
>>   18 files changed, 932 insertions(+), 717 deletions(-)
>>
>> --
>> Signature
>>
>


  reply	other threads:[~2012-02-25  5:31 UTC|newest]

Thread overview: 130+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-02-23 13:51 [PATCH v3 00/21] mm: lru_lock splitting Konstantin Khlebnikov
2012-02-23 13:51 ` Konstantin Khlebnikov
2012-02-23 13:51 ` [PATCH v3 01/21] memcg: unify inactive_ratio calculation Konstantin Khlebnikov
2012-02-23 13:51   ` Konstantin Khlebnikov
2012-02-28  0:05   ` KAMEZAWA Hiroyuki
2012-02-28  0:05     ` KAMEZAWA Hiroyuki
2012-02-23 13:51 ` [PATCH v3 02/21] memcg: make mm_match_cgroup() hirarchical Konstantin Khlebnikov
2012-02-23 13:51   ` Konstantin Khlebnikov
2012-02-23 18:03   ` Johannes Weiner
2012-02-23 18:03     ` Johannes Weiner
2012-02-23 19:46     ` Konstantin Khlebnikov
2012-02-23 19:46       ` Konstantin Khlebnikov
2012-02-23 22:06       ` Johannes Weiner
2012-02-23 22:06         ` Johannes Weiner
2012-02-28  0:11   ` KAMEZAWA Hiroyuki
2012-02-28  0:11     ` KAMEZAWA Hiroyuki
2012-02-28  6:31     ` Konstantin Khlebnikov
2012-02-28  6:31       ` Konstantin Khlebnikov
2012-02-23 13:51 ` [PATCH v3 03/21] memcg: fix page_referencies cgroup filter on global reclaim Konstantin Khlebnikov
2012-02-23 13:51   ` Konstantin Khlebnikov
2012-02-28  0:13   ` KAMEZAWA Hiroyuki
2012-02-28  0:13     ` KAMEZAWA Hiroyuki
2012-02-23 13:51 ` [PATCH v3 04/21] memcg: use vm_swappiness from target memory cgroup Konstantin Khlebnikov
2012-02-23 13:51   ` Konstantin Khlebnikov
2012-02-28  0:15   ` KAMEZAWA Hiroyuki
2012-02-28  0:15     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 05/21] mm: rename lruvec->lists into lruvec->pages_lru Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:20   ` KAMEZAWA Hiroyuki
2012-02-28  0:20     ` KAMEZAWA Hiroyuki
2012-02-28  6:04     ` Konstantin Khlebnikov
2012-02-28  6:04       ` Konstantin Khlebnikov
2012-02-23 13:52 ` [PATCH v3 06/21] mm: lruvec linking functions Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:27   ` KAMEZAWA Hiroyuki
2012-02-28  0:27     ` KAMEZAWA Hiroyuki
2012-02-28  6:09     ` Konstantin Khlebnikov
2012-02-28  6:09       ` Konstantin Khlebnikov
2012-02-23 13:52 ` [PATCH v3 07/21] mm: add lruvec->pages_count Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:35   ` KAMEZAWA Hiroyuki
2012-02-28  0:35     ` KAMEZAWA Hiroyuki
2012-02-28  6:16     ` Konstantin Khlebnikov
2012-02-28  6:16       ` Konstantin Khlebnikov
2012-02-23 13:52 ` [PATCH v3 08/21] mm: unify inactive_list_is_low() Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:36   ` KAMEZAWA Hiroyuki
2012-02-28  0:36     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 09/21] mm: add lruvec->reclaim_stat Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:38   ` KAMEZAWA Hiroyuki
2012-02-28  0:38     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 10/21] mm: kill struct mem_cgroup_zone Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:41   ` KAMEZAWA Hiroyuki
2012-02-28  0:41     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 11/21] mm: move page-to-lruvec translation upper Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:42   ` KAMEZAWA Hiroyuki
2012-02-28  0:42     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 12/21] mm: push lruvec into update_page_reclaim_stat() Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:44   ` KAMEZAWA Hiroyuki
2012-02-28  0:44     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 13/21] mm: push lruvecs from pagevec_lru_move_fn() to iterator Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:45   ` KAMEZAWA Hiroyuki
2012-02-28  0:45     ` KAMEZAWA Hiroyuki
2012-02-23 13:52 ` [PATCH v3 14/21] mm: introduce lruvec locking primitives Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  0:56   ` KAMEZAWA Hiroyuki
2012-02-28  0:56     ` KAMEZAWA Hiroyuki
2012-02-28  6:23     ` Konstantin Khlebnikov
2012-02-28  6:23       ` Konstantin Khlebnikov
2012-02-23 13:52 ` [PATCH v3 15/21] mm: handle lruvec relocks on lumpy reclaim Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  1:01   ` KAMEZAWA Hiroyuki
2012-02-28  1:01     ` KAMEZAWA Hiroyuki
2012-02-28  6:25     ` Konstantin Khlebnikov
2012-02-28  6:25       ` Konstantin Khlebnikov
2012-02-23 13:52 ` [PATCH v3 16/21] mm: handle lruvec relocks in compaction Konstantin Khlebnikov
2012-02-23 13:52   ` Konstantin Khlebnikov
2012-02-28  1:13   ` KAMEZAWA Hiroyuki
2012-02-28  1:13     ` KAMEZAWA Hiroyuki
2012-02-28  6:31     ` Konstantin Khlebnikov
2012-02-28  6:31       ` Konstantin Khlebnikov
2012-02-23 13:53 ` [PATCH v3 17/21] mm: handle lruvec relock in memory controller Konstantin Khlebnikov
2012-02-23 13:53   ` Konstantin Khlebnikov
2012-02-28  1:22   ` KAMEZAWA Hiroyuki
2012-02-28  1:22     ` KAMEZAWA Hiroyuki
2012-02-23 13:53 ` [PATCH v3 18/21] mm: add to lruvec isolated pages counters Konstantin Khlebnikov
2012-02-23 13:53   ` Konstantin Khlebnikov
2012-02-24  5:32   ` Konstantin Khlebnikov
2012-02-24  5:32     ` Konstantin Khlebnikov
2012-02-28  1:38   ` KAMEZAWA Hiroyuki
2012-02-28  1:38     ` KAMEZAWA Hiroyuki
2012-02-23 13:53 ` [PATCH v3 19/21] memcg: check lru vectors emptiness in pre-destroy Konstantin Khlebnikov
2012-02-23 13:53   ` Konstantin Khlebnikov
2012-02-28  1:43   ` KAMEZAWA Hiroyuki
2012-02-28  1:43     ` KAMEZAWA Hiroyuki
2012-02-23 13:53 ` [PATCH v3 20/21] mm: split zone->lru_lock Konstantin Khlebnikov
2012-02-23 13:53   ` Konstantin Khlebnikov
2012-02-28  1:49   ` KAMEZAWA Hiroyuki
2012-02-28  1:49     ` KAMEZAWA Hiroyuki
2012-02-28  6:39     ` Konstantin Khlebnikov
2012-02-28  6:39       ` Konstantin Khlebnikov
2012-02-23 13:53 ` [PATCH v3 21/21] mm: zone lru vectors interleaving Konstantin Khlebnikov
2012-02-23 13:53   ` Konstantin Khlebnikov
2012-02-23 14:44   ` Hillf Danton
2012-02-23 14:44     ` Hillf Danton
2012-02-23 16:21   ` Andi Kleen
2012-02-23 16:21     ` Andi Kleen
2012-02-23 18:48     ` [PATCH 1/2] mm: configure lruvec split by boot options Konstantin Khlebnikov
2012-02-23 18:48       ` Konstantin Khlebnikov
2012-02-23 18:48     ` [PATCH 2/2] mm: show zone lruvec state in /proc/zoneinfo Konstantin Khlebnikov
2012-02-23 18:48       ` Konstantin Khlebnikov
2012-02-25  0:05 ` [PATCH v3 00/21] mm: lru_lock splitting Tim Chen
2012-02-25  0:05   ` Tim Chen
2012-02-25  5:34   ` Konstantin Khlebnikov
2012-02-25  5:34     ` Konstantin Khlebnikov
2012-02-25  2:15 ` KAMEZAWA Hiroyuki
2012-02-25  2:15   ` KAMEZAWA Hiroyuki
2012-02-25  5:31   ` Konstantin Khlebnikov [this message]
2012-02-25  5:31     ` Konstantin Khlebnikov
2012-02-26 23:54     ` KAMEZAWA Hiroyuki
2012-02-26 23:54       ` KAMEZAWA Hiroyuki
2012-02-28  1:52 ` KAMEZAWA Hiroyuki
2012-02-28  1:52   ` KAMEZAWA Hiroyuki
2012-02-28  6:49   ` Konstantin Khlebnikov
2012-02-28  6:49     ` Konstantin Khlebnikov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4F487215.7000307@openvz.org \
    --to=khlebnikov@openvz.org \
    --cc=akpm@linux-foundation.org \
    --cc=andi@firstfloor.org \
    --cc=hannes@cmpxchg.org \
    --cc=hughd@google.com \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.