From: Michal Hocko <mhocko@suse.cz>
To: Rik van Riel <riel@redhat.com>
Cc: Ying Han <yinghan@google.com>,
Johannes Weiner <hannes@cmpxchg.org>, Mel Gorman <mel@csn.ul.ie>,
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
Hillf Danton <dhillf@gmail.com>, Hugh Dickins <hughd@google.com>,
KOSAKI Motohiro <kosaki.motohiro@gmail.com>,
Andrew Morton <akpm@linux-foundation.org>,
linux-mm@kvack.org
Subject: Re: [PATCH V7 2/2] mm: memcg detect no memcgs above softlimit under zone reclaim
Date: Mon, 6 Aug 2012 17:11:16 +0200 [thread overview]
Message-ID: <20120806151115.GA4850@dhcp22.suse.cz> (raw)
In-Reply-To: <501FD44D.40205@redhat.com>
On Mon 06-08-12 10:27:25, Rik van Riel wrote:
> On 08/06/2012 10:03 AM, Michal Hocko wrote:
> >On Wed 01-08-12 16:10:32, Rik van Riel wrote:
> >>On 08/01/2012 03:04 PM, Ying Han wrote:
> >>
> >>>That is true. Hmm, then two things i can do:
> >>>
> >>>1. for kswapd case, make sure not counting the root cgroup
> >>>2. or check nr_scanned. I like the nr_scanned which is telling us
> >>>whether or not the reclaim ever make any attempt ?
> >>
> >>I am looking at a more advanced case of (3) right
> >>now. Once I have the basics working, I will send
> >>you a prototype (that applies on top of your patches)
> >>to play with.
> >>
> >>Basically, for every LRU in the system, we can keep
> >>track of 4 things:
> >>- reclaim_stat->recent_scanned
> >>- reclaim_stat->recent_rotated
> >>- reclaim_stat->recent_pressure
> >>- LRU size
> >>
> >>The first two represent the fraction of pages on the
> >>list that are actively used. The larger the fraction
> >>of recently used pages, the more valuable the cache
> >>is. The inverse of that can be used to show us how
> >>hard to reclaim this cache, compared to other caches
> >>(everything else being equal).
> >>
> >>The recent pressure can be used to keep track of how
> >>many pages we have scanned on each LRU list recently.
> >>Pressure is scaled with LRU size.
> >>
> >>This would be the basic formula to decide which LRU
> >>to reclaim from:
> >>
> >> recent_scanned LRU size
> >>score = -------------- * ----------------
> >> recent_rotated recent_pressure
> >>
> >>
> >>In other words, the less the objects on an LRU are
> >>used, the more we should reclaim from that LRU. The
> >>larger an LRU is, the more we should reclaim from
> >>that LRU.
> >
> >The formula makes sense but I am afraid that it will be hard to tune it
> >into something that wouldn't regress. For example I have seen workloads
> >which had many small groups which are used to wrap up backup jobs and
> >those are scanned a lot, you would see also many rotations because of
> >the writeback but those are definitely good to scan rather than a large
> >group which needs to keep its data resident.
>
> Writeback rotations are not counted in
> lruvec->reclaim_stat->recent_rotated - only the rotations
> that were done because we really want to keep the page are
> counted.
OK. I missed that.
> >Anyway, I am not saying the score approach is a bad idea but I am afraid
> >it will be hard to validate and make it right.
>
> One thing about the recent_scanned / recent_rotated metric is
> that we have been using it since 2.6.28, to balance between
> scanning the file and anonymous LRUs.
>
> I believe it would help us balance between multiple sets of
> LRUs, too.
>
> >>The more we have already scanned an LRU, the lower
> >>its score becomes. At some point, another LRU will
> >>have the top score, and that will be the target to
> >>scan.
> >
> >So you think we shouldn't do the full round over memcgs in shrink_zone a
> >and rather do it oom way to pick up a victim and hammer it?
>
> Not hammer it too far. Only until its score ends up well
> below (25% lower?) than that of the second highest scoring
> list.
>
> That way all the lists get hammered a little bit, in turn.
How do we provide the soft limit guarantee then?
[...]
--
Michal Hocko
SUSE Labs
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2012-08-06 15:11 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-07-30 22:32 [PATCH V7 2/2] mm: memcg detect no memcgs above softlimit under zone reclaim Ying Han
2012-07-31 15:59 ` Michal Hocko
2012-07-31 16:07 ` Rik van Riel
2012-07-31 17:52 ` Ying Han
2012-07-31 17:54 ` Ying Han
2012-07-31 20:02 ` Michal Hocko
2012-07-31 20:59 ` Ying Han
2012-08-01 8:45 ` Michal Hocko
2012-08-01 19:04 ` Ying Han
2012-08-01 20:10 ` Rik van Riel
2012-08-02 0:09 ` Ying Han
2012-08-02 0:43 ` Rik van Riel
2012-08-06 14:03 ` Michal Hocko
2012-08-06 14:27 ` Rik van Riel
2012-08-06 15:11 ` Michal Hocko [this message]
2012-08-06 18:51 ` Rik van Riel
2012-08-06 21:18 ` Ying Han
2012-08-06 22:54 ` Rik van Riel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120806151115.GA4850@dhcp22.suse.cz \
--to=mhocko@suse.cz \
--cc=akpm@linux-foundation.org \
--cc=dhillf@gmail.com \
--cc=hannes@cmpxchg.org \
--cc=hughd@google.com \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=kosaki.motohiro@gmail.com \
--cc=linux-mm@kvack.org \
--cc=mel@csn.ul.ie \
--cc=riel@redhat.com \
--cc=yinghan@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).