From: Johannes Weiner <jweiner@redhat.com>
To: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Daisuke Nishimura <nishimura@mxp.nes.nec.co.jp>,
Balbir Singh <bsingharora@gmail.com>,
Andrew Brestic <abrestic@google.com>,
Ying Han <yinghan@google.com>, Michal Hocko <mhocko@suse.cz>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org
Subject: Re: [patch] Revert "memcg: add memory.vmscan_stat"
Date: Tue, 30 Aug 2011 09:04:24 +0200 [thread overview]
Message-ID: <20110830070424.GA13061@redhat.com> (raw)
In-Reply-To: <20110830101233.ae416284.kamezawa.hiroyu@jp.fujitsu.com>
On Tue, Aug 30, 2011 at 10:12:33AM +0900, KAMEZAWA Hiroyuki wrote:
> On Mon, 29 Aug 2011 17:51:13 +0200
> Johannes Weiner <jweiner@redhat.com> wrote:
>
> > On Tue, Aug 09, 2011 at 08:33:45AM +0900, KAMEZAWA Hiroyuki wrote:
> > > On Mon, 8 Aug 2011 14:43:33 +0200
> > > Johannes Weiner <jweiner@redhat.com> wrote:
> > >
> > > > On Fri, Jul 22, 2011 at 05:15:40PM +0900, KAMEZAWA Hiroyuki wrote:
> > > > > +When under_hierarchy is added in the tail, the number indicates the
> > > > > +total memcg scan of its children and itself.
> > > >
> > > > In your implementation, statistics are only accounted to the memcg
> > > > triggering the limit and the respectively scanned memcgs.
> > > >
> > > > Consider the following setup:
> > > >
> > > > A
> > > > / \
> > > > B C
> > > > /
> > > > D
> > > >
> > > > If D tries to charge but hits the limit of A, then B's hierarchy
> > > > counters do not reflect the reclaim activity resulting in D.
> > > >
> > > yes, as I expected.
> >
> > Andrew,
> >
> > with a flawed design, the author unwilling to fix it, and two NAKs,
> > can we please revert this before the release?
>
> How about this ?
> @@ -1710,11 +1711,18 @@ static void mem_cgroup_record_scanstat(s
> spin_lock(&memcg->scanstat.lock);
> __mem_cgroup_record_scanstat(memcg->scanstat.stats[context], rec);
> spin_unlock(&memcg->scanstat.lock);
> -
> - memcg = rec->root;
> - spin_lock(&memcg->scanstat.lock);
> - __mem_cgroup_record_scanstat(memcg->scanstat.rootstats[context], rec);
> - spin_unlock(&memcg->scanstat.lock);
> + cgroup = memcg->css.cgroup;
> + do {
> + spin_lock(&memcg->scanstat.lock);
> + __mem_cgroup_record_scanstat(
> + memcg->scanstat.hierarchy_stats[context], rec);
> + spin_unlock(&memcg->scanstat.lock);
> + if (!cgroup->parent)
> + break;
> + cgroup = cgroup->parent;
> + memcg = mem_cgroup_from_cont(cgroup);
> + } while (memcg->use_hierarchy && memcg != rec->root);
Okay, so this looks correct, but it sums up all parents after each
memcg scanned, which could have a performance impact. Usually,
hierarchy statistics are only summed up when a user reads them.
I don't get why this has to be done completely different from the way
we usually do things, without any justification, whatsoever.
Why do you want to pass a recording structure down the reclaim stack?
Why not make it per-cpu counters that are only summed up, together
with the hierarchy values, when someone is actually interested in
them? With an interface like mem_cgroup_count_vm_event(), or maybe
even an extension of that function?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2011-08-30 7:04 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-07-22 8:15 [PATCH v3] memcg: add memory.vmscan_stat KAMEZAWA Hiroyuki
2011-08-08 12:43 ` Johannes Weiner
2011-08-08 23:33 ` KAMEZAWA Hiroyuki
2011-08-09 8:01 ` Johannes Weiner
2011-08-09 8:01 ` KAMEZAWA Hiroyuki
2011-08-13 1:04 ` Ying Han
2011-08-29 15:51 ` [patch] Revert "memcg: add memory.vmscan_stat" Johannes Weiner
2011-08-30 1:12 ` KAMEZAWA Hiroyuki
2011-08-30 7:04 ` Johannes Weiner [this message]
2011-08-30 7:20 ` KAMEZAWA Hiroyuki
2011-08-30 7:35 ` KAMEZAWA Hiroyuki
2011-08-30 8:42 ` Johannes Weiner
2011-08-30 8:56 ` KAMEZAWA Hiroyuki
2011-08-30 10:17 ` Johannes Weiner
2011-08-30 10:34 ` KAMEZAWA Hiroyuki
2011-08-30 11:03 ` Johannes Weiner
2011-08-30 23:38 ` KAMEZAWA Hiroyuki
2011-08-30 10:38 ` KAMEZAWA Hiroyuki
2011-08-30 11:32 ` Johannes Weiner
2011-08-30 23:29 ` KAMEZAWA Hiroyuki
2011-08-31 6:23 ` Johannes Weiner
2011-08-31 6:30 ` KAMEZAWA Hiroyuki
2011-08-31 8:33 ` Johannes Weiner
2011-09-01 6:05 ` Ying Han
2011-09-01 6:40 ` Johannes Weiner
2011-09-01 7:04 ` Ying Han
2011-09-01 8:27 ` Johannes Weiner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110830070424.GA13061@redhat.com \
--to=jweiner@redhat.com \
--cc=abrestic@google.com \
--cc=akpm@linux-foundation.org \
--cc=bsingharora@gmail.com \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.cz \
--cc=nishimura@mxp.nes.nec.co.jp \
--cc=yinghan@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).