From: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
To: Glauber Costa <glommer@parallels.com>
Cc: linux-mm@kvack.org, cgroups@vger.kernel.org,
Tejun Heo <tj@kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
Michal Hocko <mhocko@suse.cz>,
handai.szj@gmail.com, anton.vorontsov@linaro.org,
Johannes Weiner <hannes@cmpxchg.org>,
Mel Gorman <mgorman@suse.de>
Subject: Re: [PATCH v2 2/5] memcg: provide root figures from system totals
Date: Wed, 06 Mar 2013 09:27:44 +0900 [thread overview]
Message-ID: <51368D80.20701@jp.fujitsu.com> (raw)
In-Reply-To: <1362489058-3455-3-git-send-email-glommer@parallels.com>
(2013/03/05 22:10), Glauber Costa wrote:
> For the root memcg, there is no need to rely on the res_counters if hierarchy
> is enabled The sum of all mem cgroups plus the tasks in root itself, is
> necessarily the amount of memory used for the whole system. Since those figures
> are already kept somewhere anyway, we can just return them here, without too
> much hassle.
>
> Limit and soft limit can't be set for the root cgroup, so they are left at
> RESOURCE_MAX. Failcnt is left at 0, because its actual meaning is how many
> times we failed allocations due to the limit being hit. We will fail
> allocations in the root cgroup, but the limit will never the reason.
>
> Signed-off-by: Glauber Costa <glommer@parallels.com>
> CC: Michal Hocko <mhocko@suse.cz>
> CC: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
> CC: Johannes Weiner <hannes@cmpxchg.org>
> CC: Mel Gorman <mgorman@suse.de>
> CC: Andrew Morton <akpm@linux-foundation.org>
I think this patch's calculation is wrong.
> ---
> mm/memcontrol.c | 64 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 1 file changed, 64 insertions(+)
>
> diff --git a/mm/memcontrol.c b/mm/memcontrol.c
> index b8b363f..bfbf1c2 100644
> --- a/mm/memcontrol.c
> +++ b/mm/memcontrol.c
> @@ -4996,6 +4996,56 @@ static inline u64 mem_cgroup_usage(struct mem_cgroup *memcg, bool swap)
> return val << PAGE_SHIFT;
> }
>
> +static u64 memcg_read_root_rss(void)
> +{
> + struct task_struct *p;
> +
> + u64 rss = 0;
> + read_lock(&tasklist_lock);
> + for_each_process(p) {
> + if (!p->mm)
> + continue;
> + task_lock(p);
> + rss += get_mm_rss(p->mm);
> + task_unlock(p);
> + }
> + read_unlock(&tasklist_lock);
> + return rss;
> +}
I think you can use rcu_read_lock() instead of tasklist_lock.
Isn't it enough to use NR_ANON_LRU rather than this ?
> +
> +static u64 mem_cgroup_read_root(enum res_type type, int name)
> +{
> + if (name == RES_LIMIT)
> + return RESOURCE_MAX;
> + if (name == RES_SOFT_LIMIT)
> + return RESOURCE_MAX;
> + if (name == RES_FAILCNT)
> + return 0;
> + if (name == RES_MAX_USAGE)
> + return 0;
> +
> + if (WARN_ON_ONCE(name != RES_USAGE))
> + return 0;
> +
> + switch (type) {
> + case _MEM:
> + return (memcg_read_root_rss() +
> + atomic_long_read(&vm_stat[NR_FILE_PAGES])) << PAGE_SHIFT;
> + case _MEMSWAP: {
> + struct sysinfo i;
> + si_swapinfo(&i);
> +
> + return ((memcg_read_root_rss() +
> + atomic_long_read(&vm_stat[NR_FILE_PAGES])) << PAGE_SHIFT) +
> + i.totalswap - i.freeswap;
How swapcache is handled ? ...and How kmem works with this calc ?
Thanks,
-Kame
> + }
> + case _KMEM:
> + return 0;
> + default:
> + BUG();
> + };
> +}
> +
> static ssize_t mem_cgroup_read(struct cgroup *cont, struct cftype *cft,
> struct file *file, char __user *buf,
> size_t nbytes, loff_t *ppos)
> @@ -5012,6 +5062,19 @@ static ssize_t mem_cgroup_read(struct cgroup *cont, struct cftype *cft,
> if (!do_swap_account && type == _MEMSWAP)
> return -EOPNOTSUPP;
>
> + /*
> + * If we have root-level hierarchy, we can be certain that the charges
> + * in root are always global. We can then bypass the root cgroup
> + * entirely in this case, hopefuly leading to less contention in the
> + * root res_counters. The charges presented after reading it will
> + * always be the global charges.
> + */
> + if (mem_cgroup_disabled() ||
> + (mem_cgroup_is_root(memcg) && memcg->use_hierarchy)) {
> + val = mem_cgroup_read_root(type, name);
> + goto root_bypass;
> + }
> +
> switch (type) {
> case _MEM:
> if (name == RES_USAGE)
> @@ -5032,6 +5095,7 @@ static ssize_t mem_cgroup_read(struct cgroup *cont, struct cftype *cft,
> BUG();
> }
>
> +root_bypass:
> len = scnprintf(str, sizeof(str), "%llu\n", (unsigned long long)val);
> return simple_read_from_buffer(buf, nbytes, ppos, str, len);
> }
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2013-03-06 0:27 UTC|newest]
Thread overview: 72+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-03-05 13:10 [PATCH v2 0/5] bypass root memcg charges if no memcgs are possible Glauber Costa
2013-03-05 13:10 ` Glauber Costa
[not found] ` <1362489058-3455-1-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-05 13:10 ` [PATCH v2 1/5] memcg: make nocpu_base available for non hotplug Glauber Costa
2013-03-05 13:10 ` Glauber Costa
[not found] ` <1362489058-3455-2-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 0:04 ` Kamezawa Hiroyuki
2013-03-06 0:04 ` Kamezawa Hiroyuki
2013-03-19 11:07 ` Michal Hocko
2013-03-05 13:10 ` [PATCH v2 2/5] memcg: provide root figures from system totals Glauber Costa
2013-03-05 13:10 ` Glauber Costa
2013-03-06 0:27 ` Kamezawa Hiroyuki [this message]
[not found] ` <51368D80.20701-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2013-03-06 8:30 ` Glauber Costa
2013-03-06 8:30 ` Glauber Costa
[not found] ` <5136FEC2.2050004-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 10:45 ` Kamezawa Hiroyuki
2013-03-06 10:45 ` Kamezawa Hiroyuki
[not found] ` <51371E4A.7090807-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2013-03-06 10:52 ` Glauber Costa
2013-03-06 10:52 ` Glauber Costa
[not found] ` <51371FEF.3020507-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 10:59 ` Kamezawa Hiroyuki
2013-03-06 10:59 ` Kamezawa Hiroyuki
[not found] ` <513721A5.6080401-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2013-03-13 6:58 ` Sha Zhengju
2013-03-13 6:58 ` Sha Zhengju
[not found] ` <CAFj3OHWm_GjLFwNEE=D69DR-YSF25AZvKTLHpyHq7aYDi12b0g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-03-13 9:15 ` Kamezawa Hiroyuki
2013-03-13 9:15 ` Kamezawa Hiroyuki
2013-03-13 9:59 ` Sha Zhengju
2013-03-14 0:03 ` Kamezawa Hiroyuki
2013-03-14 0:03 ` Kamezawa Hiroyuki
2013-03-06 10:50 ` Kamezawa Hiroyuki
2013-03-06 10:50 ` Kamezawa Hiroyuki
[not found] ` <1362489058-3455-3-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-19 12:46 ` Michal Hocko
2013-03-19 12:46 ` Michal Hocko
[not found] ` <20130319124650.GE7869-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2013-03-19 12:55 ` Michal Hocko
2013-03-19 12:55 ` Michal Hocko
2013-03-20 7:03 ` Glauber Costa
[not found] ` <51495F35.9040302-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-20 8:03 ` Michal Hocko
2013-03-20 8:03 ` Michal Hocko
2013-03-20 8:08 ` Glauber Costa
[not found] ` <51496E71.5010707-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-20 8:18 ` Michal Hocko
2013-03-20 8:18 ` Michal Hocko
2013-03-20 8:34 ` Glauber Costa
2013-03-20 8:58 ` Michal Hocko
2013-03-20 9:30 ` Glauber Costa
[not found] ` <514981C3.8070304-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-21 6:08 ` Kamezawa Hiroyuki
2013-03-21 6:08 ` Kamezawa Hiroyuki
2013-03-20 16:40 ` Anton Vorontsov
2013-03-20 16:40 ` Anton Vorontsov
2013-03-20 7:04 ` Glauber Costa
2013-03-05 13:10 ` [PATCH v2 3/5] memcg: make it suck faster Glauber Costa
2013-03-05 13:10 ` Glauber Costa
[not found] ` <1362489058-3455-4-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 0:46 ` Kamezawa Hiroyuki
2013-03-06 0:46 ` Kamezawa Hiroyuki
2013-03-06 8:38 ` Glauber Costa
[not found] ` <5137007E.7030004-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 10:54 ` Kamezawa Hiroyuki
2013-03-06 10:54 ` Kamezawa Hiroyuki
2013-03-13 8:08 ` Sha Zhengju
2013-03-13 8:08 ` Sha Zhengju
[not found] ` <CAFj3OHU6f3o5GmbFyUsqtSWqHruSS4Yyodx=s=Vh8mO7GfTE8w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-03-20 7:13 ` Glauber Costa
2013-03-20 7:13 ` Glauber Costa
2013-03-19 13:58 ` Michal Hocko
2013-03-19 13:58 ` Michal Hocko
2013-03-20 7:00 ` Glauber Costa
[not found] ` <51495E73.8090409-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-20 8:13 ` Michal Hocko
2013-03-20 8:13 ` Michal Hocko
2013-03-05 13:10 ` [PATCH v2 4/5] memcg: do not call page_cgroup_init at system_boot Glauber Costa
2013-03-05 13:10 ` Glauber Costa
[not found] ` <1362489058-3455-5-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 1:07 ` Kamezawa Hiroyuki
2013-03-06 1:07 ` Kamezawa Hiroyuki
[not found] ` <513696C1.3090301-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2013-03-06 8:22 ` Glauber Costa
2013-03-06 8:22 ` Glauber Costa
2013-03-19 14:06 ` Michal Hocko
2013-03-05 13:10 ` [PATCH v2 5/5] memcg: do not walk all the way to the root for memcg Glauber Costa
2013-03-05 13:10 ` Glauber Costa
[not found] ` <1362489058-3455-6-git-send-email-glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2013-03-06 1:08 ` Kamezawa Hiroyuki
2013-03-06 1:08 ` Kamezawa Hiroyuki
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51368D80.20701@jp.fujitsu.com \
--to=kamezawa.hiroyu@jp.fujitsu.com \
--cc=akpm@linux-foundation.org \
--cc=anton.vorontsov@linaro.org \
--cc=cgroups@vger.kernel.org \
--cc=glommer@parallels.com \
--cc=handai.szj@gmail.com \
--cc=hannes@cmpxchg.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mhocko@suse.cz \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.