From: Jeff Liu <jeff.liu@oracle.com>
To: Michal Hocko <mhocko@suse.cz>
Cc: linux-mm@kvack.org, Glauber Costa <glommer@parallels.com>,
cgroups@vger.kernel.org
Subject: Re: [PATCH v2 5/6] memcg: introduce swap_cgroup_init()/swap_cgroup_free()
Date: Tue, 29 Jan 2013 23:51:42 +0800 [thread overview]
Message-ID: <5107F00E.7070302@oracle.com> (raw)
In-Reply-To: <20130129145639.GG29574@dhcp22.suse.cz>
On 01/29/2013 10:56 PM, Michal Hocko wrote:
> On Mon 28-01-13 18:54:47, Jeff Liu wrote:
>> Introduce swap_cgroup_init()/swap_cgroup_free() to allocate buffers when creating the first
>> non-root memcg and deallocate buffers on the last non-root memcg is gone.
>
> I think this deserves more words ;) At least it would be good to
> describe contexts from which init and free might be called. What are the
> locking rules.
> Also swap_cgroup_destroy sounds more in pair with swap_cgroup_init.
Will improve the comments log as well as fix the naming. Btw, I named
it as swap_cgroup_free() because we have mem_cgroup_free() corresponding
to mem_cgroup_init(). :)
>
> Please add the users of those function here as well. It is much easier
> to review.
Sure.
>
>> Signed-off-by: Jie Liu <jeff.liu@oracle.com>
>> CC: Glauber Costa <glommer@parallels.com>
>> CC: Michal Hocko <mhocko@suse.cz>
>> CC: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
>> CC: Johannes Weiner <hannes@cmpxchg.org>
>> CC: Mel Gorman <mgorman@suse.de>
>> CC: Andrew Morton <akpm@linux-foundation.org>
>> CC: Sha Zhengju <handai.szj@taobao.com>
>>
>> ---
>> include/linux/page_cgroup.h | 12 +++++
>> mm/page_cgroup.c | 108 +++++++++++++++++++++++++++++++++++++++----
>> 2 files changed, 110 insertions(+), 10 deletions(-)
>>
>> diff --git a/include/linux/page_cgroup.h b/include/linux/page_cgroup.h
>> index 777a524..1255cc9 100644
>> --- a/include/linux/page_cgroup.h
>> +++ b/include/linux/page_cgroup.h
>> @@ -113,6 +113,8 @@ extern unsigned short swap_cgroup_record(swp_entry_t ent, unsigned short id);
>> extern unsigned short lookup_swap_cgroup_id(swp_entry_t ent);
>> extern int swap_cgroup_swapon(int type, unsigned long max_pages);
>> extern void swap_cgroup_swapoff(int type);
>> +extern int swap_cgroup_init(void);
>> +extern void swap_cgroup_free(void);
>> #else
>>
>> static inline
>> @@ -138,6 +140,16 @@ static inline void swap_cgroup_swapoff(int type)
>> return;
>> }
>>
>> +static inline int swap_cgroup_init(void)
>> +{
>> + return 0;
>> +}
>> +
>> +static inline void swap_cgroup_free(void)
>> +{
>> + return;
>> +}
>> +
>> #endif /* CONFIG_MEMCG_SWAP */
>>
>> #endif /* !__GENERATING_BOUNDS_H */
>> diff --git a/mm/page_cgroup.c b/mm/page_cgroup.c
>> index 189fbf5..0ebd127 100644
>> --- a/mm/page_cgroup.c
>> +++ b/mm/page_cgroup.c
>> @@ -362,14 +362,28 @@ static int swap_cgroup_prepare(int type)
>> unsigned long idx, max;
>>
>> ctrl = &swap_cgroup_ctrl[type];
>> + if (!ctrl->length) {
>> + /*
>> + * Bypass the buffer allocation if the corresponding swap
>> + * partition/file was turned off.
>> + */
>> + pr_debug("couldn't allocate swap_cgroup on a disabled swap "
>> + "partition or file, index: %d\n", type);
>
> Do we really need to log this? I guess your scenario is:
> swapon part1
> swapon part2
> swapon part3
> swapoff part2
> create first non-root cgroup
Yesss!!
>
> which is perfectly ok and I do not see any reason to log it.
okay, I'll remove this redundant debug info.
>
>> + return 0;
>> + }
>> +
>> ctrl->map = vzalloc(ctrl->length * sizeof(void *));
>> - if (!ctrl->map)
>> + if (!ctrl->map) {
>> + ctrl->length = 0;
>> goto nomem;
>> + }
>>
>> for (idx = 0; idx < ctrl->length; idx++) {
>> page = alloc_page(GFP_KERNEL | __GFP_ZERO);
>> - if (!page)
>> + if (!page) {
>> + ctrl->length = 0;
>> goto not_enough_page;
>> + }
>> ctrl->map[idx] = page;
>> }
>> return 0;
>> @@ -383,6 +397,32 @@ nomem:
>
> ctrl->length = 0 under this label would be probably nicer than keeping
> it at two places.
Will take care of it.
>
>> return -ENOMEM;
>> }
>>
>> +/*
>> + * free buffer for swap_cgroup.
>> + */
>> +static void swap_cgroup_teardown(int type)
>> +{
>> + struct page **map;
>> + unsigned long length;
>> + struct swap_cgroup_ctrl *ctrl;
>> +
>> + ctrl = &swap_cgroup_ctrl[type];
>> + map = ctrl->map;
>> + length = ctrl->length;
>> + ctrl->map = NULL;
>> + ctrl->length = 0;
>> +
>> + if (map) {
>
> allocation path checks for ctrl->length so it would be good to unify
> both. They are handling the same case (gone swap).
yes, will fix it accordingly.
>
>> + unsigned long i;
>> + for (i = 0; i < length; i++) {
>> + struct page *page = map[i];
>> + if (page)
>> + __free_page(page);
>> + }
>> + vfree(map);
>> + }
>> +}
>> +
>> static struct swap_cgroup *lookup_swap_cgroup(swp_entry_t ent,
>> struct swap_cgroup_ctrl **ctrlp)
>> {
>> @@ -474,6 +514,56 @@ unsigned short lookup_swap_cgroup_id(swp_entry_t ent)
>> return sc ? sc->id : 0;
>> }
>>
>> +/*
>> + * Allocate swap cgroup accounting structures when the first non-root
>> + * memcg is created.
>> + */
>> +int swap_cgroup_init(void)
>> +{
>> + unsigned int type;
>> +
>> + if (!do_swap_account)
>> + return 0;
>> +
>> + if (atomic_add_return(1, &memsw_accounting_users) != 1)
>> + return 0;
>> +
>> + mutex_lock(&swap_cgroup_mutex);
>> + for (type = 0; type < nr_swapfiles; type++) {
>> + if (swap_cgroup_prepare(type) < 0) {
>> + mutex_unlock(&swap_cgroup_mutex);
>> + goto nomem;
>> + }
>> + }
>
> You should clean up those types that were successful...
Oops, those types should be deallocated...
Thanks,
-Jeff
>
>> + mutex_unlock(&swap_cgroup_mutex);
>> + return 0;
>> +
>> +nomem:
>> + pr_info("couldn't allocate enough memory for swap_cgroup "
>> + "while creating non-root memcg.\n");
>> + return -ENOMEM;
>> +}
>> +
>> +/*
>> + * Deallocate swap cgroup accounting structures on the last non-root
>> + * memcg removal.
>> + */
>> +void swap_cgroup_free(void)
>> +{
>> + unsigned int type;
>> +
>> + if (!do_swap_account)
>> + return;
>> +
>> + if (atomic_sub_return(1, &memsw_accounting_users))
>> + return;
>> +
>> + mutex_lock(&swap_cgroup_mutex);
>> + for (type = 0; type < nr_swapfiles; type++)
>> + swap_cgroup_teardown(type);
>> + mutex_unlock(&swap_cgroup_mutex);
>> +}
>> +
>> int swap_cgroup_swapon(int type, unsigned long max_pages)
>> {
>> unsigned long length;
>> @@ -482,20 +572,18 @@ int swap_cgroup_swapon(int type, unsigned long max_pages)
>> if (!do_swap_account)
>> return 0;
>>
>> - if (!atomic_read(&memsw_accounting_users))
>> - return 0;
>> -
>> length = DIV_ROUND_UP(max_pages, SC_PER_PAGE);
>>
>> ctrl = &swap_cgroup_ctrl[type];
>> mutex_lock(&swap_cgroup_mutex);
>> ctrl->length = length;
>> spin_lock_init(&ctrl->lock);
>> - if (swap_cgroup_prepare(type)) {
>> - /* memory shortage */
>> - ctrl->length = 0;
>> - mutex_unlock(&swap_cgroup_mutex);
>> - goto nomem;
>> + if (atomic_read(&memsw_accounting_users)) {
>> + if (swap_cgroup_prepare(type)) {
>> + /* memory shortage */
>> + mutex_unlock(&swap_cgroup_mutex);
>> + goto nomem;
>> + }
>> }
>> mutex_unlock(&swap_cgroup_mutex);
>>
>> --
>> 1.7.9.5
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2013-01-29 15:51 UTC|newest]
Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-28 10:54 [PATCH v2 0/6] memcg: disable swap cgroup allocation at swapon Jeff Liu
2013-01-28 10:54 ` [PATCH v2 1/6] memcg: refactor swap_cgroup_swapon() Jeff Liu
2013-01-29 9:15 ` Lord Glauber Costa of Sealand
2013-01-29 13:41 ` Michal Hocko
2013-01-28 10:54 ` [PATCH v2 2/6] memcg: bypass swap accounting for the root memcg Jeff Liu
2013-01-29 10:18 ` Lord Glauber Costa of Sealand
2013-01-31 6:18 ` Jeff Liu
2013-01-29 14:13 ` Michal Hocko
2013-01-30 16:01 ` Jeff Liu
2013-01-30 16:29 ` Michal Hocko
2013-01-31 4:00 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 3/6] memcg: introduce memsw_accounting_users Jeff Liu
2013-01-29 9:46 ` Lord Glauber Costa of Sealand
2013-01-29 10:52 ` Jeff Liu
2013-01-29 14:26 ` Michal Hocko
2013-01-29 14:24 ` Michal Hocko
2013-01-29 15:16 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 4/6] memcg: export nr_swap_files Jeff Liu
2013-01-29 9:47 ` Lord Glauber Costa of Sealand
2013-01-29 14:31 ` Michal Hocko
2013-01-29 15:17 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 5/6] memcg: introduce swap_cgroup_init()/swap_cgroup_free() Jeff Liu
2013-01-29 9:57 ` Lord Glauber Costa of Sealand
2013-01-29 10:21 ` Jeff Liu
2013-01-29 14:56 ` Michal Hocko
2013-01-29 15:51 ` Jeff Liu [this message]
2013-01-29 16:09 ` Michal Hocko
2013-01-28 10:54 ` [PATCH v2 6/6] memcg: init/free swap cgroup strucutres upon create/free child memcg Jeff Liu
2013-01-29 9:59 ` Lord Glauber Costa of Sealand
2013-01-29 10:27 ` Jeff Liu
2013-01-29 15:11 ` Michal Hocko
2013-01-29 15:15 ` [PATCH v2 0/6] memcg: disable swap cgroup allocation at swapon Michal Hocko
2013-01-29 16:50 ` Jeff Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5107F00E.7070302@oracle.com \
--to=jeff.liu@oracle.com \
--cc=cgroups@vger.kernel.org \
--cc=glommer@parallels.com \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).