From: Jeff Liu <jeff.liu@oracle.com>
To: Michal Hocko <mhocko@suse.cz>
Cc: linux-mm@kvack.org, Glauber Costa <glommer@parallels.com>,
cgroups@vger.kernel.org
Subject: Re: [PATCH v2 5/6] memcg: introduce swap_cgroup_init()/swap_cgroup_free()
Date: Tue, 29 Jan 2013 23:51:42 +0800 [thread overview]
Message-ID: <5107F00E.7070302@oracle.com> (raw)
In-Reply-To: <20130129145639.GG29574@dhcp22.suse.cz>
On 01/29/2013 10:56 PM, Michal Hocko wrote:
> On Mon 28-01-13 18:54:47, Jeff Liu wrote:
>> Introduce swap_cgroup_init()/swap_cgroup_free() to allocate buffers when creating the first
>> non-root memcg and deallocate buffers on the last non-root memcg is gone.
>
> I think this deserves more words ;) At least it would be good to
> describe contexts from which init and free might be called. What are the
> locking rules.
> Also swap_cgroup_destroy sounds more in pair with swap_cgroup_init.
Will improve the comments log as well as fix the naming. Btw, I named
it as swap_cgroup_free() because we have mem_cgroup_free() corresponding
to mem_cgroup_init(). :)
>
> Please add the users of those function here as well. It is much easier
> to review.
Sure.
>
>> Signed-off-by: Jie Liu <jeff.liu@oracle.com>
>> CC: Glauber Costa <glommer@parallels.com>
>> CC: Michal Hocko <mhocko@suse.cz>
>> CC: Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
>> CC: Johannes Weiner <hannes@cmpxchg.org>
>> CC: Mel Gorman <mgorman@suse.de>
>> CC: Andrew Morton <akpm@linux-foundation.org>
>> CC: Sha Zhengju <handai.szj@taobao.com>
>>
>> ---
>> include/linux/page_cgroup.h | 12 +++++
>> mm/page_cgroup.c | 108 +++++++++++++++++++++++++++++++++++++++----
>> 2 files changed, 110 insertions(+), 10 deletions(-)
>>
>> diff --git a/include/linux/page_cgroup.h b/include/linux/page_cgroup.h
>> index 777a524..1255cc9 100644
>> --- a/include/linux/page_cgroup.h
>> +++ b/include/linux/page_cgroup.h
>> @@ -113,6 +113,8 @@ extern unsigned short swap_cgroup_record(swp_entry_t ent, unsigned short id);
>> extern unsigned short lookup_swap_cgroup_id(swp_entry_t ent);
>> extern int swap_cgroup_swapon(int type, unsigned long max_pages);
>> extern void swap_cgroup_swapoff(int type);
>> +extern int swap_cgroup_init(void);
>> +extern void swap_cgroup_free(void);
>> #else
>>
>> static inline
>> @@ -138,6 +140,16 @@ static inline void swap_cgroup_swapoff(int type)
>> return;
>> }
>>
>> +static inline int swap_cgroup_init(void)
>> +{
>> + return 0;
>> +}
>> +
>> +static inline void swap_cgroup_free(void)
>> +{
>> + return;
>> +}
>> +
>> #endif /* CONFIG_MEMCG_SWAP */
>>
>> #endif /* !__GENERATING_BOUNDS_H */
>> diff --git a/mm/page_cgroup.c b/mm/page_cgroup.c
>> index 189fbf5..0ebd127 100644
>> --- a/mm/page_cgroup.c
>> +++ b/mm/page_cgroup.c
>> @@ -362,14 +362,28 @@ static int swap_cgroup_prepare(int type)
>> unsigned long idx, max;
>>
>> ctrl = &swap_cgroup_ctrl[type];
>> + if (!ctrl->length) {
>> + /*
>> + * Bypass the buffer allocation if the corresponding swap
>> + * partition/file was turned off.
>> + */
>> + pr_debug("couldn't allocate swap_cgroup on a disabled swap "
>> + "partition or file, index: %d\n", type);
>
> Do we really need to log this? I guess your scenario is:
> swapon part1
> swapon part2
> swapon part3
> swapoff part2
> create first non-root cgroup
Yesss!!
>
> which is perfectly ok and I do not see any reason to log it.
okay, I'll remove this redundant debug info.
>
>> + return 0;
>> + }
>> +
>> ctrl->map = vzalloc(ctrl->length * sizeof(void *));
>> - if (!ctrl->map)
>> + if (!ctrl->map) {
>> + ctrl->length = 0;
>> goto nomem;
>> + }
>>
>> for (idx = 0; idx < ctrl->length; idx++) {
>> page = alloc_page(GFP_KERNEL | __GFP_ZERO);
>> - if (!page)
>> + if (!page) {
>> + ctrl->length = 0;
>> goto not_enough_page;
>> + }
>> ctrl->map[idx] = page;
>> }
>> return 0;
>> @@ -383,6 +397,32 @@ nomem:
>
> ctrl->length = 0 under this label would be probably nicer than keeping
> it at two places.
Will take care of it.
>
>> return -ENOMEM;
>> }
>>
>> +/*
>> + * free buffer for swap_cgroup.
>> + */
>> +static void swap_cgroup_teardown(int type)
>> +{
>> + struct page **map;
>> + unsigned long length;
>> + struct swap_cgroup_ctrl *ctrl;
>> +
>> + ctrl = &swap_cgroup_ctrl[type];
>> + map = ctrl->map;
>> + length = ctrl->length;
>> + ctrl->map = NULL;
>> + ctrl->length = 0;
>> +
>> + if (map) {
>
> allocation path checks for ctrl->length so it would be good to unify
> both. They are handling the same case (gone swap).
yes, will fix it accordingly.
>
>> + unsigned long i;
>> + for (i = 0; i < length; i++) {
>> + struct page *page = map[i];
>> + if (page)
>> + __free_page(page);
>> + }
>> + vfree(map);
>> + }
>> +}
>> +
>> static struct swap_cgroup *lookup_swap_cgroup(swp_entry_t ent,
>> struct swap_cgroup_ctrl **ctrlp)
>> {
>> @@ -474,6 +514,56 @@ unsigned short lookup_swap_cgroup_id(swp_entry_t ent)
>> return sc ? sc->id : 0;
>> }
>>
>> +/*
>> + * Allocate swap cgroup accounting structures when the first non-root
>> + * memcg is created.
>> + */
>> +int swap_cgroup_init(void)
>> +{
>> + unsigned int type;
>> +
>> + if (!do_swap_account)
>> + return 0;
>> +
>> + if (atomic_add_return(1, &memsw_accounting_users) != 1)
>> + return 0;
>> +
>> + mutex_lock(&swap_cgroup_mutex);
>> + for (type = 0; type < nr_swapfiles; type++) {
>> + if (swap_cgroup_prepare(type) < 0) {
>> + mutex_unlock(&swap_cgroup_mutex);
>> + goto nomem;
>> + }
>> + }
>
> You should clean up those types that were successful...
Oops, those types should be deallocated...
Thanks,
-Jeff
>
>> + mutex_unlock(&swap_cgroup_mutex);
>> + return 0;
>> +
>> +nomem:
>> + pr_info("couldn't allocate enough memory for swap_cgroup "
>> + "while creating non-root memcg.\n");
>> + return -ENOMEM;
>> +}
>> +
>> +/*
>> + * Deallocate swap cgroup accounting structures on the last non-root
>> + * memcg removal.
>> + */
>> +void swap_cgroup_free(void)
>> +{
>> + unsigned int type;
>> +
>> + if (!do_swap_account)
>> + return;
>> +
>> + if (atomic_sub_return(1, &memsw_accounting_users))
>> + return;
>> +
>> + mutex_lock(&swap_cgroup_mutex);
>> + for (type = 0; type < nr_swapfiles; type++)
>> + swap_cgroup_teardown(type);
>> + mutex_unlock(&swap_cgroup_mutex);
>> +}
>> +
>> int swap_cgroup_swapon(int type, unsigned long max_pages)
>> {
>> unsigned long length;
>> @@ -482,20 +572,18 @@ int swap_cgroup_swapon(int type, unsigned long max_pages)
>> if (!do_swap_account)
>> return 0;
>>
>> - if (!atomic_read(&memsw_accounting_users))
>> - return 0;
>> -
>> length = DIV_ROUND_UP(max_pages, SC_PER_PAGE);
>>
>> ctrl = &swap_cgroup_ctrl[type];
>> mutex_lock(&swap_cgroup_mutex);
>> ctrl->length = length;
>> spin_lock_init(&ctrl->lock);
>> - if (swap_cgroup_prepare(type)) {
>> - /* memory shortage */
>> - ctrl->length = 0;
>> - mutex_unlock(&swap_cgroup_mutex);
>> - goto nomem;
>> + if (atomic_read(&memsw_accounting_users)) {
>> + if (swap_cgroup_prepare(type)) {
>> + /* memory shortage */
>> + mutex_unlock(&swap_cgroup_mutex);
>> + goto nomem;
>> + }
>> }
>> mutex_unlock(&swap_cgroup_mutex);
>>
>> --
>> 1.7.9.5
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2013-01-29 15:51 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-28 10:54 [PATCH v2 0/6] memcg: disable swap cgroup allocation at swapon Jeff Liu
2013-01-28 10:54 ` [PATCH v2 1/6] memcg: refactor swap_cgroup_swapon() Jeff Liu
[not found] ` <510658E6.9030108-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 9:15 ` Lord Glauber Costa of Sealand
2013-01-29 9:15 ` Lord Glauber Costa of Sealand
2013-01-29 13:41 ` Michal Hocko
2013-01-29 13:41 ` Michal Hocko
2013-01-28 10:54 ` [PATCH v2 2/6] memcg: bypass swap accounting for the root memcg Jeff Liu
2013-01-29 10:18 ` Lord Glauber Costa of Sealand
2013-01-31 6:18 ` Jeff Liu
2013-01-29 14:13 ` Michal Hocko
2013-01-30 16:01 ` Jeff Liu
2013-01-30 16:29 ` Michal Hocko
2013-01-31 4:00 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 3/6] memcg: introduce memsw_accounting_users Jeff Liu
[not found] ` <510658F0.9050802-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 9:46 ` Lord Glauber Costa of Sealand
2013-01-29 9:46 ` Lord Glauber Costa of Sealand
2013-01-29 10:52 ` Jeff Liu
2013-01-29 14:26 ` Michal Hocko
2013-01-29 14:24 ` Michal Hocko
2013-01-29 14:24 ` Michal Hocko
2013-01-29 15:16 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 4/6] memcg: export nr_swap_files Jeff Liu
[not found] ` <510658F6.4010504-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 9:47 ` Lord Glauber Costa of Sealand
2013-01-29 9:47 ` Lord Glauber Costa of Sealand
2013-01-29 14:31 ` Michal Hocko
2013-01-29 14:31 ` Michal Hocko
2013-01-29 15:17 ` Jeff Liu
2013-01-28 10:54 ` [PATCH v2 5/6] memcg: introduce swap_cgroup_init()/swap_cgroup_free() Jeff Liu
2013-01-29 9:57 ` Lord Glauber Costa of Sealand
2013-01-29 10:21 ` Jeff Liu
[not found] ` <510658F7.6050806-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 14:56 ` Michal Hocko
2013-01-29 14:56 ` Michal Hocko
2013-01-29 15:51 ` Jeff Liu [this message]
[not found] ` <5107F00E.7070302-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 16:09 ` Michal Hocko
2013-01-29 16:09 ` Michal Hocko
2013-01-28 10:54 ` [PATCH v2 6/6] memcg: init/free swap cgroup strucutres upon create/free child memcg Jeff Liu
[not found] ` <510658FC.50009-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 9:59 ` Lord Glauber Costa of Sealand
2013-01-29 9:59 ` Lord Glauber Costa of Sealand
2013-01-29 10:27 ` Jeff Liu
2013-01-29 15:11 ` Michal Hocko
2013-01-29 15:11 ` Michal Hocko
[not found] ` <510658E3.1020306-QHcLZuEGTsvQT0dZR+AlfA@public.gmane.org>
2013-01-29 15:15 ` [PATCH v2 0/6] memcg: disable swap cgroup allocation at swapon Michal Hocko
2013-01-29 15:15 ` Michal Hocko
2013-01-29 16:50 ` Jeff Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5107F00E.7070302@oracle.com \
--to=jeff.liu@oracle.com \
--cc=cgroups@vger.kernel.org \
--cc=glommer@parallels.com \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.