From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from d28relay04.in.ibm.com (d28relay04.in.ibm.com [9.184.220.61]) by e28smtp06.in.ibm.com (8.13.1/8.13.1) with ESMTP id m2HEfAu2002641 for ; Mon, 17 Mar 2008 20:11:10 +0530 Received: from d28av04.in.ibm.com (d28av04.in.ibm.com [9.184.220.66]) by d28relay04.in.ibm.com (8.13.8/8.13.8/NCO v8.7) with ESMTP id m2HEf9Kj1347634 for ; Mon, 17 Mar 2008 20:11:09 +0530 Received: from d28av04.in.ibm.com (loopback [127.0.0.1]) by d28av04.in.ibm.com (8.13.1/8.13.3) with ESMTP id m2HEf9pB032292 for ; Mon, 17 Mar 2008 14:41:09 GMT Message-ID: <47DE82A6.3050604@linux.vnet.ibm.com> Date: Mon, 17 Mar 2008 20:09:34 +0530 From: Balbir Singh Reply-To: balbir@linux.vnet.ibm.com MIME-Version: 1.0 Subject: Re: [RFC][2/3] Account and control virtual address space allocations References: <20080316172942.8812.56051.sendpatchset@localhost.localdomain> <20080316173005.8812.88290.sendpatchset@localhost.localdomain> <47DE57C2.5060206@openvz.org> <47DE640F.3070601@linux.vnet.ibm.com> <47DE66BE.30904@openvz.org> <47DE695D.3080605@linux.vnet.ibm.com> <47DE6B8D.5090302@openvz.org> In-Reply-To: <47DE6B8D.5090302@openvz.org> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org Return-Path: To: Pavel Emelyanov Cc: linux-mm@kvack.org, Hugh Dickins , Sudhir Kumar , YAMAMOTO Takashi , Paul Menage , lizf@cn.fujitsu.com, linux-kernel@vger.kernel.org, taka@valinux.co.jp, David Rientjes , Andrew Morton , KAMEZAWA Hiroyuki List-ID: Pavel Emelyanov wrote: > Balbir Singh wrote: >> Pavel Emelyanov wrote: >>> Balbir Singh wrote: >>>> Pavel Emelyanov wrote: >>>>> [snip] >>>>> >>>>>> +int mem_cgroup_update_as(struct mm_struct *mm, long nr_pages) >>>>>> +{ >>>>>> + int ret = 0; >>>>>> + struct mem_cgroup *mem; >>>>>> + if (mem_cgroup_subsys.disabled) >>>>>> + return ret; >>>>>> + >>>>>> + rcu_read_lock(); >>>>>> + mem = rcu_dereference(mm->mem_cgroup); >>>>>> + css_get(&mem->css); >>>>>> + rcu_read_unlock(); >>>>>> + >>>>>> + if (nr_pages > 0) { >>>>>> + if (res_counter_charge(&mem->as_res, (nr_pages * PAGE_SIZE))) >>>>>> + ret = 1; >>>>>> + } else >>>>>> + res_counter_uncharge(&mem->as_res, (-nr_pages * PAGE_SIZE)); >>>>> No, please, no. Let's make two calls - mem_cgroup_charge_as and mem_cgroup_uncharge_as. >>>>> >>>>> [snip] >>>>> >>>> Yes, sure :) >>> Thanks :) >>> >>>>>> @@ -1117,6 +1117,9 @@ munmap_back: >>>>>> } >>>>>> } >>>>>> >>>>>> + if (mem_cgroup_update_as(mm, len >> PAGE_SHIFT)) >>>>>> + return -ENOMEM; >>>>>> + >>>>> Why not use existintg cap_vm_enough_memory and co? >>>>> >>>> I thought about it and almost used may_expand_vm(), but there is a slight catch >>>> there. With cap_vm_enough_memory() or security_vm_enough_memory(), they are >>>> called after total_vm has been calculated. In our case we need to keep the >>>> cgroups equivalent of total_vm up to date, and we do this in mem_cgorup_update_as. >>> So? What prevents us from using these hooks? :) >> 1. We need to account total_vm usage of the task anyway. So why have two places, >> one for accounting and second for control? > > We still have two of them even placing hooks in each place manually. > > Besides, putting the mem_cgroup_(un)charge_as() in these vm hooks will > 1. save the number of places to patch > 2. help keeping memcgroup consistent in case someone adds more places > that expand tasks vm (arches, drivers) - in case we have our hooks > celled from inside vm ones, we won't have to patch more. > I am not sure I understand your proposal. Without manually placing these hooks how do we track 1. When the vm size has increased/decreased 2. In case due to some reason, the call following these hooks fail, how do we undo it, without placing hooks? >> 2. These hooks are activated for conditionally invoked for vma's with VM_ACCOUNT >> set. > > This is a good point against. But, wrt my previous comment, can we handle > this somehow? Not sure I understand -- Warm Regards, Balbir Singh Linux Technology Center IBM, ISTL -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org