cgroups.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Glauber Costa <glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
To: Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>
Cc: Kamezawa Hiroyuki
	<kamezawa.hiroyu-+CUm20s59erQFUHtdCDX3A@public.gmane.org>,
	Tejun Heo <tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
	lizefan-hv44wF8Li93QT0dZR+AlfA@public.gmane.org,
	paul-inf54ven1CmVyaH7bEyXVA@public.gmane.org,
	containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org,
	cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	peterz-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org,
	bsingharora-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org,
	hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org,
	linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org,
	linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: [PATCHSET cgroup/for-3.8] cpuset: decouple cpuset locking from cgroup core
Date: Fri, 30 Nov 2012 13:42:28 +0400	[thread overview]
Message-ID: <50B87F84.7040206@parallels.com> (raw)
In-Reply-To: <20121130092435.GD29317-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>

On 11/30/2012 01:24 PM, Michal Hocko wrote:
> On Fri 30-11-12 13:00:36, Glauber Costa wrote:
>> On 11/30/2012 07:21 AM, Kamezawa Hiroyuki wrote:
>>> (2012/11/29 6:34), Tejun Heo wrote:
>>>> Hello, guys.
>>>>
>>>> Depending on cgroup core locking - cgroup_mutex - is messy and makes
>>>> cgroup prone to locking dependency problems.  The current code already
>>>> has lock dependency loop - memcg nests get_online_cpus() inside
>>>> cgroup_mutex.  cpuset the other way around.
>>>>
>>>> Regardless of the locking details, whatever is protecting cgroup has
>>>> inherently to be something outer to most other locking constructs.
>>>> cgroup calls into a lot of major subsystems which in turn have to
>>>> perform subsystem-specific locking.  Trying to nest cgroup
>>>> synchronization inside other locks isn't something which can work
>>>> well.
>>>>
>>>> cgroup now has enough API to allow subsystems to implement their own
>>>> locking and cgroup_mutex is scheduled to be made private to cgroup
>>>> core.  This patchset makes cpuset implement its own locking instead of
>>>> relying on cgroup_mutex.
>>>>
>>>> cpuset is rather nasty in this respect.  Some of it seems to have come
>>>> from the implementation history - cgroup core grew out of cpuset - but
>>>> big part stems from cpuset's need to migrate tasks to an ancestor
>>>> cgroup when an hotunplug event makes a cpuset empty (w/o any cpu or
>>>> memory).
>>>>
>>>> This patchset decouples cpuset locking from cgroup_mutex.  After the
>>>> patchset, cpuset uses cpuset-specific cpuset_mutex instead of
>>>> cgroup_mutex.  This also removes the lockdep warning triggered during
>>>> cpu offlining (see 0009).
>>>>
>>>> Note that this leaves memcg as the only external user of cgroup_mutex.
>>>> Michal, Kame, can you guys please convert memcg to use its own locking
>>>> too?
>>>>
>>>
>>> Hmm. let me see....at quick glance cgroup_lock() is used at
>>>   hierarchy policy change
>>>   kmem_limit
>>>   migration policy change
>>>   swapiness change
>>>   oom control
>>>
>>> Because all aboves takes care of changes in hierarchy,
>>> Having a new memcg's mutex in ->create() may be a way.
>>>
>>> Ah, hm, Costa is mentioning task-attach. is the task-attach problem in memcg ?
>>>
>>
>> We disallow the kmem limit to be set if a task already exists in the
>> cgroup. So we can't allow a new task to attach if we are setting the limit.
> 
> This is racy without additional locking, isn't it?
> 
Speaking of it: Tejun's tree still lacks the kmem bits. How hard would
it be for you to merge his branch into a temporary branch of your tree?


  parent reply	other threads:[~2012-11-30  9:42 UTC|newest]

Thread overview: 44+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-28 21:34 [PATCHSET cgroup/for-3.8] cpuset: decouple cpuset locking from cgroup core Tejun Heo
2012-11-28 21:34 ` [PATCH 01/13] cpuset: remove unused cpuset_unlock() Tejun Heo
2012-11-28 21:34 ` [PATCH 03/13] cpuset: introduce ->css_on/offline() Tejun Heo
2012-11-28 21:34 ` [PATCH 04/13] cpuset: introduce CS_ONLINE Tejun Heo
2012-11-28 21:34 ` [PATCH 05/13] cpuset: introduce cpuset_for_each_child() Tejun Heo
2012-11-28 21:34 ` [PATCH 06/13] cpuset: cleanup cpuset[_can]_attach() Tejun Heo
     [not found]   ` <1354138460-19286-7-git-send-email-tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2012-12-26 10:20     ` Li Zefan
     [not found]       ` <50DACF5B.6050705-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
2012-12-26 12:04         ` Tejun Heo
2013-01-02  4:42           ` Rusty Russell
     [not found]             ` <87zk0s5h7c.fsf-8n+1lVoiYb80n/F98K4Iww@public.gmane.org>
2013-01-02 15:34               ` Tejun Heo
2013-01-03  0:47                 ` Rusty Russell
     [not found]                   ` <871ue35bzk.fsf-8n+1lVoiYb80n/F98K4Iww@public.gmane.org>
2013-01-03  2:29                     ` Tejun Heo
     [not found]                       ` <20130103022911.GH11220-9pTldWuhBndy/B6EtB590w@public.gmane.org>
2013-01-06 23:28                         ` Rusty Russell
2012-11-28 21:34 ` [PATCH 07/13] cpuset: drop async_rebuild_sched_domains() Tejun Heo
2012-11-28 21:34 ` [PATCH 08/13] cpuset: reorganize CPU / memory hotplug handling Tejun Heo
2012-11-28 21:34 ` [PATCH 09/13] cpuset: don't nest cgroup_mutex inside get_online_cpus() Tejun Heo
2012-11-28 21:34 ` [PATCH 10/13] cpuset: make CPU / memory hotplug propagation asynchronous Tejun Heo
2012-11-28 21:34 ` [PATCH 11/13] cpuset: pin down cpus and mems while a task is being attached Tejun Heo
2012-11-28 21:34 ` [PATCH 12/13] cpuset: schedule hotplug propagation from cpuset_attach() if the cpuset is empty Tejun Heo
2012-11-28 21:34 ` [PATCH 13/13] cpuset: replace cgroup_mutex locking with cpuset internal locking Tejun Heo
     [not found] ` <1354138460-19286-1-git-send-email-tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
2012-11-28 21:34   ` [PATCH 02/13] cpuset: remove fast exit path from remove_tasks_in_empty_cpuset() Tejun Heo
2012-11-29 11:14   ` [PATCHSET cgroup/for-3.8] cpuset: decouple cpuset locking from cgroup core Glauber Costa
     [not found]     ` <50B743A1.4040405-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2012-11-29 14:26       ` Tejun Heo
     [not found]         ` <20121129142646.GD24683-Gd/HAXX7CRxy/B6EtB590w@public.gmane.org>
2012-11-29 14:36           ` Tejun Heo
2012-11-30  3:21   ` Kamezawa Hiroyuki
2012-11-30  8:33     ` Michal Hocko
2012-11-30  9:00     ` Glauber Costa
2012-11-30  9:24       ` Michal Hocko
     [not found]         ` <20121130092435.GD29317-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2012-11-30  9:33           ` Glauber Costa
2012-11-30  9:42           ` Glauber Costa [this message]
     [not found]             ` <50B87F84.7040206-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>
2012-11-30  9:49               ` Michal Hocko
     [not found]                 ` <20121130094959.GE29317-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2012-11-30 10:00                   ` Glauber Costa
2012-11-30 14:59                     ` Tejun Heo
     [not found]                       ` <20121130145924.GA3873-Gd/HAXX7CRxy/B6EtB590w@public.gmane.org>
2012-11-30 15:09                         ` Glauber Costa
2012-12-03 15:22   ` Michal Hocko
2012-12-03 16:53     ` Tejun Heo
     [not found]       ` <20121203165338.GF19802-Gd/HAXX7CRxy/B6EtB590w@public.gmane.org>
2012-12-06  6:25         ` Li Zefan
     [not found]           ` <50C03A3F.7070605-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
2012-12-06 13:09             ` Michal Hocko
     [not found]               ` <20121206130904.GC10931-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2012-12-06 16:54                 ` Tejun Heo
2012-12-26 10:51   ` Li Zefan
2013-01-02  8:53     ` Michal Hocko
     [not found]       ` <20130102085355.GA22160-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2013-01-02 15:36         ` Tejun Heo
     [not found]           ` <20130102153605.GB11220-9pTldWuhBndy/B6EtB590w@public.gmane.org>
2013-01-02 16:02             ` Michal Hocko
     [not found]     ` <50DAD696.8050400-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
2013-01-03 22:20       ` Tejun Heo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50B87F84.7040206@parallels.com \
    --to=glommer-bzqdu9zft3wakbo8gow8eq@public.gmane.org \
    --cc=bsingharora-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    --cc=cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org \
    --cc=hannes-druUgvl0LCNAfugRpC6u6w@public.gmane.org \
    --cc=kamezawa.hiroyu-+CUm20s59erQFUHtdCDX3A@public.gmane.org \
    --cc=linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org \
    --cc=lizefan-hv44wF8Li93QT0dZR+AlfA@public.gmane.org \
    --cc=mhocko-AlSwsSmVLrQ@public.gmane.org \
    --cc=paul-inf54ven1CmVyaH7bEyXVA@public.gmane.org \
    --cc=peterz-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org \
    --cc=tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).