Linux Container Development
 help / color / mirror / Atom feed
  • [parent not found: <1357248967-24959-11-git-send-email-tj@kernel.org>]
  • [parent not found: <50E93554.3070102@huawei.com>]
  • * [PATCHSET] cpuset: decouple cpuset locking from cgroup core, take#2
    @ 2013-01-03 21:35 Tejun Heo
      0 siblings, 0 replies; 24+ messages in thread
    From: Tejun Heo @ 2013-01-03 21:35 UTC (permalink / raw)
      To: lizefan-hv44wF8Li93QT0dZR+AlfA, paul-inf54ven1CmVyaH7bEyXVA,
    	glommer-bzQdu9zFT3WakBO8gow8eQ
      Cc: containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA,
    	linux-kernel-u79uwXL29TY76Z2rM5mHXA, mhocko-AlSwsSmVLrQ,
    	linux-mm-Bw31MaZKKs3YtjvyW6yDsg, hannes-druUgvl0LCNAfugRpC6u6w,
    	cgroups-u79uwXL29TY76Z2rM5mHXA
    
    Hello, guys.
    
    This is the second attempt at decoupling cpuset locking from cgroup
    core.  Changes from the last take[L] are
    
    * cpuset-drop-async_rebuild_sched_domains.patch moved from 0007 to
      0009.  This reordering makes cpu hotplug handling async first and
      removes the temporary cyclic locking dependency.
    
    * 0006-cpuset-cleanup-cpuset-_can-_attach.patch no longer converts
      cpumask_var_t to cpumask_t as per Rusty Russell.
    
    * 0008-cpuset-don-t-nest-cgroup_mutex-inside-get_online_cpu.patch now
      synchronously rebuilds sched domains from cpu hotplug callback.
      This fixes various issues caused by confused scheduler puttings
      tasks into a dead cpu including the RCU stall problem reported by Li
      Zefan.
    
    Original patchset description follows.
    
    Depending on cgroup core locking - cgroup_mutex - is messy and makes
    cgroup prone to locking dependency problems.  The current code already
    has lock dependency loop - memcg nests get_online_cpus() inside
    cgroup_mutex.  cpuset the other way around.
    
    Regardless of the locking details, whatever is protecting cgroup has
    inherently to be something outer to most other locking constructs.
    cgroup calls into a lot of major subsystems which in turn have to
    perform subsystem-specific locking.  Trying to nest cgroup
    synchronization inside other locks isn't something which can work
    well.
    
    cgroup now has enough API to allow subsystems to implement their own
    locking and cgroup_mutex is scheduled to be made private to cgroup
    core.  This patchset makes cpuset implement its own locking instead of
    relying on cgroup_mutex.
    
    cpuset is rather nasty in this respect.  Some of it seems to have come
    from the implementation history - cgroup core grew out of cpuset - but
    big part stems from cpuset's need to migrate tasks to an ancestor
    cgroup when an hotunplug event makes a cpuset empty (w/o any cpu or
    memory).
    
    This patchset decouples cpuset locking from cgroup_mutex.  After the
    patchset, cpuset uses cpuset-specific cpuset_mutex instead of
    cgroup_mutex.  This also removes the lockdep warning triggered during
    cpu offlining (see 0009).
    
    Note that this leaves memcg as the only external user of cgroup_mutex.
    Michal, Kame, can you guys please convert memcg to use its own locking
    too?
    
    This patchset contains the following thirteen patches.
    
     0001-cpuset-remove-unused-cpuset_unlock.patch
     0002-cpuset-remove-fast-exit-path-from-remove_tasks_in_em.patch
     0003-cpuset-introduce-css_on-offline.patch
     0004-cpuset-introduce-CS_ONLINE.patch
     0005-cpuset-introduce-cpuset_for_each_child.patch
     0006-cpuset-cleanup-cpuset-_can-_attach.patch
     0007-cpuset-reorganize-CPU-memory-hotplug-handling.patch
     0008-cpuset-don-t-nest-cgroup_mutex-inside-get_online_cpu.patch
     0009-cpuset-drop-async_rebuild_sched_domains.patch
     0010-cpuset-make-CPU-memory-hotplug-propagation-asynchron.patch
     0011-cpuset-pin-down-cpus-and-mems-while-a-task-is-being-.patch
     0012-cpuset-schedule-hotplug-propagation-from-cpuset_atta.patch
     0013-cpuset-replace-cgroup_mutex-locking-with-cpuset-inte.patch
    
    0001-0006 are prep patches.
    
    0007-0009 make cpuset nest get_online_cpus() inside cgroup_mutex, not
    the other way around.
    
    0010-0012 plug holes which would be exposed by switching to
    cpuset-specific locking.
    
    0013 replaces cgroup_mutex with cpuset_mutex.
    
    This patchset is on top of v3.8-rc2 (d1c3ed669a) and also available in
    the following git branch.
    
     git://git.kernel.org/pub/scm/linux/kernel/git/tj/cgroup.git review-cpuset-locking
    
    diffstat follows.
    
     kernel/cpuset.c |  760 ++++++++++++++++++++++++++++++++------------------------
     1 file changed, 438 insertions(+), 322 deletions(-)
    
    Thanks.
    
    --
    tejun
    
    [L] http://thread.gmane.org/gmane.linux.kernel.cgroups/5251
    
    ^ permalink raw reply	[flat|nested] 24+ messages in thread

    end of thread, other threads:[~2013-01-11  9:05 UTC | newest]
    
    Thread overview: 24+ messages (download: mbox.gz follow: Atom feed
    -- links below jump to the message on this page --
         [not found] <1357248967-24959-1-git-send-email-tj@kernel.org>
         [not found] ` <1357248967-24959-1-git-send-email-tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
    2013-01-03 21:35   ` [PATCH 01/13] cpuset: remove unused cpuset_unlock() Tejun Heo
    2013-01-03 21:35   ` [PATCH 02/13] cpuset: remove fast exit path from remove_tasks_in_empty_cpuset() Tejun Heo
    2013-01-03 21:35   ` [PATCH 03/13] cpuset: introduce ->css_on/offline() Tejun Heo
    2013-01-03 21:35   ` [PATCH 04/13] cpuset: introduce CS_ONLINE Tejun Heo
    2013-01-03 21:35   ` [PATCH 05/13] cpuset: introduce cpuset_for_each_child() Tejun Heo
    2013-01-03 21:36   ` [PATCH 06/13] cpuset: cleanup cpuset[_can]_attach() Tejun Heo
    2013-01-03 21:36   ` [PATCH 07/13] cpuset: reorganize CPU / memory hotplug handling Tejun Heo
    2013-01-03 21:36   ` [PATCH 08/13] cpuset: don't nest cgroup_mutex inside get_online_cpus() Tejun Heo
    2013-01-03 21:36   ` [PATCH 09/13] cpuset: drop async_rebuild_sched_domains() Tejun Heo
    2013-01-03 21:36   ` [PATCH 10/13] cpuset: make CPU / memory hotplug propagation asynchronous Tejun Heo
    2013-01-03 21:36   ` [PATCH 11/13] cpuset: pin down cpus and mems while a task is being attached Tejun Heo
    2013-01-03 21:36   ` [PATCH 12/13] cpuset: schedule hotplug propagation from cpuset_attach() if the cpuset is empty Tejun Heo
    2013-01-03 21:36   ` [PATCH 13/13] cpuset: replace cgroup_mutex locking with cpuset internal locking Tejun Heo
    2013-01-06  8:27   ` [PATCHSET] cpuset: decouple cpuset locking from cgroup core, take#2 Li Zefan
    2013-01-07  8:12   ` Kamezawa Hiroyuki
    2013-01-09  9:46   ` Glauber Costa
         [not found] ` <1357248967-24959-11-git-send-email-tj@kernel.org>
         [not found]   ` <1357248967-24959-11-git-send-email-tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
    2013-01-06  8:29     ` [PATCH 10/13] cpuset: make CPU / memory hotplug propagation asynchronous Li Zefan
         [not found]       ` <50E935D5.4040402-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
    2013-01-07 16:42         ` Tejun Heo
         [not found] ` <50E93554.3070102@huawei.com>
         [not found]   ` <50E93554.3070102-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
    2013-01-07 16:44     ` [PATCHSET] cpuset: decouple cpuset locking from cgroup core, take#2 Tejun Heo
         [not found]       ` <20130107164453.GH3926-Gd/HAXX7CRxy/B6EtB590w@public.gmane.org>
    2013-01-08  1:31         ` Li Zefan
         [not found]           ` <50EB76DF.5070508-hv44wF8Li93QT0dZR+AlfA@public.gmane.org>
    2013-01-09 18:57             ` Tejun Heo
         [not found]               ` <20130109185724.GP3926-Gd/HAXX7CRxy/B6EtB590w@public.gmane.org>
    2013-01-11  9:05                 ` Li Zefan
    2013-01-09 19:32             ` Paul Menage
    2013-01-03 21:35 Tejun Heo
    

    This is a public inbox, see mirroring instructions
    for how to clone and mirror all data and code used for this inbox