From: Johannes Weiner <hannes@cmpxchg.org>
To: Tejun Heo <tj@kernel.org>
Cc: Vladimir Davydov <vdavydov@parallels.com>,
"Suzuki K. Poulose" <Suzuki.Poulose@arm.com>,
linux-mm@kvack.org,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
Will Deacon <Will.Deacon@arm.com>
Subject: Re: [PATCH cgroup/for-3.19-fixes] cgroup: implement cgroup_subsys->unbind() callback
Date: Sun, 11 Jan 2015 15:55:43 -0500 [thread overview]
Message-ID: <20150111205543.GA5480@phnom.home.cmpxchg.org> (raw)
In-Reply-To: <20150110214316.GF25319@htj.dyndns.org>
On Sat, Jan 10, 2015 at 04:43:16PM -0500, Tejun Heo wrote:
> Currently, if a hierarchy doesn't have any live children when it's
> unmounted, the hierarchy starts dying by killing its refcnt. The
> expectation is that even if there are lingering dead children which
> are lingering due to remaining references, they'll be put in a finite
> amount of time. When the children are finally released, the hierarchy
> is destroyed and all controllers bound to it also are released.
>
> However, for memcg, the premise that the lingering refs will be put in
> a finite amount time is not true. In the absense of memory pressure,
> dead memcg's may hang around indefinitely pinned by its pages. This
> unfortunately may lead to indefinite hang on the next mount attempt
> involving memcg as the mount logic waits for it to get released.
>
> While we can change hierarchy destruction logic such that a hierarchy
> is only destroyed when it's not mounted anywhere and all its children,
> live or dead, are gone, this makes whether the hierarchy gets
> destroyed or not to be determined by factors opaque to userland.
> Userland may or may not get a new hierarchy on the next mount attempt.
> Worse, if it explicitly wants to create a new hierarchy with different
> options or controller compositions involving memcg, it will fail in an
> essentially arbitrary manner.
>
> We want to guarantee that a hierarchy is destroyed once the
> conditions, unmounted and no visible children, are met. To aid it,
> this patch introduces a new callback cgroup_subsys->unbind() which is
> invoked right before the hierarchy a subsystem is bound to starts
> dying. memcg can implement this callback and initiate draining of
> remaining refs so that the hierarchy can eventually be released in a
> finite amount of time.
>
> Signed-off-by: Tejun Heo <tj@kernel.org>
> Cc: Li Zefan <lizefan@huawei.com>
> Cc: Johannes Weiner <hannes@cmpxchg.org>
> Cc: Michal Hocko <mhocko@suse.cz>
> Cc: Vladimir Davydov <vdavydov@parallels.com>
> ---
> Hello,
>
> > May be, we should kill the ref counter to the memory controller root in
> > cgroup_kill_sb only if there is no children at all, neither online nor
> > offline.
>
> Ah, thanks for the analysis, but I really wanna avoid making hierarchy
> destruction conditions opaque to userland. This is userland visible
> behavior. It shouldn't be determined by kernel internals invisible
> outside. This patch adds ss->unbind() which memcg can hook into to
> kick off draining of residual refs. If this would work, I'll add this
> patch to cgroup/for-3.19-fixes, possibly with stable cc'd.
How about this ->unbind() for memcg?
next prev parent reply other threads:[~2015-01-11 20:55 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-09 17:43 [Regression] 3.19-rc3 : memcg: Hang in mount memcg Suzuki K. Poulose
2015-01-09 21:46 ` Tejun Heo
2015-01-12 17:02 ` Suzuki K. Poulose
2015-01-10 8:55 ` Vladimir Davydov
2015-01-10 21:43 ` [PATCH cgroup/for-3.19-fixes] cgroup: implement cgroup_subsys->unbind() callback Tejun Heo
2015-01-11 20:55 ` Johannes Weiner [this message]
2015-01-12 8:01 ` Vladimir Davydov
2015-01-12 11:28 ` Tejun Heo
2015-01-12 12:59 ` Vladimir Davydov
2015-01-12 13:05 ` Tejun Heo
2015-01-14 11:16 ` Suzuki K. Poulose
2015-01-15 17:56 ` Michal Hocko
2015-01-15 17:26 ` Michal Hocko
2015-01-19 12:51 ` [Regression] 3.19-rc3 : memcg: Hang in mount memcg Suzuki K. Poulose
2015-01-21 16:39 ` Will Deacon
2015-01-22 13:45 ` Johannes Weiner
2015-01-22 14:34 ` Tejun Heo
2015-01-22 15:19 ` Johannes Weiner
2015-01-22 15:28 ` Tejun Heo
2015-01-23 15:00 ` Suzuki K. Poulose
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150111205543.GA5480@phnom.home.cmpxchg.org \
--to=hannes@cmpxchg.org \
--cc=Suzuki.Poulose@arm.com \
--cc=Will.Deacon@arm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=tj@kernel.org \
--cc=vdavydov@parallels.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).