public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: Theodore Ts'o <tytso@mit.edu>
Cc: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>,
	Johannes Weiner <hannes@cmpxchg.org>,
	oleg@redhat.com, xfs@oss.sgi.com, mhocko@suse.cz,
	linux-mm@kvack.org, mgorman@suse.de, dchinner@redhat.com,
	rientjes@google.com, Andrew Morton <akpm@linux-foundation.org>,
	torvalds@linux-foundation.org
Subject: Re: How to handle TIF_MEMDIE stalls?
Date: Thu, 5 Mar 2015 10:17:40 +1100	[thread overview]
Message-ID: <20150304231740.GA18360@dastard> (raw)
In-Reply-To: <20150304173841.GB15669@thunk.org>

On Wed, Mar 04, 2015 at 12:38:41PM -0500, Theodore Ts'o wrote:
> On Wed, Mar 04, 2015 at 10:04:36AM -0500, Johannes Weiner wrote:
> > Yes, we can make this work if you can tell us which allocations have
> > limited/controllable lifetime.
> 
> It may be helpful to be a bit precise about definitions here.  There
> are a number of different object lifetimes:
> 
> a) will be released before the kernel thread returns control to
> userspace
> 
> b) will be released once the current I/O operation finishes.  (In the
> case of nbd where the remote server has unexpectedy gone away might be
> quite a while, but I'm not sure how much we care about that scenario)
> 
> c) can be trivially released if the mm subsystem asks via calling a
> shrinker
> 
> d) can be released only after doing some amount of bounded work (i.e.,
> cleaning a dirty page)
> 
> e) impossible to predict when it can be released (e.g., dcache, inodes
> attached to an open file descriptors, buffer heads that won't be freed
> until the file system is umounted, etc.)
> 
> 
> I'm guessing that what you mean is (b), but what about cases such as
> (c)?

The thing is, in the XFS transaction case we are hitting e) for
every allocation, and only after IO and/or some processing do we
know whether it will fall into c), d) or whether it will be
permanently consumed.

> Would the mm subsystem find it helpful if it had more information
> about object lifetime?  For example, the CMA folks seem to really care
> about know whether memory allocations falls in category (e) or not.

The problem is that most filesystem allocations fall into category
(e). Worse is that the state of an object can change without
allocations having taken place e.g. an object on a reclaimable LRU
can be found via a cache lookup, then joined to and modified in a
transaction. Hence objects can change state from "reclaimable" to
"permanently consumed" without actually going through memory reclaim
and allocation.

IOWs, what is really required is the ability to say "this amount of
allocation reserve is now consumed" /some time after/ we've done the
allocation. i.e. when we join the object to the transaction and
modify it, that's when we need to be able to reduce the reservation
limit as that memory is now permanently consumed by the transaction
context. Objects that fall into c) and d) don't need to have anyting
special done, because reclaim will eventually free the memory they
hold once the allocating context releases them.

Indeed, this model works even when we find those c) and d) objects
in cache rather than allocating them. They would get correctly
accounted as "consumed reserve" because we no longer need to
allocate that memory in transaction context and so that reserve can
be released back to the free pool....

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

  reply	other threads:[~2015-03-04 23:22 UTC|newest]

Thread overview: 83+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20141230112158.GA15546@dhcp22.suse.cz>
     [not found] ` <201502092044.JDG39081.LVFOOtFHQFOMSJ@I-love.SAKURA.ne.jp>
     [not found]   ` <201502102258.IFE09888.OVQFJOMSFtOLFH@I-love.SAKURA.ne.jp>
     [not found]     ` <20150210151934.GA11212@phnom.home.cmpxchg.org>
     [not found]       ` <201502111123.ICD65197.FMLOHSQJFVOtFO@I-love.SAKURA.ne.jp>
     [not found]         ` <201502172123.JIE35470.QOLMVOFJSHOFFt@I-love.SAKURA.ne.jp>
     [not found]           ` <20150217125315.GA14287@phnom.home.cmpxchg.org>
2015-02-17 22:54             ` How to handle TIF_MEMDIE stalls? Dave Chinner
2015-02-17 23:32               ` Dave Chinner
2015-02-18  8:25               ` Michal Hocko
2015-02-18 10:48                 ` Dave Chinner
2015-02-18 12:16                   ` Michal Hocko
2015-02-18 21:31                     ` Dave Chinner
2015-02-19  9:40                       ` Michal Hocko
2015-02-19 22:03                         ` Dave Chinner
2015-02-20  9:27                           ` Michal Hocko
2015-02-19 11:01                     ` Johannes Weiner
2015-02-19 12:29                       ` Michal Hocko
2015-02-19 12:58                         ` Michal Hocko
2015-02-19 15:29                           ` Tetsuo Handa
2015-02-19 21:53                             ` Tetsuo Handa
2015-02-20  9:13                             ` Michal Hocko
2015-02-20 13:37                               ` Stefan Ring
2015-02-19 13:29                         ` Tetsuo Handa
2015-02-20  9:10                           ` Michal Hocko
2015-02-20 12:20                             ` Tetsuo Handa
2015-02-20 12:38                               ` Michal Hocko
2015-02-19 21:43                         ` Dave Chinner
2015-02-20 12:48                           ` Michal Hocko
2015-02-20 23:09                             ` Dave Chinner
2015-02-19 10:24               ` Johannes Weiner
2015-02-19 22:52                 ` Dave Chinner
2015-02-20 10:36                   ` Tetsuo Handa
2015-02-20 23:15                     ` Dave Chinner
2015-02-21  3:20                       ` Theodore Ts'o
2015-02-21  9:19                         ` Andrew Morton
2015-02-21 13:48                           ` Tetsuo Handa
2015-02-21 21:38                           ` Dave Chinner
2015-02-22  0:20                           ` Johannes Weiner
2015-02-23 10:48                             ` Michal Hocko
2015-02-23 11:23                               ` Tetsuo Handa
2015-02-23 21:33                             ` David Rientjes
2015-02-21 12:00                         ` Tetsuo Handa
2015-02-23 10:26                         ` Michal Hocko
2015-02-21 11:12                       ` Tetsuo Handa
2015-02-21 21:48                         ` Dave Chinner
2015-02-21 23:52                   ` Johannes Weiner
2015-02-23  0:45                     ` Dave Chinner
2015-02-23  1:29                       ` Andrew Morton
2015-02-23  7:32                         ` Dave Chinner
2015-02-27 18:24                           ` Vlastimil Babka
2015-02-28  0:03                             ` Dave Chinner
2015-02-28 15:17                               ` Theodore Ts'o
2015-03-02  9:39                           ` Vlastimil Babka
2015-03-02 22:31                             ` Dave Chinner
2015-03-03  9:13                               ` Vlastimil Babka
2015-03-04  1:33                                 ` Dave Chinner
2015-03-04  8:50                                   ` Vlastimil Babka
2015-03-04 11:03                                     ` Dave Chinner
2015-03-07  0:20                               ` Johannes Weiner
2015-03-07  3:43                                 ` Dave Chinner
2015-03-07 15:08                                   ` Johannes Weiner
2015-03-02 20:22                           ` Johannes Weiner
2015-03-02 23:12                             ` Dave Chinner
2015-03-03  2:50                               ` Johannes Weiner
2015-03-04  6:52                                 ` Dave Chinner
2015-03-04 15:04                                   ` Johannes Weiner
2015-03-04 17:38                                     ` Theodore Ts'o
2015-03-04 23:17                                       ` Dave Chinner [this message]
2015-02-28 16:29                       ` Johannes Weiner
2015-02-28 16:41                         ` Theodore Ts'o
2015-02-28 22:15                           ` Johannes Weiner
2015-03-01 11:17                             ` Tetsuo Handa
2015-03-06 11:53                               ` Tetsuo Handa
2015-03-01 13:43                             ` Theodore Ts'o
2015-03-01 16:15                               ` Johannes Weiner
2015-03-01 19:36                                 ` Theodore Ts'o
2015-03-01 20:44                                   ` Johannes Weiner
2015-03-01 20:17                               ` Johannes Weiner
2015-03-01 21:48                             ` Dave Chinner
2015-03-02  0:17                               ` Dave Chinner
2015-03-02 12:46                                 ` Brian Foster
2015-02-28 18:36                       ` Vlastimil Babka
2015-03-02 15:18                       ` Michal Hocko
2015-03-02 16:05                         ` Johannes Weiner
2015-03-02 17:10                           ` Michal Hocko
2015-03-02 17:27                             ` Johannes Weiner
2015-03-02 16:39                         ` Theodore Ts'o
2015-03-02 16:58                           ` Michal Hocko
2015-03-04 12:52                             ` Dave Chinner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150304231740.GA18360@dastard \
    --to=david@fromorbit.com \
    --cc=akpm@linux-foundation.org \
    --cc=dchinner@redhat.com \
    --cc=hannes@cmpxchg.org \
    --cc=linux-mm@kvack.org \
    --cc=mgorman@suse.de \
    --cc=mhocko@suse.cz \
    --cc=oleg@redhat.com \
    --cc=penguin-kernel@I-love.SAKURA.ne.jp \
    --cc=rientjes@google.com \
    --cc=torvalds@linux-foundation.org \
    --cc=tytso@mit.edu \
    --cc=xfs@oss.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox