linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jan Kara <jack@suse.cz>
To: Joonsoo Kim <iamjoonsoo.kim@lge.com>
Cc: Jan Kara <jack@suse.cz>, Gioh Kim <gioh.kim@lge.com>,
	Peter Zijlstra <peterz@infradead.org>,
	Alexander Viro <viro@zeniv.linux.org.uk>,
	Andrew Morton <akpm@linux-foundation.org>,
	"Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
	Theodore Ts'o <tytso@mit.edu>,
	Andreas Dilger <adilger.kernel@dilger.ca>,
	linux-ext4@vger.kernel.org, linux-mm@kvack.org,
	Minchan Kim <minchan@kernel.org>
Subject: Re: [PATCH 0/2] new API to allocate buffer-cache for superblock in non-movable area
Date: Fri, 1 Aug 2014 11:15:39 +0200	[thread overview]
Message-ID: <20140801091539.GA27281@quack.suse.cz> (raw)
In-Reply-To: <20140801083446.GA2613@js1304-P5Q-DELUXE>

On Fri 01-08-14 17:34:46, Joonsoo Kim wrote:
> On Thu, Jul 31, 2014 at 02:21:14PM +0200, Jan Kara wrote:
> > On Thu 31-07-14 09:37:15, Gioh Kim wrote:
> > > 
> > > 
> > > 2014-07-31 오전 9:03, Jan Kara 쓴 글:
> > > >On Thu 31-07-14 08:54:40, Gioh Kim wrote:
> > > >>2014-07-30 오후 7:11, Jan Kara 쓴 글:
> > > >>>On Wed 30-07-14 16:44:24, Gioh Kim wrote:
> > > >>>>2014-07-22 오후 6:38, Jan Kara 쓴 글:
> > > >>>>>On Tue 22-07-14 09:30:05, Peter Zijlstra wrote:
> > > >>>>>>On Tue, Jul 22, 2014 at 02:18:47PM +0900, Gioh Kim wrote:
> > > >>>>>>>Hello,
> > > >>>>>>>
> > > >>>>>>>This patch try to solve problem that a long-lasting page cache of
> > > >>>>>>>ext4 superblock disturbs page migration.
> > > >>>>>>>
> > > >>>>>>>I've been testing CMA feature on my ARM-based platform
> > > >>>>>>>and found some pages for page caches cannot be migrated.
> > > >>>>>>>Some of them are page caches of superblock of ext4 filesystem.
> > > >>>>>>>
> > > >>>>>>>Current ext4 reads superblock with sb_bread(). sb_bread() allocates page
> > > >>>>>>>from movable area. But the problem is that ext4 hold the page until
> > > >>>>>>>it is unmounted. If root filesystem is ext4 the page cannot be migrated forever.
> > > >>>>>>>
> > > >>>>>>>I introduce a new API for allocating page from non-movable area.
> > > >>>>>>>It is useful for ext4 and others that want to hold page cache for a long time.
> > > >>>>>>
> > > >>>>>>There's no word on why you can't teach ext4 to still migrate that page.
> > > >>>>>>For all I know it might be impossible, but at least mention why.
> > > >>>>
> > > >>>>I am very sorry for lacking of details.
> > > >>>>
> > > >>>>In ext4_fill_super() the buffer-head of superblock is stored in sbi->s_sbh.
> > > >>>>The page belongs to the buffer-head is allocated from movable area.
> > > >>>>To migrate the page the buffer-head should be released via brelse().
> > > >>>>But brelse() is not called until unmount.
> > > >>>   Hum, I don't see where in the code do we check buffer_head use count. Can
> > > >>>you please point me? Thanks.
> > > >>
> > > >>Filesystem code does not check buffer_head use count.  sb_bread() returns
> > > >>the buffer_head that is included in bh_lru and has non-zero use count.
> > > >>You can see the bh_lru code in buffer.c: __find_get_clock() and
> > > >>lookup_bh_lru().  bh_lru_install() inserts the buffer_head into the
> > > >>bh_lru().  It first calls get_bh() to increase the use count and insert
> > > >>bh into the lru array.
> > > >>
> > > >>The buffer_head use count is non-zero until brelse() is called.
> > > >   So I probably didn't phrase the question precisely enough. What I was
> > > >asking about is where exactly *migration* code checks buffer use count?
> > > >Because as I'm looking at buffer_migrate_page() we lock the buffers on a
> > > >migrated page but we don't look at buffer use counts... So it seems to me
> > > >that migration of a page with buffers should succeed even if buffer head
> > > >has an elevated use count. Now I think that it *should* check the buffer
> > > >use counts (it is dangerous to migrate buffers someone holds reference to)
> > > >but I just cannot find that place. Or does CMA use some other migration
> > > >function for buffer pages than buffer_migrate_page()?
> > > 
> > > CMA allocation function is cma_alloc().
> > > Function flow is alloc_contig_range() -> __alloc_contig_migrate_range() -> migrate_pages -> unmap_and_move
> > > -> __unmap_and_move -> try_to_free_buffers -> drop_buffers -> buffer_busy.
> > > 
> > > The buffer_busy() is checking b_count.
> > > If buffer is busy buffer-cache cannot be removed.
> > > So the page that includes buffer_head and the page that is refered by
> > > buffer_head are not movable.
> > > 
> > > Is this what you need?
> >   Yes, this is what I was asking about. Thanks! But as I'm looking into
> > __unmap_and_move() it calls try_to_free_buffers() only if page->mapping ==
> > NULL. As the comment before that test states, this can happen only for swap
> > cache (not our case) or for pagecache pages that were truncated and not yet
> > fully cleaned up. But superblock page cannot really be truncated. So I
> > somewhat doubt you can hit the above path for a page holding superblock...
> 
> Hello,
> 
> Although page->mapping != NULL, mapping->a_ops->migratepage could be
> NULL. This is the case of block_device. See def_blk_aops in
> fs/block_dev.c. In this case, fallback_migrate_page() is called and
> then try_to_release_page() and try_to_free_buffers() would be called.
  Aaah, right! Finally I understand what happens and why I couldn't see
buffer_migrate_page() being called for blkdev buffers. I didn't realize
blkdev mappings end up with NULL ->migratepage callback. Thanks a lot for
clearing this up.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

      reply	other threads:[~2014-08-01  9:15 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-07-22  5:18 [PATCH 0/2] new API to allocate buffer-cache for superblock in non-movable area Gioh Kim
2014-07-22  7:30 ` Peter Zijlstra
2014-07-22  8:14   ` Theodore Ts'o
2014-07-27  1:01     ` Theodore Ts'o
2014-07-30  7:56       ` Gioh Kim
2014-07-22  9:38   ` Jan Kara
2014-07-30  7:44     ` Gioh Kim
2014-07-30  7:57       ` Kyungmin Park
2014-07-30 10:11       ` Jan Kara
2014-07-30 10:19         ` Peter Zijlstra
2014-07-30 23:45           ` Gioh Kim
2014-07-30 23:54         ` Gioh Kim
2014-07-31  0:03           ` Jan Kara
2014-07-31  0:37             ` Gioh Kim
2014-07-31 12:21               ` Jan Kara
2014-08-01  0:07                 ` Gioh Kim
2014-08-01  1:06                   ` Gioh Kim
2014-08-01  9:57                     ` Jan Kara
2014-08-01 13:36                       ` Peter Zijlstra
2014-08-01 15:24                         ` Jan Kara
2014-08-01 16:04                           ` Peter Zijlstra
2014-08-06  6:15                             ` Gioh Kim
2014-08-01  8:34                 ` Joonsoo Kim
2014-08-01  9:15                   ` Jan Kara [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140801091539.GA27281@quack.suse.cz \
    --to=jack@suse.cz \
    --cc=adilger.kernel@dilger.ca \
    --cc=akpm@linux-foundation.org \
    --cc=gioh.kim@lge.com \
    --cc=iamjoonsoo.kim@lge.com \
    --cc=linux-ext4@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=minchan@kernel.org \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=peterz@infradead.org \
    --cc=tytso@mit.edu \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).