From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Dave Chinner <david@fromorbit.com>
Cc: Brian Foster <bfoster@redhat.com>, linux-xfs@vger.kernel.org
Subject: Re: [PATCH 24/63] xfs: when replaying bmap operations, don't let unlinked inodes get reaped
Date: Tue, 4 Oct 2016 14:44:13 -0700 [thread overview]
Message-ID: <20161004214413.GC8642@birch.djwong.org> (raw)
In-Reply-To: <20161004190727.GP27872@dastard>
On Wed, Oct 05, 2016 at 06:07:27AM +1100, Dave Chinner wrote:
> On Tue, Oct 04, 2016 at 08:44:01AM -0400, Brian Foster wrote:
> > On Mon, Oct 03, 2016 at 05:29:25PM -0700, Darrick J. Wong wrote:
> > > On Mon, Oct 03, 2016 at 03:04:10PM -0400, Brian Foster wrote:
> > > > On Thu, Sep 29, 2016 at 08:08:17PM -0700, Darrick J. Wong wrote:
> > > > > Log recovery will iget an inode to replay BUI items and iput the inode
> > > > > when it's done. Unfortunately, the iput will see that i_nlink == 0
> > > > > and decide to truncate & free the inode, which prevents us from
> > > > > replaying subsequent BUIs. We can't skip the BUIs because we have to
> > > > > replay all the redo items to ensure that atomic operations complete.
> > > > >
> > ...
> > > >
> > > > > Since unlinked inode recovery will reap the inode anyway, we can
> > > > > safely introduce a new inode flag to indicate that an inode is in this
> > > > > 'unlinked recovery' state and should not be auto-reaped in the
> > > > > drop_inode path.
> > > > >
> > > > > Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> > > > > ---
> > > > > fs/xfs/xfs_bmap_item.c | 1 +
> > > > > fs/xfs/xfs_inode.c | 8 ++++++++
> > > > > fs/xfs/xfs_inode.h | 6 ++++++
> > > > > fs/xfs/xfs_log_recover.c | 1 +
> > > > > 4 files changed, 16 insertions(+)
> > > > >
> > > > >
> > ...
> > > > > diff --git a/fs/xfs/xfs_inode.c b/fs/xfs/xfs_inode.c
> > > > > index e08eaea..0c25a76 100644
> > > > > --- a/fs/xfs/xfs_inode.c
> > > > > +++ b/fs/xfs/xfs_inode.c
> > > > > @@ -1855,6 +1855,14 @@ xfs_inactive(
> > > > > if (mp->m_flags & XFS_MOUNT_RDONLY)
> > > > > return;
> > > > >
> > > > > + /*
> > > > > + * If this unlinked inode is in the middle of recovery, don't
> > > > > + * truncate and free the inode just yet; log recovery will take
> > > > > + * care of that. See the comment for this inode flag.
> > > > > + */
> > > > > + if (xfs_iflags_test(ip, XFS_IRECOVER_UNLINKED))
> > > > > + return;
> > > > > +
> > > >
> > > > Also, it might be better to push this one block of code down since the
> > > > following block still deals with i_nlink > 0 properly (not that it will
> > > > likely affect the code as it is now, since we only handle eofblocks
> > > > trimming atm).
> > >
> > > I put the jump-out case there so that we touch the inode's bmap as little
> > > as possible while we're recovering the inode. Since the inode is still
> > > around in memory, so we'll end up back there at a later point anyway.
> > >
> >
> > I'm not quite following... it looks like we set the reclaim tag on the
> > inode unconditionally after we get through xfs_inactive(). That implies
> > the in-memory inode can go away at any point thereafter, unless somebody
> > else comes along and happens to look for it. Hmm?
>
> Yup - the iunlink recover check needs to go into xfs_fs_drop_inode()
> to determine whether the inode should be dropped from the cache or
> not by iput_final(). That way it will never get near xfs_inactive()
> because the VFS won't try to evict it until the
> XFS_IRECOVER_UNLINKED flag is cleared.
(Ick, the iput code...)
So.... paging some of my notes back into memory, iput_final() will
still evict() an i_count == 0 inode even if op->drop_inode says
not to drop the inode if MS_ACTIVE is not set on the superblock:
iput_final() {
if (op->drop_inode)
drop = op->drop_inode(inode);
else
drop = generic_drop_inode(inode);
if (!drop && (sb->s_flags & MS_ACTIVE)) {
inode->i_state |= I_REFERENCED;
inode_add_lru(inode);
spin_unlock(&inode->i_lock);
return;
}
/* do stuff */
evict(inode);
}
MS_ACTIVE isn't set on the superblock during recovery because the
VFS doesn't set it until fill_super succeeds, and fill_super doesn't
return until we're done with log recovery. Therefore, we can end up
in xfs_inactive during recovery even if _drop_inode just told the VFS
not to evict the inode.
IIRC that's why the IRECOVERY check ended up in xfs_inactive. I
don't mind adding a second IRECOVERY check to xfs_fs_drop_inode,
but removing the one in xfs_inactive breaks recovery (xfs/329).
(Or, per a suggestion of Dave, I could just set MS_ACTIVE prior to
second stage log recovery.)
--D
>
> Cheers,
>
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
next prev parent reply other threads:[~2016-10-04 21:44 UTC|newest]
Thread overview: 186+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-09-30 3:05 [PATCH v10 00/63] xfs: add reflink and dedupe support Darrick J. Wong
2016-09-30 3:05 ` [PATCH 01/63] vfs: support FS_XFLAG_COWEXTSIZE and get/set of CoW extent size hint Darrick J. Wong
2016-09-30 3:05 ` [PATCH 02/63] vfs: add a FALLOC_FL_UNSHARE mode to fallocate to unshare a range of blocks Darrick J. Wong
2016-09-30 7:08 ` Christoph Hellwig
2016-09-30 3:05 ` [PATCH 03/63] xfs: return an error when an inline directory is too small Darrick J. Wong
2016-09-30 3:06 ` [PATCH 04/63] xfs: define tracepoints for refcount btree activities Darrick J. Wong
2016-09-30 3:06 ` [PATCH 05/63] xfs: introduce refcount btree definitions Darrick J. Wong
2016-09-30 3:06 ` [PATCH 06/63] xfs: refcount btree add more reserved blocks Darrick J. Wong
2016-09-30 3:06 ` [PATCH 07/63] xfs: define the on-disk refcount btree format Darrick J. Wong
2016-09-30 3:06 ` [PATCH 08/63] xfs: add refcount btree support to growfs Darrick J. Wong
2016-09-30 3:06 ` [PATCH 09/63] xfs: account for the refcount btree in the alloc/free log reservation Darrick J. Wong
2016-09-30 3:06 ` [PATCH 10/63] xfs: add refcount btree operations Darrick J. Wong
2016-09-30 3:06 ` [PATCH 11/63] xfs: create refcount update intent log items Darrick J. Wong
2016-09-30 3:06 ` [PATCH 12/63] xfs: log refcount intent items Darrick J. Wong
2016-09-30 3:06 ` [PATCH 13/63] xfs: adjust refcount of an extent of blocks in refcount btree Darrick J. Wong
2016-09-30 7:11 ` Christoph Hellwig
2016-09-30 17:53 ` Darrick J. Wong
2016-09-30 3:07 ` [PATCH 14/63] xfs: connect refcount adjust functions to upper layers Darrick J. Wong
2016-09-30 7:13 ` Christoph Hellwig
2016-09-30 16:21 ` Brian Foster
2016-09-30 19:40 ` Darrick J. Wong
2016-09-30 20:11 ` Brian Foster
2016-09-30 3:07 ` [PATCH 15/63] xfs: adjust refcount when unmapping file blocks Darrick J. Wong
2016-09-30 7:14 ` Christoph Hellwig
2016-09-30 3:07 ` [PATCH 16/63] xfs: add refcount btree block detection to log recovery Darrick J. Wong
2016-09-30 7:15 ` Christoph Hellwig
2016-09-30 3:07 ` [PATCH 17/63] xfs: refcount btree requires more reserved space Darrick J. Wong
2016-09-30 7:15 ` Christoph Hellwig
2016-09-30 16:46 ` Brian Foster
2016-09-30 18:41 ` Darrick J. Wong
2016-09-30 3:07 ` [PATCH 18/63] xfs: introduce reflink utility functions Darrick J. Wong
2016-09-30 7:16 ` Christoph Hellwig
2016-09-30 19:22 ` Brian Foster
2016-09-30 19:50 ` Darrick J. Wong
2016-09-30 3:07 ` [PATCH 19/63] xfs: create bmbt update intent log items Darrick J. Wong
2016-09-30 7:24 ` Christoph Hellwig
2016-09-30 17:24 ` Darrick J. Wong
2016-09-30 3:07 ` [PATCH 20/63] xfs: log bmap intent items Darrick J. Wong
2016-09-30 7:26 ` Christoph Hellwig
2016-09-30 17:26 ` Darrick J. Wong
2016-09-30 19:22 ` Brian Foster
2016-09-30 19:52 ` Darrick J. Wong
2016-09-30 3:07 ` [PATCH 21/63] xfs: map an inode's offset to an exact physical block Darrick J. Wong
2016-09-30 7:31 ` Christoph Hellwig
2016-09-30 17:30 ` Darrick J. Wong
2016-10-03 19:03 ` Brian Foster
2016-10-04 0:11 ` Darrick J. Wong
2016-10-04 12:43 ` Brian Foster
2016-10-04 17:28 ` Darrick J. Wong
2016-09-30 3:08 ` [PATCH 22/63] xfs: pass bmapi flags through to bmap_del_extent Darrick J. Wong
2016-09-30 7:16 ` Christoph Hellwig
2016-09-30 3:08 ` [PATCH 23/63] xfs: implement deferred bmbt map/unmap operations Darrick J. Wong
2016-09-30 7:34 ` Christoph Hellwig
2016-09-30 17:38 ` Darrick J. Wong
2016-09-30 20:34 ` Roger Willcocks
2016-09-30 21:08 ` Darrick J. Wong
2016-09-30 3:08 ` [PATCH 24/63] xfs: when replaying bmap operations, don't let unlinked inodes get reaped Darrick J. Wong
2016-09-30 7:35 ` Christoph Hellwig
2016-10-03 19:04 ` Brian Foster
2016-10-04 0:29 ` Darrick J. Wong
2016-10-04 12:44 ` Brian Foster
2016-10-04 19:07 ` Dave Chinner
2016-10-04 21:44 ` Darrick J. Wong [this message]
2016-09-30 3:08 ` [PATCH 25/63] xfs: return work remaining at the end of a bunmapi operation Darrick J. Wong
2016-09-30 7:19 ` Christoph Hellwig
2016-10-03 19:04 ` Brian Foster
2016-10-04 0:30 ` Darrick J. Wong
2016-10-04 12:44 ` Brian Foster
2016-09-30 3:08 ` [PATCH 26/63] xfs: define tracepoints for reflink activities Darrick J. Wong
2016-09-30 7:20 ` Christoph Hellwig
2016-09-30 3:08 ` [PATCH 27/63] xfs: add reflink feature flag to geometry Darrick J. Wong
2016-09-30 7:20 ` Christoph Hellwig
2016-09-30 3:08 ` [PATCH 28/63] xfs: don't allow reflinked dir/dev/fifo/socket/pipe files Darrick J. Wong
2016-09-30 7:20 ` Christoph Hellwig
2016-09-30 3:08 ` [PATCH 29/63] xfs: introduce the CoW fork Darrick J. Wong
2016-09-30 7:39 ` Christoph Hellwig
2016-09-30 17:48 ` Darrick J. Wong
2016-09-30 3:08 ` [PATCH 30/63] xfs: support bmapping delalloc extents in " Darrick J. Wong
2016-09-30 7:42 ` Christoph Hellwig
2016-09-30 3:09 ` [PATCH 31/63] xfs: create delalloc extents in " Darrick J. Wong
2016-10-04 16:38 ` Brian Foster
2016-10-04 17:39 ` Darrick J. Wong
2016-10-04 18:38 ` Brian Foster
2016-09-30 3:09 ` [PATCH 32/63] xfs: support allocating delayed " Darrick J. Wong
2016-09-30 7:42 ` Christoph Hellwig
2016-10-04 16:38 ` Brian Foster
2016-09-30 3:09 ` [PATCH 33/63] xfs: allocate " Darrick J. Wong
2016-10-04 16:38 ` Brian Foster
2016-10-04 18:26 ` Darrick J. Wong
2016-10-04 18:39 ` Brian Foster
2016-09-30 3:09 ` [PATCH 34/63] xfs: support removing extents from " Darrick J. Wong
2016-09-30 7:46 ` Christoph Hellwig
2016-09-30 18:00 ` Darrick J. Wong
2016-10-05 18:26 ` Brian Foster
2016-09-30 3:09 ` [PATCH 35/63] xfs: move mappings from cow fork to data fork after copy-write Darrick J. Wong
2016-10-05 18:26 ` Brian Foster
2016-10-05 21:22 ` Darrick J. Wong
2016-09-30 3:09 ` [PATCH 36/63] xfs: report shared extent mappings to userspace correctly Darrick J. Wong
2016-09-30 3:09 ` [PATCH 37/63] xfs: implement CoW for directio writes Darrick J. Wong
2016-10-05 18:27 ` Brian Foster
2016-10-05 20:55 ` Darrick J. Wong
2016-10-06 12:20 ` Brian Foster
2016-10-07 1:02 ` Darrick J. Wong
2016-10-07 6:17 ` Christoph Hellwig
2016-10-07 12:16 ` Brian Foster
2016-10-07 12:15 ` Brian Foster
2016-10-13 18:14 ` Darrick J. Wong
2016-10-13 19:01 ` Brian Foster
2016-09-30 3:09 ` [PATCH 38/63] xfs: cancel CoW reservations and clear inode reflink flag when freeing blocks Darrick J. Wong
2016-09-30 7:47 ` Christoph Hellwig
2016-10-06 16:44 ` Brian Foster
2016-10-07 0:40 ` Darrick J. Wong
2016-09-30 3:09 ` [PATCH 39/63] xfs: cancel pending CoW reservations when destroying inodes Darrick J. Wong
2016-09-30 7:47 ` Christoph Hellwig
2016-10-06 16:44 ` Brian Foster
2016-10-07 0:42 ` Darrick J. Wong
2016-09-30 3:09 ` [PATCH 40/63] xfs: store in-progress CoW allocations in the refcount btree Darrick J. Wong
2016-09-30 7:49 ` Christoph Hellwig
2016-10-07 18:04 ` Brian Foster
2016-10-07 19:18 ` Darrick J. Wong
2016-09-30 3:10 ` [PATCH 41/63] xfs: reflink extents from one file to another Darrick J. Wong
2016-09-30 7:50 ` Christoph Hellwig
2016-10-07 18:04 ` Brian Foster
2016-10-07 19:44 ` Darrick J. Wong
2016-10-07 20:48 ` Brian Foster
2016-10-07 21:41 ` Darrick J. Wong
2016-10-10 13:17 ` Brian Foster
2016-09-30 3:10 ` [PATCH 42/63] xfs: add clone file and clone range vfs functions Darrick J. Wong
2016-09-30 7:51 ` Christoph Hellwig
2016-09-30 18:04 ` Darrick J. Wong
2016-10-07 18:04 ` Brian Foster
2016-10-07 20:31 ` Darrick J. Wong
2016-09-30 3:10 ` [PATCH 43/63] xfs: add dedupe range vfs function Darrick J. Wong
2016-09-30 7:53 ` Christoph Hellwig
2016-09-30 3:10 ` [PATCH 44/63] xfs: teach get_bmapx about shared extents and the CoW fork Darrick J. Wong
2016-09-30 7:53 ` Christoph Hellwig
2016-09-30 3:10 ` [PATCH 45/63] xfs: swap inode reflink flags when swapping inode extents Darrick J. Wong
2016-09-30 7:54 ` Christoph Hellwig
2016-09-30 3:10 ` [PATCH 46/63] xfs: unshare a range of blocks via fallocate Darrick J. Wong
2016-09-30 7:54 ` Christoph Hellwig
2016-10-07 18:05 ` Brian Foster
2016-10-07 20:26 ` Darrick J. Wong
2016-10-07 20:58 ` Brian Foster
2016-10-07 21:15 ` Darrick J. Wong
2016-10-07 22:25 ` Dave Chinner
2016-10-10 17:05 ` Darrick J. Wong
2016-09-30 3:10 ` [PATCH 47/63] xfs: create a separate cow extent size hint for the allocator Darrick J. Wong
2016-09-30 7:55 ` Christoph Hellwig
2016-09-30 3:10 ` [PATCH 48/63] xfs: preallocate blocks for worst-case btree expansion Darrick J. Wong
2016-09-30 8:19 ` Christoph Hellwig
2016-10-12 18:44 ` Brian Foster
2016-10-12 20:52 ` Darrick J. Wong
2016-10-12 22:42 ` Brian Foster
2016-12-06 19:32 ` Darrick J. Wong
2016-12-07 11:53 ` Brian Foster
2016-12-08 6:14 ` Darrick J. Wong
2016-09-30 3:10 ` [PATCH 49/63] xfs: don't allow reflink when the AG is low on space Darrick J. Wong
2016-09-30 8:19 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 50/63] xfs: try other AGs to allocate a BMBT block Darrick J. Wong
2016-09-30 8:20 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 51/63] xfs: garbage collect old cowextsz reservations Darrick J. Wong
2016-09-30 8:23 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 52/63] xfs: increase log reservations for reflink Darrick J. Wong
2016-09-30 8:23 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 53/63] xfs: add shared rmap map/unmap/convert log item types Darrick J. Wong
2016-09-30 8:24 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 54/63] xfs: use interval query for rmap alloc operations on shared files Darrick J. Wong
2016-09-30 8:24 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 55/63] xfs: convert unwritten status of reverse mappings for " Darrick J. Wong
2016-09-30 8:25 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 56/63] xfs: set a default CoW extent size of 32 blocks Darrick J. Wong
2016-09-30 8:25 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 57/63] xfs: check for invalid inode reflink flags Darrick J. Wong
2016-09-30 8:26 ` Christoph Hellwig
2016-09-30 3:11 ` [PATCH 58/63] xfs: don't mix reflink and DAX mode for now Darrick J. Wong
2016-09-30 8:26 ` Christoph Hellwig
2016-09-30 3:12 ` [PATCH 59/63] xfs: simulate per-AG reservations being critically low Darrick J. Wong
2016-09-30 8:27 ` Christoph Hellwig
2016-09-30 3:12 ` [PATCH 60/63] xfs: recognize the reflink feature bit Darrick J. Wong
2016-09-30 8:27 ` Christoph Hellwig
2016-09-30 3:12 ` [PATCH 61/63] xfs: various swapext cleanups Darrick J. Wong
2016-09-30 8:28 ` Christoph Hellwig
2016-09-30 3:12 ` [PATCH 62/63] xfs: refactor swapext code Darrick J. Wong
2016-09-30 8:28 ` Christoph Hellwig
2016-09-30 3:12 ` [PATCH 63/63] xfs: implement swapext for rmap filesystems Darrick J. Wong
2016-09-30 9:00 ` [PATCH v10 00/63] xfs: add reflink and dedupe support Christoph Hellwig
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20161004214413.GC8642@birch.djwong.org \
--to=darrick.wong@oracle.com \
--cc=bfoster@redhat.com \
--cc=david@fromorbit.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).