From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Brian Foster <bfoster@redhat.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 7/9] xfs: move bmbt owner change to last step of extent swap
Date: Fri, 25 Aug 2017 08:57:49 -0700 [thread overview]
Message-ID: <20170825155749.GP4796@magnolia> (raw)
In-Reply-To: <20170825150557.43010-8-bfoster@redhat.com>
On Fri, Aug 25, 2017 at 11:05:55AM -0400, Brian Foster wrote:
> The extent swap operation currently resets bmbt block owners before
> the inode forks are swapped. The bmbt buffers are marked as ordered
> so they do not have to be physically logged in the transaction.
>
> This use of ordered buffers is not safe as bmbt buffers may have
> been previously physically logged. The bmbt owner change algorithm
> needs to be updated to physically log buffers that are already dirty
> when/if they are encountered. This means that an extent swap will
> eventually require multiple rolling transactions to handle large
> btrees. In addition, all inode related changes must be logged before
> the bmbt owner change scan begins and can roll the transaction for
> the first time to preserve fs consistency via log recovery.
>
> In preparation for such fixes to the bmbt owner change algorithm,
> refactor the bmbt scan out of the extent fork swap code to the last
> operation before the transaction is committed. Update
> xfs_swap_extent_forks() to only set the inode log flags when an
> owner change scan is necessary. Update xfs_swap_extents() to trigger
> the owner change based on the inode log flags. Note that since the
> owner change now occurs after the extent fork swap, the inode btrees
> must be fixed up with the inode number of the current inode (similar
> to log recovery).
>
> Signed-off-by: Brian Foster <bfoster@redhat.com>
Looks ok I think,
Reviewed-by: Darrick J. Wong <darrick.wong@oracle.com>
> ---
> fs/xfs/xfs_bmap_util.c | 44 ++++++++++++++++++++++++++------------------
> 1 file changed, 26 insertions(+), 18 deletions(-)
>
> diff --git a/fs/xfs/xfs_bmap_util.c b/fs/xfs/xfs_bmap_util.c
> index 93e9552..ee8fb9a 100644
> --- a/fs/xfs/xfs_bmap_util.c
> +++ b/fs/xfs/xfs_bmap_util.c
> @@ -1840,29 +1840,18 @@ xfs_swap_extent_forks(
> }
>
> /*
> - * Before we've swapped the forks, lets set the owners of the forks
> - * appropriately. We have to do this as we are demand paging the btree
> - * buffers, and so the validation done on read will expect the owner
> - * field to be correctly set. Once we change the owners, we can swap the
> - * inode forks.
> + * Btree format (v3) inodes have the inode number stamped in the bmbt
> + * block headers. We can't start changing the bmbt blocks until the
> + * inode owner change is logged so recovery does the right thing in the
> + * event of a crash. Set the owner change log flags now and leave the
> + * bmbt scan as the last step.
> */
> if (ip->i_d.di_version == 3 &&
> - ip->i_d.di_format == XFS_DINODE_FMT_BTREE) {
> + ip->i_d.di_format == XFS_DINODE_FMT_BTREE)
> (*target_log_flags) |= XFS_ILOG_DOWNER;
> - error = xfs_bmbt_change_owner(tp, ip, XFS_DATA_FORK,
> - tip->i_ino, NULL);
> - if (error)
> - return error;
> - }
> -
> if (tip->i_d.di_version == 3 &&
> - tip->i_d.di_format == XFS_DINODE_FMT_BTREE) {
> + tip->i_d.di_format == XFS_DINODE_FMT_BTREE)
> (*src_log_flags) |= XFS_ILOG_DOWNER;
> - error = xfs_bmbt_change_owner(tp, tip, XFS_DATA_FORK,
> - ip->i_ino, NULL);
> - if (error)
> - return error;
> - }
>
> /*
> * Swap the data forks of the inodes
> @@ -2092,6 +2081,25 @@ xfs_swap_extents(
> xfs_trans_log_inode(tp, tip, target_log_flags);
>
> /*
> + * The extent forks have been swapped, but crc=1,rmapbt=0 filesystems
> + * have inode number owner values in the bmbt blocks that still refer to
> + * the old inode. Scan each bmbt to fix up the owner values with the
> + * inode number of the current inode.
> + */
> + if (src_log_flags & XFS_ILOG_DOWNER) {
> + error = xfs_bmbt_change_owner(tp, ip, XFS_DATA_FORK,
> + ip->i_ino, NULL);
> + if (error)
> + goto out_trans_cancel;
> + }
> + if (target_log_flags & XFS_ILOG_DOWNER) {
> + error = xfs_bmbt_change_owner(tp, tip, XFS_DATA_FORK,
> + tip->i_ino, NULL);
> + if (error)
> + goto out_trans_cancel;
> + }
> +
> + /*
> * If this is a synchronous mount, make sure that the
> * transaction goes to disk before returning to the user.
> */
> --
> 2.9.5
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2017-08-25 15:57 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-25 15:05 [PATCH 0/9] xfs: xfs: refactor ordered buffer logging code Brian Foster
2017-08-25 15:05 ` [PATCH 1/9] xfs: open-code xfs_buf_item_dirty() Brian Foster
2017-08-25 15:26 ` Darrick J. Wong
2017-08-28 9:20 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 2/9] xfs: remove unnecessary dirty bli format check for ordered bufs Brian Foster
2017-08-25 15:51 ` Darrick J. Wong
2017-08-28 9:25 ` Christoph Hellwig
2017-08-28 10:51 ` Brian Foster
2017-08-25 15:05 ` [PATCH 3/9] xfs: ordered buffer log items are never formatted Brian Foster
2017-08-25 15:26 ` Darrick J. Wong
2017-08-28 9:26 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 4/9] xfs: refactor buffer logging into buffer dirtying helper Brian Foster
2017-08-28 9:28 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 5/9] xfs: don't log dirty ranges for ordered buffers Brian Foster
2017-08-25 15:51 ` Darrick J. Wong
2017-08-28 9:29 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 6/9] xfs: skip bmbt block ino validation during owner change Brian Foster
2017-08-25 15:35 ` Darrick J. Wong
2017-08-25 18:11 ` Brian Foster
2017-08-28 9:44 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 7/9] xfs: move bmbt owner change to last step of extent swap Brian Foster
2017-08-25 15:57 ` Darrick J. Wong [this message]
2017-08-28 9:46 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 8/9] xfs: disallow marking previously dirty buffers as ordered Brian Foster
2017-08-25 16:50 ` Darrick J. Wong
2017-08-28 9:34 ` Christoph Hellwig
2017-08-25 15:05 ` [PATCH 9/9] xfs: relog dirty buffers during swapext bmbt owner change Brian Foster
2017-08-25 16:53 ` Darrick J. Wong
2017-08-28 9:51 ` Christoph Hellwig
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170825155749.GP4796@magnolia \
--to=darrick.wong@oracle.com \
--cc=bfoster@redhat.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).