linux-xfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Dave Chinner <david@fromorbit.com>
To: Brian Foster <bfoster@redhat.com>
Cc: xfs@oss.sgi.com
Subject: Re: [PATCH v3 14/20] xfsprogs/repair: phase 2 finobt scan
Date: Thu, 24 Apr 2014 09:06:43 +1000	[thread overview]
Message-ID: <20140423230643.GU15995@dastard> (raw)
In-Reply-To: <20140423200151.GD5225@bfoster.bfoster>

On Wed, Apr 23, 2014 at 04:01:52PM -0400, Brian Foster wrote:
> On Wed, Apr 23, 2014 at 04:19:45PM +1000, Dave Chinner wrote:
> > On Thu, Apr 10, 2014 at 12:11:04PM -0400, Brian Foster wrote:
> > > If one exists, scan the free inode btree in phase 2 of xfs_repair.
> > > We use the same general infrastructure as for the inobt scan, but
> > > trigger finobt chunk scan logic in in scan_inobt() via the magic
> > > value.
> > > 
> > > The new scan_single_finobt_chunk() function is similar to the inobt
> > > equivalent with some finobt specific logic. We can expect that
> > > underlying inode chunk blocks are already marked used due to the
> > > previous inobt scan. We can also expect to find every record
> > > tracked by the finobt already accounted for in the in-core tree
> > > with equivalent (and internally consistent) inobt record data.
> > > 
> > > Spit out a warning on any divergences from the above and add the
> > > inodes referenced by the current finobt record to the appropriate
> > > in-core tree.
> > > 
> > > Signed-off-by: Brian Foster <bfoster@redhat.com>
> > ....
> > > +
> > > +	if (!suspect) {
> > > +		/*
> > > +		 * inodes previously inserted into the uncertain tree should be
> > > +		 * superceded by these when the uncertain tree is processed
> > > +		 */
> > > +		nfree = 0;
> > > +		if (XFS_INOBT_IS_FREE_DISK(rp, 0)) {
> > > +			nfree++;
> > > +			ino_rec = set_inode_free_alloc(mp, agno, ino);
> > > +		} else  {
> > > +			ino_rec = set_inode_used_alloc(mp, agno, ino);
> > > +		}
> > > +		for (j = 1; j < XFS_INODES_PER_CHUNK; j++) {
> > > +			if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
> > > +				nfree++;
> > > +				set_inode_free(ino_rec, j);
> > > +			} else  {
> > > +				set_inode_used(ino_rec, j);
> > > +			}
> > > +		}
> > > +	} else {
> > > +		/*
> > > +		 * this should handle the case where the inobt scan may have
> > > +		 * already added uncertain inodes
> > > +		 */
> > > +		nfree = 0;
> > > +		for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
> > > +			if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
> > > +				add_aginode_uncertain(mp, agno, ino + j, 1);
> > > +				nfree++;
> > > +			} else {
> > > +				add_aginode_uncertain(mp, agno, ino + j, 0);
> > > +			}
> > > +		}
> > > +	}
> > > +
> > > +check_freecount:
> > > +
> > > +	if (nfree != be32_to_cpu(rp->ir_freecount)) {
> > > +		do_warn(
> > > +_("finobt ir_freecount/free mismatch, inode chunk %d/%u, freecount %d nfree %d\n"),
> > > +			agno, ino, be32_to_cpu(rp->ir_freecount), nfree);
> > > +	}
> > > +
> > > +	if (!nfree) {
> > > +		do_warn(
> > > +_("finobt record with no free inodes, inode chunk %d/%u\n"), agno, ino);
> > > +	}
> > 
> > Shouldn't both of these increment suspect?
> > 
> 
> So at this point of the scan, we're processing through finobt leaf
> blocks. Setting suspect has two side effects that I can see:
> 
> 1.) Suppressing the additional, more granular checks for the
> records/inodes that make up the remainder of the block.
> 
> 2.) Setting bad_ino_btree, which causes us to skip phases 6 and 7.
> 
> I think my thought process was that side effect #2 was generally too
> heavy handed for a scenario where the freecount doesn't happen to match
> or we've failed to remove a fully allocated record from the finobt.

Yes, that's true.

> Furthermore, we already increment suspect in this function if either the
> allocation state of the inode chunk blocks or the allocation state of
> any individual inode is unexpected based on the inobt state. The
> additional value of the freecount checks is to catch any kind of
> intra-record corruption or inter-freecount corruption across the trees,
> which IIUC would be fixed naturally provided nothing else is wrong
> (e.g., suspect remains 0) when the trees are regenerated.
> 
> In other words, if inode allocation state does differ between the trees,
> we already do set suspect. If these warnings fire without suspect being
> set, it means we just need to fix up the freecount and there is no
> broader tree corruption. Thoughts?

Seems reasonable. Perhaps a comment explaining this would be a good
idea - we dont have nearly enough comments in repair explaining how
it is cross-validating different pieces of metadata...

> NOTE:
> 
> When looking through this again, I think there is a hole in the
> allocation state logic in that we don't catch a case where the finobt
> inode could be allocated and the inobt inode marked as free:
> 
> 	if (first_rec) {
> 		...
> 
> 		nfree = 0;
> 		for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
> 			if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
> 				nfree++;
> 				if (!suspect && !is_inode_free(first_rec, j))
> 					suspect++;
> 			}
> 			/*
> 			 * XXX: else if the finobt inode is allocated, confirm
> 			 * here that the inobt inode is allocated as well!
> 			 */
> 		}
> 		goto check_freecount;
> 	}
> 
> I think something like this might work:
> 
> 	if (first_rec) {
> 		...
> 
> 		nfree = 0;
> 		for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
> 			int isfree = XFS_INOBT_IS_FREE_DISK(rp, j);
> 
> 			if (isfree) 
> 				nfree++;
> 
> 			if (!suspect &&
> 			    isfree != is_inode_free(first_rec, j))
> 				suspect++;
> 		}
> 		goto check_freecount;
> 	}

Yes, that makes sense - we need to validate in both directions.
Again, a comment would be a useful reminder here. ;)

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

  reply	other threads:[~2014-04-23 23:07 UTC|newest]

Thread overview: 36+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-04-10 16:10 [PATCH v3 00/20] xfsprogs: introduce the free inode btree Brian Foster
2014-04-10 16:10 ` [PATCH v3 01/20] xfs: refactor xfs_ialloc_btree.c to support multiple inobt numbers Brian Foster
2014-04-10 16:10 ` [PATCH v3 02/20] xfs: reserve v5 superblock read-only compat. feature bit for finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 03/20] xfs: support the XFS_BTNUM_FINOBT free inode btree type Brian Foster
2014-04-10 16:10 ` [PATCH v3 04/20] xfs: update inode allocation/free transaction reservations for finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 05/20] xfs: insert newly allocated inode chunks into the finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 06/20] xfs: use and update the finobt on inode allocation Brian Foster
2014-04-10 16:10 ` [PATCH v3 07/20] xfs: refactor xfs_difree() inobt bits into xfs_difree_inobt() helper Brian Foster
2014-04-10 16:10 ` [PATCH v3 08/20] xfs: update the finobt on inode free Brian Foster
2014-04-10 16:10 ` [PATCH v3 09/20] xfs: report finobt status in fs geometry Brian Foster
2014-04-10 16:11 ` [PATCH v3 10/20] xfs: enable the finobt feature on v5 superblocks Brian Foster
2014-04-10 16:11 ` [PATCH v3 11/20] xfsprogs/mkfs: finobt mkfs support Brian Foster
2014-04-23  6:03   ` Dave Chinner
2014-04-23 20:01     ` Brian Foster
2014-04-10 16:11 ` [PATCH v3 12/20] xfsprogs/db: finobt support Brian Foster
2014-04-10 16:11 ` [PATCH v3 13/20] xfsprogs/repair: account for finobt in ag 0 geometry pre-calculation Brian Foster
2014-04-23  6:12   ` Dave Chinner
2014-04-23 20:02     ` Brian Foster
2014-04-23 22:53       ` Dave Chinner
2014-04-10 16:11 ` [PATCH v3 14/20] xfsprogs/repair: phase 2 finobt scan Brian Foster
2014-04-23  6:10   ` Dave Chinner
2014-04-23  6:19   ` Dave Chinner
2014-04-23 20:01     ` Brian Foster
2014-04-23 23:06       ` Dave Chinner [this message]
2014-04-24  0:39         ` Brian Foster
2014-04-10 16:11 ` [PATCH v3 15/20] xfsprogs/repair: pass btree block magic as param to build_ino_tree() Brian Foster
2014-04-10 16:11 ` [PATCH v3 16/20] xfsprogs/repair: pull the build_agi() call up out of the inode tree build Brian Foster
2014-04-10 16:11 ` [PATCH v3 17/20] xfsprogs/repair: helpers for finding in-core inode records w/ free inodes Brian Foster
2014-04-23  6:24   ` Dave Chinner
2014-04-23 20:02     ` Brian Foster
2014-04-23 22:58       ` Dave Chinner
2014-04-10 16:11 ` [PATCH v3 18/20] xfsprogs/repair: reconstruct the finobt in phase 5 Brian Foster
2014-04-10 16:11 ` [PATCH v3 19/20] xfsprogs/growfs: report finobt status in fs geometry (xfs_info) Brian Foster
2014-04-10 16:11 ` [PATCH v3 20/20] xfsprogs/db: add finobt support to metadump Brian Foster
2014-04-23  6:35 ` [PATCH v3 00/20] xfsprogs: introduce the free inode btree Dave Chinner
2014-04-23 20:02   ` Brian Foster

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140423230643.GU15995@dastard \
    --to=david@fromorbit.com \
    --cc=bfoster@redhat.com \
    --cc=xfs@oss.sgi.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).