From: Brian Foster <bfoster@redhat.com>
To: Dave Chinner <david@fromorbit.com>
Cc: xfs@oss.sgi.com
Subject: Re: [PATCH v3 14/20] xfsprogs/repair: phase 2 finobt scan
Date: Wed, 23 Apr 2014 16:01:52 -0400 [thread overview]
Message-ID: <20140423200151.GD5225@bfoster.bfoster> (raw)
In-Reply-To: <20140423061945.GO15995@dastard>
On Wed, Apr 23, 2014 at 04:19:45PM +1000, Dave Chinner wrote:
> On Thu, Apr 10, 2014 at 12:11:04PM -0400, Brian Foster wrote:
> > If one exists, scan the free inode btree in phase 2 of xfs_repair.
> > We use the same general infrastructure as for the inobt scan, but
> > trigger finobt chunk scan logic in in scan_inobt() via the magic
> > value.
> >
> > The new scan_single_finobt_chunk() function is similar to the inobt
> > equivalent with some finobt specific logic. We can expect that
> > underlying inode chunk blocks are already marked used due to the
> > previous inobt scan. We can also expect to find every record
> > tracked by the finobt already accounted for in the in-core tree
> > with equivalent (and internally consistent) inobt record data.
> >
> > Spit out a warning on any divergences from the above and add the
> > inodes referenced by the current finobt record to the appropriate
> > in-core tree.
> >
> > Signed-off-by: Brian Foster <bfoster@redhat.com>
> ....
> > + /*
> > + * on multi-block block chunks, all chunks start
> > + * at the beginning of the block. with multi-chunk
> > + * blocks, all chunks must start on 64-inode boundaries
> > + * since each block can hold N complete chunks. if
> > + * fs has aligned inodes, all chunks must start
> > + * at a fs_ino_alignment*N'th agbno. skip recs
> > + * with badly aligned starting inodes.
> > + */
>
> Use all 80 columns for the comment ;)
>
Yep, I'll fix this and the others...
...
> > +
> > + if (!suspect) {
> > + /*
> > + * inodes previously inserted into the uncertain tree should be
> > + * superceded by these when the uncertain tree is processed
> > + */
> > + nfree = 0;
> > + if (XFS_INOBT_IS_FREE_DISK(rp, 0)) {
> > + nfree++;
> > + ino_rec = set_inode_free_alloc(mp, agno, ino);
> > + } else {
> > + ino_rec = set_inode_used_alloc(mp, agno, ino);
> > + }
> > + for (j = 1; j < XFS_INODES_PER_CHUNK; j++) {
> > + if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
> > + nfree++;
> > + set_inode_free(ino_rec, j);
> > + } else {
> > + set_inode_used(ino_rec, j);
> > + }
> > + }
> > + } else {
> > + /*
> > + * this should handle the case where the inobt scan may have
> > + * already added uncertain inodes
> > + */
> > + nfree = 0;
> > + for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
> > + if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
> > + add_aginode_uncertain(mp, agno, ino + j, 1);
> > + nfree++;
> > + } else {
> > + add_aginode_uncertain(mp, agno, ino + j, 0);
> > + }
> > + }
> > + }
> > +
> > +check_freecount:
> > +
> > + if (nfree != be32_to_cpu(rp->ir_freecount)) {
> > + do_warn(
> > +_("finobt ir_freecount/free mismatch, inode chunk %d/%u, freecount %d nfree %d\n"),
> > + agno, ino, be32_to_cpu(rp->ir_freecount), nfree);
> > + }
> > +
> > + if (!nfree) {
> > + do_warn(
> > +_("finobt record with no free inodes, inode chunk %d/%u\n"), agno, ino);
> > + }
>
> Shouldn't both of these increment suspect?
>
So at this point of the scan, we're processing through finobt leaf
blocks. Setting suspect has two side effects that I can see:
1.) Suppressing the additional, more granular checks for the
records/inodes that make up the remainder of the block.
2.) Setting bad_ino_btree, which causes us to skip phases 6 and 7.
I think my thought process was that side effect #2 was generally too
heavy handed for a scenario where the freecount doesn't happen to match
or we've failed to remove a fully allocated record from the finobt.
Furthermore, we already increment suspect in this function if either the
allocation state of the inode chunk blocks or the allocation state of
any individual inode is unexpected based on the inobt state. The
additional value of the freecount checks is to catch any kind of
intra-record corruption or inter-freecount corruption across the trees,
which IIUC would be fixed naturally provided nothing else is wrong
(e.g., suspect remains 0) when the trees are regenerated.
In other words, if inode allocation state does differ between the trees,
we already do set suspect. If these warnings fire without suspect being
set, it means we just need to fix up the freecount and there is no
broader tree corruption. Thoughts?
NOTE:
When looking through this again, I think there is a hole in the
allocation state logic in that we don't catch a case where the finobt
inode could be allocated and the inobt inode marked as free:
if (first_rec) {
...
nfree = 0;
for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
if (XFS_INOBT_IS_FREE_DISK(rp, j)) {
nfree++;
if (!suspect && !is_inode_free(first_rec, j))
suspect++;
}
/*
* XXX: else if the finobt inode is allocated, confirm
* here that the inobt inode is allocated as well!
*/
}
goto check_freecount;
}
I think something like this might work:
if (first_rec) {
...
nfree = 0;
for (j = 0; j < XFS_INODES_PER_CHUNK; j++) {
int isfree = XFS_INOBT_IS_FREE_DISK(rp, j);
if (isfree)
nfree++;
if (!suspect &&
isfree != is_inode_free(first_rec, j))
suspect++;
}
goto check_freecount;
}
Brian
> Cheers,
>
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2014-04-23 20:02 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-04-10 16:10 [PATCH v3 00/20] xfsprogs: introduce the free inode btree Brian Foster
2014-04-10 16:10 ` [PATCH v3 01/20] xfs: refactor xfs_ialloc_btree.c to support multiple inobt numbers Brian Foster
2014-04-10 16:10 ` [PATCH v3 02/20] xfs: reserve v5 superblock read-only compat. feature bit for finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 03/20] xfs: support the XFS_BTNUM_FINOBT free inode btree type Brian Foster
2014-04-10 16:10 ` [PATCH v3 04/20] xfs: update inode allocation/free transaction reservations for finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 05/20] xfs: insert newly allocated inode chunks into the finobt Brian Foster
2014-04-10 16:10 ` [PATCH v3 06/20] xfs: use and update the finobt on inode allocation Brian Foster
2014-04-10 16:10 ` [PATCH v3 07/20] xfs: refactor xfs_difree() inobt bits into xfs_difree_inobt() helper Brian Foster
2014-04-10 16:10 ` [PATCH v3 08/20] xfs: update the finobt on inode free Brian Foster
2014-04-10 16:10 ` [PATCH v3 09/20] xfs: report finobt status in fs geometry Brian Foster
2014-04-10 16:11 ` [PATCH v3 10/20] xfs: enable the finobt feature on v5 superblocks Brian Foster
2014-04-10 16:11 ` [PATCH v3 11/20] xfsprogs/mkfs: finobt mkfs support Brian Foster
2014-04-23 6:03 ` Dave Chinner
2014-04-23 20:01 ` Brian Foster
2014-04-10 16:11 ` [PATCH v3 12/20] xfsprogs/db: finobt support Brian Foster
2014-04-10 16:11 ` [PATCH v3 13/20] xfsprogs/repair: account for finobt in ag 0 geometry pre-calculation Brian Foster
2014-04-23 6:12 ` Dave Chinner
2014-04-23 20:02 ` Brian Foster
2014-04-23 22:53 ` Dave Chinner
2014-04-10 16:11 ` [PATCH v3 14/20] xfsprogs/repair: phase 2 finobt scan Brian Foster
2014-04-23 6:10 ` Dave Chinner
2014-04-23 6:19 ` Dave Chinner
2014-04-23 20:01 ` Brian Foster [this message]
2014-04-23 23:06 ` Dave Chinner
2014-04-24 0:39 ` Brian Foster
2014-04-10 16:11 ` [PATCH v3 15/20] xfsprogs/repair: pass btree block magic as param to build_ino_tree() Brian Foster
2014-04-10 16:11 ` [PATCH v3 16/20] xfsprogs/repair: pull the build_agi() call up out of the inode tree build Brian Foster
2014-04-10 16:11 ` [PATCH v3 17/20] xfsprogs/repair: helpers for finding in-core inode records w/ free inodes Brian Foster
2014-04-23 6:24 ` Dave Chinner
2014-04-23 20:02 ` Brian Foster
2014-04-23 22:58 ` Dave Chinner
2014-04-10 16:11 ` [PATCH v3 18/20] xfsprogs/repair: reconstruct the finobt in phase 5 Brian Foster
2014-04-10 16:11 ` [PATCH v3 19/20] xfsprogs/growfs: report finobt status in fs geometry (xfs_info) Brian Foster
2014-04-10 16:11 ` [PATCH v3 20/20] xfsprogs/db: add finobt support to metadump Brian Foster
2014-04-23 6:35 ` [PATCH v3 00/20] xfsprogs: introduce the free inode btree Dave Chinner
2014-04-23 20:02 ` Brian Foster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140423200151.GD5225@bfoster.bfoster \
--to=bfoster@redhat.com \
--cc=david@fromorbit.com \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).