From: Dave Chinner <david@fromorbit.com>
To: "Darrick J. Wong" <darrick.wong@oracle.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 13/21] xfs: repair inode records
Date: Tue, 3 Jul 2018 16:17:18 +1000 [thread overview]
Message-ID: <20180703061718.GJ2234@dastard> (raw)
In-Reply-To: <152986829144.3155.13577483407162701849.stgit@magnolia>
On Sun, Jun 24, 2018 at 12:24:51PM -0700, Darrick J. Wong wrote:
> From: Darrick J. Wong <darrick.wong@oracle.com>
>
> Try to reinitialize corrupt inodes, or clear the reflink flag
> if it's not needed.
>
> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
A comment somewhere that this is only attmepting to repair inodes
that have failed verifier checks on read would be good.
......
> +/* Make sure this buffer can pass the inode buffer verifier. */
> +STATIC void
> +xfs_repair_inode_buf(
> + struct xfs_scrub_context *sc,
> + struct xfs_buf *bp)
> +{
> + struct xfs_mount *mp = sc->mp;
> + struct xfs_trans *tp = sc->tp;
> + struct xfs_dinode *dip;
> + xfs_agnumber_t agno;
> + xfs_agino_t agino;
> + int ioff;
> + int i;
> + int ni;
> + int di_ok;
> + bool unlinked_ok;
> +
> + ni = XFS_BB_TO_FSB(mp, bp->b_length) * mp->m_sb.sb_inopblock;
> + agno = xfs_daddr_to_agno(mp, XFS_BUF_ADDR(bp));
> + for (i = 0; i < ni; i++) {
> + ioff = i << mp->m_sb.sb_inodelog;
> + dip = xfs_buf_offset(bp, ioff);
> + agino = be32_to_cpu(dip->di_next_unlinked);
> + unlinked_ok = (agino == NULLAGINO ||
> + xfs_verify_agino(sc->mp, agno, agino));
> + di_ok = dip->di_magic == cpu_to_be16(XFS_DINODE_MAGIC) &&
> + xfs_dinode_good_version(mp, dip->di_version);
> + if (di_ok && unlinked_ok)
> + continue;
Readability woul dbe better with:
unlinked_ok = false;
if (agino == NULLAGINO || xfs_verify_agino(sc->mp, agno, agino))
unlinked_ok = true;
di_ok = false;
if (dip->di_magic == cpu_to_be16(XFS_DINODE_MAGIC) &&
xfs_dinode_good_version(mp, dip->di_version))
di_ok = true;
if (di_ok && unlinked_ok)
continue;
Also, is there a need to check the inode CRC here?
> + dip->di_magic = cpu_to_be16(XFS_DINODE_MAGIC);
> + dip->di_version = 3;
> + if (!unlinked_ok)
> + dip->di_next_unlinked = cpu_to_be32(NULLAGINO);
> + xfs_dinode_calc_crc(mp, dip);
> + xfs_trans_buf_set_type(tp, bp, XFS_BLFT_DINO_BUF);
> + xfs_trans_log_buf(tp, bp, ioff, ioff + sizeof(*dip) - 1);
Hmmmm. how does this interact with other transactions in repair that
might have logged changes to the same in-core inode? If it was just
changing the unlinked pointer, then that would be ok, but
magic/version are overwritten by the inode item recovery...
> +/* Reinitialize things that never change in an inode. */
> +STATIC void
> +xfs_repair_inode_header(
> + struct xfs_scrub_context *sc,
> + struct xfs_dinode *dip)
> +{
> + dip->di_magic = cpu_to_be16(XFS_DINODE_MAGIC);
> + if (!xfs_dinode_good_version(sc->mp, dip->di_version))
> + dip->di_version = 3;
> + dip->di_ino = cpu_to_be64(sc->sm->sm_ino);
> + uuid_copy(&dip->di_uuid, &sc->mp->m_sb.sb_meta_uuid);
> + dip->di_gen = cpu_to_be32(sc->sm->sm_gen);
> +}
> +
> +/*
> + * Turn di_mode into /something/ recognizable.
> + *
> + * XXX: Ideally we'd try to read data block 0 to see if it's a directory.
> + */
> +STATIC void
> +xfs_repair_inode_mode(
> + struct xfs_dinode *dip)
> +{
> + uint16_t mode;
> +
> + mode = be16_to_cpu(dip->di_mode);
> + if (mode == 0 || xfs_mode_to_ftype(mode) != XFS_DIR3_FT_UNKNOWN)
> + return;
> +
> + /* bad mode, so we set it to a file that only root can read */
> + mode = S_IFREG;
> + dip->di_mode = cpu_to_be16(mode);
> + dip->di_uid = 0;
> + dip->di_gid = 0;
Not sure that's a good idea - if the mode is bad I don't think we
should expose it to anyone. Perhaps we need an orphan type
> +}
> +
> +/* Fix any conflicting flags that the verifiers complain about. */
> +STATIC void
> +xfs_repair_inode_flags(
> + struct xfs_scrub_context *sc,
> + struct xfs_dinode *dip)
> +{
> + struct xfs_mount *mp = sc->mp;
> + uint64_t flags2;
> + uint16_t mode;
> + uint16_t flags;
> +
> + mode = be16_to_cpu(dip->di_mode);
> + flags = be16_to_cpu(dip->di_flags);
> + flags2 = be64_to_cpu(dip->di_flags2);
> +
> + if (xfs_sb_version_hasreflink(&mp->m_sb) && S_ISREG(mode))
> + flags2 |= XFS_DIFLAG2_REFLINK;
> + else
> + flags2 &= ~(XFS_DIFLAG2_REFLINK | XFS_DIFLAG2_COWEXTSIZE);
> + if (flags & XFS_DIFLAG_REALTIME)
> + flags2 &= ~XFS_DIFLAG2_REFLINK;
> + if (flags2 & XFS_DIFLAG2_REFLINK)
> + flags2 &= ~XFS_DIFLAG2_DAX;
> + dip->di_flags = cpu_to_be16(flags);
> + dip->di_flags2 = cpu_to_be64(flags2);
> +}
> +
> +/* Make sure we don't have a garbage file size. */
> +STATIC void
> +xfs_repair_inode_size(
> + struct xfs_dinode *dip)
> +{
> + uint64_t size;
> + uint16_t mode;
> +
> + mode = be16_to_cpu(dip->di_mode);
> + size = be64_to_cpu(dip->di_size);
> + switch (mode & S_IFMT) {
> + case S_IFIFO:
> + case S_IFCHR:
> + case S_IFBLK:
> + case S_IFSOCK:
> + /* di_size can't be nonzero for special files */
> + dip->di_size = 0;
> + break;
> + case S_IFREG:
> + /* Regular files can't be larger than 2^63-1 bytes. */
> + dip->di_size = cpu_to_be64(size & ~(1ULL << 63));
> + break;
> + case S_IFLNK:
> + /* Catch over- or under-sized symlinks. */
> + if (size > XFS_SYMLINK_MAXLEN)
> + dip->di_size = cpu_to_be64(XFS_SYMLINK_MAXLEN);
> + else if (size == 0)
> + dip->di_size = cpu_to_be64(1);
Not sure this is valid - if the inode is in extent format then a
size of 1 is invalid and means the symlink will point to the
first byte in the data fork, and that could be anything....
> + break;
> + case S_IFDIR:
> + /* Directories can't have a size larger than 32G. */
> + if (size > XFS_DIR2_SPACE_SIZE)
> + dip->di_size = cpu_to_be64(XFS_DIR2_SPACE_SIZE);
> + else if (size == 0)
> + dip->di_size = cpu_to_be64(1);
Similar. A size of 1 is not valid for a directory.
> + break;
> + }
> +}
.....
> +
> +/* Inode didn't pass verifiers, so fix the raw buffer and retry iget. */
> +STATIC int
> +xfs_repair_inode_core(
> + struct xfs_scrub_context *sc)
> +{
> + struct xfs_imap imap;
> + struct xfs_buf *bp;
> + struct xfs_dinode *dip;
> + xfs_ino_t ino;
> + int error;
> +
> + /* Map & read inode. */
> + ino = sc->sm->sm_ino;
> + error = xfs_imap(sc->mp, sc->tp, ino, &imap, XFS_IGET_UNTRUSTED);
> + if (error)
> + return error;
> +
> + error = xfs_trans_read_buf(sc->mp, sc->tp, sc->mp->m_ddev_targp,
> + imap.im_blkno, imap.im_len, XBF_UNMAPPED, &bp, NULL);
> + if (error)
> + return error;
I'd like to see this check the inode isn't in-core after we've read
and locked the inode buffer, just to ensure we haven't raced with
another access.
> +
> + /* Make sure we can pass the inode buffer verifier. */
> + xfs_repair_inode_buf(sc, bp);
> + bp->b_ops = &xfs_inode_buf_ops;
> +
> + /* Fix everything the verifier will complain about. */
> + dip = xfs_buf_offset(bp, imap.im_boffset);
> + xfs_repair_inode_header(sc, dip);
> + xfs_repair_inode_mode(dip);
> + xfs_repair_inode_flags(sc, dip);
> + xfs_repair_inode_size(dip);
> + xfs_repair_inode_extsize_hints(sc, dip);
what if the inode failed the fork verifiers rather than the dinode
verifier?
> + * Fix problems that the verifiers don't care about. In general these are
> + * errors that don't cause problems elsewhere in the kernel that we can easily
> + * detect, so we don't check them all that rigorously.
> + */
> +
> +/* Make sure block and extent counts are ok. */
> +STATIC int
> +xfs_repair_inode_unchecked_blockcounts(
> + struct xfs_scrub_context *sc)
> +{
> + xfs_filblks_t count;
> + xfs_filblks_t acount;
> + xfs_extnum_t nextents;
> + int error;
> +
> + /* di_nblocks/di_nextents/di_anextents don't match up? */
> + error = xfs_bmap_count_blocks(sc->tp, sc->ip, XFS_DATA_FORK,
> + &nextents, &count);
> + if (error)
> + return error;
> + sc->ip->i_d.di_nextents = nextents;
> +
> + error = xfs_bmap_count_blocks(sc->tp, sc->ip, XFS_ATTR_FORK,
> + &nextents, &acount);
> + if (error)
> + return error;
> + sc->ip->i_d.di_anextents = nextents;
Should the returned extent/block counts be validity checked?
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
next prev parent reply other threads:[~2018-07-03 6:17 UTC|newest]
Thread overview: 77+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-06-24 19:23 [PATCH v16 00/21] xfs-4.19: online repair support Darrick J. Wong
2018-06-24 19:23 ` [PATCH 01/21] xfs: don't assume a left rmap when allocating a new rmap Darrick J. Wong
2018-06-27 0:54 ` Dave Chinner
2018-06-28 21:11 ` Allison Henderson
2018-06-29 14:39 ` Darrick J. Wong
2018-06-24 19:23 ` [PATCH 02/21] xfs: add helper to decide if an inode has allocated cow blocks Darrick J. Wong
2018-06-27 1:02 ` Dave Chinner
2018-06-28 21:12 ` Allison Henderson
2018-06-24 19:23 ` [PATCH 03/21] xfs: refactor part of xfs_free_eofblocks Darrick J. Wong
2018-06-28 21:13 ` Allison Henderson
2018-06-24 19:23 ` [PATCH 04/21] xfs: repair the AGF and AGFL Darrick J. Wong
2018-06-27 2:19 ` Dave Chinner
2018-06-27 16:44 ` Allison Henderson
2018-06-27 23:37 ` Dave Chinner
2018-06-29 15:14 ` Darrick J. Wong
2018-06-28 17:25 ` Allison Henderson
2018-06-29 15:08 ` Darrick J. Wong
2018-06-28 21:14 ` Allison Henderson
2018-06-28 23:21 ` Dave Chinner
2018-06-29 1:35 ` Allison Henderson
2018-06-29 14:55 ` Darrick J. Wong
2018-06-24 19:24 ` [PATCH 05/21] xfs: repair the AGI Darrick J. Wong
2018-06-27 2:22 ` Dave Chinner
2018-06-28 21:15 ` Allison Henderson
2018-06-24 19:24 ` [PATCH 06/21] xfs: repair free space btrees Darrick J. Wong
2018-06-27 3:21 ` Dave Chinner
2018-07-04 2:15 ` Darrick J. Wong
2018-07-04 2:25 ` Dave Chinner
2018-06-30 17:36 ` Allison Henderson
2018-06-24 19:24 ` [PATCH 07/21] xfs: repair inode btrees Darrick J. Wong
2018-06-28 0:55 ` Dave Chinner
2018-07-04 2:22 ` Darrick J. Wong
2018-06-30 17:36 ` Allison Henderson
2018-06-30 18:30 ` Darrick J. Wong
2018-07-01 0:45 ` Allison Henderson
2018-06-24 19:24 ` [PATCH 08/21] xfs: defer iput on certain inodes while scrub / repair are running Darrick J. Wong
2018-06-28 23:37 ` Dave Chinner
2018-06-29 14:49 ` Darrick J. Wong
2018-06-24 19:24 ` [PATCH 09/21] xfs: finish our set of inode get/put tracepoints for scrub Darrick J. Wong
2018-06-24 19:24 ` [PATCH 10/21] xfs: introduce online scrub freeze Darrick J. Wong
2018-06-24 19:24 ` [PATCH 11/21] xfs: repair the rmapbt Darrick J. Wong
2018-07-03 5:32 ` Dave Chinner
2018-07-03 23:59 ` Darrick J. Wong
2018-07-04 8:44 ` Carlos Maiolino
2018-07-04 18:40 ` Darrick J. Wong
2018-07-04 23:21 ` Dave Chinner
2018-07-05 3:48 ` Darrick J. Wong
2018-07-05 7:03 ` Dave Chinner
2018-07-06 0:47 ` Darrick J. Wong
2018-07-06 1:08 ` Dave Chinner
2018-06-24 19:24 ` [PATCH 12/21] xfs: repair refcount btrees Darrick J. Wong
2018-07-03 5:50 ` Dave Chinner
2018-07-04 2:23 ` Darrick J. Wong
2018-06-24 19:24 ` [PATCH 13/21] xfs: repair inode records Darrick J. Wong
2018-07-03 6:17 ` Dave Chinner [this message]
2018-07-04 0:16 ` Darrick J. Wong
2018-07-04 1:03 ` Dave Chinner
2018-07-04 1:30 ` Darrick J. Wong
2018-06-24 19:24 ` [PATCH 14/21] xfs: zap broken inode forks Darrick J. Wong
2018-07-04 2:07 ` Dave Chinner
2018-07-04 3:26 ` Darrick J. Wong
2018-06-24 19:25 ` [PATCH 15/21] xfs: repair inode block maps Darrick J. Wong
2018-07-04 3:00 ` Dave Chinner
2018-07-04 3:41 ` Darrick J. Wong
2018-06-24 19:25 ` [PATCH 16/21] xfs: repair damaged symlinks Darrick J. Wong
2018-07-04 5:45 ` Dave Chinner
2018-07-04 18:45 ` Darrick J. Wong
2018-06-24 19:25 ` [PATCH 17/21] xfs: repair extended attributes Darrick J. Wong
2018-07-06 1:03 ` Dave Chinner
2018-07-06 3:10 ` Darrick J. Wong
2018-06-24 19:25 ` [PATCH 18/21] xfs: scrub should set preen if attr leaf has holes Darrick J. Wong
2018-06-29 2:52 ` Dave Chinner
2018-06-24 19:25 ` [PATCH 19/21] xfs: repair quotas Darrick J. Wong
2018-07-06 1:50 ` Dave Chinner
2018-07-06 3:16 ` Darrick J. Wong
2018-06-24 19:25 ` [PATCH 20/21] xfs: implement live quotacheck as part of quota repair Darrick J. Wong
2018-06-24 19:25 ` [PATCH 21/21] xfs: add online scrub/repair for superblock counters Darrick J. Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180703061718.GJ2234@dastard \
--to=david@fromorbit.com \
--cc=darrick.wong@oracle.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox