linux-xfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Eric Sandeen <sandeen@sandeen.net>
Cc: sandeen@redhat.com, linux-xfs@vger.kernel.org
Subject: Re: [PATCH 04/10] xfs_scrub: schedule and manage optimizations/repairs to the filesystem
Date: Wed, 25 Jul 2018 18:17:38 -0700	[thread overview]
Message-ID: <20180726011738.GB30972@magnolia> (raw)
In-Reply-To: <1e05bca4-1d65-af25-e6b2-c6e721cf5a42@sandeen.net>

On Wed, Jul 25, 2018 at 06:06:00PM -0700, Eric Sandeen wrote:
> On 6/26/18 7:48 PM, Darrick J. Wong wrote:
> > From: Darrick J. Wong <darrick.wong@oracle.com>
> > 
> > Teach xfs_scrub to remember scrub requests that failed (or indicated
> > that optimization is a possibility) as action items.  Depending on the
> > circumstances, certain items are acted upon immediately (e.g. metadata
> > that needs to be healthy in order to finish the scan, or files that are
> > already open) or deferred until later.  Expand the repair phase to
> > deal with the deferred actions.
> > 
> > Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> > ---
> >  man/man8/xfs_scrub.8 |   38 ++++++-
> >  scrub/Makefile       |    2 
> >  scrub/phase1.c       |    7 +
> >  scrub/phase2.c       |   59 ++++++++++
> >  scrub/phase3.c       |   42 +++++--
> >  scrub/phase4.c       |   76 +++++++++++++
> >  scrub/repair.c       |  286 ++++++++++++++++++++++++++++++++++++++++++++++++++
> >  scrub/repair.h       |   42 +++++++
> >  scrub/scrub.c        |  130 ++++++++++++++++-------
> >  scrub/scrub.h        |   42 +++++--
> >  scrub/xfs_scrub.c    |   44 +++++++-
> >  scrub/xfs_scrub.h    |    1 
> >  12 files changed, 690 insertions(+), 79 deletions(-)
> >  create mode 100644 scrub/repair.c
> >  create mode 100644 scrub/repair.h
> > 
> > 
> > diff --git a/man/man8/xfs_scrub.8 b/man/man8/xfs_scrub.8
> > index 680ef72b..18948a4e 100644
> > --- a/man/man8/xfs_scrub.8
> > +++ b/man/man8/xfs_scrub.8
> > @@ -1,6 +1,6 @@
> >  .TH xfs_scrub 8
> >  .SH NAME
> > -xfs_scrub \- check the contents of a mounted XFS filesystem
> > +xfs_scrub \- check and repair the contents of a mounted XFS filesystem
> >  .SH SYNOPSIS
> >  .B xfs_scrub
> >  [
> > @@ -108,10 +108,40 @@ Optimizations supported by this program include, but are not limited to:
> >  Instructing the underlying storage to discard unused extents via the
> >  .B TRIM
> >  ioctl.
> > +.IP \[bu]
> > +Updating secondary superblocks to match the primary superblock.
> > +.IP \[bu]
> > +Turning off shared block write checks for files that no longer share blocks.
> >  .SH REPAIRS
> > -This program currently does not support making any repairs.
> > -Corruptions can only be fixed by unmounting the filesystem and running
> > -.BR xfs_repair (8).
> > +Repairs are performed by calling into the kernel.
> > +This limits the scope of repair activities to rebuilding primary data
> > +structures from secondary data structures, or secondary structures from
> > +primary structures.
> > +The existence of secondary data structures may require features that can
> > +only be turned on from
> > +.BR mkfs.xfs (8).
> > +If errors cannot be repaired, the filesystem must be
> > +unmounted and
> > +.BR xfs_repair (8)
> > +run.
> > +Repairs supported by the kernel include, but are not limited to:
> > +.IP \[bu] 2
> > +Reconstructing extent allocation data.
> > +.IP \[bu]
> > +Rebuilding free space information.
> > +.IP \[bu]
> > +Rebuilding inode indexes.
> > +.IP \[bu]
> > +Fixing minor corruptions of inode records.
> > +.IP \[bu]
> > +Recalculating reference count information.
> > +.IP \[bu]
> > +Reconstructing reverse mapping data from primary extent allocation data.
> > +.IP \[bu]
> > +Scheduling a quotacheck for the next mount.
> > +.PP
> > +If corrupt metadata is successfully repaired, this program will log that
> > +a repair has succeeded instead of a corruption report.
> 
> for 4.18 I think we should only list what's possible, right?
> But, um, what is currently possible in 4.18?

Secondary superblocks only.

> Maybe I'll just add a "may include ..." and a "depending on the capabilities
> of the running kernel" to give us some wiggle room.

/me shrugs, happy to resubmit that list w/ just the secondary supers and
then to send more patches with each release as/if I get things
reviewed.

> As for the rest, um, seems ok AFAICT (which isn't that far).  Minor nit:
> 
> ...
> 
> >  static void
> >  report_outcome(
> >  	struct scrub_ctx	*ctx)
> > @@ -525,9 +555,16 @@ report_outcome(
> >  	 * setting up the scrub and we actually saw corruptions.  Warnings
> >  	 * are not corruptions.
> >  	 */
> > -	if (ctx->scrub_setup_succeeded && total_errors > 0)
> > -		fprintf(stderr, _("%s: Unmount and run xfs_repair.\n"),
> > -				ctx->mntpoint);
> > +	if (ctx->scrub_setup_succeeded && total_errors > 0) {
> > +		char		*msg;
> > +
> > +		if (ctx->mode == SCRUB_MODE_DRY_RUN)
> > +			msg = _("%s: Re-run xfs_scrub without -n.\n");
> 
> "... to repair?"

Yes.

> Is it possible that we found something that can't actually be repaired
> online, and so they'll get:
> 
> # xfs_scrub -n
> "Re-run xfs_scrub without -n"
> # xfs_scrub
> "Unmount and run xfs_repair."

Right.  First scrub found something and told you how to fix it, but then
you tried that and it couldn't fix it so it told you about ye olde
repair tool.

> 
> Maybe that's ok ... there's no way to keep track of which found problems
> are repairable by the running kernel, is there?

Not really, I'm afraid.  Either the kernel has a function to repair
things or it doesn't, and there's no good way to track at a finer
grained level than that.

--D

> -Eric
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2018-07-26  2:32 UTC|newest]

Thread overview: 29+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-06-27  2:47 [PATCH 00/10] xfsprogs-4.18: mostly scrub/repair stuff Darrick J. Wong
2018-06-27  2:47 ` [PATCH 01/10] libxfs: remove crc32 functions Darrick J. Wong
2018-06-27  2:47 ` [PATCH 02/10] libfrog: move crc32c code out of libxfs Darrick J. Wong
2018-06-27  2:48 ` [PATCH 03/10] xfs_scrub: destroy workqueues when erroring out Darrick J. Wong
2018-07-26  0:29   ` Eric Sandeen
2018-06-27  2:48 ` [PATCH 04/10] xfs_scrub: schedule and manage optimizations/repairs to the filesystem Darrick J. Wong
2018-07-26  1:06   ` Eric Sandeen
2018-07-26  1:17     ` Darrick J. Wong [this message]
2018-06-27  2:48 ` [PATCH 05/10] xfs_scrub: allow developers to force repairs Darrick J. Wong
2018-07-26  1:07   ` Eric Sandeen
2018-06-27  2:48 ` [PATCH 06/10] xfs_scrub: don't error out if an optimize-only repair isn't supported Darrick J. Wong
2018-07-26  1:09   ` Eric Sandeen
2018-06-27  2:48 ` [PATCH 07/10] xfs_scrub: rename NOFIX_COMPLAIN to be less confusing Darrick J. Wong
2018-07-26  1:10   ` Eric Sandeen
2018-06-27  2:48 ` [PATCH 08/10] xfs_scrub: only retry non-permanent repair failures Darrick J. Wong
2018-07-26  1:16   ` Eric Sandeen
2018-07-26  1:19     ` Darrick J. Wong
2018-06-27  2:48 ` [PATCH 09/10] xfs_io: wire up repair ioctl stuff Darrick J. Wong
2018-07-26  1:23   ` Eric Sandeen
2018-06-27  2:48 ` [PATCH 10/10] xfs_repair: use libxfs extsize/cowextsize validation helpers Darrick J. Wong
2018-06-27  3:25   ` Eric Sandeen
2018-06-28 17:28     ` Darrick J. Wong
2018-06-28 17:29 ` [PATCH 11/10] xfs_repair: clear extent size hints when clearing inode core Darrick J. Wong
2018-06-28 19:24   ` Eric Sandeen
2018-06-28 17:30 ` [PATCH 12/10] xfs_repair: use libxfs extsize/cowextsize validation helpers Darrick J. Wong
2018-07-26  1:44   ` Eric Sandeen
2018-07-26  1:48     ` Eric Sandeen
2018-07-26 20:38   ` [PATCH v2 " Darrick J. Wong
2018-07-27 22:44     ` Eric Sandeen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20180726011738.GB30972@magnolia \
    --to=darrick.wong@oracle.com \
    --cc=linux-xfs@vger.kernel.org \
    --cc=sandeen@redhat.com \
    --cc=sandeen@sandeen.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).