From: Eric Sandeen <sandeen@sandeen.net>
To: Eric Sandeen <sandeen@redhat.com>
Cc: "'linux-xfs@oss.sgi.com'" <linux-xfs@oss.sgi.com>
Subject: Re: [PATCH, RFC] xfs: don't break from growfs ag update loop on error
Date: Mon, 09 Sep 2013 15:36:18 -0500 [thread overview]
Message-ID: <522E3142.7090501@sandeen.net> (raw)
In-Reply-To: <520D1AAC.8090701@redhat.com>
On 8/15/13 1:15 PM, Eric Sandeen wrote:
> When xfs_growfs_data_private() is updating backup superblocks,
> it bails out on the first error encountered, whether reading or
> writing:
Any thoughts on this one? W/ the verifiers, we have a higher
chance of encountering an error, and leaving the rest of the
supers un-updated. Repair will then possibly revert the fs to
it's pre-growfs state, and data loss will ensue...
Thanks,
-Eric
> * If we get an error writing out the alternate superblocks,
> * just issue a warning and continue. The real work is
> * already done and committed.
>
> This can cause a problem later during repair, because repair
> looks at all superblocks, and picks the most prevalent one
> as correct. If we bail out early in the backup superblock
> loop, we can end up with more "bad" matching superblocks than
> good, and a post-growfs repair may revert the filesystem to
> the old geometry.
>
> With the combination of superblock verifiers and old bugs,
> we're more likely to encounter read errors due to verification.
>
> And perhaps even worse, we don't even properly write any of the
> newly-added superblocks in the new AGs.
>
> Even with this change, growfs will still say:
>
> xfs_growfs: XFS_IOC_FSGROWFSDATA xfsctl failed: Structure needs cleaning
> data blocks changed from 319815680 to 335216640
>
> which might be confusing to the user, but it at least communicates
> that something has gone wrong, and dmesg will probably highlight
> the need for an xfs_repair.
>
> And this is still best-effort; if verifiers fail on more than
> half the backup supers, they may still "win" - but that's probably
> best left to repair to more gracefully handle by doing its own
> strict verification as part of the backup super "voting."
>
> Signed-off-by: Eric Sandeen <sandeen@redhat.com>
> ---
>
> diff --git a/fs/xfs/xfs_fsops.c b/fs/xfs/xfs_fsops.c
> index 614eb0c..70714bb 100644
> --- a/fs/xfs/xfs_fsops.c
> +++ b/fs/xfs/xfs_fsops.c
> @@ -153,7 +153,7 @@ xfs_growfs_data_private(
> xfs_buf_t *bp;
> int bucket;
> int dpct;
> - int error;
> + int error, saved_error = 0;
> xfs_agnumber_t nagcount;
> xfs_agnumber_t nagimax = 0;
> xfs_rfsblock_t nb, nb_mod;
> @@ -495,29 +495,33 @@ xfs_growfs_data_private(
> error = ENOMEM;
> }
>
> + /*
> + * If we get an error reading or writing alternate superblocks,
> + * continue. xfs_repair chooses the "best" superblock based
> + * on most matches; if we break early, we'll leave more
> + * superblocks un-updated than updated, and xfs_repair may
> + * pick them over the properly-updated primary.
> + */
> if (error) {
> xfs_warn(mp,
> "error %d reading secondary superblock for ag %d",
> error, agno);
> - break;
> + saved_error = error;
> + continue;
> }
> xfs_sb_to_disk(XFS_BUF_TO_SBP(bp), &mp->m_sb, XFS_SB_ALL_BITS);
>
> - /*
> - * If we get an error writing out the alternate superblocks,
> - * just issue a warning and continue. The real work is
> - * already done and committed.
> - */
> error = xfs_bwrite(bp);
> xfs_buf_relse(bp);
> if (error) {
> xfs_warn(mp,
> "write error %d updating secondary superblock for ag %d",
> error, agno);
> - break; /* no point in continuing */
> + saved_error = error;
> + continue;
> }
> }
> - return error;
> + return saved_error ? saved_error : error;
>
> error0:
> xfs_trans_cancel(tp, XFS_TRANS_ABORT);
>
>
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs
>
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2013-09-09 20:36 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-08-15 18:15 [PATCH, RFC] xfs: don't break from growfs ag update loop on error Eric Sandeen
2013-09-09 20:36 ` Eric Sandeen [this message]
2013-09-09 22:08 ` Dave Chinner
2013-09-10 15:21 ` Mark Tinguely
2013-09-10 15:23 ` Eric Sandeen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=522E3142.7090501@sandeen.net \
--to=sandeen@sandeen.net \
--cc=linux-xfs@oss.sgi.com \
--cc=sandeen@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox