public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: "Darrick J. Wong" <djwong@kernel.org>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org, allison.henderson@oracle.com
Subject: Re: [PATCH 1/3] xfs: empty xattr leaf header blocks are not corruption
Date: Sun, 26 Jun 2022 20:59:10 -0700	[thread overview]
Message-ID: <YrkrDtr5qQlxDoE/@magnolia> (raw)
In-Reply-To: <20220627011654.GZ227878@dread.disaster.area>

On Mon, Jun 27, 2022 at 11:16:54AM +1000, Dave Chinner wrote:
> On Sun, Jun 26, 2022 at 03:03:53PM -0700, Darrick J. Wong wrote:
> > From: Darrick J. Wong <djwong@kernel.org>
> > 
> > Every now and then we get a corruption report from the kernel or
> > xfs_repair about empty leaf blocks in the extended attribute structure.
> > We've long thought that these shouldn't be possible, but prior to 5.18
> > one would shake loose in the recoveryloop fstests about once a month.
> > 
> > A new addition to the xattr leaf block verifier in 5.19-rc1 makes this
> > happen every 7 minutes on my testing cloud.
> 
> Ok, so this is all just a long way of saying:
> 
> Revert commit 51e6104fdb95 ("xfs: detect empty attr leaf blocks in
> xfs_attr3_leaf_verify") because it was wrong.
> 
> Yes?

Yep.

> > Original-bug: 517c22207b04 ("xfs: add CRCs to attr leaf blocks")
> > Still-not-fixed: 2e1d23370e75 ("xfs: ignore leaf attr ichdr.count in verifier during log replay")
> > Removed-in: f28cef9e4dac ("xfs: don't fail verifier on empty attr3 leaf block")
> > Fixes: 51e6104fdb95 ("xfs: detect empty attr leaf blocks in xfs_attr3_leaf_verify")
> > Signed-off-by: Darrick J. Wong <djwong@kernel.org>
> > ---
> >  fs/xfs/libxfs/xfs_attr_leaf.c |   48 ++++++++++++++++++++++++++++++++++++-----
> >  1 file changed, 42 insertions(+), 6 deletions(-)
> > 
> > 
> > diff --git a/fs/xfs/libxfs/xfs_attr_leaf.c b/fs/xfs/libxfs/xfs_attr_leaf.c
> > index 37e7c33f6283..be7c216ec8f2 100644
> > --- a/fs/xfs/libxfs/xfs_attr_leaf.c
> > +++ b/fs/xfs/libxfs/xfs_attr_leaf.c
> > @@ -311,13 +311,49 @@ xfs_attr3_leaf_verify(
> >  		return fa;
> >  
> >  	/*
> > -	 * Empty leaf blocks should never occur;  they imply the existence of a
> > -	 * software bug that needs fixing. xfs_repair also flags them as a
> > -	 * corruption that needs fixing, so we should never let these go to
> > -	 * disk.
> > +	 * Empty leaf blocks can occur under the following circumstances:
> > +	 *
> > +	 * 1. setxattr adds a new extended attribute to a file;
> > +	 * 2. The file has zero existing attributes;
> > +	 * 3. The attribute is too large to fit in the attribute fork;
> > +	 * 4. The attribute is small enough to fit in a leaf block;
> > +	 * 5. A log flush occurs after committing the transaction that creates
> > +	 *    the (empty) leaf block; and
> > +	 * 6. The filesystem goes down after the log flush but before the new
> > +	 *    attribute can be committed to the leaf block.
> > +	 *
> > +	 * xfs_repair used to flag these empty leaf blocks as corruption, but
> > +	 * aside from wasting space, they are benign.  The rest of the xattr
> > +	 * code will happily add attributes to empty leaf blocks.  Hence this
> > +	 * comment serves as a tombstone to incorrect verifier code.
> > +	 *
> > +	 * Unfortunately, this check has been added and removed multiple times
> > +	 * throughout history.  It first appeared in[1] kernel 3.10 as part of
> > +	 * the early V5 format patches.  The check was later discovered to
> > +	 * break log recovery and hence disabled[2] during log recovery in
> > +	 * kernel 4.10.  Simultaneously, the check was added[3] to xfs_repair
> > +	 * 4.9.0 to try to weed out the empty leaf blocks.  This was still not
> > +	 * correct because log recovery would recover an empty attr leaf block
> > +	 * successfully only for regular xattr operations to trip over the
> > +	 * empty block during of the block during regular operation.
> > +	 * Therefore, the check was removed entirely[4] in kernel 5.7 but
> > +	 * removal of the xfs_repair check was forgotten.  The continued
> > +	 * complaints from xfs_repair lead to us mistakenly re-adding[5] the
> > +	 * verifier check for kernel 5.19, and has been removed once again.
> > +	 *
> > +	 * [1] 517c22207b04 ("xfs: add CRCs to attr leaf blocks")
> > +	 * [2] 2e1d23370e75 ("xfs: ignore leaf attr ichdr.count in verifier
> > +	 *                    during log replay")
> > +	 * [3] f7140161 ("xfs_repair: junk leaf attribute if count == 0")
> > +	 * [4] f28cef9e4dac ("xfs: don't fail verifier on empty attr3 leaf
> > +	 *                    block")
> > +	 * [5] 51e6104fdb95 ("xfs: detect empty attr leaf blocks in
> > +	 *                    xfs_attr3_leaf_verify")
> > +	 *
> > +	 * Normally this would go in the commit message, but as we've a history
> > +	 * of getting this wrong, this now goes in the code base as a gigantic
> > +	 * comment.
> >  	 */
> 
> I still think it should be in the commit history, not here in the
> code. The reason I missed this is that the existing comment about
> empty leaf attrs is above a section that is verifying entries
> *after* various fields in the header had been validated. Hence I
> thought it was a case that the header field had not been validated
> and so I added it. Simple mistake.

<nod> ...and I wanted redundant breadcrumbs in the commit history and
the code itself because you and I keep making the same mistakes. :/

> This needs to be a comment at the head of the function, not
> associated with validity checking the entries. i.e.
> 
> /*
>  * Validate the attribute leaf.
>  *
>  * A leaf block can be empty as a result of transient state whilst
>  * creating a new leaf form attribute:
>  *
>  * 1. setxattr adds a new extended attribute to a file;
>  * 2. The file has zero existing attributes;
>  * 3. The attribute is too large to fit in the attribute fork;
>  * 4. The attribute is small enough to fit in a leaf block;
>  * 5. A log flush occurs after committing the transaction that creates
>  *    the (empty) leaf block; and
>  * 6. The filesystem goes down after the log flush but before the new
>  *    attribute can be committed to the leaf block.
>  *
>  * Hence we need to ensure that we don't fail the validation purely
>  * because the leaf is empty.
>  */

Ok done.

--D

> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@fromorbit.com

  reply	other threads:[~2022-06-27  3:59 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-06-26 22:03 [PATCHSET 0/3] xfs: random fixes for 5.19-rc5 Darrick J. Wong
2022-06-26 22:03 ` [PATCH 1/3] xfs: empty xattr leaf header blocks are not corruption Darrick J. Wong
2022-06-27  1:16   ` Dave Chinner
2022-06-27  3:59     ` Darrick J. Wong [this message]
2022-06-26 22:03 ` [PATCH 2/3] xfs: don't hold xattr leaf buffers across transaction rolls Darrick J. Wong
2022-06-27  1:23   ` Dave Chinner
2022-06-27  3:46     ` Darrick J. Wong
2022-06-27  5:10       ` Dave Chinner
2022-06-26 22:04 ` [PATCH 3/3] xfs: dont treat rt extents beyond EOF as eofblocks to be cleared Darrick J. Wong
2022-06-27  1:37   ` Dave Chinner
2022-06-27  3:57     ` Darrick J. Wong
2022-06-27  5:16       ` Dave Chinner
2022-06-27 20:59         ` Darrick J. Wong
2022-06-27 22:27           ` Dave Chinner
  -- strict thread matches above, loose matches on Subject: below --
2022-06-27 21:35 [PATCHSET v2 0/3] xfs: random fixes for 5.19-rc5 Darrick J. Wong
2022-06-27 21:35 ` [PATCH 1/3] xfs: empty xattr leaf header blocks are not corruption Darrick J. Wong
2022-06-28  0:27   ` Dave Chinner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YrkrDtr5qQlxDoE/@magnolia \
    --to=djwong@kernel.org \
    --cc=allison.henderson@oracle.com \
    --cc=david@fromorbit.com \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox