From: "Libor Klepáč" <libor.klepac@bcom.cz>
To: Alex Lyakas <alex@zadarastorage.com>
Cc: Dave Chinner <david@fromorbit.com>,
linux-xfs@vger.kernel.org,
Shyam Kaushik <shyam@zadarastorage.com>,
bfoster@redhat.com, dchinner@redhat.com
Subject: Re: Metadata corruption at xfs_attr3_leaf_write_verify()
Date: Mon, 07 Aug 2017 15:55:30 +0200 [thread overview]
Message-ID: <4655939.aPg9oOrUOo@libor-nb> (raw)
In-Reply-To: <7AF4FEF16E034B868577B3ED535D5C41@alyakaslap>
Hello,
can this be related to our problems on 4.9.x kernel, we have started to see
after starting to use ACL?
I have several crashes in this thread, it bites us usually once per month:
https://www.spinics.net/lists/linux-xfs/msg07058.html
Metadata buffer dump seems to be the same
Thanks,
Libor
On středa 2. srpna 2017 11:38:36 CEST Alex Lyakas wrote:
> Hello Dave,
>
> Thank you for your analysis. It sounds like this issue exists in recent
> kernels as well.
>
> We are reviewing some of the paths that operate xfs_buf's, but still we
> don't have enough understanding on how to properly lock out the xfs_buf from
> AIL grabbing it. Can you please point us at similar flows, where such
> locking is done?
>
> Or otherwise, should you propose a patch to fix this, we can test it. If
> possible, making the patch applicable to kernel 3.18.19 would be
> appreciated. I realize that this is an EOL kernel, but still it used to be a
> long-term kernel.
>
> Thanks,
> Alex.
>
>
>
> -----Original Message-----
> From: Dave Chinner
> Sent: Wednesday, August 02, 2017 2:18 AM
> To: Alex Lyakas
> Cc: linux-xfs@vger.kernel.org ; Shyam Kaushik ; bfoster@redhat.com ;
> dchinner@redhat.com
> Subject: Re: Metadata corruption at xfs_attr3_leaf_write_verify()
>
> On Tue, Aug 01, 2017 at 08:30:31PM +0300, Alex Lyakas wrote:
> > Greetings XFS developers, David, Brian,
> >
> > We did additional debugging on this issue. The problematic flow
> > happens to be the following:
> >
> > - New inode (regular file) is being created.
> > - As part of creation, due to parent directory having a default ACL,
> > initial ACL is applied to the inode.
> > - This ACL is applied as an extended attribute with name
> > "SGI_ACL_FILE" and value length of 100 bytes.
> > - XFS tries to add this attribute into the inline inode attribute
> > fork area (AKA shortform).
> > - But 100 bytes is too large for the shortform, so XFS creates an
> > empty shortform and then calls xfs_attr_shortform_to_leaf()
> > - This calls xfs_attr3_leaf_create() and creates a leaf with zero
> > attributes.
> > - Before XFS is able to add the attribute to the leaf, the xfsaild
> > thread wants to write this leaf to disk, and trips over the assert
> > in xfs_attr3_leaf_verify, that ichdr.count should not be 0
>
> Ok, this makes it pretty obvious as to what's going on here. The new
> attribute leaf buffer is not held locked across the transaction roll
> between the shortform->leaf modification and the addition of the new
> entry. As a result the attribute buffer modification being made is
> not atomic from an operational perspective. Hence the AIL push can
> grab it in the transient state of "just created" after the initial
> transaction is rolled because the buffer has been released.
>
> Cheers,
>
> Dave.
>
--------
[1] mailto:libor.klepac@bcom.cz
[2] tel:+420377457676
[3] http://www.bcom.cz
next prev parent reply other threads:[~2017-08-07 14:04 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <d161b07cad6536b0baf285328cf99500@mail.gmail.com>
2017-08-01 17:30 ` Metadata corruption at xfs_attr3_leaf_write_verify() Alex Lyakas
2017-08-01 18:22 ` Eric Sandeen
2017-08-01 18:53 ` AW: " Markus Stockhausen
2017-08-01 18:57 ` Alex Lyakas
2017-08-01 19:02 ` Eric Sandeen
2017-08-01 23:18 ` Dave Chinner
2017-08-02 8:38 ` Alex Lyakas
2017-08-02 11:50 ` Dave Chinner
2017-08-07 14:31 ` Alex Lyakas
2017-08-07 13:55 ` Libor Klepáč [this message]
2017-08-07 14:32 ` Alex Lyakas
[not found] <CAPh1sj5oU6QRyH_cnzrkGJb6ed3XO4fGABJ4yJLPnb-ppqVJeg@mail.gmail.com>
2017-07-26 5:22 ` Shyam Kaushik
2017-07-26 12:15 ` Brian Foster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4655939.aPg9oOrUOo@libor-nb \
--to=libor.klepac@bcom.cz \
--cc=alex@zadarastorage.com \
--cc=bfoster@redhat.com \
--cc=david@fromorbit.com \
--cc=dchinner@redhat.com \
--cc=linux-xfs@vger.kernel.org \
--cc=shyam@zadarastorage.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox