From: "Darrick J. Wong" <djwong@kernel.org>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 3/4] xfs: set the buffer type after holding the AG[IF] across trans_roll
Date: Sun, 23 Oct 2022 21:16:01 -0700 [thread overview]
Message-ID: <Y1YRgQGaxOFg7Vk3@magnolia> (raw)
In-Reply-To: <20221014012819.GI3600936@dread.disaster.area>
On Fri, Oct 14, 2022 at 12:28:19PM +1100, Dave Chinner wrote:
> On Thu, Oct 13, 2022 at 04:19:15PM -0700, Darrick J. Wong wrote:
> > On Fri, Oct 14, 2022 at 09:25:53AM +1100, Dave Chinner wrote:
> > > On Sun, Oct 02, 2022 at 11:19:48AM -0700, Darrick J. Wong wrote:
> > > > From: Darrick J. Wong <djwong@kernel.org>
> > > >
> > > > Currently, the only way to lock an allocation group is to hold the AGI
> > > > and AGF buffers. If repair needs to roll the transaction while
> > > > repairing some AG metadata, it maintains that lock by holding the two
> > > > buffers across the transaction roll and joins them afterwards.
> > > >
> > > > However, repair is not the same as the other parts of XFS that employ
> > > > this bhold/bjoin sequence, because it's possible that the AGI or AGF
> > > > buffers are not actually dirty before the roll. In this case, the
> > > > buffer log item can detach from the buffer, which means that we have to
> > >
> > > Doesn't this imply we have a reference counting problem with
> > > XFS_BLI_HOLD buffers? i.e. the bli can only get detached when the
> > > reference count on it goes to zero. If the buffer is clean and
> > > joined to a transaction, then that means the only reference to the
> > > BLI is the current transaction. Hence the only way it can get
> > > detached is for the transaction commit to release the current
> > > transaction's reference to the BLI.
> > >
> > > Ah, XFS_BLI_HOLD does not take a reference to the BLI - it just
> > > prevents ->iop_release from releasing the -buffer- after it drops
> > > the transaction reference to the BLI. That's the problem right there
> > > - xfs_buf_item_release() drops the current trans ref to the clean
> > > item via xfs_buf_item_release() regardless of whether BLI_HOLD is
> > > set or not, hence freeing the BLI on clean buffers.
> > >
> > > IOWs, it looks to me like XFS_BLI_HOLD should actually hold a
> > > reference to the BLI as well as the buffer so that we don't free the
> > > BLI for a held clean buffer in xfs_buf_item_release(). The reference
> > > we leave behind will then be picked up by the subsequent call to
> > > xfs_trans_bjoin() which finds the clean BLI already attached to the
> > > buffer...
> >
> > <nod> I think you're saying that _xfs_trans_bjoin should:
> >
> > if (!(bip->bli_flags & XFS_BLI_HOLD))
> > atomic_inc(&bip->bli_refcount);
> >
> > and xfs_buf_item_release should do:
> >
> > if (hold)
> > return;
> > released = xfs_buf_item_put(bip);
> > if (stale && !released)
> > return;
>
> Not exactly. What I was thinking was something like this:
>
> xfs_trans_bhold() should do:
>
> bip->bli_flags |= XFS_BLI_HOLD;
> atomic_inc(&bip->bli_refcount);
>
> xfs_trans_bhold_release() should do:
>
> bip->bli_flags &= ~XFS_BLI_HOLD;
> atomic_dec(&bip->bli_refcount);
>
> xfs_trans_brelse() shoudl do:
>
> if (bip->bli_flags & XFS_BLI_HOLD) {
> bip->bli_flags &= ~XFS_BLI_HOLD;
> atomic_dec(&bip->bli_refcount);
> }
>
> and xfs_buf_item_release() should do:
>
> if (hold) {
> /* Release the hold ref but not the rolling transaction ref */
> bip->bli_flags &= ~XFS_BLI_HOLD;
> atomic_dec(&bip->bli_refcount);
> return;
> }
> released = xfs_buf_item_put(bip);
> if (stale && !released)
> return;
>
> Then _xfs_trans_bjoin() remains unchanged, as we don't remove the
> BLI from the held clean buffer as there is still a reference to it.
> The new transaction we rejoin the buffer to will take that
> reference.
Hnmmmm. I tried this, but it lead to massive memory corruption
problems, slab leak warnings, and a hang. If I get around to it I'll
try to see if KASAN will help me figure out what went wrong.
It occurred to me that it might be easier to log the first byte of the
AGI and AGF buffers before the transaction roll, which prevents the bli
from falling off the buffer during the transaction rolling.
Yes that isn't fixing the problem that the unusual use of bhold on a
clean buffer doesn't work right, but <shrug> I don't want to increase
the risks of this patchset by reworking buffer/bli lifetimes when I can
do something simpler. :)
--D
> Cheers,
>
> Dave.
> --
> Dave Chinner
> david@fromorbit.com
next prev parent reply other threads:[~2022-10-24 4:16 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-10-02 18:19 [PATCHSET v23.1 0/4] xfs: fix handling of AG[IF] header buffers during scrub Darrick J. Wong
2022-10-02 18:19 ` [PATCH 1/4] xfs: fully initialize xfs_da_args in xchk_directory_blocks Darrick J. Wong
2022-10-13 22:33 ` Dave Chinner
2022-10-02 18:19 ` [PATCH 3/4] xfs: set the buffer type after holding the AG[IF] across trans_roll Darrick J. Wong
2022-10-13 22:25 ` Dave Chinner
2022-10-13 23:19 ` Darrick J. Wong
2022-10-14 1:28 ` Dave Chinner
2022-10-24 4:16 ` Darrick J. Wong [this message]
2022-10-31 18:08 ` [PATCH v23.2 3/4] xfs: log the AGI/AGF buffers when rolling transactions during an AG repair Darrick J. Wong
2022-10-31 21:17 ` Dave Chinner
2022-10-02 18:19 ` [PATCH 2/4] xfs: don't track the AGFL buffer in the scrub AG context Darrick J. Wong
2022-10-13 22:32 ` Dave Chinner
2022-10-02 18:19 ` [PATCH 4/4] xfs: make AGFL repair function avoid crosslinked blocks Darrick J. Wong
2022-10-13 22:35 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Y1YRgQGaxOFg7Vk3@magnolia \
--to=djwong@kernel.org \
--cc=david@fromorbit.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox