public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
From: "Darrick J. Wong" <djwong@kernel.org>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org, allison.henderson@oracle.com
Subject: Re: [PATCH 3/3] xfs: dont treat rt extents beyond EOF as eofblocks to be cleared
Date: Mon, 27 Jun 2022 13:59:32 -0700	[thread overview]
Message-ID: <YroaNHm9nQz39dyR@magnolia> (raw)
In-Reply-To: <20220627051649.GD227878@dread.disaster.area>

On Mon, Jun 27, 2022 at 03:16:49PM +1000, Dave Chinner wrote:
> On Sun, Jun 26, 2022 at 08:57:25PM -0700, Darrick J. Wong wrote:
> > On Mon, Jun 27, 2022 at 11:37:31AM +1000, Dave Chinner wrote:
> > > On Sun, Jun 26, 2022 at 03:04:04PM -0700, Darrick J. Wong wrote:
> > > > From: Darrick J. Wong <djwong@kernel.org>
> > > > 
> > > > On a system with a realtime volume and a 28k realtime extent,
> > > > generic/491 fails because the test opens a file on a frozen filesystem
> > > > and closing it causes xfs_release -> xfs_can_free_eofblocks to
> > > > mistakenly think that the the blocks of the realtime extent beyond EOF
> > > > are posteof blocks to be freed.  Realtime extents cannot be partially
> > > > unmapped, so this is pointless.  Worse yet, this triggers posteof
> > > > cleanup, which stalls on a transaction allocation, which is why the test
> > > > fails.
> > > > 
> > > > Teach the predicate to account for realtime extents properly.
> > > > 
> > > > Signed-off-by: Darrick J. Wong <djwong@kernel.org>
> > > > ---
> > > >  fs/xfs/xfs_bmap_util.c |    2 ++
> > > >  1 file changed, 2 insertions(+)
> > > > 
> > > > 
> > > > diff --git a/fs/xfs/xfs_bmap_util.c b/fs/xfs/xfs_bmap_util.c
> > > > index 52be58372c63..85e1a26c92e8 100644
> > > > --- a/fs/xfs/xfs_bmap_util.c
> > > > +++ b/fs/xfs/xfs_bmap_util.c
> > > > @@ -686,6 +686,8 @@ xfs_can_free_eofblocks(
> > > >  	 * forever.
> > > >  	 */
> > > >  	end_fsb = XFS_B_TO_FSB(mp, (xfs_ufsize_t)XFS_ISIZE(ip));
> > > > +	if (XFS_IS_REALTIME_INODE(ip) && mp->m_sb.sb_rextsize > 1)
> > > > +		end_fsb = roundup_64(end_fsb, mp->m_sb.sb_rextsize);
> > > >  	last_fsb = XFS_B_TO_FSB(mp, mp->m_super->s_maxbytes);
> > > >  	if (last_fsb <= end_fsb)
> > > >  		return false;
> > > 
> > > Ok, that works.
> 
> Reviewed-by: Dave Chinner <dchinner@redhat.com>
> 
> > > 
> > > However, I was looking at xfs_can_free_eofblocks() w.r.t. freeze a
> > > couple of days ago and wondering why there isn't a freeze/RO state
> > > check in xfs_can_free_eofblocks(). Shouldn't we have one here so
> > > that we never try to run xfs_free_eofblocks() on RO/frozen
> > > filesystems regardless of unexpected state/alignment issues?
> > 
> > I asked myself that question too.  I found three callers of this
> > predicate:
> > 
> > 1. fallocate, which should have obtained freeze protection
> 
> *nod*
> 
> > 2. inodegc, which should never be running when we get to the innermost
> > freeze protection level
> 
> So inodegc could still do IO here on a read-only fs?

Correct.

> > 3. xfs_release, which doesn't take freeze protection at all.  Either it
> > needs to take freeze protection so that xfs_free_eofblocks can't get
> > stuck in xfs_trans_alloc, or we'd need to modify xfs_trans_alloc to
> > sb_start_intwrite_trylock
> 
> That looks to me like it is simply a case of replacing the
> !xfs_is_readonly() check in xfs_release() with a
> !xfs_fs_writeable(mp, SB_FREEZE_WRITE) check and we shouldn't have
> to touch anythign else, right?

I think there would still be a race if we did that -- I don't see
anything in __fput that prohibits another thread from initiating a
freeze after the release process calls _can_free_eofblocks but before
the actual call to _free_eofblocks.

Hm.  How often would we have a readonly fd pointing to a file that has
posteof blocks?  I suppose this could happen if the system was extending
a file, crashed, and then someone remounted, opened a ro fd, and then
closed and froze the fs at the same time...?

--D

> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@fromorbit.com

  reply	other threads:[~2022-06-27 20:59 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-06-26 22:03 [PATCHSET 0/3] xfs: random fixes for 5.19-rc5 Darrick J. Wong
2022-06-26 22:03 ` [PATCH 1/3] xfs: empty xattr leaf header blocks are not corruption Darrick J. Wong
2022-06-27  1:16   ` Dave Chinner
2022-06-27  3:59     ` Darrick J. Wong
2022-06-26 22:03 ` [PATCH 2/3] xfs: don't hold xattr leaf buffers across transaction rolls Darrick J. Wong
2022-06-27  1:23   ` Dave Chinner
2022-06-27  3:46     ` Darrick J. Wong
2022-06-27  5:10       ` Dave Chinner
2022-06-26 22:04 ` [PATCH 3/3] xfs: dont treat rt extents beyond EOF as eofblocks to be cleared Darrick J. Wong
2022-06-27  1:37   ` Dave Chinner
2022-06-27  3:57     ` Darrick J. Wong
2022-06-27  5:16       ` Dave Chinner
2022-06-27 20:59         ` Darrick J. Wong [this message]
2022-06-27 22:27           ` Dave Chinner
  -- strict thread matches above, loose matches on Subject: below --
2022-06-27 21:35 [PATCHSET v2 0/3] xfs: random fixes for 5.19-rc5 Darrick J. Wong
2022-06-27 21:35 ` [PATCH 3/3] xfs: dont treat rt extents beyond EOF as eofblocks to be cleared Darrick J. Wong
2022-06-29  7:26   ` Christoph Hellwig

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YroaNHm9nQz39dyR@magnolia \
    --to=djwong@kernel.org \
    --cc=allison.henderson@oracle.com \
    --cc=david@fromorbit.com \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox