From: "Darrick J. Wong" <djwong@kernel.org>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org
Subject: Re: [PATCH 7/8 V2] xfs: AGF length has never been bounds checked
Date: Thu, 29 Jun 2023 09:35:35 -0700 [thread overview]
Message-ID: <20230629163535.GG11441@frogsfrogsfrogs> (raw)
In-Reply-To: <ZJzn1QMNdCAXx4Il@dread.disaster.area>
On Thu, Jun 29, 2023 at 12:09:25PM +1000, Dave Chinner wrote:
> From: Dave Chinner <dchinner@redhat.com>
>
> The AGF verifier does not check that the AGF length field is within
> known good bounds. This has never been checked by runtime kernel
> code (i.e. the lack of verification goes back to 1993) yet we assume
> in many places that it is correct and verify other metdata against
> it.
>
> Add length verification to the AGF verifier. The length of the AGF
> must be equal to the size of the AG specified in the superblock,
> unless it is the last AG in the filesystem. In that case, it must be
> less than or equal to sb->sb_agblocks and greater than
> XFS_MIN_AG_BLOCKS, which is the smallest AG a growfs operation will
> allow to exist.
>
> This requires a bit of rework of the verifier function. We want to
> verify metadata before we use it to verify other metadata. Hence
> we need to verify the AGF sequence numbers before using them to
> verify the length of the AGF. Then we can verify the AGF length
> before we verify AGFL fields. Then we can verifier other fields that
> are bounds limited by the AGF length.
>
> And, finally, by calculating agf_length only once into a local
> variable, we can collapse repeated "if (xfs_has_foo() &&"
> conditionaly checks into single checks. This makes the code much
> easier to follow as all the checks for a given feature are obviously
> in the same place.
>
> Signed-off-by: Dave Chinner <dchinner@redhat.com>
> Reviewed-by: Christoph Hellwig <hch@lst.de>
> Reviewed-by: Darrick J. Wong <djwong@kernel.org>
Still looks good to me. New question: Do we need to validate agi_length
in the AGI verifier too?
--D
> ---
>
> Version 2:
> - growfs will write the new AGFs before the superblock has been
> updated, so we have to skip the new runt AGF seqno check otherwise
> it will fail.
>
> fs/xfs/libxfs/xfs_alloc.c | 92 +++++++++++++++++++++++++++++------------------
> 1 file changed, 57 insertions(+), 35 deletions(-)
>
> diff --git a/fs/xfs/libxfs/xfs_alloc.c b/fs/xfs/libxfs/xfs_alloc.c
> index 1e72b91daff6..fe7d5ea47b90 100644
> --- a/fs/xfs/libxfs/xfs_alloc.c
> +++ b/fs/xfs/libxfs/xfs_alloc.c
> @@ -2974,6 +2974,7 @@ xfs_agf_verify(
> {
> struct xfs_mount *mp = bp->b_mount;
> struct xfs_agf *agf = bp->b_addr;
> + uint32_t agf_length = be32_to_cpu(agf->agf_length);
>
> if (xfs_has_crc(mp)) {
> if (!uuid_equal(&agf->agf_uuid, &mp->m_sb.sb_meta_uuid))
> @@ -2985,18 +2986,49 @@ xfs_agf_verify(
> if (!xfs_verify_magic(bp, agf->agf_magicnum))
> return __this_address;
>
> - if (!(XFS_AGF_GOOD_VERSION(be32_to_cpu(agf->agf_versionnum)) &&
> - be32_to_cpu(agf->agf_freeblks) <= be32_to_cpu(agf->agf_length) &&
> - be32_to_cpu(agf->agf_flfirst) < xfs_agfl_size(mp) &&
> - be32_to_cpu(agf->agf_fllast) < xfs_agfl_size(mp) &&
> - be32_to_cpu(agf->agf_flcount) <= xfs_agfl_size(mp)))
> + if (!XFS_AGF_GOOD_VERSION(be32_to_cpu(agf->agf_versionnum)))
> return __this_address;
>
> - if (be32_to_cpu(agf->agf_length) > mp->m_sb.sb_dblocks)
> + /*
> + * Both agf_seqno and agf_length need to validated before anything else
> + * block number related in the AGF or AGFL can be checked.
> + *
> + * During growfs operations, the perag is not fully initialised,
> + * so we can't use it for any useful checking. growfs ensures we can't
> + * use it by using uncached buffers that don't have the perag attached
> + * so we can detect and avoid this problem.
> + */
> + if (bp->b_pag && be32_to_cpu(agf->agf_seqno) != bp->b_pag->pag_agno)
> + return __this_address;
> +
> + /*
> + * Only the last AGF in the filesytsem is allowed to be shorter
> + * than the AG size recorded in the superblock.
> + */
> + if (agf_length != mp->m_sb.sb_agblocks) {
> + /*
> + * During growfs, the new last AGF can get here before we
> + * have updated the superblock. Give it a pass on the seqno
> + * check.
> + */
> + if (bp->b_pag &&
> + be32_to_cpu(agf->agf_seqno) != mp->m_sb.sb_agcount - 1)
> + return __this_address;
> + if (agf_length < XFS_MIN_AG_BLOCKS)
> + return __this_address;
> + if (agf_length > mp->m_sb.sb_agblocks)
> + return __this_address;
> + }
> +
> + if (be32_to_cpu(agf->agf_flfirst) >= xfs_agfl_size(mp))
> + return __this_address;
> + if (be32_to_cpu(agf->agf_fllast) >= xfs_agfl_size(mp))
> + return __this_address;
> + if (be32_to_cpu(agf->agf_flcount) > xfs_agfl_size(mp))
> return __this_address;
>
> if (be32_to_cpu(agf->agf_freeblks) < be32_to_cpu(agf->agf_longest) ||
> - be32_to_cpu(agf->agf_freeblks) > be32_to_cpu(agf->agf_length))
> + be32_to_cpu(agf->agf_freeblks) > agf_length)
> return __this_address;
>
> if (be32_to_cpu(agf->agf_levels[XFS_BTNUM_BNO]) < 1 ||
> @@ -3007,38 +3039,28 @@ xfs_agf_verify(
> mp->m_alloc_maxlevels)
> return __this_address;
>
> - if (xfs_has_rmapbt(mp) &&
> - (be32_to_cpu(agf->agf_levels[XFS_BTNUM_RMAP]) < 1 ||
> - be32_to_cpu(agf->agf_levels[XFS_BTNUM_RMAP]) >
> - mp->m_rmap_maxlevels))
> - return __this_address;
> -
> - if (xfs_has_rmapbt(mp) &&
> - be32_to_cpu(agf->agf_rmap_blocks) > be32_to_cpu(agf->agf_length))
> - return __this_address;
> -
> - /*
> - * during growfs operations, the perag is not fully initialised,
> - * so we can't use it for any useful checking. growfs ensures we can't
> - * use it by using uncached buffers that don't have the perag attached
> - * so we can detect and avoid this problem.
> - */
> - if (bp->b_pag && be32_to_cpu(agf->agf_seqno) != bp->b_pag->pag_agno)
> - return __this_address;
> -
> if (xfs_has_lazysbcount(mp) &&
> - be32_to_cpu(agf->agf_btreeblks) > be32_to_cpu(agf->agf_length))
> + be32_to_cpu(agf->agf_btreeblks) > agf_length)
> return __this_address;
>
> - if (xfs_has_reflink(mp) &&
> - be32_to_cpu(agf->agf_refcount_blocks) >
> - be32_to_cpu(agf->agf_length))
> - return __this_address;
> + if (xfs_has_rmapbt(mp)) {
> + if (be32_to_cpu(agf->agf_rmap_blocks) > agf_length)
> + return __this_address;
>
> - if (xfs_has_reflink(mp) &&
> - (be32_to_cpu(agf->agf_refcount_level) < 1 ||
> - be32_to_cpu(agf->agf_refcount_level) > mp->m_refc_maxlevels))
> - return __this_address;
> + if (be32_to_cpu(agf->agf_levels[XFS_BTNUM_RMAP]) < 1 ||
> + be32_to_cpu(agf->agf_levels[XFS_BTNUM_RMAP]) >
> + mp->m_rmap_maxlevels)
> + return __this_address;
> + }
> +
> + if (xfs_has_reflink(mp)) {
> + if (be32_to_cpu(agf->agf_refcount_blocks) > agf_length)
> + return __this_address;
> +
> + if (be32_to_cpu(agf->agf_refcount_level) < 1 ||
> + be32_to_cpu(agf->agf_refcount_level) > mp->m_refc_maxlevels)
> + return __this_address;
> + }
>
> return NULL;
> }
next prev parent reply other threads:[~2023-06-29 16:35 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-27 22:44 [PATCH 0/8 v3] xfs: various fixes for 6.5 Dave Chinner
2023-06-27 22:44 ` [PATCH 1/8] xfs: don't reverse order of items in bulk AIL insertion Dave Chinner
2023-06-28 6:03 ` Christoph Hellwig
2023-06-28 9:55 ` Chandan Babu R
2023-06-28 17:46 ` Darrick J. Wong
2023-06-27 22:44 ` [PATCH 2/8] xfs: use deferred frees for btree block freeing Dave Chinner
2023-06-28 17:46 ` Darrick J. Wong
2023-06-28 22:55 ` Dave Chinner
2023-06-29 7:52 ` Chandan Babu R
2023-06-27 22:44 ` [PATCH 3/8] xfs: pass alloc flags through to xfs_extent_busy_flush() Dave Chinner
2023-06-29 9:44 ` Chandan Babu R
2023-06-27 22:44 ` [PATCH 4/8] xfs: allow extent free intents to be retried Dave Chinner
2023-06-28 17:48 ` Darrick J. Wong
2023-06-28 22:57 ` Dave Chinner
2023-06-29 9:50 ` Chandan Babu R
2023-06-27 22:44 ` [PATCH 5/8] xfs: don't block in busy flushing when freeing extents Dave Chinner
2023-06-27 22:44 ` [PATCH 6/8] xfs: journal geometry is not properly bounds checked Dave Chinner
2023-06-28 6:08 ` Christoph Hellwig
2023-06-28 6:38 ` Dave Chinner
2023-06-28 17:50 ` Darrick J. Wong
2023-06-27 22:44 ` [PATCH 7/8] xfs: AGF length has never been " Dave Chinner
2023-06-28 17:52 ` Darrick J. Wong
2023-06-29 2:09 ` [PATCH 7/8 V2] " Dave Chinner
2023-06-29 16:35 ` Darrick J. Wong [this message]
2023-06-29 22:33 ` Dave Chinner
2023-06-27 22:44 ` [PATCH 8/8] xfs: fix bounds check in xfs_defer_agfl_block() Dave Chinner
2023-06-28 6:09 ` Christoph Hellwig
2023-06-28 17:52 ` Darrick J. Wong
2023-06-29 19:42 ` [RFC PATCH 9/8] xfs: AGI length should be bounds checked Darrick J. Wong
2023-06-29 22:35 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230629163535.GG11441@frogsfrogsfrogs \
--to=djwong@kernel.org \
--cc=david@fromorbit.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox