From: Dave Chinner <david@fromorbit.com>
To: "Darrick J. Wong" <darrick.wong@oracle.com>
Cc: xfs <linux-xfs@vger.kernel.org>
Subject: Re: [PATCH] xfs: scrub extended attribute leaf space
Date: Wed, 1 Nov 2017 10:53:25 +1100 [thread overview]
Message-ID: <20171031235325.GJ5858@dastard> (raw)
In-Reply-To: <20171031225532.GD4911@magnolia>
On Tue, Oct 31, 2017 at 03:55:32PM -0700, Darrick J. Wong wrote:
> From: Darrick J. Wong <darrick.wong@oracle.com>
>
> As we walk the attribute btree, explicitly check the structure of the
> attribute leaves to make sure the pointers make sense and the freemap is
> sensible.
>
> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> ---
> fs/xfs/scrub/attr.c | 233 ++++++++++++++++++++++++++++++++++++++++++++----
> fs/xfs/scrub/dabtree.c | 4 +
> fs/xfs/scrub/dabtree.h | 3 -
> fs/xfs/scrub/dir.c | 2
> 4 files changed, 218 insertions(+), 24 deletions(-)
>
> diff --git a/fs/xfs/scrub/attr.c b/fs/xfs/scrub/attr.c
> index a70cd9b..5d624ca 100644
> --- a/fs/xfs/scrub/attr.c
> +++ b/fs/xfs/scrub/attr.c
> @@ -50,8 +50,17 @@ xfs_scrub_setup_xattr(
> struct xfs_scrub_context *sc,
> struct xfs_inode *ip)
> {
> - /* Allocate the buffer without the inode lock held. */
> - sc->buf = kmem_zalloc_large(XATTR_SIZE_MAX, KM_SLEEP);
> + size_t sz;
> +
> + /*
> + * Allocate the buffer without the inode lock held. We need enough
> + * space to read every xattr value in the file and enough space to
^^^
nit: or
> + * hold three copies of the xattr free space bitmap. (Not both at
> + * the same time.)
> + */
> + sz = max_t(size_t, XATTR_SIZE_MAX, 3 * sizeof(long) *
> + BITS_TO_LONGS(sc->mp->m_attr_geo->blksize));
> + sc->buf = kmem_zalloc_large(sz, KM_SLEEP);
> if (!sc->buf)
> return -ENOMEM;
>
> @@ -122,6 +131,197 @@ xfs_scrub_xattr_listent(
> return;
> }
>
> +/*
> + * Mark a range [start, start+len) in this map. Returns true if the
> + * region was free, and false if there's a conflict or a problem.
> + *
> + * Within a char, the lowest bit of the char represents the byte with
> + * the smallest address
> + */
> +STATIC bool
> +xfs_scrub_xattr_set_map(
> + struct xfs_scrub_context *sc,
> + unsigned long *map,
> + unsigned int start,
> + unsigned int len)
> +{
> + unsigned int mapsize = sc->mp->m_attr_geo->blksize;
> + bool ret = true;
> +
> + if (start >= mapsize)
> + return false;
> + if (start + len > mapsize) {
> + len = mapsize - start;
> + ret = false;
> + }
> +
> + ret &= find_next_bit(map, mapsize, start) >= (start + len);
That's a bit convoluted. It's a conditional boolean bit masking
operation. AFAICT it clears ret if the next bit is < (start
+ len), but if ret is already false it can never be set. I think.
Much simpler to write:
if (find_next_bit(map, mapsize, start) < start + len)
ret = false;
> +/* Scrub an attribute leaf. */
> +STATIC int
> +xfs_scrub_xattr_block(
> + struct xfs_scrub_da_btree *ds,
> + int level)
> +{
> + struct xfs_attr3_icleaf_hdr leafhdr;
> + struct xfs_mount *mp = ds->state->mp;
> + struct xfs_da_state_blk *blk = &ds->state->path.blk[level];
> + struct xfs_buf *bp = blk->bp;
> + xfs_dablk_t *last_checked = ds->private;
> + struct xfs_attr_leafblock *leaf = bp->b_addr;
> + struct xfs_attr_leaf_entry *ent;
> + struct xfs_attr_leaf_entry *entries;
> + struct xfs_attr_leaf_name_local *lentry;
> + struct xfs_attr_leaf_name_remote *rentry;
> + unsigned long *usedmap = ds->sc->buf;
> + char *name_end;
> + char *buf_end;
> + __u32 last_hashval = 0;
> + unsigned int usedbytes = 0;
> + unsigned int nameidx;
> + unsigned int hdrsize;
> + unsigned int namesize;
> + int i;
> +
> + if (*last_checked == blk->blkno)
> + return 0;
> + *last_checked = blk->blkno;
> + bitmap_zero(usedmap, mp->m_attr_geo->blksize);
> +
> + /* Check all the padding. */
> + if (xfs_sb_version_hascrc(&ds->sc->mp->m_sb)) {
> + struct xfs_attr3_leafblock *leaf = bp->b_addr;
> +
> + if (leaf->hdr.pad1 != 0 ||
> + leaf->hdr.pad2 != cpu_to_be32(0) ||
cpu_to_be32(0) = 0.
So there's no need for endian conversion of 0.
> + leaf->hdr.info.hdr.pad != cpu_to_be16(0))
> + xfs_scrub_da_set_corrupt(ds, level);
> + } else {
> + if (leaf->hdr.pad1 != 0 ||
> + leaf->hdr.info.pad != cpu_to_be16(0))
> + xfs_scrub_da_set_corrupt(ds, level);
> + }
> +
> + /* Check the leaf header */
> + xfs_attr3_leaf_hdr_from_disk(mp->m_attr_geo, &leafhdr, leaf);
> +
> + if (leafhdr.usedbytes > mp->m_attr_geo->blksize)
> + xfs_scrub_da_set_corrupt(ds, level);
> + if (leafhdr.firstused > mp->m_attr_geo->blksize)
> + xfs_scrub_da_set_corrupt(ds, level);
> + hdrsize = xfs_attr3_leaf_hdr_size(leaf);
> + if (leafhdr.firstused < hdrsize)
> + xfs_scrub_da_set_corrupt(ds, level);
> +
> + if (!xfs_scrub_xattr_set_map(ds->sc, usedmap, 0, hdrsize)) {
> + xfs_scrub_da_set_corrupt(ds, level);
> + goto out;
> + }
> +
> + entries = xfs_attr3_leaf_entryp(leaf);
> + if ((char *)&entries[leafhdr.count] > (char *)leaf + leafhdr.firstused)
> + xfs_scrub_da_set_corrupt(ds, level);
All the casts are a bit ugly, but I guess we've got to check
offsets somehow.
> + /* Check the entries' relations to each other */
> + buf_end = (char *)bp->b_addr + mp->m_attr_geo->blksize;
> + for (i = 0, ent = entries; i < leafhdr.count; ent++, i++) {
> + if (!xfs_scrub_xattr_set_map(ds->sc, usedmap,
> + (char *)ent - (char *)leaf,
> + sizeof(xfs_attr_leaf_entry_t))) {
> + xfs_scrub_da_set_corrupt(ds, level);
> + goto out;
> + }
The amount of indentation and broken lines inside this loop makes it
reall hard to read. Can you factor it at all? ...
> +
> + if (ent->pad2 != cpu_to_be32(0))
> + xfs_scrub_da_set_corrupt(ds, level);
compare against 0
> +
> + if (be32_to_cpu(ent->hashval) < last_hashval)
> + xfs_scrub_da_set_corrupt(ds, level);
> + last_hashval = be32_to_cpu(ent->hashval);
> +
> + nameidx = be16_to_cpu(ent->nameidx);
> + if (nameidx < hdrsize ||
> + nameidx < leafhdr.firstused ||
> + nameidx >= mp->m_attr_geo->blksize) {
> + xfs_scrub_da_set_corrupt(ds, level);
> + goto out;
> + }
> +
> + if (ent->flags & XFS_ATTR_LOCAL) {
> + lentry = xfs_attr3_leaf_name_local(leaf, i);
> + if (lentry->namelen <= 0)
> + xfs_scrub_da_set_corrupt(ds, level);
> + name_end = (char *)lentry->nameval + lentry->namelen +
> + be16_to_cpu(lentry->valuelen);
Isn't there padding that has to be taken into account here?
> + if (name_end > buf_end)
> + xfs_scrub_da_set_corrupt(ds, level);
> + namesize = xfs_attr_leaf_entsize_local(
> + lentry->namelen,
> + be16_to_cpu(lentry->valuelen));
Because this pads the size of the entry...
> + } else {
> + rentry = xfs_attr3_leaf_name_remote(leaf, i);
> + if (rentry->namelen <= 0)
> + xfs_scrub_da_set_corrupt(ds, level);
> + name_end = (char *)rentry->name + rentry->namelen;
Same here.
> + if (name_end > buf_end)
> + xfs_scrub_da_set_corrupt(ds, level);
> + namesize = xfs_attr_leaf_entsize_remote(
> + rentry->namelen);
> + if (rentry->valueblk == cpu_to_be32(0))
> + xfs_scrub_da_set_corrupt(ds, level);
> + }
Maybe just factoring the local/remote attr name checks will clean
this all up - this is the part that's really hard to read.
> + usedbytes += namesize;
> + if (!xfs_scrub_xattr_set_map(ds->sc, usedmap, nameidx,
> + namesize)) {
> + xfs_scrub_da_set_corrupt(ds, level);
> + goto out;
> + }
> + }
> +
> + if (!xfs_scrub_xattr_check_freemap(ds->sc, usedmap, &leafhdr))
> + xfs_scrub_da_set_corrupt(ds, level);
> +
> + if (leafhdr.usedbytes != usedbytes)
> + xfs_scrub_da_set_corrupt(ds, level);
And so this validates all the padded entries....
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
next prev parent reply other threads:[~2017-10-31 23:53 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-31 22:55 [PATCH] xfs: scrub extended attribute leaf space Darrick J. Wong
2017-10-31 23:53 ` Dave Chinner [this message]
2017-11-01 0:54 ` Darrick J. Wong
2017-11-01 6:22 ` [PATCH v2] " Darrick J. Wong
2017-11-01 21:11 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20171031235325.GJ5858@dastard \
--to=david@fromorbit.com \
--cc=darrick.wong@oracle.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.