Re: [PATCH] xfs: scrub extended attribute leaf space

linux-xfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: "Darrick J. Wong" <darrick.wong@oracle.com>
To: Dave Chinner <david@fromorbit.com>
Cc: xfs <linux-xfs@vger.kernel.org>
Subject: Re: [PATCH] xfs: scrub extended attribute leaf space
Date: Tue, 31 Oct 2017 17:54:29 -0700	[thread overview]
Message-ID: <20171101005429.GG4911@magnolia> (raw)
In-Reply-To: <20171031235325.GJ5858@dastard>

On Wed, Nov 01, 2017 at 10:53:25AM +1100, Dave Chinner wrote:
> On Tue, Oct 31, 2017 at 03:55:32PM -0700, Darrick J. Wong wrote:
> > From: Darrick J. Wong <darrick.wong@oracle.com>
> > 
> > As we walk the attribute btree, explicitly check the structure of the
> > attribute leaves to make sure the pointers make sense and the freemap is
> > sensible.
> > 
> > Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> > ---
> >  fs/xfs/scrub/attr.c    |  233 ++++++++++++++++++++++++++++++++++++++++++++----
> >  fs/xfs/scrub/dabtree.c |    4 +
> >  fs/xfs/scrub/dabtree.h |    3 -
> >  fs/xfs/scrub/dir.c     |    2 
> >  4 files changed, 218 insertions(+), 24 deletions(-)
> > 
> > diff --git a/fs/xfs/scrub/attr.c b/fs/xfs/scrub/attr.c
> > index a70cd9b..5d624ca 100644
> > --- a/fs/xfs/scrub/attr.c
> > +++ b/fs/xfs/scrub/attr.c
> > @@ -50,8 +50,17 @@ xfs_scrub_setup_xattr(
> >  	struct xfs_scrub_context	*sc,
> >  	struct xfs_inode		*ip)
> >  {
> > -	/* Allocate the buffer without the inode lock held. */
> > -	sc->buf = kmem_zalloc_large(XATTR_SIZE_MAX, KM_SLEEP);
> > +	size_t				sz;
> > +
> > +	/*
> > +	 * Allocate the buffer without the inode lock held.  We need enough
> > +	 * space to read every xattr value in the file and enough space to
> 						       ^^^
> nit:							or

Fixed.

> > +	 * hold three copies of the xattr free space bitmap.  (Not both at
> > +	 * the same time.)
> > +	 */
> > +	sz = max_t(size_t, XATTR_SIZE_MAX, 3 * sizeof(long) *
> > +			BITS_TO_LONGS(sc->mp->m_attr_geo->blksize));
> > +	sc->buf = kmem_zalloc_large(sz, KM_SLEEP);
> >  	if (!sc->buf)
> >  		return -ENOMEM;
> >  
> > @@ -122,6 +131,197 @@ xfs_scrub_xattr_listent(
> >  	return;
> >  }
> >  
> > +/*
> > + * Mark a range [start, start+len) in this map.  Returns true if the
> > + * region was free, and false if there's a conflict or a problem.
> > + *
> > + * Within a char, the lowest bit of the char represents the byte with
> > + * the smallest address
> > + */
> > +STATIC bool
> > +xfs_scrub_xattr_set_map(
> > +	struct xfs_scrub_context	*sc,
> > +	unsigned long			*map,
> > +	unsigned int			start,
> > +	unsigned int			len)
> > +{
> > +	unsigned int			mapsize = sc->mp->m_attr_geo->blksize;
> > +	bool				ret = true;
> > +
> > +	if (start >= mapsize)
> > +		return false;
> > +	if (start + len > mapsize) {
> > +		len = mapsize - start;
> > +		ret = false;
> > +	}
> > +
> > +	ret &= find_next_bit(map, mapsize, start) >= (start + len);
> 
> That's a bit convoluted. It's a conditional boolean bit masking
> operation. AFAICT it clears ret if the next bit is < (start
> + len), but if ret is already false it can never be set. I think.
> 
> Much simpler to write:
> 
> 	if (find_next_bit(map, mapsize, start) < start + len)
> 		ret = false;

Fixed.

> > +/* Scrub an attribute leaf. */
> > +STATIC int
> > +xfs_scrub_xattr_block(
> > +	struct xfs_scrub_da_btree	*ds,
> > +	int				level)
> > +{
> > +	struct xfs_attr3_icleaf_hdr	leafhdr;
> > +	struct xfs_mount		*mp = ds->state->mp;
> > +	struct xfs_da_state_blk		*blk = &ds->state->path.blk[level];
> > +	struct xfs_buf			*bp = blk->bp;
> > +	xfs_dablk_t			*last_checked = ds->private;
> > +	struct xfs_attr_leafblock	*leaf = bp->b_addr;
> > +	struct xfs_attr_leaf_entry	*ent;
> > +	struct xfs_attr_leaf_entry	*entries;
> > +	struct xfs_attr_leaf_name_local	*lentry;
> > +	struct xfs_attr_leaf_name_remote	*rentry;
> > +	unsigned long			*usedmap = ds->sc->buf;
> > +	char				*name_end;
> > +	char				*buf_end;
> > +	__u32				last_hashval = 0;
> > +	unsigned int			usedbytes = 0;
> > +	unsigned int			nameidx;
> > +	unsigned int			hdrsize;
> > +	unsigned int			namesize;
> > +	int				i;
> > +
> > +	if (*last_checked == blk->blkno)
> > +		return 0;
> > +	*last_checked = blk->blkno;
> > +	bitmap_zero(usedmap, mp->m_attr_geo->blksize);
> > +
> > +	/* Check all the padding. */
> > +	if (xfs_sb_version_hascrc(&ds->sc->mp->m_sb)) {
> > +		struct xfs_attr3_leafblock	*leaf = bp->b_addr;
> > +
> > +		if (leaf->hdr.pad1 != 0 ||
> > +		    leaf->hdr.pad2 != cpu_to_be32(0) ||
> 
> cpu_to_be32(0) = 0.
> 
> So there's no need for endian conversion of 0.

Fixed.

> > +		    leaf->hdr.info.hdr.pad != cpu_to_be16(0))
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +	} else {
> > +		if (leaf->hdr.pad1 != 0 ||
> > +		    leaf->hdr.info.pad != cpu_to_be16(0))
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +	}
> > +
> > +	/* Check the leaf header */
> > +	xfs_attr3_leaf_hdr_from_disk(mp->m_attr_geo, &leafhdr, leaf);
> > +
> > +	if (leafhdr.usedbytes > mp->m_attr_geo->blksize)
> > +		xfs_scrub_da_set_corrupt(ds, level);
> > +	if (leafhdr.firstused > mp->m_attr_geo->blksize)
> > +		xfs_scrub_da_set_corrupt(ds, level);
> > +	hdrsize = xfs_attr3_leaf_hdr_size(leaf);
> > +	if (leafhdr.firstused < hdrsize)
> > +		xfs_scrub_da_set_corrupt(ds, level);
> > +
> > +	if (!xfs_scrub_xattr_set_map(ds->sc, usedmap, 0, hdrsize)) {
> > +		xfs_scrub_da_set_corrupt(ds, level);
> > +		goto out;
> > +	}
> > +
> > +	entries = xfs_attr3_leaf_entryp(leaf);
> > +	if ((char *)&entries[leafhdr.count] > (char *)leaf + leafhdr.firstused)
> > +		xfs_scrub_da_set_corrupt(ds, level);
> 
> All the casts are a bit ugly, but I guess we've got to check
> offsets somehow. 
> 
> > +	/* Check the entries' relations to each other */
> > +	buf_end = (char *)bp->b_addr + mp->m_attr_geo->blksize;
> > +	for (i = 0, ent = entries; i < leafhdr.count; ent++, i++) {
> > +		if (!xfs_scrub_xattr_set_map(ds->sc, usedmap,
> > +				(char *)ent - (char *)leaf,
> > +				sizeof(xfs_attr_leaf_entry_t))) {
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +			goto out;
> > +		}
> 
> The amount of indentation and broken lines inside this loop makes it
> reall hard to read. Can you factor it at all? ...

Done.

> > +
> > +		if (ent->pad2 != cpu_to_be32(0))
> > +			xfs_scrub_da_set_corrupt(ds, level);
> 
> compare against 0

Done.

> > +
> > +		if (be32_to_cpu(ent->hashval) < last_hashval)
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +		last_hashval = be32_to_cpu(ent->hashval);
> > +
> > +		nameidx = be16_to_cpu(ent->nameidx);
> > +		if (nameidx < hdrsize ||
> > +		    nameidx < leafhdr.firstused ||
> > +		    nameidx >= mp->m_attr_geo->blksize) {
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +			goto out;
> > +		}
> > +
> > +		if (ent->flags & XFS_ATTR_LOCAL) {
> > +			lentry = xfs_attr3_leaf_name_local(leaf, i);
> > +			if (lentry->namelen <= 0)
> > +				xfs_scrub_da_set_corrupt(ds, level);
> > +			name_end = (char *)lentry->nameval + lentry->namelen +
> > +					be16_to_cpu(lentry->valuelen);
> 
> Isn't there padding that has to be taken into account here?

Yeah, I think there is.  All this crap here can be simplified to:

	lentry = xfs_attr3_leaf_name_local(leaf, idx);
	namesize = xfs_attr_leaf_entsize_local(lentry->namelen,
			be16_to_cpu(lentry->valuelen));
	name_end = (char *)lentry + namesize;
	if (lentry->namelen == 0)
		xfs_scrub_da_set_corrupt(ds, level);
	if (name_end > buf_end)
		xfs_scrub_da_set_corrupt(ds, level);

> 
> > +			if (name_end > buf_end)
> > +				xfs_scrub_da_set_corrupt(ds, level);
> > +			namesize = xfs_attr_leaf_entsize_local(
> > +					lentry->namelen,
> > +					be16_to_cpu(lentry->valuelen));
> 
> Because this pads the size of the entry...
> 
> > +		} else {
> > +			rentry = xfs_attr3_leaf_name_remote(leaf, i);
> > +			if (rentry->namelen <= 0)
> > +				xfs_scrub_da_set_corrupt(ds, level);
> > +			name_end = (char *)rentry->name + rentry->namelen;
> 
> Same here.
> 
> > +			if (name_end > buf_end)
> > +				xfs_scrub_da_set_corrupt(ds, level);
> > +			namesize = xfs_attr_leaf_entsize_remote(
> > +						rentry->namelen);
> > +			if (rentry->valueblk == cpu_to_be32(0))
> > +				xfs_scrub_da_set_corrupt(ds, level);
> > +		}
> 
> Maybe just factoring the local/remote attr name checks will clean
> this all up - this is the part that's really hard to read.

yeah.

> 
> > +		usedbytes += namesize;
> > +		if (!xfs_scrub_xattr_set_map(ds->sc, usedmap, nameidx,
> > +				namesize)) {
> > +			xfs_scrub_da_set_corrupt(ds, level);
> > +			goto out;
> > +		}
> > +	}
> > +
> > +	if (!xfs_scrub_xattr_check_freemap(ds->sc, usedmap, &leafhdr))
> > +		xfs_scrub_da_set_corrupt(ds, level);
> > +
> > +	if (leafhdr.usedbytes != usedbytes)
> > +		xfs_scrub_da_set_corrupt(ds, level);
> 
> And so this validates all the padded entries....

<nod>

--D

> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@fromorbit.com
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

next prev parent reply	other threads:[~2017-11-01  0:54 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-10-31 22:55 [PATCH] xfs: scrub extended attribute leaf space Darrick J. Wong
2017-10-31 23:53 ` Dave Chinner
2017-11-01  0:54   ` Darrick J. Wong [this message]
2017-11-01  6:22 ` [PATCH v2] " Darrick J. Wong
2017-11-01 21:11   ` Dave Chinner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20171101005429.GG4911@magnolia \
    --to=darrick.wong@oracle.com \
    --cc=david@fromorbit.com \
    --cc=linux-xfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).