From: Dave Chinner <david@fromorbit.com>
To: Eric Sandeen <sandeen@sandeen.net>
Cc: Eric Sandeen <sandeen@redhat.com>,
linux-xfs <linux-xfs@vger.kernel.org>,
"Xu, Wen" <wen.xu@gatech.edu>
Subject: Re: [PATCH 2/2 V3] xfs: verify size-vs-format for symlinks & dirs
Date: Sun, 30 Sep 2018 16:05:20 +1000 [thread overview]
Message-ID: <20180930060520.GK31060@dastard> (raw)
In-Reply-To: <71034ef7-836b-a462-a4f9-53d9004339bc@sandeen.net>
On Sun, Sep 30, 2018 at 12:06:44AM -0500, Eric Sandeen wrote:
> On 9/29/18 10:25 PM, Dave Chinner wrote:
> > On Mon, Sep 24, 2018 at 10:00:39PM -0500, Eric Sandeen wrote:
> >> Today, xfs_ifork_verify_data() will simply skip verification if the inode
> >> claims to be in non-local format. However, nothing catches the case where
> >> the size for the format is too small to be non-local. xfs_repair tests
> >> for this mismatch in process_check_inode_sizes(), so do the same in this
> >> verifier.
> >>
> >> Reported-by: Xu, Wen <wen.xu@gatech.edu>
> >> Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=200925
> >> Signed-off-by: Eric Sandeen <sandeen@redhat.com>
> >> Reviewed-by: Brian Foster <bfoster@redhat.com>
> >> ---
> >>
> >> V2: restructure code & tests per Dave's suggestion on the V1 patch.
> >> V3: rewrite dave's comments per brian's suggestions
> >>
> >> diff --git a/fs/xfs/libxfs/xfs_inode_fork.c b/fs/xfs/libxfs/xfs_inode_fork.c
> >> index f9acf1d436f6..d1a58e7a872f 100644
> >> --- a/fs/xfs/libxfs/xfs_inode_fork.c
> >> +++ b/fs/xfs/libxfs/xfs_inode_fork.c
> >> @@ -704,12 +704,33 @@ xfs_ifork_verify_data(
> >> struct xfs_inode *ip,
> >> struct xfs_ifork_ops *ops)
> >> {
> >> - /* Non-local data fork, we're done. */
> >> - if (ip->i_d.di_format != XFS_DINODE_FMT_LOCAL)
> >> + struct xfs_mount *mp = ip->i_mount;
> >> + int mode = VFS_I(ip)->i_mode;
> >> +
> >> + /*
> >> + * Verify non-local format forks have a valid size. Symlinks must have
> >> + * outgrown the data fork size. The same goes for non-local dirs, but
> >> + * dirs grow at dirblock granularity. Perform a slightly stronger check
> >> + * and require the dir is at least one dirblock in size.
> >> + */
> >> + if (ip->i_d.di_format != XFS_DINODE_FMT_LOCAL) {
> >> + switch (mode & S_IFMT) {
> >> + case S_IFDIR:
> >> + if (ip->i_d.di_size < mp->m_dir_geo->blksize)
> >> + return __this_address;
> >> + break;
> >> + case S_IFLNK:
> >> + if (ip->i_d.di_size <= XFS_IFORK_DSIZE(ip))
> >> + return __this_address;
> >
> > Just had this fire in inode writeback from generic/390. I'm going to
> > drop it for the moment, because I'm not sure what the correct fix is
> > yet. Consider this:
> >
> > create symlink XFS_LITINO bytes in length
> > fits in inode, so put inline. size <= IFORK_DSIZE
> > [....]
> > add attr to symlink
> > creates attr fork
> > inline data fork too large, size > new IFORK_DSIZE
> > xfs_symlink_local_to_remote()
> > data fork goes to extent format, size remains unchanged
> > [....]
> > remove last attrs from inode
> > remove attr fork
> > IFORK_DSIZE grows again, now size = IFORK_DSIZE again
> > data fork remains in extent format
> > [....]
> > inode writeback
> > size = IFORK_DSIZE, extent format
> > xfs_ifork_verify_data verifier fails.
> >
> >
> > With this process, I think a symlink can be out of line even if it
> > is less than the size of the data fork. I think this can happen even
> > for symlinks much smaller than XFS_LITINO, because the attribute
> > fork can grow into free space in the literal area and push local
> > data larger than XFS_BMDR_SPACE_CALC(MINDBTPTRS) bytes to extent
> > format.
> >
> > #define MINDBTPTRS 3
> >
> > #define XFS_BMDR_SPACE_CALC(nrecs) \
> > (int)(sizeof(xfs_bmdr_block_t) + \
> > ((nrecs) * (sizeof(xfs_bmbt_key_t) + sizeof(xfs_bmbt_ptr_t))))
> >
> > = 4 + 3 * (8 + 8)
> > = 52 bytes
> > = 56 bytes when rounded up to 8 byte offset
> >
> > So, yeah, I think that this check needs to be different because I
> > think we could have symlinks as short at 56 bytes in extent format,
> > even when the inode has no attribute fork...
>
> Hrmph. And yet, xfs_repair:
>
> static int
> process_symlink_extlist(xfs_mount_t *mp, xfs_ino_t lino, xfs_dinode_t *dino)
> {
> xfs_fileoff_t expected_offset;
> xfs_bmbt_rec_t *rp;
> xfs_bmbt_irec_t irec;
> int numrecs;
> int i;
> int max_blocks;
>
> if (be64_to_cpu(dino->di_size) <= XFS_DFORK_DSIZE(dino, mp)) {
> if (dino->di_format == XFS_DINODE_FMT_LOCAL)
> return 0;
> do_warn(
> _("mismatch between format (%d) and size (%" PRId64 ") in symlink ino %" PRIu64 "\n"),
> dino->di_format,
> (int64_t)be64_to_cpu(dino->di_size), lino);
> return 1;
> }
>
> seems to clearly call "non-local symlink with size < XFS_DFORK_DSIZE" corruption.
> What gives?
Seems to me like another cases that the verifiers have uncovered
another situation that even repair doesn't handle correctly. (Which
is why I like to look at these things from first principles, rather
than just from the "what does reapir do" POV?). It's like to be rare
because who removes all the attributes on a file apart from when
unlinking the inode?
xfs_attr_fork_remove() has set the di_forkoff back to zero
when the last attribute is removed from the inode for a long time
(I stopped looking when I got to ~2008...), so this isn't a new
situation. i.e. trying to define what are valid values has
demonstrated that there are more cases we need to take into account
than anyone has realised.
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
next prev parent reply other threads:[~2018-09-30 12:37 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-09-25 2:56 [PATCH 0/2 V3] xfs: validate size vs format Eric Sandeen
2018-09-25 2:58 ` [PATCH 1/2 V3] xfs: validate inode di_forkoff Eric Sandeen
2018-09-25 3:00 ` [PATCH 2/2 V3] xfs: verify size-vs-format for symlinks & dirs Eric Sandeen
2018-09-30 3:25 ` Dave Chinner
2018-09-30 5:06 ` Eric Sandeen
2018-09-30 6:05 ` Dave Chinner [this message]
2018-09-30 17:54 ` Eric Sandeen
2018-09-26 0:13 ` [PATCH 0/2 V3] xfs: validate size vs format Darrick J. Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180930060520.GK31060@dastard \
--to=david@fromorbit.com \
--cc=linux-xfs@vger.kernel.org \
--cc=sandeen@redhat.com \
--cc=sandeen@sandeen.net \
--cc=wen.xu@gatech.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).