From: Dave Chinner <david@fromorbit.com>
To: Brian Foster <bfoster@redhat.com>
Cc: xfs@oss.sgi.com
Subject: Re: [PATCH 14/13] xfs: swap leaf buffer into path struct atomically during path shift
Date: Tue, 18 Aug 2015 07:34:13 +1000 [thread overview]
Message-ID: <20150817213413.GC714@dastard> (raw)
In-Reply-To: <1439830072-61117-1-git-send-email-bfoster@redhat.com>
On Mon, Aug 17, 2015 at 12:47:52PM -0400, Brian Foster wrote:
> The node directory lookup code uses a state structure that tracks the
> path of buffers used to search for the hash of a filename through the
> leaf blocks. When the lookup encounters a block that ends with the
> requested hash, but the entry has not yet been found, it must shift over
> to the next block and continue looking for the entry (i.e., duplicate
> hashes could continue over into the next block). This shift mechanism
> involves walking back up and down the state structure, replacing buffers
> at the appropriate btree levels as necessary.
>
> When a buffer is replaced, the old buffer is released and the new buffer
> read into the active slot in the path structure. Because the buffer is
> read directly into the path slot, a buffer read failure can result in
> setting a NULL buffer pointer in an active slot. This throws off the
> state cleanup code in xfs_dir2_node_lookup(), which expects to release a
> buffer from each active slot. Instead, a BUG occurs due to a NULL
> pointer dereference:
>
> BUG: unable to handle kernel NULL pointer dereference at 00000000000001e8
> IP: [<ffffffffa0585063>] xfs_trans_brelse+0x2a3/0x3c0 [xfs]
> ...
> RIP: 0010:[<ffffffffa0585063>] [<ffffffffa0585063>] xfs_trans_brelse+0x2a3/0x3c0 [xfs]
> ...
> Call Trace:
> [<ffffffffa05250c6>] xfs_dir2_node_lookup+0xa6/0x2c0 [xfs]
> [<ffffffffa0519f7c>] xfs_dir_lookup+0x1ac/0x1c0 [xfs]
> [<ffffffffa055d0e1>] xfs_lookup+0x91/0x290 [xfs]
> [<ffffffffa05580b3>] xfs_vn_lookup+0x73/0xb0 [xfs]
> [<ffffffff8122de8d>] lookup_real+0x1d/0x50
> [<ffffffff8123330e>] path_openat+0x91e/0x1490
> [<ffffffff81235079>] do_filp_open+0x89/0x100
> ...
>
> This has been reproduced via a parallel fsstress and filesystem shutdown
> workload in a loop. The shutdown triggers the read error in the
> aforementioned codepath and causes the BUG in xfs_dir2_node_lookup().
>
> Update xfs_da3_path_shift() to update the active path slot atomically
> with respect to the caller when a buffer is replaced. This ensures that
> the caller always sees the old or new buffer in the slot and prevents
> the NULL pointer dereference.
>
> Signed-off-by: Brian Foster <bfoster@redhat.com>
> ---
>
> This is just another shutdown/error handling issue I've run into with
> the same testing associated with all of the other fixes. I'm tacking it
> on to the end of this series...
>
> Brian
>
> fs/xfs/libxfs/xfs_da_btree.c | 25 ++++++++++++++++---------
> 1 file changed, 16 insertions(+), 9 deletions(-)
>
> diff --git a/fs/xfs/libxfs/xfs_da_btree.c b/fs/xfs/libxfs/xfs_da_btree.c
> index 3264d81..04a3765 100644
> --- a/fs/xfs/libxfs/xfs_da_btree.c
> +++ b/fs/xfs/libxfs/xfs_da_btree.c
> @@ -1822,6 +1822,7 @@ xfs_da3_path_shift(
> struct xfs_da_args *args;
> struct xfs_da_node_entry *btree;
> struct xfs_da3_icnode_hdr nodehdr;
> + struct xfs_buf *bp;
> xfs_dablk_t blkno = 0;
> int level;
> int error;
> @@ -1865,21 +1866,27 @@ xfs_da3_path_shift(
> * same depth we were at originally.
> */
> for (blk++, level++; level < path->active; blk++, level++) {
> + struct xfs_buf **bpp = &blk->bp;
> +
What do we need this for? The new code is:
> /*
> + * Read the next child block into a local buffer.
> */
> + error = xfs_da3_node_read(args->trans, dp, blkno, -1, &bp,
> + args->whichfork);
> + if (error)
> + return error;
>
> /*
> + * Release the old block (if it's dirty, the trans doesn't
> + * actually let go) and swap the local buffer into the path
> + * structure. This ensures failure of the above read doesn't set
> + * a NULL buffer in an active slot in the path.
> */
> + if (release)
> + xfs_trans_brelse(args->trans, blk->bp);
> blk->blkno = blkno;
> + *bpp = bp;
And this can simply be:
blk->bp = bp;
so I don't think *bpp is necessary at all.
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs
next prev parent reply other threads:[~2015-08-17 21:34 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-10 19:01 [PATCH v3 00/13] xfs: miscellaneous logging, recovery, umount fixes Brian Foster
2015-08-10 19:01 ` [PATCH v3 01/13] xfs: disentagle EFI release from the extent count Brian Foster
2015-08-10 19:01 ` [PATCH v3 02/13] xfs: return committed status from xfs_trans_roll() Brian Foster
2015-08-10 19:01 ` [PATCH v3 03/13] xfs: fix efi/efd error handling to avoid fs shutdown hangs Brian Foster
2015-08-10 19:01 ` [PATCH v3 04/13] xfs: ensure EFD trans aborts on log recovery extent free failure Brian Foster
2015-08-18 23:23 ` Dave Chinner
2015-08-19 11:02 ` Brian Foster
2015-08-10 19:01 ` [PATCH v3 05/13] xfs: use EFI refcount consistently in log recovery Brian Foster
2015-08-10 19:01 ` [PATCH v3 06/13] xfs: don't leave EFIs on AIL on mount failure Brian Foster
2015-08-10 19:01 ` [PATCH v3 07/13] xfs: icreate log item recovery and cancellation tracepoints Brian Foster
2015-08-10 19:01 ` [PATCH v3 08/13] xfs: fix broken icreate log item cancellation Brian Foster
2015-08-10 19:01 ` [PATCH v3 09/13] xfs: checksum log record ext headers based on record size Brian Foster
2015-08-10 19:01 ` [PATCH v3 10/13] xfs: clean up root inode properly on mount failure Brian Foster
2015-08-10 19:01 ` [PATCH v3 11/13] xfs: fix btree cursor error cleanups Brian Foster
2015-08-10 19:01 ` [PATCH v3 12/13] xfs: add helper to conditionally remove items from the AIL Brian Foster
2015-08-12 7:17 ` Christoph Hellwig
2015-08-12 12:48 ` Brian Foster
2015-08-12 13:00 ` [PATCH v4 " Brian Foster
2015-08-10 19:01 ` [PATCH v3 13/13] xfs: add missing bmap cancel calls in error paths Brian Foster
2015-08-12 7:21 ` Christoph Hellwig
2015-08-12 12:48 ` Brian Foster
2015-08-17 16:47 ` [PATCH 14/13] xfs: swap leaf buffer into path struct atomically during path shift Brian Foster
2015-08-17 21:34 ` Dave Chinner [this message]
2015-08-17 22:19 ` Brian Foster
2015-08-17 22:25 ` [PATCH v2 " Brian Foster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150817213413.GC714@dastard \
--to=david@fromorbit.com \
--cc=bfoster@redhat.com \
--cc=xfs@oss.sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox