From: Brian Foster <bfoster@redhat.com>
To: Dave Chinner <david@fromorbit.com>
Cc: linux-xfs@vger.kernel.org, jeffm@suse.com
Subject: Re: [PATCH] xfs: avoid lockdep false positives in xfs_trans_alloc
Date: Fri, 7 Sep 2018 10:07:55 -0400 [thread overview]
Message-ID: <20180907140755.GC37751@bfoster> (raw)
In-Reply-To: <20180907015155.32559-1-david@fromorbit.com>
On Fri, Sep 07, 2018 at 11:51:55AM +1000, Dave Chinner wrote:
> From: Dave Chinner <dchinner@redhat.com>
>
> We've had a few reports of lockdep tripping over memory reclaim
> context vs filesystem freeze "deadlocks". They all have looked
> to be false positives on analysis, but it seems that they are
> being tripped because we take freeze references before we run
> a GFP_KERNEL allocation for the struct xfs_trans.
>
> We can avoid this false positive vector just by re-ordering the
> operations in xfs_trans_alloc(). That is. we need allocate the
> structure before we take the freeze reference and enter the GFP_NOFS
> allocation context that follows the xfs_trans around. This prevents
> lockdep from seeing the GFP_KERNEL allocation inside the transaction
> context, and that prevents it from triggering the freeze level vs
> alloc context vs reclaim warnings.
>
> Signed-off-by: Dave Chinner <dchinner@redhat.com>
> ---
We have an old RHEL bug for this problem and I recall dwelling on this
exact change for a bit. I don't recall whether I just lost track of it,
there was some kind of problem or I just didn't think it worth the
change to quiet a false positive. I think the latter because I don't
remember actually testing it and don't otherwise have any notes to
suggest it's problematic in any way.
The only difference I can think of is that on a frozen fs, we'd allow
some number of transaction allocations to occur and either immediately
block on the freeze/write lock or drop into reclaim. Those instances
should be limited, however, because most user-driven operations should
block at the write level rather than the internal/transaction level.
I'll run some tests to see if anything explodes, but in the meantime
this seems reasonable to me:
Reviewed-by: Brian Foster <bfoster@redhat.com>
> fs/xfs/xfs_trans.c | 10 ++++++++--
> 1 file changed, 8 insertions(+), 2 deletions(-)
>
> diff --git a/fs/xfs/xfs_trans.c b/fs/xfs/xfs_trans.c
> index bedc5a5133a5..912b42f5fe4a 100644
> --- a/fs/xfs/xfs_trans.c
> +++ b/fs/xfs/xfs_trans.c
> @@ -259,6 +259,14 @@ xfs_trans_alloc(
> struct xfs_trans *tp;
> int error;
>
> + /*
> + * Allocate the handle before we do our freeze accounting and setting up
> + * GFP_NOFS allocation context so that we avoid lockdep false positives
> + * by doing GFP_KERNEL allocations inside sb_start_intwrite().
> + */
> + tp = kmem_zone_zalloc(xfs_trans_zone,
> + (flags & XFS_TRANS_NOFS) ? KM_NOFS : KM_SLEEP);
> +
> if (!(flags & XFS_TRANS_NO_WRITECOUNT))
> sb_start_intwrite(mp->m_super);
>
> @@ -270,8 +278,6 @@ xfs_trans_alloc(
> mp->m_super->s_writers.frozen == SB_FREEZE_COMPLETE);
> atomic_inc(&mp->m_active_trans);
>
> - tp = kmem_zone_zalloc(xfs_trans_zone,
> - (flags & XFS_TRANS_NOFS) ? KM_NOFS : KM_SLEEP);
> tp->t_magic = XFS_TRANS_HEADER_MAGIC;
> tp->t_flags = flags;
> tp->t_mountp = mp;
> --
> 2.17.0
>
next prev parent reply other threads:[~2018-09-07 18:49 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-09-07 1:51 [PATCH] xfs: avoid lockdep false positives in xfs_trans_alloc Dave Chinner
2018-09-07 14:07 ` Brian Foster [this message]
2018-09-10 6:47 ` Christoph Hellwig
2018-09-30 7:56 ` Amir Goldstein
2018-10-01 1:09 ` Dave Chinner
2018-10-01 7:56 ` Amir Goldstein
2018-10-01 22:32 ` Dave Chinner
2018-10-02 4:02 ` Amir Goldstein
2018-10-02 6:39 ` Dave Chinner
2018-10-02 7:33 ` Miklos Szeredi
2018-10-02 23:14 ` Dave Chinner
2018-10-03 3:45 ` Amir Goldstein
2018-10-03 22:59 ` Dave Chinner
2018-10-03 23:14 ` Miklos Szeredi
2018-10-04 5:38 ` Dave Chinner
2018-10-04 7:33 ` Miklos Szeredi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20180907140755.GC37751@bfoster \
--to=bfoster@redhat.com \
--cc=david@fromorbit.com \
--cc=jeffm@suse.com \
--cc=linux-xfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.