From: Filipe Manana <fdmanana@kernel.org>
To: Sidong Yang <realwakka@gmail.com>
Cc: linux-btrfs@vger.kernel.org, fstests@vger.kernel.org,
Shinichiro Kawasaki <shinichiro.kawasaki@wdc.com>,
"dsterba@suse.cz" <dsterba@suse.cz>
Subject: Re: [PATCH] btrfs: add test for enable/disable quota and create/destroy qgroup repeatedly
Date: Wed, 2 Mar 2022 15:12:15 +0000 [thread overview]
Message-ID: <Yh+JT+jGeDJEL5Gd@debian9.Home> (raw)
In-Reply-To: <20220302134316.GA2682@realwakka>
On Wed, Mar 02, 2022 at 01:43:16PM +0000, Sidong Yang wrote:
> On Wed, Mar 02, 2022 at 12:10:08PM +0000, Filipe Manana wrote:
>
> Hi, Filipe!
> Thanks for review.
> > On Tue, Mar 01, 2022 at 03:19:30PM +0000, Sidong Yang wrote:
> > > Test enabling/disable quota and creating/destroying qgroup repeatedly
> > > in asynchronous and confirm it does not cause kernel hang. This is a
> >
> > in asynchronous -> in parallel
>
> Sure, Thanks!
> >
> > > regression test for the problem reported to linux-btrfs list [1].
> >
> > It's worth mentioning the deadlock only happens starting with kernel 5.17-rc3.
>
> It only happens in 5.17-rc3 version? I didn't know about it. I'll add
> mention about this.
Well, in the kernel patch we have:
Fixes: e804861bd4e6 ("btrfs: fix deadlock between quota disable and qgroup rescan worker")
And that commit was introduced in 5.17-rc3. Maybe it deadlocked in a different
way before that commit, perhaps in the way that e804861bd4e6 describes. However
I haven't checked how it behaves on a kernel without that commit. But at least we
know that currently it deadlocks at 5.17-rc3+.
> >
> > >
> > > The hang was recreated using the test case and fixed by kernel patch
> > > titled
> > >
> > > btrfs: qgroup: fix deadlock between rescan worker and remove qgroup
> > >
> > > [1] https://lore.kernel.org/linux-btrfs/20220228014340.21309-1-realwakka@gmail.com/
> > >
> > > Signed-off-by: Sidong Yang <realwakka@gmail.com>
> >
> > In addition to Shinichiro's comments...
> >
> > > ---
> > > tests/btrfs/262 | 54 +++++++++++++++++++++++++++++++++++++++++++++
> > > tests/btrfs/262.out | 2 ++
> > > 2 files changed, 56 insertions(+)
> > > create mode 100755 tests/btrfs/262
> > > create mode 100644 tests/btrfs/262.out
> > >
> > > diff --git a/tests/btrfs/262 b/tests/btrfs/262
> > > new file mode 100755
> > > index 00000000..9be380f9
> > > --- /dev/null
> > > +++ b/tests/btrfs/262
> > > @@ -0,0 +1,54 @@
> > > +#! /bin/bash
> > > +# SPDX-License-Identifier: GPL-2.0
> > > +# Copyright (c) 2022 YOUR NAME HERE. All Rights Reserved.
> > > +#
> > > +# FS QA Test 262
> > > +#
> > > +# Test the deadlock between qgroup and quota commands
> >
> > The test description should be a lot more clear.
> >
> > "the deadlock" is vague a gives the wrong idea we only ever had a single
> > deadlock related to qgroups. "qgroup and quota commands" is confusing,
> > and "qgroup" and "quota" are pretty much synonyms, and it should mention
> > which commands.
> >
> > Plus what we want to test is that we can run some qgroup operations in
> > parallel without triggering a deadlock, crash, etc.
> >
> > Perhaps something like:
> >
> > """
> > Test that running qgroup enable, create, destroy and disable commands in
> > parallel does not result in a deadlock, a crash or any filesystem
> > inconsistency.
> > """
> >
> Yeah, It was not clear. I found that this test is not only for checking
> deadlock. But it checks that test runs without any problem.
>
> >
> > > +#
> > > +. ./common/preamble
> > > +_begin_fstest auto qgroup
> >
> > Can also be added to the "quick" group. It takes 1 second in my slowest vm.
>
> Okay, Thanks!
> >
> > > +
> > > +# Import common functions.
> > > +. ./common/filter
> > > +
> > > +# real QA test starts here
> > > +
> > > +# Modify as appropriate.
> > > +_supported_fs btrfs
> > > +
> > > +_require_scratch
> > > +
> > > +# Run command that enable/disable quota and create/destroy qgroup asynchronously
> >
> > With the more clear test description above, this can go away.
>
> Sure!
> >
> > > +qgroup_deadlock_test()
> > > +{
> > > + _scratch_mkfs > /dev/null 2>&1
> > > + _scratch_mount
> > > + echo "=== qgroup deadlock test ===" >> $seqres.full
> >
> > There's no point in echoing this message to the .full file, it provides no
> > value at all, as testing that is all that this testcase does.
>
> I agree. This is pointless because it's simple test.
> >
> > > +
> > > + pids=()
> > > + for ((i = 0; i < 200; i++)); do
> > > + $BTRFS_UTIL_PROG quota enable $SCRATCH_MNT 2>> $seqres.full &
> > > + pids+=($!)
> > > + $BTRFS_UTIL_PROG qgroup create 1/0 $SCRATCH_MNT 2>> $seqres.full &
> > > + pids+=($!)
> > > + $BTRFS_UTIL_PROG qgroup destroy 1/0 $SCRATCH_MNT 2>> $seqres.full &
> > > + pids+=($!)
> > > + $BTRFS_UTIL_PROG quota disable $SCRATCH_MNT 2>> $seqres.full &
> > > + pids+=($!)
> > > + done
> > > +
> > > + for pid in "${pids[@]}"; do
> > > + wait $pid
> > > + done
> >
> > As pointed before by Shinichiro, a simple 'wait' here is enough, no need to
> > keep track of the PIDs.
>
> Yeah, I don't have to go hard way.
> >
> > > +
> > > + _scratch_unmount
> > > + _check_scratch_fs
> >
> > Not needed, the fstests framework automatically runs 'btrfs check' when a test
> > finishes. Doing this explicitly is only necessary when we need to do several
> > mount/unmount operations and want to check the fs is fine after each unmount
> > and before the next mount.
> >
>
> I didn't know about that. I don't need to check manually.
> > > +}
> > > +
> > > +qgroup_deadlock_test
> >
> > There's no point in putting all the test code in a function, as the function
> > is only called once.
>
> Of course!
> >
> > Otherwise it looks good, and the test works as advertised, it triggers a
> > deadlock on 5.17-rc3+ kernel and passes on a patched kernel.
> >
> > Thanks for converting the reproducer into a test case.
> >
>
> Thanks for detailed review. I'll back soon with v2.
>
> Thanks,
> Sidong
> > > +
> > > +# success, all done
> > > +echo "Silence is golden"
> > > +status=0
> > > +exit
> > > diff --git a/tests/btrfs/262.out b/tests/btrfs/262.out
> > > new file mode 100644
> > > index 00000000..404badc3
> > > --- /dev/null
> > > +++ b/tests/btrfs/262.out
> > > @@ -0,0 +1,2 @@
> > > +QA output created by 262
> > > +Silence is golden
> > > --
> > > 2.25.1
> > >
prev parent reply other threads:[~2022-03-02 15:12 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-03-01 15:19 [PATCH] btrfs: add test for enable/disable quota and create/destroy qgroup repeatedly Sidong Yang
2022-03-02 4:43 ` Shinichiro Kawasaki
2022-03-02 6:26 ` Sidong Yang
2022-03-02 8:24 ` Shinichiro Kawasaki
2022-03-02 13:47 ` Sidong Yang
2022-03-02 12:10 ` Filipe Manana
2022-03-02 13:43 ` Sidong Yang
2022-03-02 15:12 ` Filipe Manana [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Yh+JT+jGeDJEL5Gd@debian9.Home \
--to=fdmanana@kernel.org \
--cc=dsterba@suse.cz \
--cc=fstests@vger.kernel.org \
--cc=linux-btrfs@vger.kernel.org \
--cc=realwakka@gmail.com \
--cc=shinichiro.kawasaki@wdc.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox