From: Mikulas Patocka <mpatocka@redhat.com>
To: Yu Kuai <yukuai1@huaweicloud.com>
Cc: Song Liu <song@kernel.org>, David Jeffery <djeffery@redhat.com>,
Li Nan <linan122@huawei.com>,
dm-devel@lists.linux.dev, linux-raid@vger.kernel.org,
Mike Snitzer <msnitzer@redhat.com>,
Heinz Mauelshagen <heinzm@redhat.com>,
Benjamin Marzinski <bmarzins@redhat.com>,
"yukuai (C)" <yukuai3@huawei.com>
Subject: Re: [PATCH 2/7] md: fix a race condition when stopping the sync thread
Date: Thu, 18 Jan 2024 14:28:05 +0100 (CET) [thread overview]
Message-ID: <21bafddb-63f6-82f0-40bf-b91fcb6260fc@redhat.com> (raw)
In-Reply-To: <4af9fe2b-7f5a-59d6-0b5e-762ecae1b007@huaweicloud.com>
[-- Attachment #1: Type: text/plain, Size: 1781 bytes --]
On Thu, 18 Jan 2024, Yu Kuai wrote:
> Hi,
>
> 在 2024/01/18 21:07, Mikulas Patocka 写道:
> >
> >
> > On Thu, 18 Jan 2024, Yu Kuai wrote:
> >
> >> Hi,
> >>
> >> 在 2024/01/18 2:18, Mikulas Patocka 写道:
> >>> Note that md_wakeup_thread_directly is racy - it will do nothing if the
> >>> thread is already running or it may cause spurious wake-up if the thread
> >>> is blocked in another subsystem.
> >>
> >> No, as the comment said, md_wakeup_thread_directly() is just to prevent
> >> that md_wakeup_thread() can't wake up md_do_sync() if it's waiting for
> >> metadata update.
> >
> > Yes - but what happens if you wake up the thread just a few instructions
> > before it is going to sleep for metadata update? wake_up_process does
> > nothing on a running process and the thread proceeds with waiting. This is
> > what I thought could happen when I was making the patch.
>
> Please notice that in the orginal code md_wakeup_thread_directly() is
> used for sync_thread, and md_wakeup_thread() should be used for
> *mddev->thread* (mddev_unlock always do that) to clear
> MD_RECOVERY_RUNNING.
>
> By the way, the root cause that MD_RECOVERY_RUNNING is not cleared is
> that mddev_suspend() never stop sync_thread at all, while
> md_check_recovery() won't do anything when mddev is suspended.
>
> Before:
> 1. suspend
> 2. call md_reap_sync_thread() directly to unregister sync_thread
> -> notice that this is not safe.
> 3. resume
>
> Now:
> 1. suspend
> 2. call stop_sync_thread() to unregister sync_thread interrupt
> md_do_sync() and wait for md_check_recovery() to clear
> MD_RECOVERY_RUNNING.
> -> which will never happen now;
> 3. resume
>
> I fixed this locally and the test integrity-caching.sh passed in my VM.
>
> Thanks,
> Kuai
OK, Thanks.
Mikulas
next prev parent reply other threads:[~2024-01-18 13:28 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-01-17 18:16 [PATCH 0/7] MD fixes for the LVM2 testsuite Mikulas Patocka
2024-01-17 18:17 ` [PATCH 1/7] md: Revert fa2bbff7b0b4 ("md: synchronize flush io with array reconfiguration") Mikulas Patocka
2024-01-18 1:27 ` Yu Kuai
2024-01-17 18:18 ` [PATCH 2/7] md: fix a race condition when stopping the sync thread Mikulas Patocka
2024-01-18 1:32 ` Yu Kuai
2024-01-18 13:07 ` Mikulas Patocka
2024-01-18 13:20 ` Yu Kuai
2024-01-18 13:28 ` Mikulas Patocka [this message]
2024-01-17 18:19 ` [PATCH 3/7] md: test for MD_RECOVERY_DONE in stop_sync_thread Mikulas Patocka
2024-01-18 0:19 ` Song Liu
2024-01-18 13:23 ` Mikulas Patocka
2024-01-18 21:10 ` Song Liu
2024-01-22 16:34 ` Mikulas Patocka
2024-01-23 2:31 ` Benjamin Marzinski
2024-01-26 9:17 ` Yu Kuai
2024-01-26 9:37 ` Yu Kuai
2024-01-26 10:29 ` Zdenek Kabelac
2024-01-27 1:13 ` Yu Kuai
2024-01-27 1:19 ` Yu Kuai
2024-01-18 1:35 ` Yu Kuai
2024-01-17 18:20 ` [PATCH 4/7] md: call md_reap_sync_thread from __md_stop_writes Mikulas Patocka
2024-01-18 1:38 ` Yu Kuai
2024-01-17 18:21 ` [PATCH 5/7] md: fix deadlock in shell/lvconvert-raid-reshape-linear_to_raid6-single-type.sh Mikulas Patocka
2024-01-18 1:12 ` Song Liu
2024-01-18 1:51 ` Yu Kuai
2024-01-17 18:22 ` [PATCH 6/7] md: partially revert "md/raid6: use valid sector values to determine if an I/O should wait on the reshape" Mikulas Patocka
2024-01-17 23:56 ` Song Liu
2024-01-17 18:22 ` [PATCH 7/7] md: fix a suspicious RCU usage warning Mikulas Patocka
2024-01-17 23:59 ` Song Liu
2024-01-18 1:56 ` Yu Kuai
2024-01-25 17:31 ` Song Liu
2024-01-17 19:27 ` [PATCH 0/7] MD fixes for the LVM2 testsuite Song Liu
2024-01-18 2:03 ` Yu Kuai
2024-01-27 7:57 ` Yu Kuai
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=21bafddb-63f6-82f0-40bf-b91fcb6260fc@redhat.com \
--to=mpatocka@redhat.com \
--cc=bmarzins@redhat.com \
--cc=djeffery@redhat.com \
--cc=dm-devel@lists.linux.dev \
--cc=heinzm@redhat.com \
--cc=linan122@huawei.com \
--cc=linux-raid@vger.kernel.org \
--cc=msnitzer@redhat.com \
--cc=song@kernel.org \
--cc=yukuai1@huaweicloud.com \
--cc=yukuai3@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox