public inbox for linux-raid@vger.kernel.org
 help / color / mirror / Atom feed
From: Mikulas Patocka <mpatocka@redhat.com>
To: Yu Kuai <yukuai1@huaweicloud.com>
Cc: Song Liu <song@kernel.org>, David Jeffery <djeffery@redhat.com>,
	 Li Nan <linan122@huawei.com>,
	dm-devel@lists.linux.dev,  linux-raid@vger.kernel.org,
	Mike Snitzer <msnitzer@redhat.com>,
	 Heinz Mauelshagen <heinzm@redhat.com>,
	 Benjamin Marzinski <bmarzins@redhat.com>,
	 "yukuai (C)" <yukuai3@huawei.com>
Subject: Re: [PATCH 2/7] md: fix a race condition when stopping the sync thread
Date: Thu, 18 Jan 2024 14:28:05 +0100 (CET)	[thread overview]
Message-ID: <21bafddb-63f6-82f0-40bf-b91fcb6260fc@redhat.com> (raw)
In-Reply-To: <4af9fe2b-7f5a-59d6-0b5e-762ecae1b007@huaweicloud.com>

[-- Attachment #1: Type: text/plain, Size: 1781 bytes --]



On Thu, 18 Jan 2024, Yu Kuai wrote:

> Hi,
> 
> 在 2024/01/18 21:07, Mikulas Patocka 写道:
> > 
> > 
> > On Thu, 18 Jan 2024, Yu Kuai wrote:
> > 
> >> Hi,
> >>
> >> 在 2024/01/18 2:18, Mikulas Patocka 写道:
> >>> Note that md_wakeup_thread_directly is racy - it will do nothing if the
> >>> thread is already running or it may cause spurious wake-up if the thread
> >>> is blocked in another subsystem.
> >>
> >> No, as the comment said, md_wakeup_thread_directly() is just to prevent
> >> that md_wakeup_thread() can't wake up md_do_sync() if it's waiting for
> >> metadata update.
> > 
> > Yes - but what happens if you wake up the thread just a few instructions
> > before it is going to sleep for metadata update? wake_up_process does
> > nothing on a running process and the thread proceeds with waiting. This is
> > what I thought could happen when I was making the patch.
> 
> Please notice that in the orginal code md_wakeup_thread_directly() is
> used for sync_thread, and md_wakeup_thread() should be used for
> *mddev->thread* (mddev_unlock always do that) to clear
> MD_RECOVERY_RUNNING.
> 
> By the way, the root cause that MD_RECOVERY_RUNNING is not cleared is
> that mddev_suspend() never stop sync_thread at all, while
> md_check_recovery() won't do anything when mddev is suspended.
> 
> Before:
> 1. suspend
> 2. call md_reap_sync_thread() directly to unregister sync_thread
>     -> notice that this is not safe.
> 3. resume
> 
> Now:
> 1. suspend
> 2. call stop_sync_thread() to unregister sync_thread interrupt
> md_do_sync() and wait for md_check_recovery() to clear
> MD_RECOVERY_RUNNING.
>    -> which will never happen now;
> 3. resume
> 
> I fixed this locally and the test integrity-caching.sh passed in my VM.
> 
> Thanks,
> Kuai

OK, Thanks.

Mikulas

  reply	other threads:[~2024-01-18 13:28 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-01-17 18:16 [PATCH 0/7] MD fixes for the LVM2 testsuite Mikulas Patocka
2024-01-17 18:17 ` [PATCH 1/7] md: Revert fa2bbff7b0b4 ("md: synchronize flush io with array reconfiguration") Mikulas Patocka
2024-01-18  1:27   ` Yu Kuai
2024-01-17 18:18 ` [PATCH 2/7] md: fix a race condition when stopping the sync thread Mikulas Patocka
2024-01-18  1:32   ` Yu Kuai
2024-01-18 13:07     ` Mikulas Patocka
2024-01-18 13:20       ` Yu Kuai
2024-01-18 13:28         ` Mikulas Patocka [this message]
2024-01-17 18:19 ` [PATCH 3/7] md: test for MD_RECOVERY_DONE in stop_sync_thread Mikulas Patocka
2024-01-18  0:19   ` Song Liu
2024-01-18 13:23     ` Mikulas Patocka
2024-01-18 21:10       ` Song Liu
2024-01-22 16:34         ` Mikulas Patocka
2024-01-23  2:31           ` Benjamin Marzinski
2024-01-26  9:17             ` Yu Kuai
2024-01-26  9:37               ` Yu Kuai
2024-01-26 10:29                 ` Zdenek Kabelac
2024-01-27  1:13                   ` Yu Kuai
2024-01-27  1:19                     ` Yu Kuai
2024-01-18  1:35   ` Yu Kuai
2024-01-17 18:20 ` [PATCH 4/7] md: call md_reap_sync_thread from __md_stop_writes Mikulas Patocka
2024-01-18  1:38   ` Yu Kuai
2024-01-17 18:21 ` [PATCH 5/7] md: fix deadlock in shell/lvconvert-raid-reshape-linear_to_raid6-single-type.sh Mikulas Patocka
2024-01-18  1:12   ` Song Liu
2024-01-18  1:51   ` Yu Kuai
2024-01-17 18:22 ` [PATCH 6/7] md: partially revert "md/raid6: use valid sector values to determine if an I/O should wait on the reshape" Mikulas Patocka
2024-01-17 23:56   ` Song Liu
2024-01-17 18:22 ` [PATCH 7/7] md: fix a suspicious RCU usage warning Mikulas Patocka
2024-01-17 23:59   ` Song Liu
2024-01-18  1:56   ` Yu Kuai
2024-01-25 17:31     ` Song Liu
2024-01-17 19:27 ` [PATCH 0/7] MD fixes for the LVM2 testsuite Song Liu
2024-01-18  2:03   ` Yu Kuai
2024-01-27  7:57 ` Yu Kuai

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=21bafddb-63f6-82f0-40bf-b91fcb6260fc@redhat.com \
    --to=mpatocka@redhat.com \
    --cc=bmarzins@redhat.com \
    --cc=djeffery@redhat.com \
    --cc=dm-devel@lists.linux.dev \
    --cc=heinzm@redhat.com \
    --cc=linan122@huawei.com \
    --cc=linux-raid@vger.kernel.org \
    --cc=msnitzer@redhat.com \
    --cc=song@kernel.org \
    --cc=yukuai1@huaweicloud.com \
    --cc=yukuai3@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox