* [PATCH v2] dm-raid: Fix WARN_ON_ONCE check for sync_thread in raid_resume
@ 2024-06-12 18:12 Benjamin Marzinski
2024-06-13 3:50 ` Yu Kuai
0 siblings, 1 reply; 2+ messages in thread
From: Benjamin Marzinski @ 2024-06-12 18:12 UTC (permalink / raw)
To: Mike Snitzer, Mikulas Patocka, Yu Kuai, Song Liu
Cc: Heinz Mauelshagen, Xiao Ni, dm-devel, linux-raid, Yu Kuai
rm-raid devices will occasionally trigger the following warning when
being resumed after a table load because DM_RECOVERY_RUNNING is set:
WARNING: CPU: 7 PID: 5660 at drivers/md/dm-raid.c:4105 raid_resume+0xee/0x100 [dm_raid]
The failing check is:
WARN_ON_ONCE(test_bit(MD_RECOVERY_RUNNING, &mddev->recovery));
This check is designed to make sure that the sync thread isn't
registered, but md_check_recovery can set MD_RECOVERY_RUNNING without
the sync_thread ever getting registered. Instead of checking if
MD_RECOVERY_RUNNING is set, check if sync_thread is non-NULL.
Fixes: 16c4770c75b1 ("dm-raid: really frozen sync_thread during suspend")
Suggested-by: Yu Kuai <yukuai1@huaweicloud.com>
Signed-off-by: Benjamin Marzinski <bmarzins@redhat.com>
---
Changes in v2:
- Move mddev_lock_nointr() earlier to protect dereference and use
rcu_dereference_protected() to access sync_thread
drivers/md/dm-raid.c | 5 +++--
1 file changed, 3 insertions(+), 2 deletions(-)
diff --git a/drivers/md/dm-raid.c b/drivers/md/dm-raid.c
index abe88d1e6735..b149ac46a990 100644
--- a/drivers/md/dm-raid.c
+++ b/drivers/md/dm-raid.c
@@ -4101,10 +4101,11 @@ static void raid_resume(struct dm_target *ti)
if (mddev->delta_disks < 0)
rs_set_capacity(rs);
+ mddev_lock_nointr(mddev);
WARN_ON_ONCE(!test_bit(MD_RECOVERY_FROZEN, &mddev->recovery));
- WARN_ON_ONCE(test_bit(MD_RECOVERY_RUNNING, &mddev->recovery));
+ WARN_ON_ONCE(rcu_dereference_protected(mddev->sync_thread,
+ lockdep_is_held(&mddev->reconfig_mutex)));
clear_bit(RT_FLAG_RS_FROZEN, &rs->runtime_flags);
- mddev_lock_nointr(mddev);
mddev->ro = 0;
mddev->in_sync = 0;
md_unfrozen_sync_thread(mddev);
--
2.43.0
^ permalink raw reply related [flat|nested] 2+ messages in thread
* Re: [PATCH v2] dm-raid: Fix WARN_ON_ONCE check for sync_thread in raid_resume
2024-06-12 18:12 [PATCH v2] dm-raid: Fix WARN_ON_ONCE check for sync_thread in raid_resume Benjamin Marzinski
@ 2024-06-13 3:50 ` Yu Kuai
0 siblings, 0 replies; 2+ messages in thread
From: Yu Kuai @ 2024-06-13 3:50 UTC (permalink / raw)
To: Benjamin Marzinski, Mike Snitzer, Mikulas Patocka, Song Liu
Cc: Heinz Mauelshagen, Xiao Ni, dm-devel, linux-raid, Yu Kuai,
yukuai (C)
Hi,
在 2024/06/13 2:12, Benjamin Marzinski 写道:
> rm-raid devices will occasionally trigger the following warning when
dm-raid
> being resumed after a table load because DM_RECOVERY_RUNNING is set:
>
> WARNING: CPU: 7 PID: 5660 at drivers/md/dm-raid.c:4105 raid_resume+0xee/0x100 [dm_raid]
>
> The failing check is:
> WARN_ON_ONCE(test_bit(MD_RECOVERY_RUNNING, &mddev->recovery));
>
> This check is designed to make sure that the sync thread isn't
> registered, but md_check_recovery can set MD_RECOVERY_RUNNING without
> the sync_thread ever getting registered. Instead of checking if
> MD_RECOVERY_RUNNING is set, check if sync_thread is non-NULL.
>
> Fixes: 16c4770c75b1 ("dm-raid: really frozen sync_thread during suspend")
> Suggested-by: Yu Kuai <yukuai1@huaweicloud.com>
Please use the address yukuai3@huawei.com
> Signed-off-by: Benjamin Marzinski <bmarzins@redhat.com>
> ---
> Changes in v2:
> - Move mddev_lock_nointr() earlier to protect dereference and use
> rcu_dereference_protected() to access sync_thread
>
> drivers/md/dm-raid.c | 5 +++--
> 1 file changed, 3 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/md/dm-raid.c b/drivers/md/dm-raid.c
> index abe88d1e6735..b149ac46a990 100644
> --- a/drivers/md/dm-raid.c
> +++ b/drivers/md/dm-raid.c
> @@ -4101,10 +4101,11 @@ static void raid_resume(struct dm_target *ti)
> if (mddev->delta_disks < 0)
> rs_set_capacity(rs);
>
> + mddev_lock_nointr(mddev);
> WARN_ON_ONCE(!test_bit(MD_RECOVERY_FROZEN, &mddev->recovery));
> - WARN_ON_ONCE(test_bit(MD_RECOVERY_RUNNING, &mddev->recovery));
> + WARN_ON_ONCE(rcu_dereference_protected(mddev->sync_thread,
> + lockdep_is_held(&mddev->reconfig_mutex)));
> clear_bit(RT_FLAG_RS_FROZEN, &rs->runtime_flags);
> - mddev_lock_nointr(mddev);
Other than the typo, LGTM
Suggested-and-reviewed-by: Yu Kuai <yukuai3@huawei.com>
> mddev->ro = 0;
> mddev->in_sync = 0;
> md_unfrozen_sync_thread(mddev);
>
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2024-06-13 3:50 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-06-12 18:12 [PATCH v2] dm-raid: Fix WARN_ON_ONCE check for sync_thread in raid_resume Benjamin Marzinski
2024-06-13 3:50 ` Yu Kuai
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).