* [PATCH] xfs: check kthread_should_stop() after the setting of task state
@ 2017-09-07 8:04 Hou Tao
2017-09-08 7:32 ` Christoph Hellwig
2017-09-08 17:42 ` Brian Foster
0 siblings, 2 replies; 5+ messages in thread
From: Hou Tao @ 2017-09-07 8:04 UTC (permalink / raw)
To: linux-xfs; +Cc: david, bfoster
A umount hang is possible when a race occurs between the umount
process and the xfsaild kthread. The following sequences outline
the race:
xfsaild: kthread_should_stop()
=> return false, so xfsaild continue
umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags)
=> by kthread_stop()
umount: wake_up_process()
=> because xfsaild is still running, so 0 is returned
xfsaild: __set_current_state(TASK_INTERRUPTIBLE)
xfsaild: schedule()
=> now, xfsaild will wait indefinitely
umount: wait_for_completion()
=> and umount will hang
To fix that, we need to check kthread_should_stop() after we set
the task state, so the xfsaild will either see the stop bit and
exit or the task state is reset to runnable by wake_up_process()
such that it isn't blocked indefinitely and detects the stop bit
at the next iteration.
Signed-off-by: Hou Tao <houtao1@huawei.com>
---
fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++---
1 file changed, 14 insertions(+), 3 deletions(-)
diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c
index 9056c0f..cd6e185 100644
--- a/fs/xfs/xfs_trans_ail.c
+++ b/fs/xfs/xfs_trans_ail.c
@@ -499,11 +499,22 @@ xfsaild(
current->flags |= PF_MEMALLOC;
set_freezable();
- while (!kthread_should_stop()) {
+ while (1) {
if (tout && tout <= 20)
- __set_current_state(TASK_KILLABLE);
+ set_current_state(TASK_KILLABLE);
else
- __set_current_state(TASK_INTERRUPTIBLE);
+ set_current_state(TASK_INTERRUPTIBLE);
+
+ /*
+ * Check kthread_should_stop() after we set the task state
+ * to guarantee that we either see the stop bit and exit or the
+ * task state is reset to runnable such that it's not blocked
+ * indefinitely and detects the stop bit at the next iteration.
+ */
+ if (kthread_should_stop()) {
+ __set_current_state(TASK_RUNNING);
+ break;
+ }
spin_lock(&ailp->xa_lock);
--
2.5.0
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state
2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao
@ 2017-09-08 7:32 ` Christoph Hellwig
2017-09-08 17:42 ` Brian Foster
1 sibling, 0 replies; 5+ messages in thread
From: Christoph Hellwig @ 2017-09-08 7:32 UTC (permalink / raw)
To: Hou Tao; +Cc: linux-xfs, david, bfoster
Looks good,
Reviewed-by: Christoph Hellwig <hch@lst.de>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state
2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao
2017-09-08 7:32 ` Christoph Hellwig
@ 2017-09-08 17:42 ` Brian Foster
2017-09-12 8:38 ` Hou Tao
1 sibling, 1 reply; 5+ messages in thread
From: Brian Foster @ 2017-09-08 17:42 UTC (permalink / raw)
To: Hou Tao; +Cc: linux-xfs, david
On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote:
> A umount hang is possible when a race occurs between the umount
> process and the xfsaild kthread. The following sequences outline
> the race:
>
> xfsaild: kthread_should_stop()
> => return false, so xfsaild continue
>
> umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags)
> => by kthread_stop()
> umount: wake_up_process()
> => because xfsaild is still running, so 0 is returned
>
> xfsaild: __set_current_state(TASK_INTERRUPTIBLE)
> xfsaild: schedule()
> => now, xfsaild will wait indefinitely
>
> umount: wait_for_completion()
> => and umount will hang
>
> To fix that, we need to check kthread_should_stop() after we set
> the task state, so the xfsaild will either see the stop bit and
> exit or the task state is reset to runnable by wake_up_process()
> such that it isn't blocked indefinitely and detects the stop bit
> at the next iteration.
>
> Signed-off-by: Hou Tao <houtao1@huawei.com>
> ---
I assume you've verified this against your local reproducer? Otherwise
just a nit on the comment...
> fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++---
> 1 file changed, 14 insertions(+), 3 deletions(-)
>
> diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c
> index 9056c0f..cd6e185 100644
> --- a/fs/xfs/xfs_trans_ail.c
> +++ b/fs/xfs/xfs_trans_ail.c
> @@ -499,11 +499,22 @@ xfsaild(
> current->flags |= PF_MEMALLOC;
> set_freezable();
>
> - while (!kthread_should_stop()) {
> + while (1) {
> if (tout && tout <= 20)
> - __set_current_state(TASK_KILLABLE);
> + set_current_state(TASK_KILLABLE);
> else
> - __set_current_state(TASK_INTERRUPTIBLE);
> + set_current_state(TASK_INTERRUPTIBLE);
> +
> + /*
> + * Check kthread_should_stop() after we set the task state
> + * to guarantee that we either see the stop bit and exit or the
> + * task state is reset to runnable such that it's not blocked
> + * indefinitely and detects the stop bit at the next iteration.
> + */
I'd change the "it's not blocked indefinitely" wording to something like
"it's not scheduled out indefinitely." Also, a mention that the task
state sets above include a memory barrier to serialize against
kthread_stop() couldn't hurt. Otherwise this looks fine to me:
Reviewed-by: Brian Foster <bfoster@redhat.com>
> + if (kthread_should_stop()) {
> + __set_current_state(TASK_RUNNING);
> + break;
> + }
>
> spin_lock(&ailp->xa_lock);
>
> --
> 2.5.0
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state
2017-09-08 17:42 ` Brian Foster
@ 2017-09-12 8:38 ` Hou Tao
2017-09-18 18:43 ` Darrick J. Wong
0 siblings, 1 reply; 5+ messages in thread
From: Hou Tao @ 2017-09-12 8:38 UTC (permalink / raw)
To: Brian Foster; +Cc: linux-xfs, david
Hi,
On 2017/9/9 1:42, Brian Foster wrote:
> On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote:
>> A umount hang is possible when a race occurs between the umount
>> process and the xfsaild kthread. The following sequences outline
>> the race:
>>
>> xfsaild: kthread_should_stop()
>> => return false, so xfsaild continue
>>
>> umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags)
>> => by kthread_stop()
>> umount: wake_up_process()
>> => because xfsaild is still running, so 0 is returned
>>
>> xfsaild: __set_current_state(TASK_INTERRUPTIBLE)
>> xfsaild: schedule()
>> => now, xfsaild will wait indefinitely
>>
>> umount: wait_for_completion()
>> => and umount will hang
>>
>> To fix that, we need to check kthread_should_stop() after we set
>> the task state, so the xfsaild will either see the stop bit and
>> exit or the task state is reset to runnable by wake_up_process()
>> such that it isn't blocked indefinitely and detects the stop bit
>> at the next iteration.
>>
>> Signed-off-by: Hou Tao <houtao1@huawei.com>
>> ---
>
> I assume you've verified this against your local reproducer? Otherwise
> just a nit on the comment...
Yes, the patch fixes the test case with the delay hacks. After applying the
patch, I also have tried to move the artificial delay in xfsaild down to the
next of kthread_should_stop() and there is no hang of umount.
>> fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++---
>> 1 file changed, 14 insertions(+), 3 deletions(-)
>>
>> diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c
>> index 9056c0f..cd6e185 100644
>> --- a/fs/xfs/xfs_trans_ail.c
>> +++ b/fs/xfs/xfs_trans_ail.c
>> @@ -499,11 +499,22 @@ xfsaild(
>> current->flags |= PF_MEMALLOC;
>> set_freezable();
>>
>> - while (!kthread_should_stop()) {
>> + while (1) {
>> if (tout && tout <= 20)
>> - __set_current_state(TASK_KILLABLE);
>> + set_current_state(TASK_KILLABLE);
>> else
>> - __set_current_state(TASK_INTERRUPTIBLE);
>> + set_current_state(TASK_INTERRUPTIBLE);
>> +
>> + /*
>> + * Check kthread_should_stop() after we set the task state
>> + * to guarantee that we either see the stop bit and exit or the
>> + * task state is reset to runnable such that it's not blocked
>> + * indefinitely and detects the stop bit at the next iteration.
>> + */
>
> I'd change the "it's not blocked indefinitely" wording to something like
> "it's not scheduled out indefinitely." Also, a mention that the task
> state sets above include a memory barrier to serialize against
> kthread_stop() couldn't hurt. Otherwise this looks fine to me:
>
> Reviewed-by: Brian Foster <bfoster@redhat.com>
Thanks for your comments. I will send v2 soon.
Regards,
Tao
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state
2017-09-12 8:38 ` Hou Tao
@ 2017-09-18 18:43 ` Darrick J. Wong
0 siblings, 0 replies; 5+ messages in thread
From: Darrick J. Wong @ 2017-09-18 18:43 UTC (permalink / raw)
To: Hou Tao; +Cc: Brian Foster, linux-xfs, david
On Tue, Sep 12, 2017 at 04:38:03PM +0800, Hou Tao wrote:
> Hi,
>
> On 2017/9/9 1:42, Brian Foster wrote:
> > On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote:
> >> A umount hang is possible when a race occurs between the umount
> >> process and the xfsaild kthread. The following sequences outline
> >> the race:
> >>
> >> xfsaild: kthread_should_stop()
> >> => return false, so xfsaild continue
> >>
> >> umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags)
> >> => by kthread_stop()
> >> umount: wake_up_process()
> >> => because xfsaild is still running, so 0 is returned
> >>
> >> xfsaild: __set_current_state(TASK_INTERRUPTIBLE)
> >> xfsaild: schedule()
> >> => now, xfsaild will wait indefinitely
> >>
> >> umount: wait_for_completion()
> >> => and umount will hang
> >>
> >> To fix that, we need to check kthread_should_stop() after we set
> >> the task state, so the xfsaild will either see the stop bit and
> >> exit or the task state is reset to runnable by wake_up_process()
> >> such that it isn't blocked indefinitely and detects the stop bit
> >> at the next iteration.
> >>
> >> Signed-off-by: Hou Tao <houtao1@huawei.com>
> >> ---
> >
> > I assume you've verified this against your local reproducer? Otherwise
> > just a nit on the comment...
> Yes, the patch fixes the test case with the delay hacks. After applying the
> patch, I also have tried to move the artificial delay in xfsaild down to the
> next of kthread_should_stop() and there is no hang of umount.
Can this be turned into an xfstest, please?
--D
>
> >> fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++---
> >> 1 file changed, 14 insertions(+), 3 deletions(-)
> >>
> >> diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c
> >> index 9056c0f..cd6e185 100644
> >> --- a/fs/xfs/xfs_trans_ail.c
> >> +++ b/fs/xfs/xfs_trans_ail.c
> >> @@ -499,11 +499,22 @@ xfsaild(
> >> current->flags |= PF_MEMALLOC;
> >> set_freezable();
> >>
> >> - while (!kthread_should_stop()) {
> >> + while (1) {
> >> if (tout && tout <= 20)
> >> - __set_current_state(TASK_KILLABLE);
> >> + set_current_state(TASK_KILLABLE);
> >> else
> >> - __set_current_state(TASK_INTERRUPTIBLE);
> >> + set_current_state(TASK_INTERRUPTIBLE);
> >> +
> >> + /*
> >> + * Check kthread_should_stop() after we set the task state
> >> + * to guarantee that we either see the stop bit and exit or the
> >> + * task state is reset to runnable such that it's not blocked
> >> + * indefinitely and detects the stop bit at the next iteration.
> >> + */
> >
> > I'd change the "it's not blocked indefinitely" wording to something like
> > "it's not scheduled out indefinitely." Also, a mention that the task
> > state sets above include a memory barrier to serialize against
> > kthread_stop() couldn't hurt. Otherwise this looks fine to me:
> >
> > Reviewed-by: Brian Foster <bfoster@redhat.com>
> Thanks for your comments. I will send v2 soon.
>
> Regards,
>
> Tao
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-09-18 18:43 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao
2017-09-08 7:32 ` Christoph Hellwig
2017-09-08 17:42 ` Brian Foster
2017-09-12 8:38 ` Hou Tao
2017-09-18 18:43 ` Darrick J. Wong
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).