* [PATCH] xfs: check kthread_should_stop() after the setting of task state
@ 2017-09-07 8:04 Hou Tao
2017-09-08 7:32 ` Christoph Hellwig
2017-09-08 17:42 ` Brian Foster
0 siblings, 2 replies; 5+ messages in thread
From: Hou Tao @ 2017-09-07 8:04 UTC (permalink / raw)
To: linux-xfs; +Cc: david, bfoster
A umount hang is possible when a race occurs between the umount
process and the xfsaild kthread. The following sequences outline
the race:
xfsaild: kthread_should_stop()
=> return false, so xfsaild continue
umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags)
=> by kthread_stop()
umount: wake_up_process()
=> because xfsaild is still running, so 0 is returned
xfsaild: __set_current_state(TASK_INTERRUPTIBLE)
xfsaild: schedule()
=> now, xfsaild will wait indefinitely
umount: wait_for_completion()
=> and umount will hang
To fix that, we need to check kthread_should_stop() after we set
the task state, so the xfsaild will either see the stop bit and
exit or the task state is reset to runnable by wake_up_process()
such that it isn't blocked indefinitely and detects the stop bit
at the next iteration.
Signed-off-by: Hou Tao <houtao1@huawei.com>
---
fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++---
1 file changed, 14 insertions(+), 3 deletions(-)
diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c
index 9056c0f..cd6e185 100644
--- a/fs/xfs/xfs_trans_ail.c
+++ b/fs/xfs/xfs_trans_ail.c
@@ -499,11 +499,22 @@ xfsaild(
current->flags |= PF_MEMALLOC;
set_freezable();
- while (!kthread_should_stop()) {
+ while (1) {
if (tout && tout <= 20)
- __set_current_state(TASK_KILLABLE);
+ set_current_state(TASK_KILLABLE);
else
- __set_current_state(TASK_INTERRUPTIBLE);
+ set_current_state(TASK_INTERRUPTIBLE);
+
+ /*
+ * Check kthread_should_stop() after we set the task state
+ * to guarantee that we either see the stop bit and exit or the
+ * task state is reset to runnable such that it's not blocked
+ * indefinitely and detects the stop bit at the next iteration.
+ */
+ if (kthread_should_stop()) {
+ __set_current_state(TASK_RUNNING);
+ break;
+ }
spin_lock(&ailp->xa_lock);
--
2.5.0
^ permalink raw reply related [flat|nested] 5+ messages in thread* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state 2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao @ 2017-09-08 7:32 ` Christoph Hellwig 2017-09-08 17:42 ` Brian Foster 1 sibling, 0 replies; 5+ messages in thread From: Christoph Hellwig @ 2017-09-08 7:32 UTC (permalink / raw) To: Hou Tao; +Cc: linux-xfs, david, bfoster Looks good, Reviewed-by: Christoph Hellwig <hch@lst.de> ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state 2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao 2017-09-08 7:32 ` Christoph Hellwig @ 2017-09-08 17:42 ` Brian Foster 2017-09-12 8:38 ` Hou Tao 1 sibling, 1 reply; 5+ messages in thread From: Brian Foster @ 2017-09-08 17:42 UTC (permalink / raw) To: Hou Tao; +Cc: linux-xfs, david On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote: > A umount hang is possible when a race occurs between the umount > process and the xfsaild kthread. The following sequences outline > the race: > > xfsaild: kthread_should_stop() > => return false, so xfsaild continue > > umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags) > => by kthread_stop() > umount: wake_up_process() > => because xfsaild is still running, so 0 is returned > > xfsaild: __set_current_state(TASK_INTERRUPTIBLE) > xfsaild: schedule() > => now, xfsaild will wait indefinitely > > umount: wait_for_completion() > => and umount will hang > > To fix that, we need to check kthread_should_stop() after we set > the task state, so the xfsaild will either see the stop bit and > exit or the task state is reset to runnable by wake_up_process() > such that it isn't blocked indefinitely and detects the stop bit > at the next iteration. > > Signed-off-by: Hou Tao <houtao1@huawei.com> > --- I assume you've verified this against your local reproducer? Otherwise just a nit on the comment... > fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++--- > 1 file changed, 14 insertions(+), 3 deletions(-) > > diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c > index 9056c0f..cd6e185 100644 > --- a/fs/xfs/xfs_trans_ail.c > +++ b/fs/xfs/xfs_trans_ail.c > @@ -499,11 +499,22 @@ xfsaild( > current->flags |= PF_MEMALLOC; > set_freezable(); > > - while (!kthread_should_stop()) { > + while (1) { > if (tout && tout <= 20) > - __set_current_state(TASK_KILLABLE); > + set_current_state(TASK_KILLABLE); > else > - __set_current_state(TASK_INTERRUPTIBLE); > + set_current_state(TASK_INTERRUPTIBLE); > + > + /* > + * Check kthread_should_stop() after we set the task state > + * to guarantee that we either see the stop bit and exit or the > + * task state is reset to runnable such that it's not blocked > + * indefinitely and detects the stop bit at the next iteration. > + */ I'd change the "it's not blocked indefinitely" wording to something like "it's not scheduled out indefinitely." Also, a mention that the task state sets above include a memory barrier to serialize against kthread_stop() couldn't hurt. Otherwise this looks fine to me: Reviewed-by: Brian Foster <bfoster@redhat.com> > + if (kthread_should_stop()) { > + __set_current_state(TASK_RUNNING); > + break; > + } > > spin_lock(&ailp->xa_lock); > > -- > 2.5.0 > > -- > To unsubscribe from this list: send the line "unsubscribe linux-xfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state 2017-09-08 17:42 ` Brian Foster @ 2017-09-12 8:38 ` Hou Tao 2017-09-18 18:43 ` Darrick J. Wong 0 siblings, 1 reply; 5+ messages in thread From: Hou Tao @ 2017-09-12 8:38 UTC (permalink / raw) To: Brian Foster; +Cc: linux-xfs, david Hi, On 2017/9/9 1:42, Brian Foster wrote: > On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote: >> A umount hang is possible when a race occurs between the umount >> process and the xfsaild kthread. The following sequences outline >> the race: >> >> xfsaild: kthread_should_stop() >> => return false, so xfsaild continue >> >> umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags) >> => by kthread_stop() >> umount: wake_up_process() >> => because xfsaild is still running, so 0 is returned >> >> xfsaild: __set_current_state(TASK_INTERRUPTIBLE) >> xfsaild: schedule() >> => now, xfsaild will wait indefinitely >> >> umount: wait_for_completion() >> => and umount will hang >> >> To fix that, we need to check kthread_should_stop() after we set >> the task state, so the xfsaild will either see the stop bit and >> exit or the task state is reset to runnable by wake_up_process() >> such that it isn't blocked indefinitely and detects the stop bit >> at the next iteration. >> >> Signed-off-by: Hou Tao <houtao1@huawei.com> >> --- > > I assume you've verified this against your local reproducer? Otherwise > just a nit on the comment... Yes, the patch fixes the test case with the delay hacks. After applying the patch, I also have tried to move the artificial delay in xfsaild down to the next of kthread_should_stop() and there is no hang of umount. >> fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++--- >> 1 file changed, 14 insertions(+), 3 deletions(-) >> >> diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c >> index 9056c0f..cd6e185 100644 >> --- a/fs/xfs/xfs_trans_ail.c >> +++ b/fs/xfs/xfs_trans_ail.c >> @@ -499,11 +499,22 @@ xfsaild( >> current->flags |= PF_MEMALLOC; >> set_freezable(); >> >> - while (!kthread_should_stop()) { >> + while (1) { >> if (tout && tout <= 20) >> - __set_current_state(TASK_KILLABLE); >> + set_current_state(TASK_KILLABLE); >> else >> - __set_current_state(TASK_INTERRUPTIBLE); >> + set_current_state(TASK_INTERRUPTIBLE); >> + >> + /* >> + * Check kthread_should_stop() after we set the task state >> + * to guarantee that we either see the stop bit and exit or the >> + * task state is reset to runnable such that it's not blocked >> + * indefinitely and detects the stop bit at the next iteration. >> + */ > > I'd change the "it's not blocked indefinitely" wording to something like > "it's not scheduled out indefinitely." Also, a mention that the task > state sets above include a memory barrier to serialize against > kthread_stop() couldn't hurt. Otherwise this looks fine to me: > > Reviewed-by: Brian Foster <bfoster@redhat.com> Thanks for your comments. I will send v2 soon. Regards, Tao ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] xfs: check kthread_should_stop() after the setting of task state 2017-09-12 8:38 ` Hou Tao @ 2017-09-18 18:43 ` Darrick J. Wong 0 siblings, 0 replies; 5+ messages in thread From: Darrick J. Wong @ 2017-09-18 18:43 UTC (permalink / raw) To: Hou Tao; +Cc: Brian Foster, linux-xfs, david On Tue, Sep 12, 2017 at 04:38:03PM +0800, Hou Tao wrote: > Hi, > > On 2017/9/9 1:42, Brian Foster wrote: > > On Thu, Sep 07, 2017 at 04:04:57PM +0800, Hou Tao wrote: > >> A umount hang is possible when a race occurs between the umount > >> process and the xfsaild kthread. The following sequences outline > >> the race: > >> > >> xfsaild: kthread_should_stop() > >> => return false, so xfsaild continue > >> > >> umount: set_bit(KTHREAD_SHOULD_STOP, &kthread->flags) > >> => by kthread_stop() > >> umount: wake_up_process() > >> => because xfsaild is still running, so 0 is returned > >> > >> xfsaild: __set_current_state(TASK_INTERRUPTIBLE) > >> xfsaild: schedule() > >> => now, xfsaild will wait indefinitely > >> > >> umount: wait_for_completion() > >> => and umount will hang > >> > >> To fix that, we need to check kthread_should_stop() after we set > >> the task state, so the xfsaild will either see the stop bit and > >> exit or the task state is reset to runnable by wake_up_process() > >> such that it isn't blocked indefinitely and detects the stop bit > >> at the next iteration. > >> > >> Signed-off-by: Hou Tao <houtao1@huawei.com> > >> --- > > > > I assume you've verified this against your local reproducer? Otherwise > > just a nit on the comment... > Yes, the patch fixes the test case with the delay hacks. After applying the > patch, I also have tried to move the artificial delay in xfsaild down to the > next of kthread_should_stop() and there is no hang of umount. Can this be turned into an xfstest, please? --D > > >> fs/xfs/xfs_trans_ail.c | 17 ++++++++++++++--- > >> 1 file changed, 14 insertions(+), 3 deletions(-) > >> > >> diff --git a/fs/xfs/xfs_trans_ail.c b/fs/xfs/xfs_trans_ail.c > >> index 9056c0f..cd6e185 100644 > >> --- a/fs/xfs/xfs_trans_ail.c > >> +++ b/fs/xfs/xfs_trans_ail.c > >> @@ -499,11 +499,22 @@ xfsaild( > >> current->flags |= PF_MEMALLOC; > >> set_freezable(); > >> > >> - while (!kthread_should_stop()) { > >> + while (1) { > >> if (tout && tout <= 20) > >> - __set_current_state(TASK_KILLABLE); > >> + set_current_state(TASK_KILLABLE); > >> else > >> - __set_current_state(TASK_INTERRUPTIBLE); > >> + set_current_state(TASK_INTERRUPTIBLE); > >> + > >> + /* > >> + * Check kthread_should_stop() after we set the task state > >> + * to guarantee that we either see the stop bit and exit or the > >> + * task state is reset to runnable such that it's not blocked > >> + * indefinitely and detects the stop bit at the next iteration. > >> + */ > > > > I'd change the "it's not blocked indefinitely" wording to something like > > "it's not scheduled out indefinitely." Also, a mention that the task > > state sets above include a memory barrier to serialize against > > kthread_stop() couldn't hurt. Otherwise this looks fine to me: > > > > Reviewed-by: Brian Foster <bfoster@redhat.com> > Thanks for your comments. I will send v2 soon. > > Regards, > > Tao > > -- > To unsubscribe from this list: send the line "unsubscribe linux-xfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-09-18 18:43 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2017-09-07 8:04 [PATCH] xfs: check kthread_should_stop() after the setting of task state Hou Tao 2017-09-08 7:32 ` Christoph Hellwig 2017-09-08 17:42 ` Brian Foster 2017-09-12 8:38 ` Hou Tao 2017-09-18 18:43 ` Darrick J. Wong
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).