[PATCH] f2fs: avoid hungtask problem caused by losing wake

linux-f2fs-devel.lists.sourceforge.net archive mirror
 help / color / mirror / Atom feed

* [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
@ 2016-02-23  4:07 Yunlei He
  2016-02-23  5:44 ` Chao Yu
  0 siblings, 1 reply; 16+ messages in thread
From: Yunlei He @ 2016-02-23  4:07 UTC (permalink / raw)
  To: chao2.yu, jaegeuk, linux-f2fs-devel; +Cc: Biao He

The D state of wait_on_all_pages_writeback should be waken by
function f2fs_write_end_io when all writeback pages have been
succesfully written to device. It's possible that wake_up comes
between get_pages and io_schedule. Maybe in this case it will
lost wake_up and still in D state even if all pages have been
write back to device, and finally, the whole system will be into
the hungtask state.

                if (!get_pages(sbi, F2FS_WRITEBACK))
                         break;
					<---------  wake_up
                io_schedule();

Signed-off-by: Yunlei He <heyunlei@huawei.com>
Signed-off-by: Biao He <hebiao6@huawei.com>
---
 fs/f2fs/checkpoint.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
index 2bac8a1..f55355d 100644
--- a/fs/f2fs/checkpoint.c
+++ b/fs/f2fs/checkpoint.c
@@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
 		if (!get_pages(sbi, F2FS_WRITEBACK))
 			break;
 
-		io_schedule();
+		io_schedule_timeout(5*HZ);
 	}
 	finish_wait(&sbi->cp_wait, &wait);
 }
-- 
1.9.1


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply related	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  4:07 [PATCH] f2fs: avoid hungtask problem caused by losing wake_up Yunlei He
@ 2016-02-23  5:44 ` Chao Yu
  2016-02-23  7:02   ` He YunLei
  0 siblings, 1 reply; 16+ messages in thread
From: Chao Yu @ 2016-02-23  5:44 UTC (permalink / raw)
  To: 'Yunlei He', jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

Hi Yunlei,

> -----Original Message-----
> From: Yunlei He [mailto:heyunlei@huawei.com]
> Sent: Tuesday, February 23, 2016 12:08 PM
> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> The D state of wait_on_all_pages_writeback should be waken by
> function f2fs_write_end_io when all writeback pages have been
> succesfully written to device. It's possible that wake_up comes
> between get_pages and io_schedule. Maybe in this case it will
> lost wake_up and still in D state even if all pages have been
> write back to device, and finally, the whole system will be into
> the hungtask state.

I haven't encountered such issue so far, do you suffer this in real
world?

> 
>                 if (!get_pages(sbi, F2FS_WRITEBACK))
>                          break;
> 					<---------  wake_up

wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
here it should be save to call io_schedule, after being rescheduled,
it will get the chance to check above condition to break out.

Thanks,

>                 io_schedule();
> 
> Signed-off-by: Yunlei He <heyunlei@huawei.com>
> Signed-off-by: Biao He <hebiao6@huawei.com>
> ---
>  fs/f2fs/checkpoint.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
> index 2bac8a1..f55355d 100644
> --- a/fs/f2fs/checkpoint.c
> +++ b/fs/f2fs/checkpoint.c
> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>  		if (!get_pages(sbi, F2FS_WRITEBACK))
>  			break;
> 
> -		io_schedule();
> +		io_schedule_timeout(5*HZ);
>  	}
>  	finish_wait(&sbi->cp_wait, &wait);
>  }
> --
> 1.9.1



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  5:44 ` Chao Yu
@ 2016-02-23  7:02   ` He YunLei
  2016-02-23  9:15     ` Chao Yu
  2016-02-23  9:32     ` Shawn Lin
  0 siblings, 2 replies; 16+ messages in thread
From: He YunLei @ 2016-02-23  7:02 UTC (permalink / raw)
  To: Chao Yu, jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

On 2016/2/23 13:44, Chao Yu wrote:
> Hi Yunlei,
Hi Chao,
>
>> -----Original Message-----
>> From: Yunlei He [mailto:heyunlei@huawei.com]
>> Sent: Tuesday, February 23, 2016 12:08 PM
>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>
>> The D state of wait_on_all_pages_writeback should be waken by
>> function f2fs_write_end_io when all writeback pages have been
>> succesfully written to device. It's possible that wake_up comes
>> between get_pages and io_schedule. Maybe in this case it will
>> lost wake_up and still in D state even if all pages have been
>> write back to device, and finally, the whole system will be into
>> the hungtask state.
>
> I haven't encountered such issue so far, do you suffer this in real
> world?
>
yes, I have encounter it, the whole file system is blocked at function
wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
by storage device driver.
>>
>>                  if (!get_pages(sbi, F2FS_WRITEBACK))
>>                           break;
>> 					<---------  wake_up
>
> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
> here it should be save to call io_schedule, after being rescheduled,
> it will get the chance to check above condition to break out.
>
> Thanks,

Here, we just doubt something weird may cause wait_on_all_pages_writeback
could not be waken. Wake_up trigger only one time by last bio's end_io
function, if the thread happen to miss it, the thread will be in D state
forever. So we change the code to make wait_on_all_pages_writeback awaken
periodically, then check the condition.

>
>>                  io_schedule();
>>
>> Signed-off-by: Yunlei He <heyunlei@huawei.com>
>> Signed-off-by: Biao He <hebiao6@huawei.com>
>> ---
>>   fs/f2fs/checkpoint.c | 2 +-
>>   1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
>> index 2bac8a1..f55355d 100644
>> --- a/fs/f2fs/checkpoint.c
>> +++ b/fs/f2fs/checkpoint.c
>> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>>   		if (!get_pages(sbi, F2FS_WRITEBACK))
>>   			break;
>>
>> -		io_schedule();
>> +		io_schedule_timeout(5*HZ);
>>   	}
>>   	finish_wait(&sbi->cp_wait, &wait);
>>   }
>> --
>> 1.9.1
>
>
>
> .
>


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  7:02   ` He YunLei
@ 2016-02-23  9:15     ` Chao Yu
  2016-02-23 11:36       ` He YunLei
  2016-02-23  9:32     ` Shawn Lin
  1 sibling, 1 reply; 16+ messages in thread
From: Chao Yu @ 2016-02-23  9:15 UTC (permalink / raw)
  To: 'He YunLei', jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

Hi Yunlei,

> -----Original Message-----
> From: He YunLei [mailto:heyunlei@huawei.com]
> Sent: Tuesday, February 23, 2016 3:03 PM
> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: bintian.wang@huawei.com; 'Biao He'
> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> On 2016/2/23 13:44, Chao Yu wrote:
> > Hi Yunlei,
> Hi Chao,
> >
> >> -----Original Message-----
> >> From: Yunlei He [mailto:heyunlei@huawei.com]
> >> Sent: Tuesday, February 23, 2016 12:08 PM
> >> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> >> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>
> >> The D state of wait_on_all_pages_writeback should be waken by
> >> function f2fs_write_end_io when all writeback pages have been
> >> succesfully written to device. It's possible that wake_up comes
> >> between get_pages and io_schedule. Maybe in this case it will
> >> lost wake_up and still in D state even if all pages have been
> >> write back to device, and finally, the whole system will be into
> >> the hungtask state.
> >
> > I haven't encountered such issue so far, do you suffer this in real
> > world?
> >
> yes, I have encounter it, the whole file system is blocked at function
> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
> by storage device driver.

Could this reproducible? If it could, could you please share the details.
And did this occur in a huge size f2fs image?

> >>
> >>                  if (!get_pages(sbi, F2FS_WRITEBACK))
> >>                           break;
> >> 					<---------  wake_up
> >
> > wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
> > here it should be save to call io_schedule, after being rescheduled,
> > it will get the chance to check above condition to break out.
> >
> > Thanks,
> 
> Here, we just doubt something weird may cause wait_on_all_pages_writeback
> could not be waken. Wake_up trigger only one time by last bio's end_io
> function, if the thread happen to miss it, the thread will be in D state
> forever. So we change the code to make wait_on_all_pages_writeback awaken
> periodically, then check the condition.

Got it.

The patch can fix issue that checkpointer will wait forever in case of
write_end_io was failed to call wake_up for some reason.

But I doubt more that the reason we are stuck is there are remained pages
cached in bio buffer without being submitted. To make sure, maybe in
wait_on_all_pages_writeback we could add print info to see whether
sbi->write_io[].bio is valid or not.

Thanks,

> 
> >
> >>                  io_schedule();
> >>
> >> Signed-off-by: Yunlei He <heyunlei@huawei.com>
> >> Signed-off-by: Biao He <hebiao6@huawei.com>
> >> ---
> >>   fs/f2fs/checkpoint.c | 2 +-
> >>   1 file changed, 1 insertion(+), 1 deletion(-)
> >>
> >> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
> >> index 2bac8a1..f55355d 100644
> >> --- a/fs/f2fs/checkpoint.c
> >> +++ b/fs/f2fs/checkpoint.c
> >> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
> >>   		if (!get_pages(sbi, F2FS_WRITEBACK))
> >>   			break;
> >>
> >> -		io_schedule();
> >> +		io_schedule_timeout(5*HZ);
> >>   	}
> >>   	finish_wait(&sbi->cp_wait, &wait);
> >>   }
> >> --
> >> 1.9.1
> >
> >
> >
> > .
> >



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  9:15     ` Chao Yu
@ 2016-02-23 11:36       ` He YunLei
  2016-02-24  3:46         ` Chao Yu
  0 siblings, 1 reply; 16+ messages in thread
From: He YunLei @ 2016-02-23 11:36 UTC (permalink / raw)
  To: Chao Yu, jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

On 2016/2/23 17:15, Chao Yu wrote:
Hi Chao,

> Hi Yunlei,
>
>> -----Original Message-----
>> From: He YunLei [mailto:heyunlei@huawei.com]
>> Sent: Tuesday, February 23, 2016 3:03 PM
>> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>> Cc: bintian.wang@huawei.com; 'Biao He'
>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>
>> On 2016/2/23 13:44, Chao Yu wrote:
>>> Hi Yunlei,
>> Hi Chao,
>>>
>>>> -----Original Message-----
>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
>>>> Sent: Tuesday, February 23, 2016 12:08 PM
>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>>>
>>>> The D state of wait_on_all_pages_writeback should be waken by
>>>> function f2fs_write_end_io when all writeback pages have been
>>>> succesfully written to device. It's possible that wake_up comes
>>>> between get_pages and io_schedule. Maybe in this case it will
>>>> lost wake_up and still in D state even if all pages have been
>>>> write back to device, and finally, the whole system will be into
>>>> the hungtask state.
>>>
>>> I haven't encountered such issue so far, do you suffer this in real
>>> world?
>>>
>> yes, I have encounter it, the whole file system is blocked at function
>> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
>> by storage device driver.
>
> Could this reproducible? If it could, could you please share the details.
> And did this occur in a huge size f2fs image?
>
>>>>
>>>>                   if (!get_pages(sbi, F2FS_WRITEBACK))
>>>>                            break;
>>>> 					<---------  wake_up
>>>
>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
>>> here it should be save to call io_schedule, after being rescheduled,
>>> it will get the chance to check above condition to break out.
>>>
>>> Thanks,
>>
>> Here, we just doubt something weird may cause wait_on_all_pages_writeback
>> could not be waken. Wake_up trigger only one time by last bio's end_io
>> function, if the thread happen to miss it, the thread will be in D state
>> forever. So we change the code to make wait_on_all_pages_writeback awaken
>> periodically, then check the condition.
>
> Got it.
>
> The patch can fix issue that checkpointer will wait forever in case of
> write_end_io was failed to call wake_up for some reason.
>
> But I doubt more that the reason we are stuck is there are remained pages
> cached in bio buffer without being submitted. To make sure, maybe in
> wait_on_all_pages_writeback we could add print info to see whether
> sbi->write_io[].bio is valid or not.
>
We use tool dump f2fs_sb_info information and find that:

	write_io[DATA].bio = 0;
	write_io[NODE].bio = 0;
	write_io[META].bio = 0;

	nr_pages[F2FS_WRITEBACK] = 0;
	nr_pages[F2FS_DIRTY_DENTS] = 0;
	nr_pages[F2FS_DIRTY_NODES] = 13;
	nr_pages[F2FS_DIRTY_META] = 0;
	nr_pages[F2FS_INMEM_PAGES] = 0;

So we believe that the block device is ok!

Thanks,

> Thanks,
>
>>
>>>
>>>>                   io_schedule();
>>>>
>>>> Signed-off-by: Yunlei He <heyunlei@huawei.com>
>>>> Signed-off-by: Biao He <hebiao6@huawei.com>
>>>> ---
>>>>    fs/f2fs/checkpoint.c | 2 +-
>>>>    1 file changed, 1 insertion(+), 1 deletion(-)
>>>>
>>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
>>>> index 2bac8a1..f55355d 100644
>>>> --- a/fs/f2fs/checkpoint.c
>>>> +++ b/fs/f2fs/checkpoint.c
>>>> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>>>>    		if (!get_pages(sbi, F2FS_WRITEBACK))
>>>>    			break;
>>>>
>>>> -		io_schedule();
>>>> +		io_schedule_timeout(5*HZ);
>>>>    	}
>>>>    	finish_wait(&sbi->cp_wait, &wait);
>>>>    }
>>>> --
>>>> 1.9.1
>>>
>>>
>>>
>>> .
>>>
>
>
>
> .
>


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23 11:36       ` He YunLei
@ 2016-02-24  3:46         ` Chao Yu
  2016-02-24  7:32           ` He YunLei
  2016-02-25  7:36           ` He YunLei
  0 siblings, 2 replies; 16+ messages in thread
From: Chao Yu @ 2016-02-24  3:46 UTC (permalink / raw)
  To: 'He YunLei', jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

Hi Yunlei,

> -----Original Message-----
> From: He YunLei [mailto:heyunlei@huawei.com]
> Sent: Tuesday, February 23, 2016 7:36 PM
> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: bintian.wang@huawei.com; 'Biao He'
> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> On 2016/2/23 17:15, Chao Yu wrote:
> Hi Chao,
> 
> > Hi Yunlei,
> >
> >> -----Original Message-----
> >> From: He YunLei [mailto:heyunlei@huawei.com]
> >> Sent: Tuesday, February 23, 2016 3:03 PM
> >> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >> Cc: bintian.wang@huawei.com; 'Biao He'
> >> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>
> >> On 2016/2/23 13:44, Chao Yu wrote:
> >>> Hi Yunlei,
> >> Hi Chao,
> >>>
> >>>> -----Original Message-----
> >>>> From: Yunlei He [mailto:heyunlei@huawei.com]
> >>>> Sent: Tuesday, February 23, 2016 12:08 PM
> >>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> >>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>>>
> >>>> The D state of wait_on_all_pages_writeback should be waken by
> >>>> function f2fs_write_end_io when all writeback pages have been
> >>>> succesfully written to device. It's possible that wake_up comes
> >>>> between get_pages and io_schedule. Maybe in this case it will
> >>>> lost wake_up and still in D state even if all pages have been
> >>>> write back to device, and finally, the whole system will be into
> >>>> the hungtask state.
> >>>
> >>> I haven't encountered such issue so far, do you suffer this in real
> >>> world?
> >>>
> >> yes, I have encounter it, the whole file system is blocked at function
> >> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
> >> by storage device driver.
> >
> > Could this reproducible? If it could, could you please share the details.
> > And did this occur in a huge size f2fs image?
> >
> >>>>
> >>>>                   if (!get_pages(sbi, F2FS_WRITEBACK))
> >>>>                            break;
> >>>> 					<---------  wake_up
> >>>
> >>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
> >>> here it should be save to call io_schedule, after being rescheduled,
> >>> it will get the chance to check above condition to break out.
> >>>
> >>> Thanks,
> >>
> >> Here, we just doubt something weird may cause wait_on_all_pages_writeback
> >> could not be waken. Wake_up trigger only one time by last bio's end_io
> >> function, if the thread happen to miss it, the thread will be in D state
> >> forever. So we change the code to make wait_on_all_pages_writeback awaken
> >> periodically, then check the condition.
> >
> > Got it.
> >
> > The patch can fix issue that checkpointer will wait forever in case of
> > write_end_io was failed to call wake_up for some reason.

I found one possible case:

CPU0:					CPU1:
 - write_checkpoint
  - do_checkpoint
   - wait_on_all_pages_writeback
					 - f2fs_write_end_io
					  - wake_up
					this is last writebacked page, but
					no sleeper in sbi->cp_wait wait
					queue, wake_up is not been called.
    - prepare_to_wait(TASK_UNINTERRUPTIBLE)
    Here, current task is been preempted,
    but there will be no waker to wake up
    this task since last write_end_io
    has been called before. So current
    task will sleep forever.
    - io_schedule

How do you think of it?

And if this is right, following patch can fix this issue.

---
 fs/f2fs/checkpoint.c | 14 +++++++++-----
 fs/f2fs/data.c       |  9 +++++++--
 fs/f2fs/f2fs.h       |  3 ++-
 fs/f2fs/super.c      |  1 +
 4 files changed, 19 insertions(+), 8 deletions(-)

diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
index 9d277f8..9446c3d 100644
--- a/fs/f2fs/checkpoint.c
+++ b/fs/f2fs/checkpoint.c
@@ -914,15 +914,19 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
 {
 	DEFINE_WAIT(wait);
 
-	for (;;) {
-		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
+	spin_lock(&sbi->cp_wb_lock);
 
-		if (!get_pages(sbi, F2FS_WRITEBACK))
-			break;
+	while (get_pages(sbi, F2FS_WRITEBACK)) {
+		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
 
+		spin_unlock(&sbi->cp_wb_lock);
 		io_schedule();
+		spin_lock(&sbi->cp_wb_lock);
+
+		finish_wait(&sbi->cp_wait, &wait);
 	}
-	finish_wait(&sbi->cp_wait, &wait);
+
+	spin_unlock(&sbi->cp_wb_lock);
 }
 
 static int do_checkpoint(struct f2fs_sb_info *sbi, struct cp_control *cpc)
diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c
index e5c762b..e31deb97 100644
--- a/fs/f2fs/data.c
+++ b/fs/f2fs/data.c
@@ -59,6 +59,7 @@ static void f2fs_write_end_io(struct bio *bio)
 {
 	struct f2fs_sb_info *sbi = bio->bi_private;
 	struct bio_vec *bvec;
+	unsigned long flags;
 	int i;
 
 	bio_for_each_segment_all(bvec, bio, i) {
@@ -74,8 +75,12 @@ static void f2fs_write_end_io(struct bio *bio)
 		dec_page_count(sbi, F2FS_WRITEBACK);
 	}
 
-	if (!get_pages(sbi, F2FS_WRITEBACK) && wq_has_sleeper(&sbi->cp_wait))
-		wake_up(&sbi->cp_wait);
+	if (!get_pages(sbi, F2FS_WRITEBACK)) {
+		spin_lock_irqsave(&sbi->cp_wb_lock, flags);
+		if (wq_has_sleeper(&sbi->cp_wait))
+			wake_up(&sbi->cp_wait);
+		spin_unlock_irqrestore(&sbi->cp_wb_lock, flags);
+	}
 
 	bio_put(bio);
 }
diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h
index 0d25430..fd47984 100644
--- a/fs/f2fs/f2fs.h
+++ b/fs/f2fs/f2fs.h
@@ -727,7 +727,8 @@ struct f2fs_sb_info {
 	struct rw_semaphore cp_rwsem;		/* blocking FS operations */
 	struct rw_semaphore node_write;		/* locking node writes */
 	struct mutex writepages;		/* mutex for writepages() */
-	wait_queue_head_t cp_wait;
+	wait_queue_head_t cp_wait;		/* for wait pages writeback */
+	spinlock_t cp_wb_lock;			/* for protect cp_wait */
 	unsigned long last_time[MAX_TIME];	/* to store time in jiffies */
 	long interval_time[MAX_TIME];		/* to store thresholds */
 
diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c
index 7b62016..5316c7a 100644
--- a/fs/f2fs/super.c
+++ b/fs/f2fs/super.c
@@ -1374,6 +1374,7 @@ try_onemore:
 
 	init_rwsem(&sbi->cp_rwsem);
 	init_waitqueue_head(&sbi->cp_wait);
+	spin_lock_init(&sbi->cp_wb_lock);
 	init_sb_info(sbi);
 
 	/* get an inode for meta space */
-- 
2.7.0

> >
> > But I doubt more that the reason we are stuck is there are remained pages
> > cached in bio buffer without being submitted. To make sure, maybe in
> > wait_on_all_pages_writeback we could add print info to see whether
> > sbi->write_io[].bio is valid or not.
> >
> We use tool dump f2fs_sb_info information and find that:
> 
> 	write_io[DATA].bio = 0;
> 	write_io[NODE].bio = 0;
> 	write_io[META].bio = 0;
> 
> 	nr_pages[F2FS_WRITEBACK] = 0;
> 	nr_pages[F2FS_DIRTY_DENTS] = 0;
> 	nr_pages[F2FS_DIRTY_NODES] = 13;

Weird, dirty nodes count should be 0.

Thanks

> 	nr_pages[F2FS_DIRTY_META] = 0;
> 	nr_pages[F2FS_INMEM_PAGES] = 0;
> 
> So we believe that the block device is ok!
> 
> Thanks,
> 
> > Thanks,
> >
> >>
> >>>
> >>>>                   io_schedule();
> >>>>
> >>>> Signed-off-by: Yunlei He <heyunlei@huawei.com>
> >>>> Signed-off-by: Biao He <hebiao6@huawei.com>
> >>>> ---
> >>>>    fs/f2fs/checkpoint.c | 2 +-
> >>>>    1 file changed, 1 insertion(+), 1 deletion(-)
> >>>>
> >>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
> >>>> index 2bac8a1..f55355d 100644
> >>>> --- a/fs/f2fs/checkpoint.c
> >>>> +++ b/fs/f2fs/checkpoint.c
> >>>> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
> >>>>    		if (!get_pages(sbi, F2FS_WRITEBACK))
> >>>>    			break;
> >>>>
> >>>> -		io_schedule();
> >>>> +		io_schedule_timeout(5*HZ);
> >>>>    	}
> >>>>    	finish_wait(&sbi->cp_wait, &wait);
> >>>>    }
> >>>> --
> >>>> 1.9.1
> >>>
> >>>
> >>>
> >>> .
> >>>
> >
> >
> >
> > .
> >



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply related	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-24  3:46         ` Chao Yu
@ 2016-02-24  7:32           ` He YunLei
  2016-02-24  8:05             ` Chao Yu
  2016-02-25  7:36           ` He YunLei
  1 sibling, 1 reply; 16+ messages in thread
From: He YunLei @ 2016-02-24  7:32 UTC (permalink / raw)
  To: Chao Yu, jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

On 2016/2/24 11:46, Chao Yu wrote:
> Hi Yunlei,
>
>> -----Original Message-----
>> From: He YunLei [mailto:heyunlei@huawei.com]
>> Sent: Tuesday, February 23, 2016 7:36 PM
>> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>> Cc: bintian.wang@huawei.com; 'Biao He'
>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>
>> On 2016/2/23 17:15, Chao Yu wrote:
>> Hi Chao,
>>
>>> Hi Yunlei,
>>>
>>>> -----Original Message-----
>>>> From: He YunLei [mailto:heyunlei@huawei.com]
>>>> Sent: Tuesday, February 23, 2016 3:03 PM
>>>> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>>>> Cc: bintian.wang@huawei.com; 'Biao He'
>>>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>>>
>>>> On 2016/2/23 13:44, Chao Yu wrote:
>>>>> Hi Yunlei,
>>>> Hi Chao,
>>>>>
>>>>>> -----Original Message-----
>>>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
>>>>>> Sent: Tuesday, February 23, 2016 12:08 PM
>>>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>>>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
>>>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>>>>>
>>>>>> The D state of wait_on_all_pages_writeback should be waken by
>>>>>> function f2fs_write_end_io when all writeback pages have been
>>>>>> succesfully written to device. It's possible that wake_up comes
>>>>>> between get_pages and io_schedule. Maybe in this case it will
>>>>>> lost wake_up and still in D state even if all pages have been
>>>>>> write back to device, and finally, the whole system will be into
>>>>>> the hungtask state.
>>>>>
>>>>> I haven't encountered such issue so far, do you suffer this in real
>>>>> world?
>>>>>
>>>> yes, I have encounter it, the whole file system is blocked at function
>>>> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
>>>> by storage device driver.
>>>
>>> Could this reproducible? If it could, could you please share the details.
>>> And did this occur in a huge size f2fs image?
>>>
>>>>>>
>>>>>>                    if (!get_pages(sbi, F2FS_WRITEBACK))
>>>>>>                             break;
>>>>>> 					<---------  wake_up
>>>>>
>>>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
>>>>> here it should be save to call io_schedule, after being rescheduled,
>>>>> it will get the chance to check above condition to break out.
>>>>>
>>>>> Thanks,
>>>>
>>>> Here, we just doubt something weird may cause wait_on_all_pages_writeback
>>>> could not be waken. Wake_up trigger only one time by last bio's end_io
>>>> function, if the thread happen to miss it, the thread will be in D state
>>>> forever. So we change the code to make wait_on_all_pages_writeback awaken
>>>> periodically, then check the condition.
>>>
>>> Got it.
>>>
>>> The patch can fix issue that checkpointer will wait forever in case of
>>> write_end_io was failed to call wake_up for some reason.
>
> I found one possible case:
>
> CPU0:					CPU1:
>   - write_checkpoint
>    - do_checkpoint
>     - wait_on_all_pages_writeback
> 					 - f2fs_write_end_io
> 					  - wake_up
> 					this is last writebacked page, but
> 					no sleeper in sbi->cp_wait wait
> 					queue, wake_up is not been called.
>      - prepare_to_wait(TASK_UNINTERRUPTIBLE)
>      Here, current task is been preempted,
>      but there will be no waker to wake up
>      this task since last write_end_io
>      has been called before. So current
>      task will sleep forever.
>      - io_schedule
>
> How do you think of it?
Hi Chao,

Here, current task add itself into wait queue at first, and then check the
condition whether write back page is zero. So, in the above situation,
current task is been preempted in - prepare_to_wait(TASK_UNINTERRUPTIBLE),
current task will not sleep for the write back page is zero.

Thanks,
>
> And if this is right, following patch can fix this issue.
>
> ---
>   fs/f2fs/checkpoint.c | 14 +++++++++-----
>   fs/f2fs/data.c       |  9 +++++++--
>   fs/f2fs/f2fs.h       |  3 ++-
>   fs/f2fs/super.c      |  1 +
>   4 files changed, 19 insertions(+), 8 deletions(-)
>
> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
> index 9d277f8..9446c3d 100644
> --- a/fs/f2fs/checkpoint.c
> +++ b/fs/f2fs/checkpoint.c
> @@ -914,15 +914,19 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>   {
>   	DEFINE_WAIT(wait);
>
> -	for (;;) {
> -		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
> +	spin_lock(&sbi->cp_wb_lock);
>
> -		if (!get_pages(sbi, F2FS_WRITEBACK))
> -			break;
> +	while (get_pages(sbi, F2FS_WRITEBACK)) {
> +		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
>
> +		spin_unlock(&sbi->cp_wb_lock);
>   		io_schedule();
> +		spin_lock(&sbi->cp_wb_lock);
> +
> +		finish_wait(&sbi->cp_wait, &wait);
>   	}
> -	finish_wait(&sbi->cp_wait, &wait);
> +
> +	spin_unlock(&sbi->cp_wb_lock);
>   }
>
>   static int do_checkpoint(struct f2fs_sb_info *sbi, struct cp_control *cpc)
> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c
> index e5c762b..e31deb97 100644
> --- a/fs/f2fs/data.c
> +++ b/fs/f2fs/data.c
> @@ -59,6 +59,7 @@ static void f2fs_write_end_io(struct bio *bio)
>   {
>   	struct f2fs_sb_info *sbi = bio->bi_private;
>   	struct bio_vec *bvec;
> +	unsigned long flags;
>   	int i;
>
>   	bio_for_each_segment_all(bvec, bio, i) {
> @@ -74,8 +75,12 @@ static void f2fs_write_end_io(struct bio *bio)
>   		dec_page_count(sbi, F2FS_WRITEBACK);
>   	}
>
> -	if (!get_pages(sbi, F2FS_WRITEBACK) && wq_has_sleeper(&sbi->cp_wait))
> -		wake_up(&sbi->cp_wait);
> +	if (!get_pages(sbi, F2FS_WRITEBACK)) {
> +		spin_lock_irqsave(&sbi->cp_wb_lock, flags);
> +		if (wq_has_sleeper(&sbi->cp_wait))
> +			wake_up(&sbi->cp_wait);
> +		spin_unlock_irqrestore(&sbi->cp_wb_lock, flags);
> +	}
>
>   	bio_put(bio);
>   }
> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h
> index 0d25430..fd47984 100644
> --- a/fs/f2fs/f2fs.h
> +++ b/fs/f2fs/f2fs.h
> @@ -727,7 +727,8 @@ struct f2fs_sb_info {
>   	struct rw_semaphore cp_rwsem;		/* blocking FS operations */
>   	struct rw_semaphore node_write;		/* locking node writes */
>   	struct mutex writepages;		/* mutex for writepages() */
> -	wait_queue_head_t cp_wait;
> +	wait_queue_head_t cp_wait;		/* for wait pages writeback */
> +	spinlock_t cp_wb_lock;			/* for protect cp_wait */
>   	unsigned long last_time[MAX_TIME];	/* to store time in jiffies */
>   	long interval_time[MAX_TIME];		/* to store thresholds */
>
> diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c
> index 7b62016..5316c7a 100644
> --- a/fs/f2fs/super.c
> +++ b/fs/f2fs/super.c
> @@ -1374,6 +1374,7 @@ try_onemore:
>
>   	init_rwsem(&sbi->cp_rwsem);
>   	init_waitqueue_head(&sbi->cp_wait);
> +	spin_lock_init(&sbi->cp_wb_lock);
>   	init_sb_info(sbi);
>
>   	/* get an inode for meta space */
>


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-24  7:32           ` He YunLei
@ 2016-02-24  8:05             ` Chao Yu
  2016-02-24  9:45               ` hebiao (G)
  0 siblings, 1 reply; 16+ messages in thread
From: Chao Yu @ 2016-02-24  8:05 UTC (permalink / raw)
  To: 'He YunLei', jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

Hi Yunlei,

> -----Original Message-----
> From: He YunLei [mailto:heyunlei@huawei.com]
> Sent: Wednesday, February 24, 2016 3:32 PM
> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: bintian.wang@huawei.com; 'Biao He'
> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> On 2016/2/24 11:46, Chao Yu wrote:
> > Hi Yunlei,
> >
> >> -----Original Message-----
> >> From: He YunLei [mailto:heyunlei@huawei.com]
> >> Sent: Tuesday, February 23, 2016 7:36 PM
> >> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >> Cc: bintian.wang@huawei.com; 'Biao He'
> >> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>
> >> On 2016/2/23 17:15, Chao Yu wrote:
> >> Hi Chao,
> >>
> >>> Hi Yunlei,
> >>>
> >>>> -----Original Message-----
> >>>> From: He YunLei [mailto:heyunlei@huawei.com]
> >>>> Sent: Tuesday, February 23, 2016 3:03 PM
> >>>> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >>>> Cc: bintian.wang@huawei.com; 'Biao He'
> >>>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>>>
> >>>> On 2016/2/23 13:44, Chao Yu wrote:
> >>>>> Hi Yunlei,
> >>>> Hi Chao,
> >>>>>
> >>>>>> -----Original Message-----
> >>>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
> >>>>>> Sent: Tuesday, February 23, 2016 12:08 PM
> >>>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> >>>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> >>>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> >>>>>>
> >>>>>> The D state of wait_on_all_pages_writeback should be waken by
> >>>>>> function f2fs_write_end_io when all writeback pages have been
> >>>>>> succesfully written to device. It's possible that wake_up comes
> >>>>>> between get_pages and io_schedule. Maybe in this case it will
> >>>>>> lost wake_up and still in D state even if all pages have been
> >>>>>> write back to device, and finally, the whole system will be into
> >>>>>> the hungtask state.
> >>>>>
> >>>>> I haven't encountered such issue so far, do you suffer this in real
> >>>>> world?
> >>>>>
> >>>> yes, I have encounter it, the whole file system is blocked at function
> >>>> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
> >>>> by storage device driver.
> >>>
> >>> Could this reproducible? If it could, could you please share the details.
> >>> And did this occur in a huge size f2fs image?
> >>>
> >>>>>>
> >>>>>>                    if (!get_pages(sbi, F2FS_WRITEBACK))
> >>>>>>                             break;
> >>>>>> 					<---------  wake_up
> >>>>>
> >>>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
> >>>>> here it should be save to call io_schedule, after being rescheduled,
> >>>>> it will get the chance to check above condition to break out.
> >>>>>
> >>>>> Thanks,
> >>>>
> >>>> Here, we just doubt something weird may cause wait_on_all_pages_writeback
> >>>> could not be waken. Wake_up trigger only one time by last bio's end_io
> >>>> function, if the thread happen to miss it, the thread will be in D state
> >>>> forever. So we change the code to make wait_on_all_pages_writeback awaken
> >>>> periodically, then check the condition.
> >>>
> >>> Got it.
> >>>
> >>> The patch can fix issue that checkpointer will wait forever in case of
> >>> write_end_io was failed to call wake_up for some reason.
> >
> > I found one possible case:
> >
> > CPU0:					CPU1:
> >   - write_checkpoint
> >    - do_checkpoint
> >     - wait_on_all_pages_writeback
> > 					 - f2fs_write_end_io
> > 					  - wake_up
> > 					this is last writebacked page, but
> > 					no sleeper in sbi->cp_wait wait
> > 					queue, wake_up is not been called.
> >      - prepare_to_wait(TASK_UNINTERRUPTIBLE)
> >      Here, current task is been preempted,
> >      but there will be no waker to wake up
> >      this task since last write_end_io
> >      has been called before. So current
> >      task will sleep forever.
> >      - io_schedule
> >
> > How do you think of it?
> Hi Chao,
> 
> Here, current task add itself into wait queue at first, and then check the
> condition whether write back page is zero. So, in the above situation,
> current task is been preempted in - prepare_to_wait(TASK_UNINTERRUPTIBLE),
> current task will not sleep for the write back page is zero.

Oh, I meant:

 - prepare_to_wait(TASK_UNINTERRUPTIBLE)
 - Preempt
 - if (!get_pages(sbi, F2FS_WRITEBACK)) break;    or maybe preempt here
 - io_schedule
 - finish_wait

Even there is no more writeback state page, also preemption can happen before
finish_wait, after that, once the task was been switched out, as it was set as
TASK_UNINTERRUPTIBLE, there is no chance to schedule it again.

Thanks,

> 
> Thanks,
> >
> > And if this is right, following patch can fix this issue.
> >
> > ---
> >   fs/f2fs/checkpoint.c | 14 +++++++++-----
> >   fs/f2fs/data.c       |  9 +++++++--
> >   fs/f2fs/f2fs.h       |  3 ++-
> >   fs/f2fs/super.c      |  1 +
> >   4 files changed, 19 insertions(+), 8 deletions(-)
> >
> > diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
> > index 9d277f8..9446c3d 100644
> > --- a/fs/f2fs/checkpoint.c
> > +++ b/fs/f2fs/checkpoint.c
> > @@ -914,15 +914,19 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
> >   {
> >   	DEFINE_WAIT(wait);
> >
> > -	for (;;) {
> > -		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
> > +	spin_lock(&sbi->cp_wb_lock);
> >
> > -		if (!get_pages(sbi, F2FS_WRITEBACK))
> > -			break;
> > +	while (get_pages(sbi, F2FS_WRITEBACK)) {
> > +		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
> >
> > +		spin_unlock(&sbi->cp_wb_lock);
> >   		io_schedule();
> > +		spin_lock(&sbi->cp_wb_lock);
> > +
> > +		finish_wait(&sbi->cp_wait, &wait);
> >   	}
> > -	finish_wait(&sbi->cp_wait, &wait);
> > +
> > +	spin_unlock(&sbi->cp_wb_lock);
> >   }
> >
> >   static int do_checkpoint(struct f2fs_sb_info *sbi, struct cp_control *cpc)
> > diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c
> > index e5c762b..e31deb97 100644
> > --- a/fs/f2fs/data.c
> > +++ b/fs/f2fs/data.c
> > @@ -59,6 +59,7 @@ static void f2fs_write_end_io(struct bio *bio)
> >   {
> >   	struct f2fs_sb_info *sbi = bio->bi_private;
> >   	struct bio_vec *bvec;
> > +	unsigned long flags;
> >   	int i;
> >
> >   	bio_for_each_segment_all(bvec, bio, i) {
> > @@ -74,8 +75,12 @@ static void f2fs_write_end_io(struct bio *bio)
> >   		dec_page_count(sbi, F2FS_WRITEBACK);
> >   	}
> >
> > -	if (!get_pages(sbi, F2FS_WRITEBACK) && wq_has_sleeper(&sbi->cp_wait))
> > -		wake_up(&sbi->cp_wait);
> > +	if (!get_pages(sbi, F2FS_WRITEBACK)) {
> > +		spin_lock_irqsave(&sbi->cp_wb_lock, flags);
> > +		if (wq_has_sleeper(&sbi->cp_wait))
> > +			wake_up(&sbi->cp_wait);
> > +		spin_unlock_irqrestore(&sbi->cp_wb_lock, flags);
> > +	}
> >
> >   	bio_put(bio);
> >   }
> > diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h
> > index 0d25430..fd47984 100644
> > --- a/fs/f2fs/f2fs.h
> > +++ b/fs/f2fs/f2fs.h
> > @@ -727,7 +727,8 @@ struct f2fs_sb_info {
> >   	struct rw_semaphore cp_rwsem;		/* blocking FS operations */
> >   	struct rw_semaphore node_write;		/* locking node writes */
> >   	struct mutex writepages;		/* mutex for writepages() */
> > -	wait_queue_head_t cp_wait;
> > +	wait_queue_head_t cp_wait;		/* for wait pages writeback */
> > +	spinlock_t cp_wb_lock;			/* for protect cp_wait */
> >   	unsigned long last_time[MAX_TIME];	/* to store time in jiffies */
> >   	long interval_time[MAX_TIME];		/* to store thresholds */
> >
> > diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c
> > index 7b62016..5316c7a 100644
> > --- a/fs/f2fs/super.c
> > +++ b/fs/f2fs/super.c
> > @@ -1374,6 +1374,7 @@ try_onemore:
> >
> >   	init_rwsem(&sbi->cp_rwsem);
> >   	init_waitqueue_head(&sbi->cp_wait);
> > +	spin_lock_init(&sbi->cp_wb_lock);
> >   	init_sb_info(sbi);
> >
> >   	/* get an inode for meta space */
> >



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-24  8:05             ` Chao Yu
@ 2016-02-24  9:45               ` hebiao (G)
  2016-02-25  9:32                 ` Chao Yu
  0 siblings, 1 reply; 16+ messages in thread
From: hebiao (G) @ 2016-02-24  9:45 UTC (permalink / raw)
  To: Chao Yu, heyunlei, jaegeuk@kernel.org,
	linux-f2fs-devel@lists.sourceforge.net

Hi, Chao,

> -----Original Message-----
> From: Chao Yu [mailto:chao2.yu@samsung.com]
> Sent: Wednesday, February 24, 2016 4:05 PM
> To: heyunlei; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: Wangbintian; hebiao (G)
> Subject: RE: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing
> wake_up
> 
> Hi Yunlei,
> 
> > -----Original Message-----
> > From: He YunLei [mailto:heyunlei@huawei.com]
> > Sent: Wednesday, February 24, 2016 3:32 PM
> > To: Chao Yu; jaegeuk@kernel.org;
> > linux-f2fs-devel@lists.sourceforge.net
> > Cc: bintian.wang@huawei.com; 'Biao He'
> > Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by
> > losing wake_up
> >
> > On 2016/2/24 11:46, Chao Yu wrote:
> > > Hi Yunlei,
> > >
> > >> -----Original Message-----
> > >> From: He YunLei [mailto:heyunlei@huawei.com]
> > >> Sent: Tuesday, February 23, 2016 7:36 PM
> > >> To: Chao Yu; jaegeuk@kernel.org;
> > >> linux-f2fs-devel@lists.sourceforge.net
> > >> Cc: bintian.wang@huawei.com; 'Biao He'
> > >> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused
> > >> by losing wake_up
> > >>
> > >> On 2016/2/23 17:15, Chao Yu wrote:
> > >> Hi Chao,
> > >>
> > >>> Hi Yunlei,
> > >>>
> > >>>> -----Original Message-----
> > >>>> From: He YunLei [mailto:heyunlei@huawei.com]
> > >>>> Sent: Tuesday, February 23, 2016 3:03 PM
> > >>>> To: Chao Yu; jaegeuk@kernel.org;
> > >>>> linux-f2fs-devel@lists.sourceforge.net
> > >>>> Cc: bintian.wang@huawei.com; 'Biao He'
> > >>>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem
> > >>>> caused by losing wake_up
> > >>>>
> > >>>> On 2016/2/23 13:44, Chao Yu wrote:
> > >>>>> Hi Yunlei,
> > >>>> Hi Chao,
> > >>>>>
> > >>>>>> -----Original Message-----
> > >>>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
> > >>>>>> Sent: Tuesday, February 23, 2016 12:08 PM
> > >>>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org;
> > >>>>>> linux-f2fs-devel@lists.sourceforge.net
> > >>>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> > >>>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused
> > >>>>>> by losing wake_up
> > >>>>>>
> > >>>>>> The D state of wait_on_all_pages_writeback should be waken by
> > >>>>>> function f2fs_write_end_io when all writeback pages have been
> > >>>>>> succesfully written to device. It's possible that wake_up comes
> > >>>>>> between get_pages and io_schedule. Maybe in this case it will
> > >>>>>> lost wake_up and still in D state even if all pages have been
> > >>>>>> write back to device, and finally, the whole system will be
> > >>>>>> into the hungtask state.
> > >>>>>
> > >>>>> I haven't encountered such issue so far, do you suffer this in
> > >>>>> real world?
> > >>>>>
> > >>>> yes, I have encounter it, the whole file system is blocked at
> > >>>> function wait_on_all_pages_writeback beyond 120s when write cp,
> > >>>> and no error reported by storage device driver.
> > >>>
> > >>> Could this reproducible? If it could, could you please share the details.
> > >>> And did this occur in a huge size f2fs image?
> > >>>
> > >>>>>>
> > >>>>>>                    if (!get_pages(sbi, F2FS_WRITEBACK))
> > >>>>>>                             break;
> > >>>>>> 					<---------  wake_up
> > >>>>>
> > >>>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue,
> > >>>>> so here it should be save to call io_schedule, after being
> > >>>>> rescheduled, it will get the chance to check above condition to break
> out.
> > >>>>>
> > >>>>> Thanks,
> > >>>>
> > >>>> Here, we just doubt something weird may cause
> > >>>> wait_on_all_pages_writeback could not be waken. Wake_up trigger
> > >>>> only one time by last bio's end_io function, if the thread happen
> > >>>> to miss it, the thread will be in D state forever. So we change
> > >>>> the code to make wait_on_all_pages_writeback awaken periodically,
> then check the condition.
> > >>>
> > >>> Got it.
> > >>>
> > >>> The patch can fix issue that checkpointer will wait forever in
> > >>> case of write_end_io was failed to call wake_up for some reason.
> > >
> > > I found one possible case:
> > >
> > > CPU0:					CPU1:
> > >   - write_checkpoint
> > >    - do_checkpoint
> > >     - wait_on_all_pages_writeback
> > > 					 - f2fs_write_end_io
> > > 					  - wake_up
> > > 					this is last writebacked page, but
> > > 					no sleeper in sbi->cp_wait wait
> > > 					queue, wake_up is not been called.
> > >      - prepare_to_wait(TASK_UNINTERRUPTIBLE)
> > >      Here, current task is been preempted,
> > >      but there will be no waker to wake up
> > >      this task since last write_end_io
> > >      has been called before. So current
> > >      task will sleep forever.
> > >      - io_schedule
> > >
> > > How do you think of it?
> > Hi Chao,
> >
> > Here, current task add itself into wait queue at first, and then check
> > the condition whether write back page is zero. So, in the above
> > situation, current task is been preempted in -
> > prepare_to_wait(TASK_UNINTERRUPTIBLE),
> > current task will not sleep for the write back page is zero.
> 
> Oh, I meant:
> 
>  - prepare_to_wait(TASK_UNINTERRUPTIBLE)
>  - Preempt
>  - if (!get_pages(sbi, F2FS_WRITEBACK)) break;    or maybe preempt here
>  - io_schedule
>  - finish_wait
> 
> Even there is no more writeback state page, also preemption can happen
> before finish_wait, after that, once the task was been switched out, as it was
> set as TASK_UNINTERRUPTIBLE, there is no chance to schedule it again.
> 
> Thanks,
> 
First, according to our hungtask stack, we can confirm we are in io_schedule.

Secondly, from my point of view, it is safe to preempt regardless of the task's state. It will 
eventually reschedule. You can see kernel\sched\core.c :: __schedule. If PREEMPT_ACTIVE 
is set, the current task will be put back to the run queue.

> >
> > Thanks,
> > >
> > > And if this is right, following patch can fix this issue.
> > >
> > > ---
> > >   fs/f2fs/checkpoint.c | 14 +++++++++-----
> > >   fs/f2fs/data.c       |  9 +++++++--
> > >   fs/f2fs/f2fs.h       |  3 ++-
> > >   fs/f2fs/super.c      |  1 +
> > >   4 files changed, 19 insertions(+), 8 deletions(-)
> > >
> > > diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c index
> > > 9d277f8..9446c3d 100644
> > > --- a/fs/f2fs/checkpoint.c
> > > +++ b/fs/f2fs/checkpoint.c
> > > @@ -914,15 +914,19 @@ static void wait_on_all_pages_writeback(struct
> f2fs_sb_info *sbi)
> > >   {
> > >   	DEFINE_WAIT(wait);
> > >
> > > -	for (;;) {
> > > -		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
> > > +	spin_lock(&sbi->cp_wb_lock);
> > >
> > > -		if (!get_pages(sbi, F2FS_WRITEBACK))
> > > -			break;
> > > +	while (get_pages(sbi, F2FS_WRITEBACK)) {
> > > +		prepare_to_wait(&sbi->cp_wait, &wait,
> TASK_UNINTERRUPTIBLE);
> > >
> > > +		spin_unlock(&sbi->cp_wb_lock);
> > >   		io_schedule();
> > > +		spin_lock(&sbi->cp_wb_lock);
> > > +
> > > +		finish_wait(&sbi->cp_wait, &wait);
> > >   	}
> > > -	finish_wait(&sbi->cp_wait, &wait);
> > > +
> > > +	spin_unlock(&sbi->cp_wb_lock);
> > >   }
> > >
> > >   static int do_checkpoint(struct f2fs_sb_info *sbi, struct
> > > cp_control *cpc) diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c index
> > > e5c762b..e31deb97 100644
> > > --- a/fs/f2fs/data.c
> > > +++ b/fs/f2fs/data.c
> > > @@ -59,6 +59,7 @@ static void f2fs_write_end_io(struct bio *bio)
> > >   {
> > >   	struct f2fs_sb_info *sbi = bio->bi_private;
> > >   	struct bio_vec *bvec;
> > > +	unsigned long flags;
> > >   	int i;
> > >
> > >   	bio_for_each_segment_all(bvec, bio, i) { @@ -74,8 +75,12 @@
> > > static void f2fs_write_end_io(struct bio *bio)
> > >   		dec_page_count(sbi, F2FS_WRITEBACK);
> > >   	}
> > >
> > > -	if (!get_pages(sbi, F2FS_WRITEBACK) && wq_has_sleeper(&sbi->cp_wait))
> > > -		wake_up(&sbi->cp_wait);
> > > +	if (!get_pages(sbi, F2FS_WRITEBACK)) {
> > > +		spin_lock_irqsave(&sbi->cp_wb_lock, flags);
> > > +		if (wq_has_sleeper(&sbi->cp_wait))
> > > +			wake_up(&sbi->cp_wait);
> > > +		spin_unlock_irqrestore(&sbi->cp_wb_lock, flags);
> > > +	}
> > >
> > >   	bio_put(bio);
> > >   }
> > > diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h index 0d25430..fd47984
> > > 100644
> > > --- a/fs/f2fs/f2fs.h
> > > +++ b/fs/f2fs/f2fs.h
> > > @@ -727,7 +727,8 @@ struct f2fs_sb_info {
> > >   	struct rw_semaphore cp_rwsem;		/* blocking FS operations */
> > >   	struct rw_semaphore node_write;		/* locking node writes */
> > >   	struct mutex writepages;		/* mutex for writepages() */
> > > -	wait_queue_head_t cp_wait;
> > > +	wait_queue_head_t cp_wait;		/* for wait pages writeback */
> > > +	spinlock_t cp_wb_lock;			/* for protect cp_wait */
> > >   	unsigned long last_time[MAX_TIME];	/* to store time in jiffies */
> > >   	long interval_time[MAX_TIME];		/* to store thresholds */
> > >
> > > diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c index
> > > 7b62016..5316c7a 100644
> > > --- a/fs/f2fs/super.c
> > > +++ b/fs/f2fs/super.c
> > > @@ -1374,6 +1374,7 @@ try_onemore:
> > >
> > >   	init_rwsem(&sbi->cp_rwsem);
> > >   	init_waitqueue_head(&sbi->cp_wait);
> > > +	spin_lock_init(&sbi->cp_wb_lock);
> > >   	init_sb_info(sbi);
> > >
> > >   	/* get an inode for meta space */
> > >
> 


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-24  9:45               ` hebiao (G)
@ 2016-02-25  9:32                 ` Chao Yu
  0 siblings, 0 replies; 16+ messages in thread
From: Chao Yu @ 2016-02-25  9:32 UTC (permalink / raw)
  To: 'hebiao (G)', 'heyunlei', jaegeuk,
	linux-f2fs-devel

Hi Hebiao,

> -----Original Message-----
> From: hebiao (G) [mailto:hebiao6@huawei.com]
> Sent: Wednesday, February 24, 2016 5:46 PM
> To: Chao Yu; heyunlei; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: Wangbintian
> Subject: RE: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> Hi, Chao,
> 
> > -----Original Message-----
> > From: Chao Yu [mailto:chao2.yu@samsung.com]
> > Sent: Wednesday, February 24, 2016 4:05 PM
> > To: heyunlei; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> > Cc: Wangbintian; hebiao (G)
> > Subject: RE: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing
> > wake_up
> >
> > Hi Yunlei,
> >
> > > -----Original Message-----
> > > From: He YunLei [mailto:heyunlei@huawei.com]
> > > Sent: Wednesday, February 24, 2016 3:32 PM
> > > To: Chao Yu; jaegeuk@kernel.org;
> > > linux-f2fs-devel@lists.sourceforge.net
> > > Cc: bintian.wang@huawei.com; 'Biao He'
> > > Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by
> > > losing wake_up
> > >
> > > On 2016/2/24 11:46, Chao Yu wrote:
> > > > Hi Yunlei,
> > > >
> > > >> -----Original Message-----
> > > >> From: He YunLei [mailto:heyunlei@huawei.com]
> > > >> Sent: Tuesday, February 23, 2016 7:36 PM
> > > >> To: Chao Yu; jaegeuk@kernel.org;
> > > >> linux-f2fs-devel@lists.sourceforge.net
> > > >> Cc: bintian.wang@huawei.com; 'Biao He'
> > > >> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused
> > > >> by losing wake_up
> > > >>
> > > >> On 2016/2/23 17:15, Chao Yu wrote:
> > > >> Hi Chao,
> > > >>
> > > >>> Hi Yunlei,
> > > >>>
> > > >>>> -----Original Message-----
> > > >>>> From: He YunLei [mailto:heyunlei@huawei.com]
> > > >>>> Sent: Tuesday, February 23, 2016 3:03 PM
> > > >>>> To: Chao Yu; jaegeuk@kernel.org;
> > > >>>> linux-f2fs-devel@lists.sourceforge.net
> > > >>>> Cc: bintian.wang@huawei.com; 'Biao He'
> > > >>>> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem
> > > >>>> caused by losing wake_up
> > > >>>>
> > > >>>> On 2016/2/23 13:44, Chao Yu wrote:
> > > >>>>> Hi Yunlei,
> > > >>>> Hi Chao,
> > > >>>>>
> > > >>>>>> -----Original Message-----
> > > >>>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
> > > >>>>>> Sent: Tuesday, February 23, 2016 12:08 PM
> > > >>>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org;
> > > >>>>>> linux-f2fs-devel@lists.sourceforge.net
> > > >>>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
> > > >>>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused
> > > >>>>>> by losing wake_up
> > > >>>>>>
> > > >>>>>> The D state of wait_on_all_pages_writeback should be waken by
> > > >>>>>> function f2fs_write_end_io when all writeback pages have been
> > > >>>>>> succesfully written to device. It's possible that wake_up comes
> > > >>>>>> between get_pages and io_schedule. Maybe in this case it will
> > > >>>>>> lost wake_up and still in D state even if all pages have been
> > > >>>>>> write back to device, and finally, the whole system will be
> > > >>>>>> into the hungtask state.
> > > >>>>>
> > > >>>>> I haven't encountered such issue so far, do you suffer this in
> > > >>>>> real world?
> > > >>>>>
> > > >>>> yes, I have encounter it, the whole file system is blocked at
> > > >>>> function wait_on_all_pages_writeback beyond 120s when write cp,
> > > >>>> and no error reported by storage device driver.
> > > >>>
> > > >>> Could this reproducible? If it could, could you please share the details.
> > > >>> And did this occur in a huge size f2fs image?
> > > >>>
> > > >>>>>>
> > > >>>>>>                    if (!get_pages(sbi, F2FS_WRITEBACK))
> > > >>>>>>                             break;
> > > >>>>>> 					<---------  wake_up
> > > >>>>>
> > > >>>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue,
> > > >>>>> so here it should be save to call io_schedule, after being
> > > >>>>> rescheduled, it will get the chance to check above condition to break
> > out.
> > > >>>>>
> > > >>>>> Thanks,
> > > >>>>
> > > >>>> Here, we just doubt something weird may cause
> > > >>>> wait_on_all_pages_writeback could not be waken. Wake_up trigger
> > > >>>> only one time by last bio's end_io function, if the thread happen
> > > >>>> to miss it, the thread will be in D state forever. So we change
> > > >>>> the code to make wait_on_all_pages_writeback awaken periodically,
> > then check the condition.
> > > >>>
> > > >>> Got it.
> > > >>>
> > > >>> The patch can fix issue that checkpointer will wait forever in
> > > >>> case of write_end_io was failed to call wake_up for some reason.
> > > >
> > > > I found one possible case:
> > > >
> > > > CPU0:					CPU1:
> > > >   - write_checkpoint
> > > >    - do_checkpoint
> > > >     - wait_on_all_pages_writeback
> > > > 					 - f2fs_write_end_io
> > > > 					  - wake_up
> > > > 					this is last writebacked page, but
> > > > 					no sleeper in sbi->cp_wait wait
> > > > 					queue, wake_up is not been called.
> > > >      - prepare_to_wait(TASK_UNINTERRUPTIBLE)
> > > >      Here, current task is been preempted,
> > > >      but there will be no waker to wake up
> > > >      this task since last write_end_io
> > > >      has been called before. So current
> > > >      task will sleep forever.
> > > >      - io_schedule
> > > >
> > > > How do you think of it?
> > > Hi Chao,
> > >
> > > Here, current task add itself into wait queue at first, and then check
> > > the condition whether write back page is zero. So, in the above
> > > situation, current task is been preempted in -
> > > prepare_to_wait(TASK_UNINTERRUPTIBLE),
> > > current task will not sleep for the write back page is zero.
> >
> > Oh, I meant:
> >
> >  - prepare_to_wait(TASK_UNINTERRUPTIBLE)
> >  - Preempt
> >  - if (!get_pages(sbi, F2FS_WRITEBACK)) break;    or maybe preempt here
> >  - io_schedule
> >  - finish_wait
> >
> > Even there is no more writeback state page, also preemption can happen
> > before finish_wait, after that, once the task was been switched out, as it was
> > set as TASK_UNINTERRUPTIBLE, there is no chance to schedule it again.
> >
> > Thanks,
> >
> First, according to our hungtask stack, we can confirm we are in io_schedule.

Oh, IMO, it can prove that we actually arrive there, not being preempted, thanks
for your information.

> 
> Secondly, from my point of view, it is safe to preempt regardless of the task's state. It will
> eventually reschedule. You can see kernel\sched\core.c :: __schedule. If PREEMPT_ACTIVE
> is set, the current task will be put back to the run queue.

OK, I will have a look at it.

Thanks,

> 
> > >
> > > Thanks,
> > > >
> > > > And if this is right, following patch can fix this issue.
> > > >
> > > > ---
> > > >   fs/f2fs/checkpoint.c | 14 +++++++++-----
> > > >   fs/f2fs/data.c       |  9 +++++++--
> > > >   fs/f2fs/f2fs.h       |  3 ++-
> > > >   fs/f2fs/super.c      |  1 +
> > > >   4 files changed, 19 insertions(+), 8 deletions(-)
> > > >
> > > > diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c index
> > > > 9d277f8..9446c3d 100644
> > > > --- a/fs/f2fs/checkpoint.c
> > > > +++ b/fs/f2fs/checkpoint.c
> > > > @@ -914,15 +914,19 @@ static void wait_on_all_pages_writeback(struct
> > f2fs_sb_info *sbi)
> > > >   {
> > > >   	DEFINE_WAIT(wait);
> > > >
> > > > -	for (;;) {
> > > > -		prepare_to_wait(&sbi->cp_wait, &wait, TASK_UNINTERRUPTIBLE);
> > > > +	spin_lock(&sbi->cp_wb_lock);
> > > >
> > > > -		if (!get_pages(sbi, F2FS_WRITEBACK))
> > > > -			break;
> > > > +	while (get_pages(sbi, F2FS_WRITEBACK)) {
> > > > +		prepare_to_wait(&sbi->cp_wait, &wait,
> > TASK_UNINTERRUPTIBLE);
> > > >
> > > > +		spin_unlock(&sbi->cp_wb_lock);
> > > >   		io_schedule();
> > > > +		spin_lock(&sbi->cp_wb_lock);
> > > > +
> > > > +		finish_wait(&sbi->cp_wait, &wait);
> > > >   	}
> > > > -	finish_wait(&sbi->cp_wait, &wait);
> > > > +
> > > > +	spin_unlock(&sbi->cp_wb_lock);
> > > >   }
> > > >
> > > >   static int do_checkpoint(struct f2fs_sb_info *sbi, struct
> > > > cp_control *cpc) diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c index
> > > > e5c762b..e31deb97 100644
> > > > --- a/fs/f2fs/data.c
> > > > +++ b/fs/f2fs/data.c
> > > > @@ -59,6 +59,7 @@ static void f2fs_write_end_io(struct bio *bio)
> > > >   {
> > > >   	struct f2fs_sb_info *sbi = bio->bi_private;
> > > >   	struct bio_vec *bvec;
> > > > +	unsigned long flags;
> > > >   	int i;
> > > >
> > > >   	bio_for_each_segment_all(bvec, bio, i) { @@ -74,8 +75,12 @@
> > > > static void f2fs_write_end_io(struct bio *bio)
> > > >   		dec_page_count(sbi, F2FS_WRITEBACK);
> > > >   	}
> > > >
> > > > -	if (!get_pages(sbi, F2FS_WRITEBACK) && wq_has_sleeper(&sbi->cp_wait))
> > > > -		wake_up(&sbi->cp_wait);
> > > > +	if (!get_pages(sbi, F2FS_WRITEBACK)) {
> > > > +		spin_lock_irqsave(&sbi->cp_wb_lock, flags);
> > > > +		if (wq_has_sleeper(&sbi->cp_wait))
> > > > +			wake_up(&sbi->cp_wait);
> > > > +		spin_unlock_irqrestore(&sbi->cp_wb_lock, flags);
> > > > +	}
> > > >
> > > >   	bio_put(bio);
> > > >   }
> > > > diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h index 0d25430..fd47984
> > > > 100644
> > > > --- a/fs/f2fs/f2fs.h
> > > > +++ b/fs/f2fs/f2fs.h
> > > > @@ -727,7 +727,8 @@ struct f2fs_sb_info {
> > > >   	struct rw_semaphore cp_rwsem;		/* blocking FS operations */
> > > >   	struct rw_semaphore node_write;		/* locking node writes */
> > > >   	struct mutex writepages;		/* mutex for writepages() */
> > > > -	wait_queue_head_t cp_wait;
> > > > +	wait_queue_head_t cp_wait;		/* for wait pages writeback */
> > > > +	spinlock_t cp_wb_lock;			/* for protect cp_wait */
> > > >   	unsigned long last_time[MAX_TIME];	/* to store time in jiffies */
> > > >   	long interval_time[MAX_TIME];		/* to store thresholds */
> > > >
> > > > diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c index
> > > > 7b62016..5316c7a 100644
> > > > --- a/fs/f2fs/super.c
> > > > +++ b/fs/f2fs/super.c
> > > > @@ -1374,6 +1374,7 @@ try_onemore:
> > > >
> > > >   	init_rwsem(&sbi->cp_rwsem);
> > > >   	init_waitqueue_head(&sbi->cp_wait);
> > > > +	spin_lock_init(&sbi->cp_wb_lock);
> > > >   	init_sb_info(sbi);
> > > >
> > > >   	/* get an inode for meta space */
> > > >
> >



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-24  3:46         ` Chao Yu
  2016-02-24  7:32           ` He YunLei
@ 2016-02-25  7:36           ` He YunLei
  2016-02-25  9:41             ` Chao Yu
  1 sibling, 1 reply; 16+ messages in thread
From: He YunLei @ 2016-02-25  7:36 UTC (permalink / raw)
  To: Chao Yu, jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

On 2016/2/24 11:46, Chao Yu wrote:
>>> > >
>>> > >But I doubt more that the reason we are stuck is there are remained pages
>>> > >cached in bio buffer without being submitted. To make sure, maybe in
>>> > >wait_on_all_pages_writeback we could add print info to see whether
>>> > >sbi->write_io[].bio is valid or not.
>>> > >
>> >We use tool dump f2fs_sb_info information and find that:
>> >
>> >	write_io[DATA].bio = 0;
>> >	write_io[NODE].bio = 0;
>> >	write_io[META].bio = 0;
>> >
>> >	nr_pages[F2FS_WRITEBACK] = 0;
>> >	nr_pages[F2FS_DIRTY_DENTS] = 0;
>> >	nr_pages[F2FS_DIRTY_NODES] = 13;
> Weird, dirty nodes count should be 0.
>
> Thanks
>
Hi Chao,

In our code,

1524 static int f2fs_write_end(struct file *file,
1525                         struct address_space *mapping,
1526                         loff_t pos, unsigned len, unsigned copied,
1527                         struct page *page, void *fsdata)
1528 {
1529         struct inode *inode = page->mapping->host;
1530
1531         trace_f2fs_write_end(inode, pos, len, copied);
1532
1533         set_page_dirty(page);
1534
1535         if (pos + copied > i_size_read(inode)) {
1536                 i_size_write(inode, pos + copied);
1537                 mark_inode_dirty(inode);
1538                 update_inode_page(inode);
1539         }
1540
1541         f2fs_put_page(page, 1);
1542         return copied;
1543 }

Here update_inode_page(inode) has been removed by Kim, maybe here could
result dirty node page is not zero.

Thanks,

>> >	nr_pages[F2FS_DIRTY_META] = 0;
>> >	nr_pages[F2FS_INMEM_PAGES] = 0;
>> >
>> >So we believe that the block device is ok!
>> >
>> >Thanks,


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-25  7:36           ` He YunLei
@ 2016-02-25  9:41             ` Chao Yu
  2016-02-25 19:03               ` Jaegeuk Kim
  0 siblings, 1 reply; 16+ messages in thread
From: Chao Yu @ 2016-02-25  9:41 UTC (permalink / raw)
  To: 'He YunLei', jaegeuk, linux-f2fs-devel; +Cc: 'Biao He'

Hi Yunlei,

> -----Original Message-----
> From: He YunLei [mailto:heyunlei@huawei.com]
> Sent: Thursday, February 25, 2016 3:36 PM
> To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> Cc: bintian.wang@huawei.com; 'Biao He'
> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> On 2016/2/24 11:46, Chao Yu wrote:
> >>> > >
> >>> > >But I doubt more that the reason we are stuck is there are remained pages
> >>> > >cached in bio buffer without being submitted. To make sure, maybe in
> >>> > >wait_on_all_pages_writeback we could add print info to see whether
> >>> > >sbi->write_io[].bio is valid or not.
> >>> > >
> >> >We use tool dump f2fs_sb_info information and find that:
> >> >
> >> >	write_io[DATA].bio = 0;
> >> >	write_io[NODE].bio = 0;
> >> >	write_io[META].bio = 0;
> >> >
> >> >	nr_pages[F2FS_WRITEBACK] = 0;
> >> >	nr_pages[F2FS_DIRTY_DENTS] = 0;
> >> >	nr_pages[F2FS_DIRTY_NODES] = 13;
> > Weird, dirty nodes count should be 0.
> >
> > Thanks
> >
> Hi Chao,
> 
> In our code,
> 
> 1524 static int f2fs_write_end(struct file *file,
> 1525                         struct address_space *mapping,
> 1526                         loff_t pos, unsigned len, unsigned copied,
> 1527                         struct page *page, void *fsdata)
> 1528 {
> 1529         struct inode *inode = page->mapping->host;
> 1530
> 1531         trace_f2fs_write_end(inode, pos, len, copied);
> 1532
> 1533         set_page_dirty(page);
> 1534
> 1535         if (pos + copied > i_size_read(inode)) {
> 1536                 i_size_write(inode, pos + copied);
> 1537                 mark_inode_dirty(inode);
> 1538                 update_inode_page(inode);
> 1539         }
> 1540
> 1541         f2fs_put_page(page, 1);
> 1542         return copied;
> 1543 }
> 
> Here update_inode_page(inode) has been removed by Kim, maybe here could
> result dirty node page is not zero.

As your hint, I found an issue in f2fs_write_inode which may be related to
our problem. I wrote a patch for fixing.

Thanks,

> 
> Thanks,
> 
> >> >	nr_pages[F2FS_DIRTY_META] = 0;
> >> >	nr_pages[F2FS_INMEM_PAGES] = 0;
> >> >
> >> >So we believe that the block device is ok!
> >> >
> >> >Thanks,



------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-25  9:41             ` Chao Yu
@ 2016-02-25 19:03               ` Jaegeuk Kim
  2016-02-26  1:15                 ` Chao Yu
  0 siblings, 1 reply; 16+ messages in thread
From: Jaegeuk Kim @ 2016-02-25 19:03 UTC (permalink / raw)
  To: Chao Yu; +Cc: 'Biao He', linux-f2fs-devel

Hi all,

On Thu, Feb 25, 2016 at 05:41:26PM +0800, Chao Yu wrote:
> Hi Yunlei,
> 
> > -----Original Message-----
> > From: He YunLei [mailto:heyunlei@huawei.com]
> > Sent: Thursday, February 25, 2016 3:36 PM
> > To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> > Cc: bintian.wang@huawei.com; 'Biao He'
> > Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> > 
> > On 2016/2/24 11:46, Chao Yu wrote:
> > >>> > >
> > >>> > >But I doubt more that the reason we are stuck is there are remained pages
> > >>> > >cached in bio buffer without being submitted. To make sure, maybe in
> > >>> > >wait_on_all_pages_writeback we could add print info to see whether
> > >>> > >sbi->write_io[].bio is valid or not.
> > >>> > >
> > >> >We use tool dump f2fs_sb_info information and find that:
> > >> >
> > >> >	write_io[DATA].bio = 0;
> > >> >	write_io[NODE].bio = 0;
> > >> >	write_io[META].bio = 0;
> > >> >
> > >> >	nr_pages[F2FS_WRITEBACK] = 0;
> > >> >	nr_pages[F2FS_DIRTY_DENTS] = 0;
> > >> >	nr_pages[F2FS_DIRTY_NODES] = 13;
> > > Weird, dirty nodes count should be 0.
> > >
> > > Thanks
> > >
> > Hi Chao,
> > 
> > In our code,
> > 
> > 1524 static int f2fs_write_end(struct file *file,
> > 1525                         struct address_space *mapping,
> > 1526                         loff_t pos, unsigned len, unsigned copied,
> > 1527                         struct page *page, void *fsdata)
> > 1528 {
> > 1529         struct inode *inode = page->mapping->host;
> > 1530
> > 1531         trace_f2fs_write_end(inode, pos, len, copied);
> > 1532
> > 1533         set_page_dirty(page);
> > 1534
> > 1535         if (pos + copied > i_size_read(inode)) {
> > 1536                 i_size_write(inode, pos + copied);
> > 1537                 mark_inode_dirty(inode);
> > 1538                 update_inode_page(inode);
> > 1539         }
> > 1540
> > 1541         f2fs_put_page(page, 1);
> > 1542         return copied;
> > 1543 }
> > 
> > Here update_inode_page(inode) has been removed by Kim, maybe here could
> > result dirty node page is not zero.
> 
> As your hint, I found an issue in f2fs_write_inode which may be related to
> our problem. I wrote a patch for fixing.

I just remember out that this has no problem, since we only need to disallow
writing node pages during checkpoint.

IOW, write_inode or any other functions can update node pages during checkpoint,
and then f2fs will flush them after checkpoint.

Thanks,

> 
> Thanks,
> 
> > 
> > Thanks,
> > 
> > >> >	nr_pages[F2FS_DIRTY_META] = 0;
> > >> >	nr_pages[F2FS_INMEM_PAGES] = 0;
> > >> >
> > >> >So we believe that the block device is ok!
> > >> >
> > >> >Thanks,
> 

------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-25 19:03               ` Jaegeuk Kim
@ 2016-02-26  1:15                 ` Chao Yu
  0 siblings, 0 replies; 16+ messages in thread
From: Chao Yu @ 2016-02-26  1:15 UTC (permalink / raw)
  To: 'Jaegeuk Kim'; +Cc: 'Biao He', linux-f2fs-devel

Hi Jaegeuk,

> -----Original Message-----
> From: Jaegeuk Kim [mailto:jaegeuk@kernel.org]
> Sent: Friday, February 26, 2016 3:03 AM
> To: Chao Yu
> Cc: 'He YunLei'; linux-f2fs-devel@lists.sourceforge.net; bintian.wang@huawei.com; 'Biao He'
> Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> 
> Hi all,
> 
> On Thu, Feb 25, 2016 at 05:41:26PM +0800, Chao Yu wrote:
> > Hi Yunlei,
> >
> > > -----Original Message-----
> > > From: He YunLei [mailto:heyunlei@huawei.com]
> > > Sent: Thursday, February 25, 2016 3:36 PM
> > > To: Chao Yu; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
> > > Cc: bintian.wang@huawei.com; 'Biao He'
> > > Subject: Re: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
> > >
> > > On 2016/2/24 11:46, Chao Yu wrote:
> > > >>> > >
> > > >>> > >But I doubt more that the reason we are stuck is there are remained pages
> > > >>> > >cached in bio buffer without being submitted. To make sure, maybe in
> > > >>> > >wait_on_all_pages_writeback we could add print info to see whether
> > > >>> > >sbi->write_io[].bio is valid or not.
> > > >>> > >
> > > >> >We use tool dump f2fs_sb_info information and find that:
> > > >> >
> > > >> >	write_io[DATA].bio = 0;
> > > >> >	write_io[NODE].bio = 0;
> > > >> >	write_io[META].bio = 0;
> > > >> >
> > > >> >	nr_pages[F2FS_WRITEBACK] = 0;
> > > >> >	nr_pages[F2FS_DIRTY_DENTS] = 0;
> > > >> >	nr_pages[F2FS_DIRTY_NODES] = 13;
> > > > Weird, dirty nodes count should be 0.
> > > >
> > > > Thanks
> > > >
> > > Hi Chao,
> > >
> > > In our code,
> > >
> > > 1524 static int f2fs_write_end(struct file *file,
> > > 1525                         struct address_space *mapping,
> > > 1526                         loff_t pos, unsigned len, unsigned copied,
> > > 1527                         struct page *page, void *fsdata)
> > > 1528 {
> > > 1529         struct inode *inode = page->mapping->host;
> > > 1530
> > > 1531         trace_f2fs_write_end(inode, pos, len, copied);
> > > 1532
> > > 1533         set_page_dirty(page);
> > > 1534
> > > 1535         if (pos + copied > i_size_read(inode)) {
> > > 1536                 i_size_write(inode, pos + copied);
> > > 1537                 mark_inode_dirty(inode);
> > > 1538                 update_inode_page(inode);
> > > 1539         }
> > > 1540
> > > 1541         f2fs_put_page(page, 1);
> > > 1542         return copied;
> > > 1543 }
> > >
> > > Here update_inode_page(inode) has been removed by Kim, maybe here could
> > > result dirty node page is not zero.
> >
> > As your hint, I found an issue in f2fs_write_inode which may be related to
> > our problem. I wrote a patch for fixing.
> 
> I just remember out that this has no problem, since we only need to disallow
> writing node pages during checkpoint.
> 
> IOW, write_inode or any other functions can update node pages during checkpoint,
> and then f2fs will flush them after checkpoint.

Sounds reasonable to me. :)

Thanks,

> 
> Thanks,
> 
> >
> > Thanks,
> >
> > >
> > > Thanks,
> > >
> > > >> >	nr_pages[F2FS_DIRTY_META] = 0;
> > > >> >	nr_pages[F2FS_INMEM_PAGES] = 0;
> > > >> >
> > > >> >So we believe that the block device is ok!
> > > >> >
> > > >> >Thanks,
> >


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  7:02   ` He YunLei
  2016-02-23  9:15     ` Chao Yu
@ 2016-02-23  9:32     ` Shawn Lin
  2016-02-23 11:45       ` He YunLei
  1 sibling, 1 reply; 16+ messages in thread
From: Shawn Lin @ 2016-02-23  9:32 UTC (permalink / raw)
  To: He YunLei, Chao Yu, jaegeuk, linux-f2fs-devel
  Cc: 'Biao He', shawn.lin

On 2016/2/23 15:02, He YunLei wrote:
> On 2016/2/23 13:44, Chao Yu wrote:
>> Hi Yunlei,
> Hi Chao,
>>
>>> -----Original Message-----
>>> From: Yunlei He [mailto:heyunlei@huawei.com]
>>> Sent: Tuesday, February 23, 2016 12:08 PM
>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>>
>>> The D state of wait_on_all_pages_writeback should be waken by
>>> function f2fs_write_end_io when all writeback pages have been
>>> succesfully written to device. It's possible that wake_up comes
>>> between get_pages and io_schedule. Maybe in this case it will
>>> lost wake_up and still in D state even if all pages have been
>>> write back to device, and finally, the whole system will be into
>>> the hungtask state.
>>
>> I haven't encountered such issue so far, do you suffer this in real
>> world?
>>
> yes, I have encounter it, the whole file system is blocked at function
> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
> by storage device driver.

Hi YunLei

Interesting...
How to prove the page isn't been locked by storage block layer which
doesn't finish the related bio?

I suffer that case before, but then I solve it by seting a timer to
break my mmc driver while missing some important interrupts(chip
problem) for data-transfer to finish the io queue by mmcqd.

Another thing,
why making 5HZ for timeout? why not 10HZ ,15HZ...
io_schedule_timeout(5*HZ);

>>>
>>>                   if (!get_pages(sbi, F2FS_WRITEBACK))
>>>                            break;
>>> 					<---------  wake_up
>>
>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
>> here it should be save to call io_schedule, after being rescheduled,
>> it will get the chance to check above condition to break out.
>>
>> Thanks,
>
> Here, we just doubt something weird may cause wait_on_all_pages_writeback
> could not be waken. Wake_up trigger only one time by last bio's end_io
> function, if the thread happen to miss it, the thread will be in D state
> forever. So we change the code to make wait_on_all_pages_writeback awaken
> periodically, then check the condition.
>
>>
>>>                   io_schedule();
>>>
>>> Signed-off-by: Yunlei He <heyunlei@huawei.com>
>>> Signed-off-by: Biao He <hebiao6@huawei.com>
>>> ---
>>>    fs/f2fs/checkpoint.c | 2 +-
>>>    1 file changed, 1 insertion(+), 1 deletion(-)
>>>
>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
>>> index 2bac8a1..f55355d 100644
>>> --- a/fs/f2fs/checkpoint.c
>>> +++ b/fs/f2fs/checkpoint.c
>>> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>>>    		if (!get_pages(sbi, F2FS_WRITEBACK))
>>>    			break;
>>>
>>> -		io_schedule();
>>> +		io_schedule_timeout(5*HZ);
>>>    	}
>>>    	finish_wait(&sbi->cp_wait, &wait);
>>>    }
>>> --
>>> 1.9.1
>>
>>
>>
>> .
>>
>
>
> ------------------------------------------------------------------------------
> Site24x7 APM Insight: Get Deep Visibility into Application Performance
> APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
> Monitor end-to-end web transactions and take corrective actions now
> Troubleshoot faster and improve end-user experience. Signup Now!
> http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
> _______________________________________________
> Linux-f2fs-devel mailing list
> Linux-f2fs-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel
>


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
  2016-02-23  9:32     ` Shawn Lin
@ 2016-02-23 11:45       ` He YunLei
  0 siblings, 0 replies; 16+ messages in thread
From: He YunLei @ 2016-02-23 11:45 UTC (permalink / raw)
  To: Shawn Lin, Chao Yu, jaegeuk, linux-f2fs-devel
  Cc: 'Biao He', shawn.lin

On 2016/2/23 17:32, Shawn Lin wrote:
> On 2016/2/23 15:02, He YunLei wrote:
>> On 2016/2/23 13:44, Chao Yu wrote:
>>> Hi Yunlei,
>> Hi Chao,
>>>
>>>> -----Original Message-----
>>>> From: Yunlei He [mailto:heyunlei@huawei.com]
>>>> Sent: Tuesday, February 23, 2016 12:08 PM
>>>> To: chao2.yu@samsung.com; jaegeuk@kernel.org; linux-f2fs-devel@lists.sourceforge.net
>>>> Cc: bintian.wang@huawei.com; Yunlei He; Biao He
>>>> Subject: [f2fs-dev] [PATCH] f2fs: avoid hungtask problem caused by losing wake_up
>>>>
>>>> The D state of wait_on_all_pages_writeback should be waken by
>>>> function f2fs_write_end_io when all writeback pages have been
>>>> succesfully written to device. It's possible that wake_up comes
>>>> between get_pages and io_schedule. Maybe in this case it will
>>>> lost wake_up and still in D state even if all pages have been
>>>> write back to device, and finally, the whole system will be into
>>>> the hungtask state.
>>>
>>> I haven't encountered such issue so far, do you suffer this in real
>>> world?
>>>
>> yes, I have encounter it, the whole file system is blocked at function
>> wait_on_all_pages_writeback beyond 120s when write cp, and no error reported
>> by storage device driver.
>
Hi Shawn,
> Hi YunLei
>
> Interesting...
> How to prove the page isn't been locked by storage block layer which
> doesn't finish the related bio?
>
> I suffer that case before, but then I solve it by seting a timer to
> break my mmc driver while missing some important interrupts(chip
> problem) for data-transfer to finish the io queue by mmcqd.
>
Thanks for your review, I has replied Chao about the info of f2fs_sb_info,
and from the info we can make sure the block layer and device driver are
both ok.

> Another thing,
> why making 5HZ for timeout? why not 10HZ ,15HZ...
> io_schedule_timeout(5*HZ);
>

We choose 5HZ in our test, maybe other value is more reasonable.

>>>>
>>>>                   if (!get_pages(sbi, F2FS_WRITEBACK))
>>>>                            break;
>>>>                     <---------  wake_up
>>>
>>> wake_up will put all tasks linked in sbi->cp_wait on run-queue, so
>>> here it should be save to call io_schedule, after being rescheduled,
>>> it will get the chance to check above condition to break out.
>>>
>>> Thanks,
>>
>> Here, we just doubt something weird may cause wait_on_all_pages_writeback
>> could not be waken. Wake_up trigger only one time by last bio's end_io
>> function, if the thread happen to miss it, the thread will be in D state
>> forever. So we change the code to make wait_on_all_pages_writeback awaken
>> periodically, then check the condition.
>>
>>>
>>>>                   io_schedule();
>>>>
>>>> Signed-off-by: Yunlei He <heyunlei@huawei.com>
>>>> Signed-off-by: Biao He <hebiao6@huawei.com>
>>>> ---
>>>>    fs/f2fs/checkpoint.c | 2 +-
>>>>    1 file changed, 1 insertion(+), 1 deletion(-)
>>>>
>>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
>>>> index 2bac8a1..f55355d 100644
>>>> --- a/fs/f2fs/checkpoint.c
>>>> +++ b/fs/f2fs/checkpoint.c
>>>> @@ -920,7 +920,7 @@ static void wait_on_all_pages_writeback(struct f2fs_sb_info *sbi)
>>>>            if (!get_pages(sbi, F2FS_WRITEBACK))
>>>>                break;
>>>>
>>>> -        io_schedule();
>>>> +        io_schedule_timeout(5*HZ);
>>>>        }
>>>>        finish_wait(&sbi->cp_wait, &wait);
>>>>    }
>>>> --
>>>> 1.9.1
>>>
>>>
>>>
>>> .
>>>
>>
>>
>> ------------------------------------------------------------------------------
>> Site24x7 APM Insight: Get Deep Visibility into Application Performance
>> APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
>> Monitor end-to-end web transactions and take corrective actions now
>> Troubleshoot faster and improve end-user experience. Signup Now!
>> http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
>> _______________________________________________
>> Linux-f2fs-devel mailing list
>> Linux-f2fs-devel@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel
>>
>
>
> .
>


------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2016-02-26  1:17 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-02-23  4:07 [PATCH] f2fs: avoid hungtask problem caused by losing wake_up Yunlei He
2016-02-23  5:44 ` Chao Yu
2016-02-23  7:02   ` He YunLei
2016-02-23  9:15     ` Chao Yu
2016-02-23 11:36       ` He YunLei
2016-02-24  3:46         ` Chao Yu
2016-02-24  7:32           ` He YunLei
2016-02-24  8:05             ` Chao Yu
2016-02-24  9:45               ` hebiao (G)
2016-02-25  9:32                 ` Chao Yu
2016-02-25  7:36           ` He YunLei
2016-02-25  9:41             ` Chao Yu
2016-02-25 19:03               ` Jaegeuk Kim
2016-02-26  1:15                 ` Chao Yu
2016-02-23  9:32     ` Shawn Lin
2016-02-23 11:45       ` He YunLei

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).