* Re: Revert "aio: block exit_aio() until all context requests are completed"
[not found] ` <x49egmiyn2e.fsf@segfault.boston.devel.redhat.com>
@ 2015-05-15 15:26 ` Christian Borntraeger
2015-05-16 15:16 ` Jens Axboe
0 siblings, 1 reply; 2+ messages in thread
From: Christian Borntraeger @ 2015-05-15 15:26 UTC (permalink / raw)
To: Jeff Moyer; +Cc: Gu Zheng, Benjamin LaHaise, linux-aio, linux-fsdevel, stable
Am 15.05.2015 um 15:42 schrieb Jeff Moyer:
> Christian Borntraeger <borntraeger@de.ibm.com> writes:
>
>> I see a significant latency (can be minutes with 2000 disks and HZ=100)
>> when exiting a QEMU process that has lots of disk devices via aio. The
>> process sits idle doing nothing as zombie in exit_aio waiting for the
>> completion.
>>
>> Turns out that
>> commit 6098b45b32 ("aio: block exit_aio() until all context requests are
>> completed") caused the delay.
>>
>> Patch description was:
>>
>> It seems that exit_aio() also needs to wait for all iocbs to complete (like
>> io_destroy), but we missed the wait step in current implemention, so fix
>> it in the same way as we did in io_destroy.
>>
>> Now: io_destroy requires to block until everything is cleaned up from its
>> interface description in the manpage:
>> DESCRIPTION
>> The io_destroy() system call will attempt to cancel all outstanding
>> asynchronous I/O operations against ctx_id, will block on the completion
>> of all operations that could not be canceled, and will destroy the ctx_id.
>>
>> Does process exit require the same full blocking? We might be able to
>> cleanup the process and let the aio data structures be freed lazily.
>> Opinions or better ideas?
>
> This has already been fixed:
>
> commit dc48e56d761610da4ea1088d1bea0a030b8e3e43
> Author: Jens Axboe <axboe@fb.com>
> Date: Wed Apr 15 11:17:23 2015 -0600
>
> aio: fix serial draining in exit_aio()
>
> Cheers,
> Jeff
>
Cool thanks. As the original patch had cc stable, shouldnt the fix also be backported?
Christian
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: Revert "aio: block exit_aio() until all context requests are completed"
2015-05-15 15:26 ` Revert "aio: block exit_aio() until all context requests are completed" Christian Borntraeger
@ 2015-05-16 15:16 ` Jens Axboe
0 siblings, 0 replies; 2+ messages in thread
From: Jens Axboe @ 2015-05-16 15:16 UTC (permalink / raw)
To: Christian Borntraeger, Jeff Moyer
Cc: Gu Zheng, Benjamin LaHaise, linux-aio, linux-fsdevel, stable
On 05/15/2015 09:26 AM, Christian Borntraeger wrote:
> Am 15.05.2015 um 15:42 schrieb Jeff Moyer:
>> Christian Borntraeger <borntraeger@de.ibm.com> writes:
>>
>>> I see a significant latency (can be minutes with 2000 disks and HZ=100)
>>> when exiting a QEMU process that has lots of disk devices via aio. The
>>> process sits idle doing nothing as zombie in exit_aio waiting for the
>>> completion.
>>>
>>> Turns out that
>>> commit 6098b45b32 ("aio: block exit_aio() until all context requests are
>>> completed") caused the delay.
>>>
>>> Patch description was:
>>>
>>> It seems that exit_aio() also needs to wait for all iocbs to complete (like
>>> io_destroy), but we missed the wait step in current implemention, so fix
>>> it in the same way as we did in io_destroy.
>>>
>>> Now: io_destroy requires to block until everything is cleaned up from its
>>> interface description in the manpage:
>>> DESCRIPTION
>>> The io_destroy() system call will attempt to cancel all outstanding
>>> asynchronous I/O operations against ctx_id, will block on the completion
>>> of all operations that could not be canceled, and will destroy the ctx_id.
>>>
>>> Does process exit require the same full blocking? We might be able to
>>> cleanup the process and let the aio data structures be freed lazily.
>>> Opinions or better ideas?
>>
>> This has already been fixed:
>>
>> commit dc48e56d761610da4ea1088d1bea0a030b8e3e43
>> Author: Jens Axboe <axboe@fb.com>
>> Date: Wed Apr 15 11:17:23 2015 -0600
>>
>> aio: fix serial draining in exit_aio()
>>
>> Cheers,
>> Jeff
>>
> Cool thanks. As the original patch had cc stable, shouldnt the fix also be backported?
I'll email stable.
--
Jens Axboe
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2015-05-16 15:16 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1431675417-30464-1-git-send-email-borntraeger@de.ibm.com>
[not found] ` <5555A33B.20006@de.ibm.com>
[not found] ` <x49egmiyn2e.fsf@segfault.boston.devel.redhat.com>
2015-05-15 15:26 ` Revert "aio: block exit_aio() until all context requests are completed" Christian Borntraeger
2015-05-16 15:16 ` Jens Axboe
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox