From: Anthony Liguori <anthony@codemonkey.ws>
To: Stefan Hajnoczi <stefanha@gmail.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
Jan Kiszka <jan.kiszka@siemens.com>, Avi Kivity <avi@redhat.com>,
kvm@vger.kernel.org, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH v2] posix-aio-compat: fix latency issues
Date: Tue, 23 Aug 2011 07:40:09 -0500 [thread overview]
Message-ID: <4E539FA9.3010507@codemonkey.ws> (raw)
In-Reply-To: <CAJSP0QW0rphazTVd2cyho=D0MVsVCcHu3wzwXJsJqGnM_4owzg@mail.gmail.com>
On 08/23/2011 06:01 AM, Stefan Hajnoczi wrote:
> On Mon, Aug 22, 2011 at 6:29 PM, Jan Kiszka<jan.kiszka@siemens.com> wrote:
>> On 2011-08-14 06:04, Avi Kivity wrote:
>>> In certain circumstances, posix-aio-compat can incur a lot of latency:
>>> - threads are created by vcpu threads, so if vcpu affinity is set,
>>> aio threads inherit vcpu affinity. This can cause many aio threads
>>> to compete for one cpu.
>>> - we can create up to max_threads (64) aio threads in one go; since a
>>> pthread_create can take around 30μs, we have up to 2ms of cpu time
>>> under a global lock.
>>>
>>> Fix by:
>>> - moving thread creation to the main thread, so we inherit the main
>>> thread's affinity instead of the vcpu thread's affinity.
>>> - if a thread is currently being created, and we need to create yet
>>> another thread, let thread being born create the new thread, reducing
>>> the amount of time we spend under the main thread.
>>> - drop the local lock while creating a thread (we may still hold the
>>> global mutex, though)
>>>
>>> Note this doesn't eliminate latency completely; scheduler artifacts or
>>> lack of host cpu resources can still cause it. We may want pre-allocated
>>> threads when this cannot be tolerated.
>>>
>>> Thanks to Uli Obergfell of Red Hat for his excellent analysis and suggestions.
>>
>> At this chance: What is the state of getting rid of the remaining delta
>> between upstream's version and qemu-kvm?
>
> That would be nice. qemu-kvm.git uses a signalfd to handle I/O
> completion whereas qemu.git uses a signal, writes to a pipe from the
> signal handler, and uses qemu_notify_event() to break the vcpu. Once
> the force iothread patch is merged we should be able to move to
> qemu-kvm.git's signalfd approach.
No need to use a signal at all actually. The use of a signal is
historic and was required to work around the TCG race that I referred to
in another thread.
You should be able to just use an eventfd or pipe.
Better yet, we should look at using GThreadPool to replace posix-aio-compat.
Regards,
Anthony Liguori
>
> Stefan
>
next prev parent reply other threads:[~2011-08-23 12:40 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-08-14 4:04 [Qemu-devel] [PATCH v2] posix-aio-compat: fix latency issues Avi Kivity
2011-08-22 17:22 ` Kevin Wolf
2011-08-22 17:29 ` Jan Kiszka
2011-08-23 11:01 ` Stefan Hajnoczi
2011-08-23 12:40 ` Anthony Liguori [this message]
2011-08-23 13:02 ` Jan Kiszka
2011-08-23 14:02 ` Anthony Liguori
2011-08-23 14:10 ` Jan Kiszka
2011-08-28 8:09 ` Avi Kivity
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E539FA9.3010507@codemonkey.ws \
--to=anthony@codemonkey.ws \
--cc=avi@redhat.com \
--cc=jan.kiszka@siemens.com \
--cc=kvm@vger.kernel.org \
--cc=kwolf@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).