All of lore.kernel.org
 help / color / mirror / Atom feed
From: Anthony Liguori <anthony@codemonkey.ws>
To: Andrea Arcangeli <aarcange@redhat.com>
Cc: Gerd Hoffmann <kraxel@redhat.com>,
	qemu-devel@nongnu.org, kvm-devel <kvm@vger.kernel.org>
Subject: Re: [Qemu-devel] [RFC] Replace posix-aio with custom thread pool
Date: Fri, 12 Dec 2008 10:49:45 -0600	[thread overview]
Message-ID: <49429629.20309@codemonkey.ws> (raw)
In-Reply-To: <20081212154418.GM6809@random.random>

Andrea Arcangeli wrote:
> On Fri, Dec 12, 2008 at 08:35:57AM -0600, Anthony Liguori wrote:
>   
>> I've been thinking about this, the problems I see are:
>>
>> 1) It's impossible to accept a file descriptor for a block device (possibly 
>> not a problem)
>>     
>
> What do you mean with accept? You mean to accept a tcp connection? How
> would a block device fd be related to accept(2)?
>   

I meant, if you wanted to pass a file descriptor as a raw device.  So:

qemu -hda raw:fd=4

Or something like that.  We don't support this today.

> Now that linux-aio is out of the picture for quite a long time for us,
> I guess it worth to wait preadv/pwritev and stick with that and
> reconsider linux-aio after they fix it... Waiting Gerd to post a full
> patch.
>
> But it's your call... I'm fine either ways. Clearly the os missing
> preadv/pwritev would need to be limited to 1 thread per fd (but 1
> thread per fd kind of breaks with the current _global_ list so I guess
> they'll be limited to just 1 thread otherwise it may be actually
> simpler to just open the file multiple times than to have a per-fd
> queue ;), not the end of the world for them.
>   

I think bouncing the iov and just using pread/pwrite may be our best 
bet.  It means memory allocation but we can cap it.  Since we're using 
threads, we just can force a thread to sleep until memory becomes 
available so it's actually pretty straight forward.

We can use libaio on older Linux's to simulate preadv/pwritev.  Use the 
proper syscalls on newer kernels, on BSDs, and bounce everything else.

Regards,

Anthony Liguori


WARNING: multiple messages have this Message-ID (diff)
From: Anthony Liguori <anthony@codemonkey.ws>
To: Andrea Arcangeli <aarcange@redhat.com>
Cc: Gerd Hoffmann <kraxel@redhat.com>,
	kvm-devel <kvm@vger.kernel.org>,
	qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [RFC] Replace posix-aio with custom thread pool
Date: Fri, 12 Dec 2008 10:49:45 -0600	[thread overview]
Message-ID: <49429629.20309@codemonkey.ws> (raw)
In-Reply-To: <20081212154418.GM6809@random.random>

Andrea Arcangeli wrote:
> On Fri, Dec 12, 2008 at 08:35:57AM -0600, Anthony Liguori wrote:
>   
>> I've been thinking about this, the problems I see are:
>>
>> 1) It's impossible to accept a file descriptor for a block device (possibly 
>> not a problem)
>>     
>
> What do you mean with accept? You mean to accept a tcp connection? How
> would a block device fd be related to accept(2)?
>   

I meant, if you wanted to pass a file descriptor as a raw device.  So:

qemu -hda raw:fd=4

Or something like that.  We don't support this today.

> Now that linux-aio is out of the picture for quite a long time for us,
> I guess it worth to wait preadv/pwritev and stick with that and
> reconsider linux-aio after they fix it... Waiting Gerd to post a full
> patch.
>
> But it's your call... I'm fine either ways. Clearly the os missing
> preadv/pwritev would need to be limited to 1 thread per fd (but 1
> thread per fd kind of breaks with the current _global_ list so I guess
> they'll be limited to just 1 thread otherwise it may be actually
> simpler to just open the file multiple times than to have a per-fd
> queue ;), not the end of the world for them.
>   

I think bouncing the iov and just using pread/pwrite may be our best 
bet.  It means memory allocation but we can cap it.  Since we're using 
threads, we just can force a thread to sleep until memory becomes 
available so it's actually pretty straight forward.

We can use libaio on older Linux's to simulate preadv/pwritev.  Use the 
proper syscalls on newer kernels, on BSDs, and bounce everything else.

Regards,

Anthony Liguori

  reply	other threads:[~2008-12-12 16:49 UTC|newest]

Thread overview: 75+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-12-05 21:21 [RFC] Replace posix-aio with custom thread pool Anthony Liguori
2008-12-05 21:21 ` [Qemu-devel] " Anthony Liguori
2008-12-06  9:03 ` Blue Swirl
2008-12-06 18:26   ` Jamie Lokier
2008-12-08 18:23   ` Anthony Liguori
2008-12-08 18:23     ` Anthony Liguori
2008-12-09 15:51 ` Gerd Hoffmann
2008-12-09 16:01   ` Anthony Liguori
2008-12-10 16:44     ` Andrea Arcangeli
2008-12-10 17:21       ` Anthony Liguori
2008-12-10 17:21         ` Anthony Liguori
2008-12-10 17:29         ` Gerd Hoffmann
2008-12-10 18:50           ` Anthony Liguori
2008-12-10 19:08             ` Andrea Arcangeli
2008-12-10 19:08               ` Andrea Arcangeli
2008-12-11 13:12               ` Andrea Arcangeli
2008-12-11 15:24                 ` Gerd Hoffmann
2008-12-11 15:24                   ` Gerd Hoffmann
2008-12-11 15:53                   ` Andrea Arcangeli
2008-12-11 15:53                     ` Andrea Arcangeli
2008-12-11 16:11                     ` Gerd Hoffmann
2008-12-11 16:11                       ` Gerd Hoffmann
2008-12-11 16:49                       ` Andrea Arcangeli
2008-12-11 16:49                         ` Andrea Arcangeli
2008-12-11 17:20                         ` Gerd Hoffmann
2008-12-11 17:20                           ` Gerd Hoffmann
2008-12-11 18:11                           ` Andrea Arcangeli
2008-12-11 18:11                             ` Andrea Arcangeli
2008-12-11 20:38                             ` Gerd Hoffmann
2008-12-11 20:38                               ` Gerd Hoffmann
2008-12-11 20:40                             ` Anthony Liguori
2008-12-12  8:23                             ` Jens Axboe
2008-12-12  8:23                               ` Jens Axboe
2008-12-12 11:51                               ` Andrea Arcangeli
2008-12-12 11:51                                 ` Andrea Arcangeli
2008-12-12 11:54                                 ` Jens Axboe
2008-12-12 11:54                                   ` Jens Axboe
2008-12-12 14:13                                   ` Andrea Arcangeli
2008-12-12 14:13                                     ` Andrea Arcangeli
2008-12-12 14:24                                     ` Anthony Liguori
2008-12-12 14:24                                       ` Anthony Liguori
2008-12-12 16:33                                       ` Chris Wright
2008-12-12 16:33                                         ` Chris Wright
2008-12-12 16:51                                         ` Anthony Liguori
2008-12-12 16:51                                           ` Anthony Liguori
2008-12-12 16:52                                           ` Chris Wright
2008-12-12 16:52                                             ` Chris Wright
2008-12-11 21:32                         ` Christoph Hellwig
2008-12-12  0:27                           ` Andrea Arcangeli
2008-12-12  0:27                             ` Andrea Arcangeli
2008-12-11 21:30                     ` Christoph Hellwig
2008-12-11 16:41                   ` Anthony Liguori
2008-12-11 16:41                     ` Anthony Liguori
2008-12-12 14:24               ` Andrea Arcangeli
2008-12-12 14:24                 ` Andrea Arcangeli
2008-12-12 14:35                 ` Anthony Liguori
2008-12-12 14:35                   ` Anthony Liguori
2008-12-12 15:44                   ` Andrea Arcangeli
2008-12-12 15:44                     ` Andrea Arcangeli
2008-12-12 16:49                     ` Anthony Liguori [this message]
2008-12-12 16:49                       ` Anthony Liguori
2008-12-12 17:09                       ` Andrea Arcangeli
2008-12-12 17:09                         ` Andrea Arcangeli
2008-12-12 17:25                         ` Anthony Liguori
2008-12-12 17:25                           ` Anthony Liguori
2008-12-12 17:52                           ` Andrea Arcangeli
2008-12-12 17:52                             ` Andrea Arcangeli
2008-12-12 18:17                             ` Anthony Liguori
2008-12-12 18:17                               ` Anthony Liguori
2008-12-12 18:26                               ` Andrea Arcangeli
2008-12-12 20:12                                 ` Gerd Hoffmann
2008-12-12 20:17                                   ` Anthony Liguori
2008-12-12 20:35                                     ` Gerd Hoffmann
2008-12-09 17:16   ` Avi Kivity
2008-12-17 14:44 ` Ian Jackson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=49429629.20309@codemonkey.ws \
    --to=anthony@codemonkey.ws \
    --cc=aarcange@redhat.com \
    --cc=kraxel@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.