From: bvanassche@acm.org (Bart Van Assche)
Subject: RFC: Allow block drivers to poll for I/O instead of sleeping
Date: Tue, 25 Jun 2013 09:07:29 +0200 [thread overview]
Message-ID: <51C941B1.6000305@acm.org> (raw)
In-Reply-To: <20130625031809.GB8211@linux.intel.com>
On 06/25/13 05:18, Matthew Wilcox wrote:
> On Mon, Jun 24, 2013@10:07:51AM +0200, Ingo Molnar wrote:
>> I'm wondering, how will this scheme work if the IO completion latency is a
>> lot more than the 5 usecs in the testcase? What if it takes 20 usecs or
>> 100 usecs or more?
>
> There's clearly a threshold at which it stops making sense, and our
> current NAND-based SSDs are almost certainly on the wrong side of that
> threshold! I can't wait for one of the "post-NAND" technologies to make
> it to market in some form that makes it economical to use in an SSD.
>
> The problem is that some of the people who are looking at those
> technologies are crazy. They want to "bypass the kernel" and "do user
> space I/O" because "the kernel is too slow". This patch is part of an
> effort to show them how crazy they are. And even if it doesn't convince
> them, at least users who refuse to rewrite their applications to take
> advantage of magical userspace I/O libraries will see real performance
> benefits.
Recently I attended an interesting talk about this subject in which it
was proposed not only to bypass the kernel for access to high-IOPS
devices but also to allow byte-addressability for block devices. The
slides that accompanied that talk can be found here (includes a
performance comparison with the traditional block driver API):
Bernard Metzler, On Suitability of High-Performance Networking API for
Storage, OFA Int'l Developer Workshop, April 24, 2013
(http://www.openfabrics.org/ofa-documents/presentations/doc_download/559-on-suitability-of-high-performance-networking-api-for-storage.html).
This approach leaves the choice of whether to use polling or an
interrupt-based completion notification to the user of the new API,
something the Linux InfiniBand RDMA verbs API already allows today.
Bart.
WARNING: multiple messages have this Message-ID (diff)
From: Bart Van Assche <bvanassche@acm.org>
To: Matthew Wilcox <willy@linux.intel.com>
Cc: Ingo Molnar <mingo@kernel.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
Jens Axboe <axboe@kernel.dk>, Al Viro <viro@zeniv.linux.org.uk>,
Ingo Molnar <mingo@redhat.com>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
linux-nvme@lists.infradead.org,
Linux SCSI List <linux-scsi@vger.kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Thomas Gleixner <tglx@linutronix.de>
Subject: Re: RFC: Allow block drivers to poll for I/O instead of sleeping
Date: Tue, 25 Jun 2013 09:07:29 +0200 [thread overview]
Message-ID: <51C941B1.6000305@acm.org> (raw)
In-Reply-To: <20130625031809.GB8211@linux.intel.com>
On 06/25/13 05:18, Matthew Wilcox wrote:
> On Mon, Jun 24, 2013 at 10:07:51AM +0200, Ingo Molnar wrote:
>> I'm wondering, how will this scheme work if the IO completion latency is a
>> lot more than the 5 usecs in the testcase? What if it takes 20 usecs or
>> 100 usecs or more?
>
> There's clearly a threshold at which it stops making sense, and our
> current NAND-based SSDs are almost certainly on the wrong side of that
> threshold! I can't wait for one of the "post-NAND" technologies to make
> it to market in some form that makes it economical to use in an SSD.
>
> The problem is that some of the people who are looking at those
> technologies are crazy. They want to "bypass the kernel" and "do user
> space I/O" because "the kernel is too slow". This patch is part of an
> effort to show them how crazy they are. And even if it doesn't convince
> them, at least users who refuse to rewrite their applications to take
> advantage of magical userspace I/O libraries will see real performance
> benefits.
Recently I attended an interesting talk about this subject in which it
was proposed not only to bypass the kernel for access to high-IOPS
devices but also to allow byte-addressability for block devices. The
slides that accompanied that talk can be found here (includes a
performance comparison with the traditional block driver API):
Bernard Metzler, On Suitability of High-Performance Networking API for
Storage, OFA Int'l Developer Workshop, April 24, 2013
(http://www.openfabrics.org/ofa-documents/presentations/doc_download/559-on-suitability-of-high-performance-networking-api-for-storage.html).
This approach leaves the choice of whether to use polling or an
interrupt-based completion notification to the user of the new API,
something the Linux InfiniBand RDMA verbs API already allows today.
Bart.
next prev parent reply other threads:[~2013-06-25 7:07 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-20 20:17 RFC: Allow block drivers to poll for I/O instead of sleeping Matthew Wilcox
2013-06-20 20:17 ` Matthew Wilcox
2013-06-23 10:09 ` Ingo Molnar
2013-06-23 10:09 ` Ingo Molnar
2013-06-23 18:29 ` Linus Torvalds
2013-06-23 18:29 ` Linus Torvalds
2013-06-24 7:17 ` Jens Axboe
2013-06-24 7:17 ` Jens Axboe
2013-06-25 0:11 ` Steven Rostedt
2013-06-25 0:11 ` Steven Rostedt
2013-06-25 3:07 ` Matthew Wilcox
2013-06-25 3:07 ` Matthew Wilcox
2013-06-25 13:57 ` Steven Rostedt
2013-06-25 13:57 ` Steven Rostedt
2013-06-25 14:57 ` Jens Axboe
2013-06-25 14:57 ` Jens Axboe
2013-06-24 8:07 ` Ingo Molnar
2013-06-24 8:07 ` Ingo Molnar
2013-06-25 3:18 ` Matthew Wilcox
2013-06-25 3:18 ` Matthew Wilcox
2013-06-25 7:07 ` Bart Van Assche [this message]
2013-06-25 7:07 ` Bart Van Assche
2013-06-25 15:00 ` Jens Axboe
2013-06-25 15:00 ` Jens Axboe
2013-06-27 18:10 ` Rik van Riel
2013-06-27 18:10 ` Rik van Riel
2013-06-23 22:14 ` David Ahern
2013-06-23 22:14 ` David Ahern
2013-06-24 8:21 ` Ingo Molnar
2013-06-24 8:21 ` Ingo Molnar
2013-06-24 7:15 ` Jens Axboe
2013-06-24 7:15 ` Jens Axboe
2013-06-24 8:18 ` Ingo Molnar
2013-06-24 8:18 ` Ingo Molnar
2013-06-25 3:01 ` Matthew Wilcox
2013-06-25 3:01 ` Matthew Wilcox
2013-06-25 14:55 ` Jens Axboe
2013-06-25 14:55 ` Jens Axboe
2013-06-27 18:42 ` Rik van Riel
2013-06-27 18:42 ` Rik van Riel
2013-07-04 1:13 ` Shaohua Li
2013-07-04 1:13 ` Shaohua Li
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51C941B1.6000305@acm.org \
--to=bvanassche@acm.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.