From: jim owens <owens6336@gmail.com>
To: Andi Kleen <andi@firstfloor.org>
Cc: linux-btrfs <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH V3 17/18] Btrfs: Full direct I/O and AIO read implementation.
Date: Tue, 23 Mar 2010 17:44:37 -0400 [thread overview]
Message-ID: <4BA93645.3060901@gmail.com> (raw)
In-Reply-To: <87sk7r60zw.fsf@basil.nowhere.org>
Andi Kleen wrote:
> One thing that stroke me while reading is that, except for the out of line no data
> checksum case, this isn't real classical zero copy direct IO because
> you always have to copy through some buffer.
Uh no, unless I really messed up or don't understand what you mean.
Uncompressed data with no checksums only buffers on an error or EOF.
With checksums enabled, uncompressed reads aligned on the 4k block
are classic direct IO to user memory except at EOF.
With checksums, unaligned reads still go direct to user memory, I
just have to read the extra head and tail to kernel buffers to
make the start and end 4k aligned. This is efficient for large
reads but maybe not so efficient for small ones.
The special no-checksum EOF buffering is only for consistency, we
could choose to read the whole disk block like classic direct IO.
With checksums, unaligned reads < 4K always have some buffered part
for the (4k - user_length) so that may be what you mean.
> It's more like "uncached IO"
>
> I was wondering that at least for those cases wouldn't it be simpler
> to use the normal page cache IO path and use new hints that disable
> prefetch/write-behind/caching in the page cache after the IO operation?
Maybe.
> Is there any particular reason this wasn't done? Was it because
> of aio?
>
> I know the page cache currently doesn't support that today, but
> presumably it wouldn't be too hard to add.
The only reason I did not do something like that is:
1) I did not want to disturb the page cache with throw-away pages.
2) "uncached IO" makes it even less like classic direct IO.
3) Writing that page cache code might not be simpler.
As further argument against "uncached IO", Chris sent a very simple
patch up to read into page cache then purge it for btrfs direct IO
reads and it was NACKed.
> I guess the code would be much simpler if it only did the no checksum
> case.
yes, yes, yes :)
jim
prev parent reply other threads:[~2010-03-23 21:44 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-03-22 3:34 [PATCH V3 17/18] Btrfs: Full direct I/O and AIO read implementation jim owens
2010-03-23 5:49 ` Andi Kleen
2010-03-23 21:40 ` jim owens
2010-03-24 0:57 ` Chris Mason
2010-03-24 2:37 ` Andi Kleen
2010-03-25 3:08 ` jim owens
2010-03-25 9:41 ` Andi Kleen
2010-03-25 22:38 ` jim owens
2010-03-23 21:44 ` jim owens [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BA93645.3060901@gmail.com \
--to=owens6336@gmail.com \
--cc=andi@firstfloor.org \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).