public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Stephen Lord <lord@sgi.com>
To: Jeff Garzik <garzik@havoc.gtf.org>
Cc: Chris Mason <mason@suse.com>, Andrea Arcangeli <andrea@suse.de>,
	Chris Wedgwood <cw@f00f.org>, Andrew Morton <akpm@zip.com.au>,
	Ricardo Galli <gallir@uib.es>,
	Linux Kernel <linux-kernel@vger.kernel.org>
Subject: Re: O_DIRECT fails in some kernel and FS
Date: Sun, 03 Feb 2002 07:40:57 -0600	[thread overview]
Message-ID: <3C5D3DE9.4080503@sgi.com> (raw)
In-Reply-To: <E16WkQj-0005By-00@antoli.uib.es> <3C5AFE2D.95A3C02E@zip.com.au> <1012597538.26363.443.camel@jen.americas.sgi.com> <20020202093554.GA7207@tapu.f00f.org> <234710000.1012674008@tiny> <20020202205438.D3807@athlon.random> <242700000.1012680610@tiny> <3C5C4929.5080403@sgi.com> <20020202155028.B26147@havoc.gtf.org>

Jeff Garzik wrote:

>On Sat, Feb 02, 2002 at 02:16:41PM -0600, Stephen Lord wrote:
>
>>Can't you fall back to buffered I/O for the tail? OK it complicates the
>>code, probably a lot, but it keeps things sane from the user's point of
>>view.
>>
>
>For O_DIRECT, IMHO you should fail not fallback.  You're simply lying
>to the underlying program otherwise.
>

By fallback I mean't just for the tail, not the whole file.

I have been there before. I had to implement the mixed mode buffered/direct
I/O on Unicos because a change in underlying disk subsystems stopped
customer applications from working - the allowed boundaries for
O_DIRECT stopped working when the sales people sold them some new
disks. This also meant you could get most of the speed benefits of
O_DIRECT without having to align your I/O, it also meant really
large I/Os could be made to automatically bypass cache to avoid
cache thrashing.

What we had were two flags, one which indicated use direct I/O, and another
which indicated return an error to user space rather than go through 
buffers.
So lie to me and make it work, or don't lie to me options I suppose.

>
>
>In the ibu fs I am hacking on, the idea for O_DIRECT is to fail a read
>if the file is small enough to fit in the inode.  If the O_DIRECT
>action is a write, then I will invalidate the data in the inode,
>then follow the standard path (which eventually calls get_block()).
>
>For file tails (a different case from small-file-in-inode), I
>imagine it would be prudent to support O_DIRECT for all actions
>except reading the file tail.  If you want to be complicated, you
>could provide userspace with a way to say "this is a dense file"
>and/or simply not create a tail at all...
>
I suspect the reason XFS never did small files in the inode was because of
the problems with implementing mmap and O_DIRECT.

>
>	Jeff
>
>
Steve



  reply	other threads:[~2002-02-03 13:42 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-02-01 20:37 O_DIRECT fails in some kernel and FS Ricardo Galli
2002-02-01 20:44 ` Andrew Morton
2002-02-01 20:49   ` Ricardo Galli
2002-02-01 20:57     ` Andrew Morton
2002-02-01 21:05   ` Steve Lord
2002-02-02  9:35     ` Chris Wedgwood
2002-02-02 10:25       ` Hans Reiser
2002-02-02 15:24       ` Chris Mason
2002-02-02 18:20       ` Chris Mason
2002-02-02 19:54         ` Andrea Arcangeli
2002-02-02 20:10           ` Chris Mason
2002-02-02 20:16             ` Stephen Lord
2002-02-02 20:50               ` Jeff Garzik
2002-02-03 13:40                 ` Stephen Lord [this message]
2002-02-03 14:09                   ` Chris Wedgwood
2002-02-03 15:05                     ` Stephen Lord
2002-02-03 22:44                       ` Chris Wedgwood
2002-02-04 15:04                         ` Jeff Garzik
2002-02-04 15:21                           ` Chris Mason
2002-02-04 15:15                         ` Steve Lord
2002-02-04 15:46                           ` Alan Cox
2002-02-04 16:02                             ` Steve Lord
2002-02-04 18:22                               ` Daniel Phillips
2002-02-04 19:11                                 ` Steve Lord
2002-02-04 18:29                             ` Joel Becker
2002-02-04 18:49                               ` Jeff Garzik
2002-02-04 18:55                                 ` Joel Becker
2002-02-04 19:16                                   ` Jeff Garzik
2002-02-02 17:14   ` Christoph Hellwig
     [not found] <E16WkQj-0005By-00@antoli.uib.es.suse.lists.linux.kernel>
     [not found] ` <3C5AFE2D.95A3C02E@zip.com.au.suse.lists.linux.kernel>
     [not found]   ` <1012597538.26363.443.camel@jen.americas.sgi.com.suse.lists.linux.kernel>
     [not found]     ` <20020202093554.GA7207@tapu.f00f.org.suse.lists.linux.kernel>
     [not found]       ` <234710000.1012674008@tiny.suse.lists.linux.kernel>
     [not found]         ` <20020202205438.D3807@athlon.random.suse.lists.linux.kernel>
     [not found]           ` <242700000.1012680610@tiny.suse.lists.linux.kernel>
     [not found]             ` <3C5C4929.5080403@sgi.com.suse.lists.linux.kernel>
     [not found]               ` <20020202155028.B26147@havoc.gtf.org.suse.lists.linux.kernel>
2002-02-03  7:26                 ` Andi Kleen
2002-02-04 15:13                   ` Jeff Garzik
2002-02-04 15:31                     ` Chris Mason

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3C5D3DE9.4080503@sgi.com \
    --to=lord@sgi.com \
    --cc=akpm@zip.com.au \
    --cc=andrea@suse.de \
    --cc=cw@f00f.org \
    --cc=gallir@uib.es \
    --cc=garzik@havoc.gtf.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mason@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox