All of lore.kernel.org
 help / color / mirror / Atom feed
From: Stephen Lord <lord@sgi.com>
To: Jeff Garzik <garzik@havoc.gtf.org>
Cc: Chris Mason <mason@suse.com>, Andrea Arcangeli <andrea@suse.de>,
	Chris Wedgwood <cw@f00f.org>, Andrew Morton <akpm@zip.com.au>,
	Ricardo Galli <gallir@uib.es>,
	Linux Kernel <linux-kernel@vger.kernel.org>
Subject: Re: O_DIRECT fails in some kernel and FS
Date: Sun, 03 Feb 2002 07:40:57 -0600	[thread overview]
Message-ID: <3C5D3DE9.4080503@sgi.com> (raw)
In-Reply-To: <E16WkQj-0005By-00@antoli.uib.es> <3C5AFE2D.95A3C02E@zip.com.au> <1012597538.26363.443.camel@jen.americas.sgi.com> <20020202093554.GA7207@tapu.f00f.org> <234710000.1012674008@tiny> <20020202205438.D3807@athlon.random> <242700000.1012680610@tiny> <3C5C4929.5080403@sgi.com> <20020202155028.B26147@havoc.gtf.org>

Jeff Garzik wrote:

>On Sat, Feb 02, 2002 at 02:16:41PM -0600, Stephen Lord wrote:
>
>>Can't you fall back to buffered I/O for the tail? OK it complicates the
>>code, probably a lot, but it keeps things sane from the user's point of
>>view.
>>
>
>For O_DIRECT, IMHO you should fail not fallback.  You're simply lying
>to the underlying program otherwise.
>

By fallback I mean't just for the tail, not the whole file.

I have been there before. I had to implement the mixed mode buffered/direct
I/O on Unicos because a change in underlying disk subsystems stopped
customer applications from working - the allowed boundaries for
O_DIRECT stopped working when the sales people sold them some new
disks. This also meant you could get most of the speed benefits of
O_DIRECT without having to align your I/O, it also meant really
large I/Os could be made to automatically bypass cache to avoid
cache thrashing.

What we had were two flags, one which indicated use direct I/O, and another
which indicated return an error to user space rather than go through 
buffers.
So lie to me and make it work, or don't lie to me options I suppose.

>
>
>In the ibu fs I am hacking on, the idea for O_DIRECT is to fail a read
>if the file is small enough to fit in the inode.  If the O_DIRECT
>action is a write, then I will invalidate the data in the inode,
>then follow the standard path (which eventually calls get_block()).
>
>For file tails (a different case from small-file-in-inode), I
>imagine it would be prudent to support O_DIRECT for all actions
>except reading the file tail.  If you want to be complicated, you
>could provide userspace with a way to say "this is a dense file"
>and/or simply not create a tail at all...
>
I suspect the reason XFS never did small files in the inode was because of
the problems with implementing mmap and O_DIRECT.

>
>	Jeff
>
>
Steve



  reply	other threads:[~2002-02-03 13:42 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-02-01 20:37 O_DIRECT fails in some kernel and FS Ricardo Galli
2002-02-01 20:44 ` Andrew Morton
2002-02-01 20:49   ` Ricardo Galli
2002-02-01 20:57     ` Andrew Morton
2002-02-01 21:05   ` Steve Lord
2002-02-02  9:35     ` Chris Wedgwood
2002-02-02 10:25       ` Hans Reiser
2002-02-02 15:24       ` Chris Mason
2002-02-02 18:20       ` Chris Mason
2002-02-02 19:54         ` Andrea Arcangeli
2002-02-02 20:10           ` Chris Mason
2002-02-02 20:16             ` Stephen Lord
2002-02-02 20:50               ` Jeff Garzik
2002-02-03 13:40                 ` Stephen Lord [this message]
2002-02-03 14:09                   ` Chris Wedgwood
2002-02-03 15:05                     ` Stephen Lord
2002-02-03 22:44                       ` Chris Wedgwood
2002-02-04 15:04                         ` Jeff Garzik
2002-02-04 15:21                           ` Chris Mason
2002-02-04 15:15                         ` Steve Lord
2002-02-04 15:46                           ` Alan Cox
2002-02-04 16:02                             ` Steve Lord
2002-02-04 18:22                               ` Daniel Phillips
2002-02-04 19:11                                 ` Steve Lord
2002-02-04 18:29                             ` Joel Becker
2002-02-04 18:49                               ` Jeff Garzik
2002-02-04 18:55                                 ` Joel Becker
2002-02-04 19:16                                   ` Jeff Garzik
2002-02-02 17:14   ` Christoph Hellwig
     [not found] <E16WkQj-0005By-00@antoli.uib.es.suse.lists.linux.kernel>
     [not found] ` <3C5AFE2D.95A3C02E@zip.com.au.suse.lists.linux.kernel>
     [not found]   ` <1012597538.26363.443.camel@jen.americas.sgi.com.suse.lists.linux.kernel>
     [not found]     ` <20020202093554.GA7207@tapu.f00f.org.suse.lists.linux.kernel>
     [not found]       ` <234710000.1012674008@tiny.suse.lists.linux.kernel>
     [not found]         ` <20020202205438.D3807@athlon.random.suse.lists.linux.kernel>
     [not found]           ` <242700000.1012680610@tiny.suse.lists.linux.kernel>
     [not found]             ` <3C5C4929.5080403@sgi.com.suse.lists.linux.kernel>
     [not found]               ` <20020202155028.B26147@havoc.gtf.org.suse.lists.linux.kernel>
2002-02-03  7:26                 ` Andi Kleen
2002-02-04 15:13                   ` Jeff Garzik
2002-02-04 15:31                     ` Chris Mason

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3C5D3DE9.4080503@sgi.com \
    --to=lord@sgi.com \
    --cc=akpm@zip.com.au \
    --cc=andrea@suse.de \
    --cc=cw@f00f.org \
    --cc=gallir@uib.es \
    --cc=garzik@havoc.gtf.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mason@suse.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.