From: Andreas Dilger <adilger@clusterfs.com>
To: Helge Hafting <helgehaf@aitel.hist.no>
Cc: ptb@it.uc3m.es, linux-kernel@vger.kernel.org
Subject: Re: [RFC] mount flag "direct" (fwd)
Date: Wed, 4 Sep 2002 02:39:16 -0600 [thread overview]
Message-ID: <20020904083916.GX32468@clusterfs.com> (raw)
In-Reply-To: <3D75B344.66D4166@aitel.hist.no>
On Sep 04, 2002 09:16 +0200, Helge Hafting wrote:
> Your idea about re-reading stuff over and over isn't going to help
> because that sort of thing consumes much more bandwith. Caches help
> because they _avoid_ data transfers. So shared writeable data
> will happen, and it will use some sort of cache coherency,
> for performance reasons.
You assume too much about the applications. For example, Oracle
does not want _any_ cacheing to be done by the OS, because it
manages the cache itself, and would rather allocate the full amount
of RAM itself instead of the OS duplicating data it is cacheing
internally.
Similarly, there are many "write only" applications that are only
hindered by OS cache, such as any kind of high-speed data recording
(video, particle accelerators, scientific computing, etc) which is
using most of the RAM for internal structures and wants the data it
writes to go directly to disk at the highest possible speed.
> I claim that making a new fs from scratch for the distributed
> case is easier than tweaking ext2 and 10-20 other existing fs'es
> to work in such an environment. Making a new fs from scratch
> isn't such a big deal after all.
The problem isn't making a new fs, the problem is making a _good_
new fs. It takes at least several years of development, testing,
tuning, etc to get just a local fs right, if not longer (i.e.
reiserfs, JFS, XFS, ext3, etc). Add in the complexity of the
network side of things and it just gets that much harder to do
it all well.
We have taken the approach that local filesystems do a good job
with the "one node" assumption, so just use them as-is to
do a job they are good at. All of the network and locking code
for Lustre is outside of the filesystem, and the "local" filesystems
are used for storing either the directory structure + attributes
(for the metadata server), or file data (for the storage targets).
Local filesystems can do both of those jobs very well already, so
no need to re-invent the wheel.
See http://www.lustre.org/docs.html for lots of papers and
documentation on the design of Lustre.
Cheers, Andreas
--
Andreas Dilger
http://www-mddsp.enel.ucalgary.ca/People/adilger/
http://sourceforge.net/projects/ext2resize/
next prev parent reply other threads:[~2002-09-04 8:36 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2002-09-03 15:39 [RFC] mount flag "direct" (fwd) Peter T. Breuer
2002-09-03 15:44 ` Rik van Riel
2002-09-03 15:50 ` Peter T. Breuer
2002-09-03 15:56 ` Chris Wedgwood
2002-09-03 15:59 ` Peter T. Breuer
2002-09-03 16:09 ` Richard B. Johnson
2002-09-03 16:29 ` Peter T. Breuer
2002-09-03 16:33 ` Rik van Riel
2002-09-03 17:32 ` Richard B. Johnson
2002-09-03 18:53 ` Lars Marowsky-Bree
2002-09-03 21:07 ` Peter T. Breuer
2002-09-03 21:15 ` Andreas Dilger
2002-09-03 21:15 ` Rik van Riel
2002-09-03 21:54 ` Anton Altaparmakov
2002-09-03 22:46 ` Andreas Dilger
2002-09-03 23:19 ` Daniel Phillips
2002-09-04 0:18 ` Anton Altaparmakov
2002-09-04 5:23 ` David Lang
2002-09-04 7:16 ` Helge Hafting
2002-09-04 8:39 ` Andreas Dilger [this message]
2002-09-04 12:07 ` Helge Hafting
2002-09-04 13:03 ` Hans Reiser
2002-09-04 8:41 ` Peter T. Breuer
2002-09-04 7:50 ` Joachim Breuer
[not found] ` <3D75F8B0.8C7E974E@aitel.hist.no>
2002-09-04 21:26 ` Joachim Breuer
2002-09-04 9:26 ` Lars Marowsky-Bree
2002-09-03 16:58 ` Anton Altaparmakov
2002-09-03 17:26 ` Peter T. Breuer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20020904083916.GX32468@clusterfs.com \
--to=adilger@clusterfs.com \
--cc=helgehaf@aitel.hist.no \
--cc=linux-kernel@vger.kernel.org \
--cc=ptb@it.uc3m.es \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.