From: "Xin Zhao" <uszhaoxin@gmail.com>
To: "Al Viro" <viro@ftp.linux.org.uk>
Cc: "Theodore Ts'o" <tytso@mit.edu>,
mingz@ele.uri.edu, mikado4vn@gmail.com,
linux-kernel <linux-kernel@vger.kernel.org>,
linux-fsdevel@vger.kernel.org
Subject: Re: Question regarding to store file system metadata in database
Date: Mon, 20 Mar 2006 17:53:08 -0500 [thread overview]
Message-ID: <4ae3c140603201453p113c8c11h3029d1d55d209e27@mail.gmail.com> (raw)
In-Reply-To: <20060320195858.GD27946@ftp.linux.org.uk>
Apparently this comparison is not 100% fair. In my experiment, I
randomly pick pathname from 1.2 million path names to resolve the
inode number. But in your "cp -rl linux2.6 foo1" experiment, you
essentially did directory entry lookup sequentially, which maximize
the possible performance. If you do the same thing in a random
fashion, you will probably get much worse performance. As I said
before, I totally agree that 2000/sec is slow. But the point here is
whether 2000/sec is enough for most scenarios?
I am not saying existing FS implementation is not efficient. I agreed
that file system has been fully optimized. What I want to say is to
support complex mapping in the system I described before, we might
need some extension on existing file systems. Question is what is the
best extension. Consider how to allow user a, b to share physical copy
f.1, while allowing user c to use private copy f.2? The virtual
pathname to physical pathname should be transparent to end users. That
is, all the users should be able to access right file copies using
virtual path "f". The file system should be able to tell the different
identity and return the data from the right physical copy. That's what
we want to do. But it is hard to achieve without some extension. :)
Xin
On 3/20/06, Al Viro <viro@ftp.linux.org.uk> wrote:
> On Mon, Mar 20, 2006 at 02:36:51PM -0500, Xin Zhao wrote:
> > OK. Now I have more experimental results.
> >
> > After excluding the cost of reading file list and do stat(), the
> > insertion rate becomes 587/sec, instead of 300/sec. The query rate is
> > 2137/sec. I am runing mysql 4.1.11. FC4, 2.8G CPU and 1G mem.
> >
> > 2137/sec seems to be good enough to handle pathname to inode
> > resolving. Anyone has some statistics how many file open in a busy
> > file system?
>
> This is still ridiculously slow. From cold cache (i.e. with a lot of IO)
> cp -rl linux-2.6 foo1 gives 1.2s here. That's at least about 50000
> operations. On slower CPU, BTW, with half of the RAM you have.
>
> Moreover,
> al@duke:~/linux$ time mv foo1 foo2
>
> real 0m0.002s
> user 0m0.000s
> sys 0m0.001s
>
> Now, try _that_ on your setup. If you are using entire pathname as key,
> you are FUBAR - updating key in 20-odd thousand of records is going to
> _hurt_. If you are splitting the pathname into components, you've just
> reproduced fs directory structure and had shown that your fs layout
> is too fscking slow. Not to mention the fun with symlink implementation,
> or handling mountpoints.
>
> You are at least an order of magnitude off by performance (more, actually)
> and I still don't see any reason for what you are trying to do.
>
next prev parent reply other threads:[~2006-03-20 22:53 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-03-19 4:48 Question regarding to store file system metadata in database Xin Zhao
2006-03-19 5:07 ` Mikado
2006-03-19 17:48 ` Xin Zhao
2006-03-19 17:58 ` Ming Zhang
2006-03-19 18:11 ` Xin Zhao
2006-03-19 18:26 ` Ming Zhang
2006-03-19 18:50 ` Xin Zhao
2006-03-19 19:47 ` Al Viro
[not found] ` <441DC2D6.4060001@societasilluminati.org>
2006-03-19 21:24 ` Ming Zhang
2006-03-20 13:09 ` Theodore Ts'o
2006-03-20 15:13 ` Xin Zhao
2006-03-20 19:36 ` Xin Zhao
2006-03-20 19:58 ` Al Viro
2006-03-20 22:53 ` Xin Zhao [this message]
2006-03-20 23:32 ` Al Viro
2006-03-20 21:08 ` Matti Aarnio
2006-03-20 22:28 ` Erez Zadok
2006-03-20 22:19 ` Theodore Ts'o
2006-03-21 6:51 ` Miklos Szeredi
2006-03-21 20:05 ` Pavel Machek
2006-03-22 15:21 ` Xin Zhao
2006-03-19 21:34 ` Ming Zhang
2006-03-20 8:30 ` Matti Aarnio
2006-03-19 23:06 ` Alan Cox
2006-03-19 23:44 ` Matthew Wilcox
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4ae3c140603201453p113c8c11h3029d1d55d209e27@mail.gmail.com \
--to=uszhaoxin@gmail.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mikado4vn@gmail.com \
--cc=mingz@ele.uri.edu \
--cc=tytso@mit.edu \
--cc=viro@ftp.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).