From: Avi Kivity <avi@qumranet.com>
To: Theodore Tso <tytso@mit.edu>, Ulrich Drepper <drepper@gmail.com>,
Soeren Sandmann <sandmann@daimi.au.dk>,
linux-kernel@vger.kernel.org
Subject: Re: stat benchmark
Date: Mon, 28 Apr 2008 14:59:26 +0300 [thread overview]
Message-ID: <4815BC1E.6020805@qumranet.com> (raw)
In-Reply-To: <20080428115321.GD30840@mit.edu>
Theodore Tso wrote:
>> Aside from what has already been proposed there is also the
>> readdirplus() route. Unfortunately the people behind this and related
>> proposals vanished after the last discussions. I was hoping they
>> come back with a revised proposal but perhaps not. Maybe it's time to
>> pick up the ball myself.
>>
>> As a reminder, readdirplus() is an extended readdir() which also
>> returns (a subset of) the stat information for the file at the same
>> time. The subset part is needed to account for the different
>> information contained in the inodes. For most applications the subset
>> should be sufficient and therefore all that's needed is a single
>> iteration over the directory.
>>
>
> I'm not sure this would help in the cold cache case, which is what
> Soeren originally complained about.[1] The problem is whaever
> information the user might need won't be store in the directory, so
> the filesystem would end having to stat the file anyway, incurring a
> disk seek, which was what the user was complaining about. A
> readdirplus() would save a whole bunch of system calls if the inode
> was already cached, yes, but I'm not sure that's it would be worth the
> effort given how small Linux's system call overhead would be. But in
> the cold cache case, you end up seeking all over the disk, and the
> only thing you can do is to try to keep the inodes close to each
> other, and to have either readdir() or the caller of readdir() sort
> all of the returned directory entries by inode number to avoid seeking
> all over the disk.
>
A readdirplus() could sort the inodes according to the filesystem's
layout, and additionally issue the stats in parallel, so if you have
multiple spindles you get significant additional speedup.
--
error compiling committee.c: too many arguments to function
next prev parent reply other threads:[~2008-04-28 11:59 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-04-24 20:59 stat benchmark Soeren Sandmann
2008-04-24 21:42 ` Carl Henrik Lunde
2008-04-24 21:44 ` Jan Engelhardt
2008-04-25 2:27 ` Justin Banks
2008-04-25 7:01 ` Christoph Hellwig
2008-04-25 19:48 ` Theodore Tso
2008-04-27 23:29 ` Soeren Sandmann
2008-04-28 0:13 ` Carl Henrik Lunde
2008-04-28 19:41 ` Alexander Larsson
2008-04-28 2:10 ` Theodore Tso
2008-04-27 22:40 ` Carl Henrik Lunde
2008-04-28 17:46 ` Zach Brown
2008-04-28 4:43 ` Ulrich Drepper
2008-04-28 11:53 ` Theodore Tso
2008-04-28 11:59 ` Avi Kivity [this message]
2008-04-28 13:31 ` Theodore Tso
2008-04-28 16:18 ` J. Bruce Fields
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4815BC1E.6020805@qumranet.com \
--to=avi@qumranet.com \
--cc=drepper@gmail.com \
--cc=linux-kernel@vger.kernel.org \
--cc=sandmann@daimi.au.dk \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox