All of lore.kernel.org
 help / color / mirror / Atom feed
From: Hans Reiser <reiser@namesys.com>
To: Ross Skaliotis <mross@rs-net.org>
Cc: reiserfs-list@namesys.com
Subject: Re: Calling stat with millions of files
Date: Tue, 08 Jun 2004 15:10:27 -0700	[thread overview]
Message-ID: <40C63953.2040603@namesys.com> (raw)
In-Reply-To: <Pine.LNX.4.58.0406081339010.3391@localhost.localdomain>

Ross Skaliotis wrote:

>>On Tue, 2004-06-08 at 12:47, Hans Reiser wrote:
>>    
>>
>>>really afford to do it just for you, sorry.
>>>      
>>>
>>>>You'll get better results with the new block allocator in 2.6.7-rcX-mm,
>>>>but in the end the stat information for the file isn't horribly close to
>>>>the directory entries, and performance won't be perfect.
>>>>
>>>>Hans, I thought reiser4 was going to be good at this kind of thing?
>>>>
>>>>
>>>>        
>>>>
>>>what in reiser4 optimizes accesses to hard links to files whose stat
>>>data is stored in other directories?  Maybe the stat data being stored
>>>near other stat data instead of near file bodies will help,.  Hmmm.
>>>Could be, have to try it to see.
>>>      
>>>
>>He said above that it creates hard links "when it can", not sure what
>>percentage of the time this actually happens.
>>    
>>
>
>Most files have several hard links. Common files shared between many
>backed up computers (windows system files, etc.) can have up to 500 hard
>links.
>
>  
>
>>hardlinks destroy locality of reference for stat data, this is probably
>>your problem.
>>    
>>
>
>So this would be a problem with any filesystem, right? I don't understand
>filesystems nearly as well as others on this list, but am I correct in
>assuming the stat data would stay "near" the original file? If I can make
>my backup system only need to stat original files (which may have hard
>links linked to _them_) would my performance issues go away?
>  
>
In reiser4 it would probably cure it, probably not in reiser3, though it 
might help, and I could be wrong....

>Thanks and sorry for my ignorance,
>
>-Ross
>
>
>  
>


  reply	other threads:[~2004-06-08 22:10 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-06-07 21:54 Calling stat with millions of files Ross Skaliotis
2004-06-08 10:03 ` Vladimir Saveliev
2004-06-08 12:00   ` Heinz-Josef Claes
2004-06-08 16:28 ` Hans Reiser
2004-06-08 16:38   ` Chris Mason
2004-06-08 16:47     ` Hans Reiser
2004-06-08 16:51       ` Chris Mason
2004-06-08 16:58         ` Hans Reiser
2004-06-08 17:57         ` Ross Skaliotis
2004-06-08 22:10           ` Hans Reiser [this message]
2004-06-08 22:05         ` Carl-Daniel Hailfinger
2004-06-08 18:58 ` Mike Benoit
2004-06-08 20:29   ` Henning Westerholt

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=40C63953.2040603@namesys.com \
    --to=reiser@namesys.com \
    --cc=mross@rs-net.org \
    --cc=reiserfs-list@namesys.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.