From: Linus Torvalds <torvalds@linux-foundation.org>
To: Jeff King <peff@peff.net>
Cc: Jakub Narebski <jnareb@gmail.com>,
Martin Langhoff <martin.langhoff@gmail.com>,
Git Mailing List <git@vger.kernel.org>
Subject: Re: Why Git is so fast (was: Re: Eric Sink's blog - notes on git, dscms and a "whole product" approach)
Date: Fri, 1 May 2009 14:43:49 -0400 (EDT) [thread overview]
Message-ID: <alpine.LFD.2.00.0905011431460.5379@localhost.localdomain> (raw)
In-Reply-To: <20090430142244.GA23550@coredump.intra.peff.net>
On Thu, 30 Apr 2009, Jeff King wrote:
>
> Like all generalizations, this is only mostly true. Fast network servers
> with big caches can outperform disks for some loads.
That's _very_ few loads.
It doesn't matter how good a server you have, network filesystems
invariably suck.
Why? It's not that the network or the server sucks - you can easily find
beefy NAS setups that have big raids etc and are much faster than most
local disks.
And they _still_ suck.
Simple reason: caching. It's a lot easier to cache local filesystems. Even
modern networked filesystems (ie NFSv4), that do a pretty good job on a
file-per-file basis with delegations etc, and they still tend to suck
horribly at metadata.
In contrast, a workstation with local filesystems and enough memory to
cache it well will just be a lot nicer.
> So I wouldn't rule out the possibility of a pleasant VCS experience on a
> network-optimized system backed by beefy servers on a local network.
Hey, you can always throw resources at it.
But no:
> I have never used perforce, but I get the impression that it is more
> optimized for such a situation.
I doubt it. I suspect git will outperform pretty much anything else in
that kind of situation too.
One thing that git does - and some other VCS's avoid - is to actually
stat() the whole working tree in order to not need special per-file "I use
this file" locking semantics. That can in theory make git slower over a
network filesystem than such (very broken) alternatives.
If your VCS requires that you mark all files for editing somehow (ie you
can't just use your favourite editor or scripting to modify files, but
have to use "p4 edit" to say that you're going to write to the file, and
the file is otherwise read-only), then such a VCS can - by being annoying
and in your way - do some things faster than git can.
And yes, perforce does that (the "p4 edit" command is real, and exists).
And yes, in theory that can probably mean that perforce doesn't care so
much about the metadata caching problem on network filesystems - because
p4 will maintain some file of its own that contains the metadata.
But I suspect that the git "async stat" ("core.preloadindex") thing means
that git will kick p4 *ss even on that benchmark, and be a whole lot more
pleasant to use. Even on networked filesystems.
Linus
next prev parent reply other threads:[~2009-05-01 18:46 UTC|newest]
Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-04-27 8:55 Eric Sink's blog - notes on git, dscms and a "whole product" approach Martin Langhoff
2009-04-28 11:24 ` Cross-Platform Version Control (was: Eric Sink's blog - notes on git, dscms and a "whole product" approach) Jakub Narebski
2009-04-28 21:00 ` Robin Rosenberg
2009-04-29 6:55 ` Martin Langhoff
2009-04-29 7:21 ` Jeff King
2009-04-29 20:05 ` Markus Heidelberg
2009-04-29 7:52 ` Cross-Platform Version Control Jakub Narebski
2009-04-29 8:25 ` Martin Langhoff
2009-04-28 18:16 ` Eric Sink's blog - notes on git, dscms and a "whole product" approach Jakub Narebski
2009-04-29 7:54 ` Sitaram Chamarty
2009-04-30 12:17 ` Why Git is so fast (was: Re: Eric Sink's blog - notes on git, dscms and a "whole product" approach) Jakub Narebski
2009-04-30 12:56 ` Michael Witten
2009-04-30 15:28 ` Why Git is so fast Jakub Narebski
2009-04-30 18:52 ` Shawn O. Pearce
2009-04-30 20:36 ` Kjetil Barvik
2009-04-30 20:40 ` Shawn O. Pearce
2009-04-30 21:36 ` Kjetil Barvik
2009-05-01 0:23 ` Steven Noonan
2009-05-01 1:25 ` James Pickens
2009-05-01 9:19 ` Kjetil Barvik
2009-05-01 9:34 ` Mike Hommey
2009-05-01 9:42 ` Kjetil Barvik
2009-05-01 17:42 ` Tony Finch
2009-05-01 5:24 ` Dmitry Potapov
2009-05-01 9:42 ` Mike Hommey
2009-05-01 10:46 ` Dmitry Potapov
2009-04-30 18:43 ` Why Git is so fast (was: Re: Eric Sink's blog - notes on git, dscms and a "whole product" approach) Shawn O. Pearce
2009-04-30 14:22 ` Jeff King
2009-05-01 18:43 ` Linus Torvalds [this message]
2009-05-01 19:08 ` Jeff King
2009-05-01 19:13 ` david
2009-05-01 19:32 ` Nicolas Pitre
2009-05-01 21:17 ` Daniel Barkalow
2009-05-01 21:37 ` Linus Torvalds
2009-05-01 22:11 ` david
2009-04-30 18:56 ` Nicolas Pitre
2009-04-30 19:16 ` Alex Riesen
2009-05-04 8:01 ` Why Git is so fast Andreas Ericsson
2009-04-30 19:33 ` Jakub Narebski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.2.00.0905011431460.5379@localhost.localdomain \
--to=torvalds@linux-foundation.org \
--cc=git@vger.kernel.org \
--cc=jnareb@gmail.com \
--cc=martin.langhoff@gmail.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).