From: Jean-Luc Herren <jlh@gmx.ch>
To: Michael J Gruber <git@drmicha.warpmail.net>
Cc: Glenn Griffin <ggriffin.kernel@gmail.com>,
Git Mailing List <git@vger.kernel.org>,
Junio C Hamano <gitster@pobox.com>
Subject: Re: Bad git status performance
Date: Fri, 21 Nov 2008 21:07:16 +0100 [thread overview]
Message-ID: <492714F4.1090807@gmx.ch> (raw)
In-Reply-To: <4926D196.3000301@drmicha.warpmail.net>
Michael J Gruber wrote:
> Experimenting further: Using 10 files with 10MB each (rather than 100
> times 1MB) brings down the time by a factor 10 roughly - and so does
> using 100 files with 100k each. Huh? Latter may be expected (10MB
> total), but former (100MB total)?
100 files at each 100k gives me 1.73s, so about 10x speed up. So
it seems git indeed looks at the content of the files and having a
tenth of the content means it's ten times as fast.
Interestingly, using only a single file of 100MB gives me 0.6s.
Which is still very slow for the job of telling that a 100MB file
is not equal to a 1 byte file. And certainly there's no renaming
going on with a single file.
> Now it's getting funny: Changing your "echo >" to "echo ">>" (in your
> 100 files 1MB case) makes things "almost fast" again (1.3s).
Same here and that's pretty interesting, because in this situation
I can understand the slow down: Comparing two 1MB files that
differ only at their ends is expected to take some time, as you
have to go through the entire file until you notice they're not
the same.
jlh
prev parent reply other threads:[~2008-11-21 20:08 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-11-21 0:28 Bad git status performance Jean-Luc Herren
2008-11-21 0:42 ` David Bryson
[not found] ` <c9e534200811201711y887ddd2t33013ec4a7db3c9a@mail.gmail.com>
2008-11-21 12:46 ` Jean-Luc Herren
2008-11-21 15:19 ` Michael J Gruber
2008-11-21 20:07 ` Jean-Luc Herren [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=492714F4.1090807@gmx.ch \
--to=jlh@gmx.ch \
--cc=ggriffin.kernel@gmail.com \
--cc=git@drmicha.warpmail.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).