From: Nicolas Pitre <nico@cam.org>
To: Johannes Schindelin <Johannes.Schindelin@gmx.de>
Cc: Sam Vilain <sam@vilain.net>, Nick Edelen <sirnot@gmail.com>,
Michael J Gruber <git@drmicha.warpmail.net>,
Junio C Hamano <gitster@pobox.com>, Jeff King <peff@peff.net>,
"Shawn O. Pearce" <spearce@spearce.org>,
Andreas Ericsson <exon@op5.se>,
Christian Couder <christian@couder.net>,
"git@vger.kernel.org" <git@vger.kernel.org>
Subject: Re: [PATCH 0/5] Suggested for PU: revision caching system to significantly speed up packing/walking
Date: Fri, 07 Aug 2009 11:00:42 -0400 (EDT) [thread overview]
Message-ID: <alpine.LFD.2.00.0908071029580.16073@xanadu.home> (raw)
In-Reply-To: <alpine.DEB.1.00.0908070808340.8306@pacific.mpi-cbg.de>
On Fri, 7 Aug 2009, Johannes Schindelin wrote:
> On Fri, 7 Aug 2009, Sam Vilain wrote:
>
> > Especially as in this case as Nick mentions, the domain is subtly
> > different (ie pack vs dag). Unfortunately you just can't try to pretend
> > that they will always be the same; you can't force a full repack on
> > every ref change!
>
> No, but you do not need that, either. In the setting that is most likely
> the most thankful one, i.e. a git:// server, you _want_ to keep the
> repository "as packed as possible", otherwise the rev cache improvements
> will be lost in the bad packing performance anyway.
Yes and no.
Currently, the number #1 latency in any initial git clone is the famous
"counting objects" phase, even if the repo is perfectly packed. And
that's all this rev cache can and will improve. The packing does play
its performance role of course, but for a totally different reason.
Hence the repository needs no be perfectly packed for a rev cache to
speed up its own part of the game.
> > > Still, there is some redundancy between the pack index and your cache,
> > > as you have to write out the whole list of SHA-1s all over again. I
> > > guess it is time to look at the code instead of asking stupid
> > > questions.
> > >
> >
> > "Disk is cheap" :-)
>
> Disk I/O ain't.
>
> (Size of the I/O caches, yaddayadda, I'm sure you get my point).
I don't know about the size of the rev cache on disk yet (I asked Nick
about that) nor do I really know how this cache is implemented. But I
know damn well about git packs and associated index and I for sure don't
want to see a revision cache coupled with it.
And for a clone the disk IO will certainly be a magnitude larger than
for the cache (or so I hope). Maybe the IO for the rev cache might be a
significant overhead for operations performed on a client (aka
developer) repository, in which case it would be a good idea to have a
config variable to control the cache size, or even to turn it off
entirely. We do it for delta depth and many other things already.
Nicolas
next prev parent reply other threads:[~2009-08-07 15:01 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-06 9:55 [PATCH 0/5] Suggested for PU: revision caching system to significantly speed up packing/walking Nick Edelen
2009-08-06 14:48 ` Johannes Schindelin
2009-08-06 14:58 ` Michael J Gruber
2009-08-06 17:39 ` Nick Edelen
2009-08-06 19:06 ` Johannes Schindelin
2009-08-06 20:01 ` Nick Edelen
2009-08-06 20:30 ` Nick Edelen
2009-08-06 20:32 ` Shawn O. Pearce
2009-08-06 23:35 ` A Large Angry SCM
2009-08-06 23:37 ` Shawn O. Pearce
2009-08-06 23:43 ` A Large Angry SCM
2009-08-07 0:15 ` Nick Edelen
2009-08-07 6:05 ` Johannes Schindelin
2009-08-07 4:42 ` Nicolas Pitre
2009-08-07 2:47 ` Sam Vilain
2009-08-07 4:35 ` Nicolas Pitre
2009-08-07 6:08 ` Johannes Schindelin
2009-08-07 14:18 ` Nicolas Pitre
2009-08-08 15:18 ` Johannes Schindelin
2009-08-08 16:07 ` Junio C Hamano
2009-08-08 23:54 ` Sam Vilain
2009-08-09 2:37 ` Nicolas Pitre
2009-08-09 13:42 ` Nick Edelen
2009-08-07 6:12 ` Johannes Schindelin
2009-08-07 15:00 ` Nicolas Pitre [this message]
2009-08-07 22:02 ` Nick Edelen
2009-08-07 22:48 ` Junio C Hamano
2009-08-07 22:53 ` Nick Edelen
2009-08-08 3:11 ` Junio C Hamano
2009-08-08 7:27 ` Nick Edelen
2009-08-08 7:30 ` Jeff King
2009-08-08 7:40 ` Nick Edelen
2009-08-08 2:50 ` Jeff King
2009-08-08 18:57 ` Junio C Hamano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.2.00.0908071029580.16073@xanadu.home \
--to=nico@cam.org \
--cc=Johannes.Schindelin@gmx.de \
--cc=christian@couder.net \
--cc=exon@op5.se \
--cc=git@drmicha.warpmail.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=peff@peff.net \
--cc=sam@vilain.net \
--cc=sirnot@gmail.com \
--cc=spearce@spearce.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).