From: Nicolas Pitre <nico@cam.org>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Geert Bosch <bosch@adacore.com>,
"Shawn O. Pearce" <spearce@spearce.org>,
Dana How <danahow@gmail.com>,
git@vger.kernel.org
Subject: Re: [RFC] Packing large repositories
Date: Sat, 31 Mar 2007 15:02:56 -0400 (EDT) [thread overview]
Message-ID: <alpine.LFD.0.98.0703311459000.28181@xanadu.home> (raw)
In-Reply-To: <Pine.LNX.4.64.0703311033290.6730@woody.linux-foundation.org>
On Sat, 31 Mar 2007, Linus Torvalds wrote:
> The thing is, I think the index-pack could be improved, but I think we can
> get a bigger improvement from just being more intelligent about searching
> than from actually needing to re-organize the pack.
>
> Here's a few clues:
> - one of the fundamental rules about cryptographic hashes is that they
> are *evenly*distributed*
I was thinking about that too.
> So here's a suggestion:
>
> - start finding a range using the 256-entry fan-out, exactly the way we
> did for the binary search. It's cheap. We could probably avoid EVEN
> THIS, and just do one more newton-raphson iteration more. But since we
> have the data, we migth as well use it. After all, it really *is* just
> a first approximation of newton-raphson, and while it uses only 8 bits
> (and we could do better), at least it's an *exact* one.
>
> - use newton-raphson to iterate closer. It should be a much faster way to
> find the rough area for the entry we're searching for than binary
> search. Two or three iterations should get us there, easily.
>
> - do a linear search once you're close enough.
I like that.
Nicolas
next prev parent reply other threads:[~2007-03-31 19:03 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-03-28 7:05 [RFC] Packing large repositories Dana How
2007-03-28 16:53 ` Linus Torvalds
2007-03-30 6:23 ` Shawn O. Pearce
2007-03-30 13:01 ` Nicolas Pitre
2007-03-31 11:04 ` Geert Bosch
2007-03-31 18:36 ` Linus Torvalds
2007-03-31 19:02 ` Nicolas Pitre [this message]
2007-03-31 20:54 ` Junio C Hamano
2007-03-31 21:20 ` Linus Torvalds
2007-03-31 21:56 ` Linus Torvalds
2007-04-02 6:22 ` Geert Bosch
2007-04-03 5:39 ` Shawn O. Pearce
2007-03-31 18:51 ` Nicolas Pitre
2007-04-02 21:19 ` Dana How
2007-04-02 1:39 ` Sam Vilain
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.0.98.0703311459000.28181@xanadu.home \
--to=nico@cam.org \
--cc=bosch@adacore.com \
--cc=danahow@gmail.com \
--cc=git@vger.kernel.org \
--cc=spearce@spearce.org \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).