From: Thomas Gummerer <t.gummerer@gmail.com>
To: Junio C Hamano <gitster@pobox.com>
Cc: git@vger.kernel.org, mhagger@alum.mit.edu, pclouds@gmail.com,
trast@student.ethz.ch
Subject: Re: [GSoC] Designing a faster index format - Progress report week 13
Date: Tue, 17 Jul 2012 10:21:30 +0200 [thread overview]
Message-ID: <20120717082130.GB1849@tgummerer.surfnet.iacbox> (raw)
In-Reply-To: <7vwr23zb65.fsf@alter.siamese.dyndns.org>
On 07/16, Junio C Hamano wrote:
> Thomas Gummerer <t.gummerer@gmail.com> writes:
>
> > == Work done in the previous 12 weeks ==
> >
> > - Definition of a tentative index file v5 format [1]. This differs
> > from the proposal in making it possible to bisect the directory
> > entries and file entries, to do a binary search. The exact bits
> > for each section were also defined. To further compress the index,
> > along with prefix compression, the stat data is hashed, since
> > it's only used for comparison, but the plain data is never used.
>
> s/comparison/equality comparison/ perhaps?
>
Exactly, thanks.
> > Thanks to Michael Haggerty, Nguyen Thai Ngoc Duy, Thomas Rast
> > and Robin Rosenberg for feedback.
>
> > - Read the index format format and translate it to the current in
>
> s/format format/on-disk file format/ or something?
>
Yes, thanks.
> > memory format. This doesn't include reading any of the current
> > extensions, which are now part of the main index. The code again
> > is on github. [4] Thanks for reviewing the first steps to Thomas
> > Rast.
>
> > - Started implementing the writer, which extracts the directories from
> > the in-memory format, and writes the header and the directories to
> > disk.
> > - I found a few bugs in the algorithm for extracting the directories
> > and decided to completely rewrite it, using a hash table instead of
> > simple lists, since the old one would have to many corner cases to
> > handle.
>
> What does "the algorithm" refer to? Is it the one described in the
> previous bullet point, or is it the code in production? If latter,
> it would help to separate out the task to fix the breakage, as
> people with the current or previous versions of Git will be
> negatively affected until that bug is fixed. If former, I am not
> sure if this task needs to be described in two bullet points ("I did
> X, X had bug so I redid X in a different way" is still a single task
> to do X).
It refers to the algorithm in the previous bullet point, which
extracts the directories, and can be included in the above bullet
point. Sorry for the confusion.
> > == Work done int the last week ==
> >
> > - Polished the patch for the ce_namelen field. The thread for the
> > patch can be found at [5].
>
> Thanks for this one; I think it is ready for 'next', but if you are
> still not satisfied I do not mind waiting for further perfection.
Thanks, I'm satisfied with it, for me it can be merged to 'next'.
next prev parent reply other threads:[~2012-07-17 8:21 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-07-16 20:33 [GSoC] Designing a faster index format - Progress report week 13 Thomas Gummerer
2012-07-16 20:46 ` Junio C Hamano
2012-07-17 8:21 ` Thomas Gummerer [this message]
2012-07-17 8:24 ` Thomas Gummerer
2012-07-22 15:22 ` Robin Rosenberg
2012-07-22 18:52 ` Junio C Hamano
2012-07-22 19:43 ` Thomas Rast
2012-07-22 21:08 ` Junio C Hamano
2012-07-23 22:29 ` Robin Rosenberg
2012-07-24 11:54 ` Thomas Rast
2012-07-24 16:58 ` Junio C Hamano
2012-07-25 6:44 ` Thomas Rast
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120717082130.GB1849@tgummerer.surfnet.iacbox \
--to=t.gummerer@gmail.com \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=mhagger@alum.mit.edu \
--cc=pclouds@gmail.com \
--cc=trast@student.ethz.ch \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).