From: Jakub Narebski <jnareb@gmail.com>
To: Jeff King <peff@peff.net>
Cc: "Jonathan Nieder" <jrnieder@gmail.com>,
"Ævar Arnfjörð Bjarmason" <avarab@gmail.com>,
"Clemens Buchacher" <drizzd@aon.at>,
git@vger.kernel.org, "Junio C Hamano" <gitster@pobox.com>,
"Jakub Narebski" <jnareb@gmail.com>
Subject: Re: generation numbers (was: [PATCH 0/4] Speed up git tag --contains)
Date: Wed, 06 Jul 2011 07:26:53 -0700 (PDT) [thread overview]
Message-ID: <m3mxgr4has.fsf_-_@localhost.localdomain> (raw)
In-Reply-To: <20110706070311.GA3790@sigill.intra.peff.net>
Jeff King <peff@peff.net> writes:
> On Wed, Jul 06, 2011 at 01:56:23AM -0500, Jonathan Nieder wrote:
> > Jeff King wrote:
> >
> > > The problem is that existing objects don't have this generation number.
> > > It's easy to calculate, though, and we could in theory use a notes-cache
> > > to store it externally. Obviously the complexity and performance aren't
> > > going to be as good as if it were just in the commit object, but we're
> > > sadly 6 years too late to make that decision.
> >
> > I am still digesting the rest of what you wrote, but wouldn't this be
> > easy to do today? One could just use a notes-cache while prototyping
> > and if it seems to work well, introduce new loose and packed object
> > formats that include a field for the cached generation number.
>
> Yes, that's exactly how to do it. I'm just not sure "introduce new loose
> and packed object formats" is "easy to do". Though I'm not sure we need
> new formats. It is really just a new header in the commit object. And if
> we write the code carefully, we should be able to transparently use
> newly-generated objects with the field, and fall back to a notes-cache
> (with autogeneration) when it isn't there.
I understand that you would do autogeneration at least when you create
a commit, and at least one of parents does not have generation number.
You can also autogenerate notes-cache when following commits, and
encountering commit object without generation number.
Or make "git gc" autogenerate cache-notes for generation number,
perhaps with an option (i.e. probably not for "git gc --auto").
> Existing git will ignore the new generation field. It does mean that old
> and new git will generate different sha1s for the exact same commit. I
> don't know how big a deal this is in practice. It matters a lot more for
> blobs and trees. But for commits, even if you are replaying a commit,
> you should be updating the commit timestamp, which is going to give a
> new sha1.
>
> The other thing I worry about is performance. You are building a full
> notes tree and looking up every commit in the traversal. I don't know
> how bad that will be (though from my other back-of-the-envelope tests,
> it may not actually be that bad; notes were designed to be fast for
> exactly this case).
Well, one thing that it would test our notes infrastructure...
--
Jakub Narebski
Poland
ShadeHawk on #git
next prev parent reply other threads:[~2011-07-06 14:27 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-06-11 19:04 [PATCH 0/4] Speed up git tag --contains Ævar Arnfjörð Bjarmason
2011-06-11 19:04 ` [PATCH 1/4] tag: speed up --contains calculation Ævar Arnfjörð Bjarmason
2011-06-11 19:04 ` [PATCH 2/4] limit "contains" traversals based on commit timestamp Ævar Arnfjörð Bjarmason
2011-06-11 19:04 ` [PATCH 3/4] default core.clockskew variable to one day Ævar Arnfjörð Bjarmason
2011-06-11 19:04 ` [PATCH 4/4] Why is "git tag --contains" so slow? Ævar Arnfjörð Bjarmason
2011-07-06 6:40 ` [PATCH 0/4] Speed up git tag --contains Jeff King
2011-07-06 6:54 ` Jeff King
2011-07-06 19:06 ` Clemens Buchacher
2011-07-06 6:56 ` Jonathan Nieder
2011-07-06 7:03 ` Jeff King
2011-07-06 14:26 ` Jakub Narebski [this message]
2011-07-06 15:01 ` generation numbers (was: [PATCH 0/4] Speed up git tag --contains) Ted Ts'o
2011-07-06 18:12 ` Jeff King
2011-07-06 18:46 ` Jakub Narebski
2011-07-07 18:59 ` Jeff King
2011-07-07 19:34 ` generation numbers Junio C Hamano
2011-07-07 20:31 ` Jakub Narebski
2011-07-07 20:52 ` A Large Angry SCM
2011-07-08 0:29 ` Junio C Hamano
2011-07-08 22:57 ` Jeff King
2011-07-06 23:22 ` Junio C Hamano
2011-07-07 19:08 ` Jeff King
2011-07-07 20:10 ` Jakub Narebski
2018-01-12 18:56 ` [PATCH 0/4] Speed up git tag --contains csilvers
2018-03-03 5:15 ` Jeff King
2018-03-08 23:05 ` csilvers
2018-03-12 13:45 ` Derrick Stolee
2018-03-12 23:59 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=m3mxgr4has.fsf_-_@localhost.localdomain \
--to=jnareb@gmail.com \
--cc=avarab@gmail.com \
--cc=drizzd@aon.at \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=jrnieder@gmail.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.