From: Jeff King <peff@peff.net>
To: Junio C Hamano <gitster@pobox.com>
Cc: Michael J Gruber <git@drmicha.warpmail.net>,
Michal Hocko <mhocko@suse.cz>,
git@vger.kernel.org
Subject: Re: git describe --contains doesn't work properly for a commit
Date: Thu, 5 Mar 2015 00:12:11 -0500 [thread overview]
Message-ID: <20150305051211.GA3344@peff.net> (raw)
In-Reply-To: <xmqq7fuw8pgq.fsf@gitster.dls.corp.google.com>
On Wed, Mar 04, 2015 at 12:41:57PM -0800, Junio C Hamano wrote:
> > Calculating them is simple. Caching and storage is the bigger question.
>
> Yes, also having to handle the ones whose generation numbers haven't
> been computed yet adds to the complexity.
I'm not sure it's that bad. If you cache generation numbers for all
known commits when you repack, then worst case you have to traverse all
commits not in the pack.
> This one, and $gmane/264101, are a few instances of this known issue
> raised here recently.
If $gmane/264101 is caused by clock skew, I'd find that disturbing.
Those algorithms are supposed to be "correct, but slower" in the face of
skew, not ever incorrect.
> I have been wondering if we can do something
> along the following (these are not alternatives) as a cheaper
> workaround:
>
> (1) Introduce '--skewed-timestamps[=(allow|warn|reject)' to all
> commands that create new commit objects. If the committer
> timestamp being used is older than any of the parent commits,
> "warn" or "reject" depending on the setting.
I think this idea has come up before. If it's _your_ timestamp that is
screwed up, this detects it, which is good. But if it's somebody else's
timestamp that is screwed up, there's often not much you can do. It's
already baked into the history.
I don't mind it as an extra layer of protection, I guess. But my
recollection of the great skew survey[1] is that most of these problems
don't come from actual clock skew, but from software bugs or bogus data
in imported commits. True skew is generally less than a day, and can be
handled with a fixed slop time.
[1] http://article.gmane.org/gmane.comp.version-control.git/159065
> (2) Compute a bitmap whose timestamps are suspect when we pack to
> mark commits. When revision.c:limit_list() tries to see if
> there still are interesting commits, an UNINTERESTING commit
> marked as such shouldn't be counted as "not interesting because
> it is old enough". Use the same hint in the walker used in
> "describe --contains".
If you see mismatched timestamps between a parent and child commit, it's
often not clear which one is suspicious. Is the parent skewed to the
future, or is the child skewed to the past? Which one do you mark as
suspect?
IMHO, if you are going to go to the trouble to detect and store skew,
you should just go to the trouble to calculate and store reliable
generation numbers.
-Peff
next prev parent reply other threads:[~2015-03-05 5:12 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-02-26 13:35 git describe --contains doesn't work properly for a commit Michal Hocko
2015-02-26 14:23 ` Michal Hocko
2015-03-04 10:54 ` Jeff King
2015-03-04 15:06 ` Michael J Gruber
2015-03-04 18:05 ` Jeff King
2015-03-04 20:41 ` Junio C Hamano
2015-03-04 22:05 ` Mike Hommey
2015-03-05 5:12 ` Jeff King [this message]
2015-03-05 6:00 ` Junio C Hamano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150305051211.GA3344@peff.net \
--to=peff@peff.net \
--cc=git@drmicha.warpmail.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=mhocko@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox