From: Jeff King <peff@peff.net>
To: Junio C Hamano <gitster@pobox.com>
Cc: Duy Nguyen <pclouds@gmail.com>,
Jason Paller-Rzepka <jasonpr@google.com>,
Stefan Beller <sbeller@google.com>,
"git@vger.kernel.org" <git@vger.kernel.org>,
Dennis Kaarsemaker <dennis@kaarsemaker.net>
Subject: Re: Multiple fetches when unshallowing a shallow clone
Date: Mon, 7 Dec 2015 16:42:58 -0500 [thread overview]
Message-ID: <20151207214257.GA32082@sigill.intra.peff.net> (raw)
In-Reply-To: <xmqqa8pmlzjc.fsf@gitster.mtv.corp.google.com>
On Mon, Dec 07, 2015 at 01:27:51PM -0800, Junio C Hamano wrote:
> Jeff King <peff@peff.net> writes:
>
> > I think one thing I was missing is that we need to just grab the
> > _object_, but we need to realize that the ref needs updating[1]. So we
> > cannot skip backfill of any tag that we do not already have, even if we
> > already have the tag object.
> > ...
> > [1] I'm still puzzled why find_non_local_tags uses has_sha1_file() on
> > the tag object at all, then.
>
> The designed semantics of auto-following tags (not necessarily as
> implemented or documented, i.e. there may be implementation or
> documentation bugs), I think, is to arrive at the same state as
> doing a fetch (or a push) without the auto-following and then doing
> a separate fetch (or a push) of tags that point at the objects that
> are reachable from the tips of refs after finishing the first
> (i.e. without auto-follow) fetch (or a push). In a scenario where
> we already have a commit reachable from existing remote-tracking
> branch and the current transfer (be it a fetch or a push, with or
> without auto-follow) does not update any remote-tracking branch
> (because the source side did not have any changes), if the source
> side added a tag that refers to that commit that the receiving end
> lacks, that tag needs to be transferred and then stored.
>
> So has_sha1_file() is not the right test---if anything, it needs to
> be checking if the object being checked is reachable from a tip of
> some ref.
>
> But of course, that test is rather expensive, so perhaps the
> implementation cheated and uses has_sha1_file() instead? The only
> case it would misidentify would be after an aborted fetch (or push)
> left unconnected island of objects and some of these objects that
> are not reachable are pointed at by tags the receiving end does not
> have.
I may have confused myself. There are actually two has_sha1_file() calls
in find_non_local_tags.
I agree it is the only sensible test for "do we have the commit this tag
peels to, and if so, we want to grab the tag". Reachability is too
expensive to compute.
But for the other one ("do we have the tag object itself"), I initially
claimed "if we have the tag object already, we do not have to do the
backfill fetch". Which is not quite true. We have to update the ref even
if we have the tag object. But then, what if we have the tag object for
other reasons (e.g., because another tag points at it?).
E.g. in this sequence:
git -C parent commit --allow-empty -m base
git -C parent tag -m mytag foo
git clone parent child
git -C parent update-ref refs/tags/bar foo
git -C child fetch
we must backfill refs/tags/bar during the fetch, even though we already
have the object. I don't see any point in checking has_sha1_file() for
the tagged object at all. If we don't have it, we obviously must fetch.
And if we do have it, we must fetch the ref, even if that results in no
objects transferred.
It's entirely possible I'm just confused, and AFAICT nobody has noticed
any breakage here, so please don't feel you need to spend a lot of time
humoring me. I'm just writing up my confusion for posterity. :)
-Peff
next prev parent reply other threads:[~2015-12-07 21:43 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-30 19:35 Multiple fetches when unshallowing a shallow clone Jason Paller-Rzepka
2015-12-04 20:46 ` Stefan Beller
2015-12-04 21:27 ` Jeff King
2015-12-04 21:36 ` Stefan Beller
2015-12-04 21:38 ` Jason Paller-Rzepka
2015-12-04 21:50 ` Stefan Beller
2015-12-04 21:51 ` Jeff King
2015-12-04 22:45 ` Junio C Hamano
2015-12-05 5:33 ` Duy Nguyen
2015-12-06 4:00 ` Junio C Hamano
2015-12-06 6:37 ` Jeff King
2015-12-06 7:01 ` Jeff King
2015-12-06 10:46 ` Duy Nguyen
2015-12-07 19:57 ` Jason Paller-Rzepka
[not found] ` <CACs8u9RzUVWw2Ld1K7JeO7Eci114JEiML8bbGy96m4pZZk=FnA@mail.gmail.com>
2015-12-07 21:21 ` Duy Nguyen
2015-12-07 21:27 ` Junio C Hamano
2015-12-07 21:42 ` Jeff King [this message]
2015-12-04 21:57 ` Junio C Hamano
2015-12-04 22:10 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20151207214257.GA32082@sigill.intra.peff.net \
--to=peff@peff.net \
--cc=dennis@kaarsemaker.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=jasonpr@google.com \
--cc=pclouds@gmail.com \
--cc=sbeller@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).