From: Felipe Contreras <felipe.contreras@gmail.com>
To: Michael J Gruber <git@drmicha.warpmail.net>
Cc: Jeff King <peff@peff.net>,
Johannes Schindelin <Johannes.Schindelin@gmx.de>,
git@vger.kernel.org, Junio C Hamano <gitster@pobox.com>,
Sverre Rabbelier <srabbelier@gmail.com>,
Ilari Liusvaara <ilari.liusvaara@elisanet.fi>,
Daniel Barkalow <barkalow@iabervon.org>
Subject: Re: [PATCH v4 00/13] New remote-hg helper
Date: Mon, 5 Nov 2012 16:36:56 +0100 [thread overview]
Message-ID: <CAMP44s1AELcssUZsa+2kCdbA7DLDDNFtWzGbOvMUP5AKnHByow@mail.gmail.com> (raw)
In-Reply-To: <5097C970.9010901@drmicha.warpmail.net>
On Mon, Nov 5, 2012 at 3:13 PM, Michael J Gruber
<git@drmicha.warpmail.net> wrote:
> Felipe Contreras venit, vidit, dixit 02.11.2012 19:01:
>> I talked with some people in #mercurial, and apparently there is a
>> concept of a 'changelog' that is supposed to store these changes, but
>> since the format has changed, the content of it is unreliable. That's
>> not a big problem because it's used mostly for reporting purposes
>> (log, query), not for doing anything reliable.
>
> Is the changelog stored in the repo (i.e. generated by the hg version at
> commit time) or generated on the fly (i.e. generated by the hg version
> at hand)? See also below.
I don't know. I would expect it to be the former, and then when the
format changes, generated by the tool that did the conversion.
>> To reliably see the changes, one has to compare the 'manifest' of the
>> revisions involved, which contain *all* the files in them.
>
> 'manifest' == '(exploded) tree', right? Just making sure my hg fu is not
> subzero.
Yeah, the tree. As I said, it contains all the files.
>> That's what I was doing already, but I found a more efficient way to
>> do it. msysGit is using the changelog, which is quite fast, but not
>> reliable.
>>
>> Unfortunately while going trough mercurial's code, I found an issue,
>> and it turns out that 1) is not correct.
>>
>> In mercurial, a file hash contains also the parent file nodes, which
>> means that even if two files have the same content, they would not
>> have the same hash, so there's no point in keeping track of them to
>> avoid extracting the data unnecessarily, because in order to make sure
>> they are different, you need to extract the data anyway, defeating the
>> purpose.
>
> Do I understand correctly that neither the msysgit version nor yours can
> detect duplicate blobs (without requesting them) because of that sha1 issue?
That's correct.
> I'm really wondering why a file blob hash carries its history along in
> the sha1. This appears completely strange to gitters (being brain washed
> about "content tracking"), but may be due to hg's extensive use of
> delta, or really: delta chains (which do have their merit on the server
> side).
It is a surprise to me too. I see absolutely no reason why that would be useful.
It seems like bazaar does store the file hashes without the parent
info, like git.
>> Which means mercurial doesn't really behave as one would expect:
>>
>> # add files with the same content
>>
>> $ echo a > a
>> $ hg ci -Am adda
>> adding a
>> $ echo a >> a
>> $ hg ci -m changea
>> $ echo a > a
>> $ hg st --rev 0
>> $ hg ci -m reverta
>> $ hg log -G --template '{rev} {desc}\n'
>> @ 2 reverta
>> |
>> o 1 changea
>> |
>> o 0 adda
>>
>> # check the difference between the first and the last revision
>>
>> $ hg st --rev 0:2
>> M a
>> $ hg cat -r 0 a
>> a
>> $ hg cat -r 2 a
>> a
>
> That is really scary. What use is "hg stat --rev" then? Not blaming you
> for hg, of course.
>
> On that tangent, I just noticed recently that hg has no python api.
> Seriously [1]. They even tell us not to use the internal python api.
> msysgit has been lacking support for newer hg, and you've had to add
> support for older versions (hg 1.9 will be around on quite some
> stable/LTS/EL distro releases) after developing on newer/current ones.
> I'm wondering how well that scales in the long term (telling from
> git-svn experience: it does not scale well), or whether using some
> stable api like 'hgapi' would be a huge bottleneck.
I don't know. I have never really used mercurial until recently. I
don't know how often they change their APIs and/or repository formats.
I would say the burden of updating to newer APIs is probably much less
than the burden of implementing code that accesses their repositories
directly, and eventually possibly rewriting the code when they change
the format.
If we were to access the repository directly, I would choose to use
Ruby for that, but given that 'we' is increasingly looking like 'I'. I
probably wouldn't.
Cheers.
--
Felipe Contreras
next prev parent reply other threads:[~2012-11-05 15:37 UTC|newest]
Thread overview: 75+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-10-28 3:54 [PATCH v4 00/13] New remote-hg helper Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 01/13] Add new remote-hg transport helper Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 02/13] remote-hg: add support for bookmarks Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 03/13] remote-hg: add support for pushing Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 04/13] remote-hg: add support for remote pushing Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 05/13] remote-hg: add support to push URLs Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 06/13] remote-hg: make sure the encoding is correct Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 07/13] remote-hg: match hg merge behavior Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 08/13] remote-hg: add support for hg-git compat mode Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 09/13] remote-hg: add compat for hg-git author fixes Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 10/13] remote-hg: fake bookmark when there's none Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 11/13] remote-hg: add support for fake remote Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 12/13] remote-hg: add tests to compare with hg-git Felipe Contreras
2012-10-28 3:54 ` [PATCH v4 13/13] remote-hg: add extra author test Felipe Contreras
2012-10-29 8:50 ` [PATCH v4 00/13] New remote-hg helper Jeff King
2012-10-29 14:56 ` Felipe Contreras
2012-10-29 21:26 ` Jeff King
2012-10-29 21:47 ` Felipe Contreras
2012-10-29 21:56 ` Jeff King
2012-10-29 22:02 ` Felipe Contreras
2012-10-29 22:06 ` Jeff King
2012-10-30 17:18 ` Felipe Contreras
2012-10-30 17:20 ` Johannes Schindelin
2012-10-30 18:10 ` Felipe Contreras
2012-10-30 19:33 ` Johannes Schindelin
2012-10-30 20:15 ` Felipe Contreras
2012-10-31 9:30 ` Michael J Gruber
2012-10-31 10:27 ` Jeff King
2012-10-31 15:58 ` Felipe Contreras
2012-10-31 18:20 ` Johannes Schindelin
2012-10-31 18:41 ` Felipe Contreras
2012-10-31 18:59 ` Jonathan Nieder
2012-10-31 19:24 ` Felipe Contreras
2012-10-31 20:28 ` Lack of netiquette, was " Johannes Schindelin
2012-10-31 20:37 ` Felipe Contreras
2012-11-01 1:32 ` Junio C Hamano
2012-11-01 2:58 ` Felipe Contreras
2012-11-01 13:46 ` René Scharfe
2012-11-01 14:18 ` Tomas Carnecky
2012-11-01 14:18 ` Martin Langhoff
2012-11-01 14:34 ` Felipe Contreras
2012-11-01 14:47 ` Martin Langhoff
2012-11-01 17:13 ` Felipe Contreras
2012-11-02 9:38 ` Andreas Ericsson
2012-11-02 11:03 ` Michael J Gruber
2012-11-02 16:09 ` Felipe Contreras
2012-11-05 9:25 ` Michael J Gruber
2012-11-05 15:22 ` Felipe Contreras
2012-11-05 15:58 ` Felipe Contreras
2012-11-05 16:00 ` Michael J Gruber
2012-11-05 16:15 ` Felipe Contreras
2012-11-01 20:46 ` Jonathan Nieder
2012-10-31 23:14 ` Daniel Barkalow
2012-11-01 2:46 ` Felipe Contreras
2012-11-01 1:41 ` Junio C Hamano
2012-11-01 2:54 ` Felipe Contreras
2012-10-31 15:39 ` Felipe Contreras
2012-10-31 15:55 ` Michael J Gruber
2012-10-31 16:11 ` Felipe Contreras
2012-11-02 14:46 ` Jeff King
2012-11-02 18:39 ` Felipe Contreras
2012-11-02 19:20 ` Felipe Contreras
2012-11-04 2:28 ` Felipe Contreras
2012-11-02 23:18 ` Thomas Adam
2012-11-02 23:52 ` Felipe Contreras
2012-10-31 18:04 ` Felipe Contreras
2012-10-31 19:47 ` Felipe Contreras
2012-11-01 4:08 ` Felipe Contreras
2012-11-02 14:48 ` Jeff King
2012-11-02 16:41 ` Felipe Contreras
2012-11-02 18:01 ` Felipe Contreras
2012-11-05 14:13 ` Michael J Gruber
2012-11-05 15:36 ` Felipe Contreras [this message]
2012-11-01 1:22 ` Junio C Hamano
2012-11-01 2:50 ` Felipe Contreras
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAMP44s1AELcssUZsa+2kCdbA7DLDDNFtWzGbOvMUP5AKnHByow@mail.gmail.com \
--to=felipe.contreras@gmail.com \
--cc=Johannes.Schindelin@gmx.de \
--cc=barkalow@iabervon.org \
--cc=git@drmicha.warpmail.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=ilari.liusvaara@elisanet.fi \
--cc=peff@peff.net \
--cc=srabbelier@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).