Re: Git at Better SCM Initiative comparison of VCS (long)

git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: "Alexey Mahotkin" <squadette@gmail.com>
To: "Dmitry Potapov" <dpotapov@gmail.com>
Cc: "Jakub Narebski" <jnareb@gmail.com>, git@vger.kernel.org
Subject: Re: Git at Better SCM Initiative comparison of VCS (long)
Date: Sun, 14 Sep 2008 19:09:36 +0400	[thread overview]
Message-ID: <bb5b640b0809140809g1aff9047qd6baf6cd66d23ec6@mail.gmail.com> (raw)
In-Reply-To: <20080914144306.GF28210@dpotapov.dyndns.org>

Hi,

I've written the version which is on http://versioncontrolblog.com and
sent it to Mr. Shlomi Fish several months ago.     He has extensively
re-written my text, making it more "neutral", and published it on
better-scm.  I do not agree with some of the changes he made, but I
did not insist.  :)

Occasionally I update my text with the current version at better-scm,
but this has not happened for some time, and it still contains my
original version.

Is there anything I can do to improve the state of things in any way? :)

On Sun, Sep 14, 2008 at 6:43 PM, Dmitry Potapov <dpotapov@gmail.com> wrote:
> Hello Jakub,
>
> I have added Alexey Mahotkin in CC, who is allegedly the author of that
> information about Git that you can read on the better-scm site.
>
> On Sat, Sep 13, 2008 at 07:06:16PM +0200, Jakub Narebski wrote:
>>
>> I have thought about trying yet another time... but Git was already
>> added; see http://better-scm.berlios.de/news/changes-2008-08-07/
>
> Interesting, the site still mentions Git as missing in a few places.
> For instance, when you click on Git in the list of alternatives, you
> get this: http://better-scm.berlios.de/alternatives/git/
> and then when you got to FAQ, you can read this:
>
> | The reason it's not there is that while many people have complained
> | about its absense, no one suitable has volunteered to become its
> | champion and supplied a good enough patch. If you have a substantial
> | amount of git expertise, have good English writing skills, and wish to
> | volunteer, then we'll be happy to hear from you. If not - at least don't
> | complain about it.
> |
> | In addition to everything that was said here, it seems that the
> | originator and maintainer of the site and comparison is now banned
> | from sending messages to vger.kernel.org, which hosts several
> | Linux-kernel-related mailing-lists, including the git one. This has
> | interfered with some of his Linux-related open-source work, including
> | trying to find a "Better SCM" maintainer for git. This is unfortunate,
> | but changing this situation, is currently beyond his control.
>
> Source: http://better-scm.berlios.de/faq/#git-missing
>
> I am surprised to hear that Shlomi Fish is banned...
>
>> scm>     <section id="repos_operations">
>> scm>         <title>Repository Operations</title>
>> scm>         <section id="atomic_commits">
>> scm>             <title>Atomic Commits</title>
>> scm>             <expl>
>> scm>                 Support for atomic commits means that if an
>> scm>                 operation on the repository is interrupted
>> scm>                 in the middle, the repository will not be
>> scm>                 left in an inconsistent state. Are the
>> scm>                 check-in operations atomic, or can
>> scm>                 interrupting an operation leave the
>> scm>                 repository in an intermediate state?
>> scm>             </expl>
>>
>> Here I think the explanation of a criterion (feature) is clear enough.
>> I might have added that "interruption" include killing of a process
>> during for example commit, lack of disk space for a full commit, or
>> a network fail during network operation (fetch or push, or equivalent).
>
> My initial reaction was to say that killing a process with -9 is not
> what you expect to see in practice, but a second later, I realized how
> wrong I was. Lack of memory may cause that the process gets killed with
> -9, and it has been observed in practice (at least, in case of Mercury
> repo): http://norman.walsh.name/2007/08/09/mercurial
>
> Another thing that is not clear in the above criterion is what exactly
> "inconsistent state" (or "intermediate state") means. For instance, if
> Git gets killed during commit, you may have to remove .git/index.lock
> manually. AFAIK, Mercury leaves the 'journal' file and you have to
> run "hg recovery". Does it mean that the commit is not atomic?
>
> Another thing here is that "git commit" is local, so I am not sure
> if this question includes network operations...
>
>> scm>         <section id="move">
>> scm>             <title>Files and Directories Moves or Renames</title>
>> scm>             <expl>
>> scm>                 Does the system support moving a file or directory to
>> scm>                 a different location while still retaining the history
>> scm>                 of the file? <b>Note:</b> also see the next section
>> scm>                 about intelligent merging of renamed paths.
>> scm>             </expl>
>>
>> In my opinion this criterion is next to worthess without more in depth
>> clarification of what does it mean to "support" moves or renames; as
>> entries for different systems are written by different people, if it
>> is not clear how to check if some feature is supported, some might
>> write 'no' for some system A, and some other person can write 'yes'
>> for other system B, even if the support is better in system A than in
>> system B (and would be considered enough, i.e. 'yes' answer, by the
>> creator of this criterion).
>>
>> For me the support for renames/moves and copying (see next section)
>> means that:
>>
>>  0.) When examining or going to some point in the history (some old
>>      revision/version of a project) the state you get is _exactly_
>>      the same as it was at that time, exactly the same as it was
>>      recorded (comitted) then.
>>
>>      For example tricks with moving *,v files in the CVS repository
>>      break this assertion.
>
> IMHO, the above assertion is assumed when we talk about renaming, as
> the system that is not capable of that will not be qualified as an
> SCM. Yet, there is still plenty way to interpret the above criterion.
> Even in CVS, the history of the file does not disappear when you move
> a file. You can just write, this file move was renamed from old-name,
> so anyone can get old history without any problem. Of course, it will
> require some an additional step taken manually. But if the requirement
> is to see all log history with one $scm log command, you can just copy
> old log into log of a newly added file. Of course, you cannot run $scm
> annotate on that file and see who changed what line, but there is no
> such a requirement above.
>
> So, I agree, it should be better defined.
>
>>
>>  1.) When examining history of a project as a whole version control
>>      system tells you that file was renamed (moved). I would consider
>>      having there renaming represented as copy + delete to be only
>>      a partial support of this feature.
>
> If files moving is interpreted in the sense of preserving the old history
> then copy + delete fully satisfies that criterion.
>
> However, if you defined support of file movement as ability to see that
> some file when you look at the history of the whole project then
> certainly copy + delete representation would not satisfy it.
>
> So, perhaps, it should be two separate points:
> - ability to preserve history of rename (with detail clarification
>  of what it means)
> - ability to show renames in the project history
>
>>
>> scm>                 <s id="git">
>> scm>                     Renames are supported for most practical
>> scm>                     purposes.  Git even detects renames when a file has been
>> scm>                     changed afterward the rename.  However, due to a peculiar
>> scm>                     repository structure, renames are not recorded
>> scm>                     explicitly, and Git has to deduce them (which works well
>> scm>                     in practice).
>> scm>                 </s>
>>
>> First, a correction to above statement.  It is not due to "a peculiar
>> repository structure", but due to "a design decision" (perhaps with
>> link to some explanation why it was implemented this way; I planned
>> to make a wiki page about 'rename tracking' vs. 'rename detection'
>> with references to various mailing list messages etc., but to this
>> day it was not created).
>
> Agreed.
>
>>
>>
>> Second, we can think about how the above statement could be improved.
>>
>
> <long and detail explanation of how git works>
>
>>
>> ...Now only put the above in a few short sentences to be used in
>> "Better SCM Initiative" comparison table...
>
> Git tracks content rather than file-ids, and therefore it uses heuristics
> for rename detection.  This approach has an advantage of being able to
> preserve history for code lines between files, which usually happens much
> more often than file renaming.
>
>> scm>                 <s id="git">
>> scm>                     No. As detailed in the <a
>> scm>                         href="http://git.or.cz/gitwiki/GitFaq#rename-tracking">Git
>> scm>                         FAQ</a>:
>> scm>                     "Git has a rename command git mv, but that is just a
>> scm>                     convenience. The effect is indistinguishable from removing
>> scm>                     the file and adding another with different name and the
>> scm>                     same content."
>> scm>                 </s>
>>
>> This is of course NOT TRUE.  If the author bother checking (which
>> would be helped if there was available simple shell script, or simple
>> Perl script, testing 'intelligent_renames' criterion) he/she would
>> notice that git does apply change to renamed file, both if file
>> itself is renamed, and if directory it is in gets renamed.
>
> Sure. But it just demonstrates that the line of reasoning, which was
> clearly based on unstated assumption of how file-id tracking performs
> merge in this situation leads to the wrong conclusion for Git as it is
> the content tracking system, so Git does that differently.
>
> Perhaps, it would make sense to extend GitFaq to better cover that
> point, because people with other SCM background could easily conclude
> that Git cannot do "intelligent merge" after reading about git-mv.
>
>> scm>         <section id="changesets">
>> scm>             <title>Changesets' Support</title>
>> scm>             <expl>
>> scm>                 Does the repository support changesets? Changesets are a way
>> scm>                 to group a number of modifications that are relevant to each
>> scm>                 other in one atomic package, that can be cancelled or
>> scm>                 propagated as needed.
>> scm>             </expl>
>>
>> Here it is not entirely clean what creator of "Better SCM Initiative"
>> comparison table had on mind, what he meant by this.  Not all version
>> control systems are changeset based; some are snapshot based.  I guess
>> that for snapshot based SCM the above requirement is equivalent to
>> "Whole tree commits".
>
> Yes, it is irrelevant to being changeset or snapshot based. It is
> whether modification to more than one file can be commited (and
> propogated) atomically. I also suppose that those changes should be
> shown in history as a single change (not many changes too different
> files that took place in the same time and the same commit comment).
>
> However, the whole tree commit is a more strict requirement than
> just being able to commit a group of changes atomically. For example,
> "svn ci" creates a changeset and atomically store all its modification
> on the server. Yet, it is not the whole tree commit, because the result
> tree may differ from the tree that you commiting (files that are not
> modified by changeset may differ).
>
>> scm>                 <s id="git">
>> scm>                     Yes, Changesets are supported,
>> scm>                     and there's some flexibility in creating them.
>> scm>                 </s>
>> scm>            </compare>
>> scm>         </section>
>>
>> [Again, Git part was re-wrapped for better readibility]
>>
>> In my opition, such an _empty_ addition ("there's some flexibility in
>> creating them") is totally unnecessary; it adds no solid information
>> (what does it mean "some flexibility") and should be removed.
>
> Agreed. I suspect the author implied by that Git allows to stage
> and commit separately chunk without commiting the whole file.
> Yet, as it is worded above, it is useless.
>
>> scm>         <section id="tracking_uncommited_changes">
>> scm>             <title>Tracking Uncommited Changes</title>
>> scm>             <expl>
>> scm>                 Does the software have an ability to track the changes in the
>> scm>                 working copy that were not yet committed to the repository?
>> scm>             </expl>
>>
>> This also should be made more clean.  Does it mean for example ability
>> to tell which files have changed, or ability to diff working copy to
>> either last comitted changes, or to any revision available in repository?
>
> Also, ability to diff one or more specified files in the working copy to
> some specified revision.
>
>> scm>     <section id="technical_status">
>> scm>         <title>Technical Status</title>
>> scm>         <section id="documentation">
>> scm>             <title>Documentation</title>
>> scm>             <expl>
>> scm>                 How well is the system documented? How easy is it to
>> scm>                 get started using it?
>> scm>             </expl>
>> scm>             <compare>
>> scm>                 <s id="git">
>> scm>                     Medium. The short help is too terse and obscure.
>> scm>                     The man pages are extensive, but tend to be confusing.
>> scm>                     The are many tutorials.
>> scm>                 </s>
>> scm>             </compare>
>> scm>         </section>
>>
>> That of course depends on your opinion.  I would say "Good", now that
>> there is "Git User's Manual" distributed with Git, and now that there
>> started semi-official "Git Community Book" (http://book.git-scm.com).
>
> Interesting that versioncontrolblog, which, if I am not mistaken, is
> Alexey's site, states for Git Documentation:
>
> | Good. There is extensive documentation for every command, and many
> | tutorials.
>
> http://www.versioncontrolblog.com/comparison/Git/index.html
>
> So, I am not sure were the word "Medium" came from.
>
>
> Dmitry
>



-- 
Алексей Махоткин
http://squadette.ru/

next prev parent reply	other threads:[~2008-09-14 15:10 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-09-13 17:06 Git at Better SCM Initiative comparison of VCS (long) Jakub Narebski
2008-09-14 14:43 ` Dmitry Potapov
2008-09-14 15:09   ` Alexey Mahotkin [this message]
2008-09-14 17:48   ` Jakub Narebski
2008-09-14 19:48     ` Dmitry Potapov
2008-09-14 21:06       ` Shawn O. Pearce
2008-09-14 21:29         ` Jakub Narebski
2008-09-15  0:37       ` Jakub Narebski
2008-10-01 18:45 ` Jakub Narebski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bb5b640b0809140809g1aff9047qd6baf6cd66d23ec6@mail.gmail.com \
    --to=squadette@gmail.com \
    --cc=dpotapov@gmail.com \
    --cc=git@vger.kernel.org \
    --cc=jnareb@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).