From: Brian Gerst <bgerst@didntduck.org>
To: Petr Baudis <pasky@suse.cz>
Cc: Martin Coxall <quasi@cream.org>,
Zack Brown <zbrown@tumblerings.org>,
git@vger.kernel.org
Subject: Re: rsync deprecated but promoted?
Date: Mon, 26 Sep 2005 10:41:54 -0400 [thread overview]
Message-ID: <433808B2.3070508@didntduck.org> (raw)
In-Reply-To: <20050926133204.GB21019@pasky.or.cz>
Petr Baudis wrote:
> Dear diary, on Sun, Sep 25, 2005 at 09:06:37PM CEST, I got a letter
> where Martin Coxall <quasi@cream.org> told me that...
>
>>On 25 Sep 2005, at 17:32, Zack Brown wrote:
>>
>>>Hi folks,
>>>
>>>When I use cogito, it gives a warning saying the rsync method is
>>>deprecated and
>>>will be removed in the future. But when I visit kernel.org/git, the
>>>page says to
>>>use an rsync URL with cg-clone.
>>>
>>>Maybe kernel.org should be updated?
>>>
>>
>>It does seem to be sending out a confusing message to us users too,
>>since an initial clone of Linus's tree with rsync is on my machine 10x
>>faster than an http clone, so it seems to be sending out something of a
>>confused/confusing message re: rsync.
>>
>>Am I right in thinking it's because rsync didn't originally have pack
>>support, but now it does, Petr has simply forgotten to deprecate the
>>deprecation message?
>
>
> Nope. rsync always did packs, I actually un-deprecated it for the time
> period when HTTP didn't. The thing is, rsync is bad - it will happily
> put duplicate, redundant, and especially unwanted data to your
> repository, especially when the shared GIT repositories happen. HTTP and
> git-daemon are much better access methods in this regard - actually, I
> still like HTTP the most:
>
> + Works everywhere - no special setup, no dedicated service, firewalls
> and proxies won't stop it
> + Works properly, i.e. only getting stuff you want, unlike rsync
> + Replicates packs setup - would be even better if it would kill objects
> and packs which the new pack makes redundant
>
> It would be best to have some smarter git-prune-packed, which
> would process just a single pack. The other alternative would be
> that it would prune packs being subsets of other packs as well,
> but that scaled bad. I will write another mail about that.
>
> - It is slow. Actually, I think it should be much faster for incremental
> fetches, and the initial fetch should take about the same time if you
> use packs. But the question is, did we already hit the limit? Are we
> using HTTP keepalive connections, do we parallelize the requests?
>
The current HTTP fetch doesn't do asynchronous requests (using
curl_multi_*). This means that no transfers occur while processing
received objects.
The other problem with HTTP vs. rsync is that the HTTP fetch will walk
the entire tree down to the root to verify it has every object. While
this isn't a bad thing it's usually unnecessary when it's all in one big
pack file.
--
Brian Gerst
next prev parent reply other threads:[~2005-09-26 14:40 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-09-25 16:32 rsync deprecated but promoted? Zack Brown
2005-09-25 17:07 ` H. Peter Anvin
2005-09-25 19:06 ` Martin Coxall
2005-09-26 13:32 ` Petr Baudis
2005-09-26 14:41 ` Brian Gerst [this message]
2005-09-26 16:36 ` Petr Baudis
2005-09-26 16:47 ` Brian Gerst
2005-09-26 15:04 ` Linus Torvalds
2005-09-26 16:38 ` Petr Baudis
2005-09-26 16:43 ` Linus Torvalds
2005-11-10 23:17 ` Petr Baudis
2005-09-26 16:44 ` walt
2005-09-26 17:55 ` Linus Torvalds
2005-09-26 19:23 ` walt
2005-09-26 20:12 ` Johannes Schindelin
2005-09-26 20:19 ` Junio C Hamano
2005-09-26 22:13 ` Daniel Barkalow
2005-09-26 22:38 ` Junio C Hamano
2005-09-26 20:43 ` Petr Baudis
2005-09-27 6:35 ` hared GIT repos (was Re: rsync deprecated but promoted?) Matthias Urlichs
2005-09-27 7:13 ` shared GIT repos Junio C Hamano
2005-09-27 8:45 ` Matthias Urlichs
2005-09-27 9:59 ` Sergey Vlasov
2005-09-27 10:29 ` Matthias Urlichs
2005-09-27 15:21 ` Linus Torvalds
2005-09-27 18:36 ` hared GIT repos (was Re: rsync deprecated but promoted?) A Large Angry SCM
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=433808B2.3070508@didntduck.org \
--to=bgerst@didntduck.org \
--cc=git@vger.kernel.org \
--cc=pasky@suse.cz \
--cc=quasi@cream.org \
--cc=zbrown@tumblerings.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).