From: Dhruva Krishnamurthy <dhruvakm@gmail.com>
To: Karthik Nayak <karthik.188@gmail.com>, Jeff King <peff@peff.net>,
matthew sporleder <msporleder@gmail.com>,
Toon Claes <toon@iotcl.com>
Cc: Git Mailing List <git@vger.kernel.org>
Subject: Re: bundles discovery and clones
Date: Fri, 9 Aug 2024 05:34:05 -0700 [thread overview]
Message-ID: <b76b3a9f-fa4a-41ab-893a-7fafe865ca09@gmail.com> (raw)
In-Reply-To: <CAOLa=ZRkZb65b1NawPBNOnnxi_gjCU9=85cJuxj0mQxyrPJe0g@mail.gmail.com>
On 6/15/24 6:01 AM, Karthik Nayak wrote:
> Jeff King <peff@peff.net> writes:
>
>> On Mon, Jun 10, 2024 at 02:25:19PM -0400, matthew sporleder wrote:
>>
>>> I have recently been playing with git clone --bundle-uri and loving it
>>> because I can clone with almost-*zero* resources being used on the
>>> server!
>>>
>>> I am a little confused by https://git-scm.com/docs/bundle-uri
>>> mentioning "discovery" and things. Is this something being added to
>>> the git cli, a special feature for other clients, or is it still too
>>> early-days to talk about much?
>>>
>>> I would love to produce bundles of common use cases and have them
>>> auto-discovered by git clone *without* the --bundle-uri parameter, and
>>> then let our CDN do the heavy lifting of satisfying things like:
>>> git clone
>>> git clone --depth=0
>>> git clone --single-branch --branch main
>>>
>>> I'm not sure I hold out as much hope for pre-bundling pulls/updates
>>> but any movement towards offloading our big-ish repos to CDNs is a win
>>> for us.
>>
>> I don't think the server side is well documented, but peeking at the
>> code, I think you want this on the server:
>>
>> git config uploadpack.advertiseBundleURIs true
>> git config bundle.version 1
>> git config bundle.mode any
>> git config bundle.foo.uri https://example.com/your.bundle
>>
>> And then the clients need to tell Git that they allow bundle transfers:
>>
>> git config --global transfer.bundleURI true
>>
>> I'm not sure if we'd eventually flip the client-side switch to "true" by
>> default (which is what you'd need for this to happen without any user
>> participation at all).
>>
>
> This would indeed be nice. We at GitLab have been experimenting with
> bundle-uri. While it is easy to flip the switch for clients under our
> control (CI pipelines). End users loose out on these benefits, especially
> for large monorepos where the servers spend a lot of time computing the
> packfile.
>
>> One gotcha there is that clients are now accessing an arbitrary URL
>> provided by the server, so there are cross-site security implications.
>> It might make more sense to allow only relative URLs without ".." (so if
>> I fetched from https://example.com/foo.git, the server could use only
>> the relative "bundles/bar.bundle", which would then be found at
>> https://example.com/foo.git/bundles/bar.bundle").
>>
>> -Peff
>
> True. But I suspect servers using bundle uri might not always serve them
> from the same domain. I know we were experimenting using cloud storage
> and providing the client with a one-time signed URL.
>
> https://cloud.google.com/storage/docs/access-control/signed-urls
We (at Bitbucket) have implemented bundle server for serving bundles
with expiring URL from cloud storage. It will be nice to have bundle
server discovery based on git v2 protocol based capability exchange.
example pkt format:
007bbundle-server=https://cdn-1.bitbucket.org/workspace/repository/bundle,https://bitbucket.org/workspace/repository/bundle
-Dhruva
next prev parent reply other threads:[~2024-08-09 12:34 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-10 18:25 bundles discovery and clones matthew sporleder
2024-06-11 7:21 ` Jeff King
2024-06-11 11:14 ` matthew sporleder
2024-06-13 10:20 ` Jeff King
2024-06-15 13:01 ` Karthik Nayak
2024-08-09 12:34 ` Dhruva Krishnamurthy [this message]
2024-06-29 2:02 ` Sitaram Chamarty
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=b76b3a9f-fa4a-41ab-893a-7fafe865ca09@gmail.com \
--to=dhruvakm@gmail.com \
--cc=git@vger.kernel.org \
--cc=karthik.188@gmail.com \
--cc=msporleder@gmail.com \
--cc=peff@peff.net \
--cc=toon@iotcl.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).