From: Andreas Ericsson <ae@op5.se>
To: Brian Herman <brianherman@gmail.com>
Cc: git <git@vger.kernel.org>
Subject: Re: Git and Very Large Codebases
Date: Tue, 20 Apr 2010 11:43:19 +0200 [thread overview]
Message-ID: <4BCD7737.1010709@op5.se> (raw)
In-Reply-To: <h2w4ef15ddc1004200200ocd9df12azf62da7419355ae7c@mail.gmail.com>
On 04/20/2010 11:00 AM, Brian Herman wrote:
> Can git support codebases on the order of magnitude of Gigabytes?
> For example, a game developer might want to use git as their SCM
> software but their codebases can be in gigabytes in scale.
> If git cannot support this how would one begin to create this support?
>
It depends more on the nature of the files than the size of the total
code. With large binary files and such, git will not perform very well.
On a code-tree of 1-30KiB source-code files it will perform
fantastically (compared to alternatives, that is; Nothing in the entire
world will handle multiple gigabytes faster than it can read it from
the disk).
I'd suggest splitting the repository in several different ones to
make git handle it well. Movies go in one repository, audio go in
another, pictures and whatnot in a third, and then you tie it all
together with a super-repo that holds the code and the build-system.
In repositories storing mainly binary files you go through the
gitattributes(5) man-page and set the necessary attributes for
the files in question. In your case, I'd suspect the .gitattributes
file would look something like this:
*.avi -diff -delta -crlf -merge
*.jpg -diff -delta -crlf -merge
*.mp3 -diff -delta -crlf -merge
This will prevent git from attempting delta-compression on the
files. It will also not try to create diffs from them, or perform
crlf-conversion or merge them. The gitattributes(5) man page has
more information regarding this.
Some trial and error with a small test-repository will help you
sort this out and understand what it means.
--
Andreas Ericsson andreas.ericsson@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225 Fax: +46 8-230231
Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.
next prev parent reply other threads:[~2010-04-20 9:43 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <s2x4ef15ddc1004200158s2fc6391byfab91784036ac018@mail.gmail.com>
2010-04-20 9:00 ` Git and Very Large Codebases Brian Herman
2010-04-20 9:43 ` Andreas Ericsson [this message]
2010-04-20 9:46 ` Jakub Narebski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BCD7737.1010709@op5.se \
--to=ae@op5.se \
--cc=brianherman@gmail.com \
--cc=git@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).