From: Linus Torvalds <torvalds@linux-foundation.org>
To: Dana How <danahow@gmail.com>
Cc: Nicolas Pitre <nico@cam.org>, Junio C Hamano <junkio@cox.net>,
git@vger.kernel.org
Subject: Re: [PATCH 09/13] drop objects larger than --blob-limit if specified
Date: Fri, 6 Apr 2007 12:24:53 -0700 (PDT) [thread overview]
Message-ID: <Pine.LNX.4.64.0704061214230.6730@woody.linux-foundation.org> (raw)
In-Reply-To: <56b7f5510704061109n2878a221p391b7c3edba89c63@mail.gmail.com>
On Fri, 6 Apr 2007, Dana How wrote:
>
> I agree with your arguments, but packs have different uses
> and to me there are several differing (non-)reasons for --pack-limit.
> * To split packs into smaller chunks when the .pack format is used as a
> communications protocol. But the discussion has converged to
> "we're not going to split packs at the transmitter". I agree with this
> conclusion, since it would make the total transmission larger (loss
> of deltas).
I don't agree with it, since I don't think the delta advantage is
noticeable once packs are "big enough".
Splitting on the sending side means that it needs to split just in one
place, rather than have the same logic in two different places.
I don't think it really "convered" on anything, I think the discussion
just didn't continue along that thing.
> Since no .idx file is sent there is no requirement for --pack-limit here.
I really don't think the idx file is the only - or even primary - reason.
We need *some* size limiter for a lot of reasons. If the idx file was the
only reason, we should just switch to a new index format and forget about
it.
But since there are *other* reasons that cannot go away, that doesn't
obviate the need for size limits, so while I think the idx one is the
*first* reason to do this, I don't think it is really the most important
one, exactly because the idx file we *could* solve other ways.
> * To avoid offsets larger than 31 bits in .idx files. Your proposal,
> and what I was doing for --pack-limit && --stdout, is sufficient to
> address this.
Right.
> * Avoiding (e.g.) 2GB+ files when none already exist in the repository --
> either the filesystem doesn't support anything beyond the limit,
> or we don't want to use a >31b off_t with mmap. (Perhaps
> the latter case is completely avoided by some glibc 64b trickery,
> but is that always true?) Only the write rollback approach can address this.
I disagree violently.
IN THEORY only write rollback can address that. But "theory" is not
practice, and anybody who thinks that theory is even *relevant* here is
missing the big picture.
If you use FATFS as the place to store your objects, it's really easy to
just say: you can't have objects bigger than 2GB. That's the real life
solution. The "theory" that you could have pack-files larger than 32 bits
is just about as relevant as quantum mechanics is to designing the tensile
strength of a sky-scraper. Sure, *in*theory* quantum mechanics matters,
but in practice, you'd be crazy if you tried to convince anybody that it
should be done using QM rather than traditional means.
Quite frankly, if you have objects that compress to >2GB, you're going to
be in so much pain even on *other* filesystems, that I seriously doubt
you'd want to track them using git anyway. And even if you want to, just
tell people: don't use FATFS, because this repository is insane.
> * Creating .pack files that fit on e.g. CDROM etc.
> The 2nd and 3rd cases are what I'm thinking about,
> which is why the first version of my patch did both.
Again, *in*practice*, for any sane situation, if you want to fit things on
a CD-ROM, just give a limit of 600MB, and I can pretty much guarantee that
you'll see a slop of just a percent or two for any realistic setup. And if
it goes up to 660MB, you'll still fit on any CD.
So please ignore theory, when theory isn't relevant. Designing for
something that practically cannot happen sounds pretty wrong.
(Of course, since you have a 55GB archive, you probably have an insane
thing in the first place. My condoleances if so ;)
Linus
next prev parent reply other threads:[~2007-04-06 19:25 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-04-05 22:36 [PATCH 09/13] drop objects larger than --blob-limit if specified Dana How
[not found] ` <al pine.LFD.0.98.0704052103410.28181@xanadu.home>
[not found] ` <56b7f5510704051919v7daac590m 6ac52c4fcabd5321@mail.gmail.com>
2007-04-06 1:04 ` Nicolas Pitre
2007-04-06 2:19 ` Dana How
2007-04-06 3:20 ` Nicolas Pitre
2007-04-06 15:49 ` Linus Torvalds
[not found] ` <56b7f55 10704061109n2878a221p391b7c3edba89c63@mail.gmail.com>
2007-04-06 18:09 ` Dana How
2007-04-06 19:21 ` Nicolas Pitre
2007-04-06 19:24 ` Linus Torvalds [this message]
2007-04-06 22:33 ` Junio C Hamano
2007-04-08 1:53 ` David Lang
2007-04-06 20:12 ` Nicolas Pitre
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Pine.LNX.4.64.0704061214230.6730@woody.linux-foundation.org \
--to=torvalds@linux-foundation.org \
--cc=danahow@gmail.com \
--cc=git@vger.kernel.org \
--cc=junkio@cox.net \
--cc=nico@cam.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).