From: Linus Torvalds <torvalds@linux-foundation.org>
To: Jakub Narebski <jnareb@gmail.com>
Cc: Christopher Jefferson <caj@cs.st-andrews.ac.uk>, git@vger.kernel.org
Subject: Re: Problem with large files on different OSes
Date: Wed, 27 May 2009 09:30:18 -0700 (PDT) [thread overview]
Message-ID: <alpine.LFD.2.01.0905270922250.3435@localhost.localdomain> (raw)
In-Reply-To: <m3y6siboij.fsf@localhost.localdomain>
On Wed, 27 May 2009, Jakub Narebski wrote:
>
> Do seting `pack.packSizeLimit`, or adjusting values of
> `core.packedGitWindowSize` and/or `core.packedGitLimit`
> (see git-config(1)) help in your situation?
No, that will help just the packfile mmap (and even there, it won't help
with things like index file size - we'll always mmap the whole index
file). It's definitely worth doing, though - but I think we already
default to 32MB pack-file windows on 32-bit architectures.
Individual files we always handle in one go. It's what git was designed
for, after all - fairly small files. And so git is limited to files
smaller than the virtual address space.
On a 32-bit setup, that often limits you to roughly a gigabyte. You have
4GB of virtual address space, of which one or two is used for the OS
kernel. So say you have 2GB for user mode - you then have the executable
mapping and libraries and stack, all spread out in that 2GB virtual
address space.
In fact, even if it's 3GB for user (I don't know what OS X does), getting
one contiguous area may well be limited to ~1GB depending on layout of
shared library mappings etc VM fragmentation. Older Linux systems tended
to map things in ways that made it hard to get more than 1GB of contiguous
data mapping if you compiled with dynamic libraries.
64-bit mostly makes this a non-issue.
In fact, if you do "diff", you're going to be even _more_ limited, since
for simplicity, our version of xdiff really wants both sources in memory
at a time. So you can't really diff >500MB files. Not that you generally
want to, of course.
I'll see if I can make us handle the "big file without diff" case better
by chunking.
Linus
next prev parent reply other threads:[~2009-05-27 16:31 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-05-27 10:52 Problem with large files on different OSes Christopher Jefferson
2009-05-27 11:37 ` Andreas Ericsson
2009-05-27 13:02 ` Christopher Jefferson
2009-05-27 13:28 ` John Tapsell
2009-05-27 13:30 ` Christopher Jefferson
2009-05-27 13:32 ` John Tapsell
2009-05-27 14:01 ` Tomas Carnecky
2009-05-27 14:09 ` Christopher Jefferson
2009-05-27 14:22 ` Andreas Ericsson
2009-05-27 14:37 ` Jakub Narebski
2009-05-27 16:30 ` Linus Torvalds [this message]
2009-05-27 16:59 ` Linus Torvalds
2009-05-27 17:22 ` Christopher Jefferson
2009-05-27 17:30 ` Jakub Narebski
2009-05-27 17:37 ` Nicolas Pitre
2009-05-27 21:53 ` Jeff King
2009-05-27 22:07 ` Linus Torvalds
2009-05-27 23:09 ` Alan Manuel Gloria
2009-05-28 1:56 ` Linus Torvalds
2009-05-28 3:26 ` Nicolas Pitre
2009-05-28 4:21 ` Eric Raible
2009-05-28 4:30 ` Shawn O. Pearce
2009-05-28 5:52 ` Eric Raible
2009-05-28 8:52 ` Andreas Ericsson
2009-05-28 17:41 ` Nicolas Pitre
2009-05-28 19:43 ` Jeff King
2009-05-28 19:49 ` Linus Torvalds
2009-05-27 23:29 ` Nicolas Pitre
2009-05-28 20:00 ` Jeff King
2009-05-28 20:54 ` Nicolas Pitre
2009-05-28 21:21 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.2.01.0905270922250.3435@localhost.localdomain \
--to=torvalds@linux-foundation.org \
--cc=caj@cs.st-andrews.ac.uk \
--cc=git@vger.kernel.org \
--cc=jnareb@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).