git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* git-local-pull
@ 2005-06-27  3:05 David S. Miller
  2005-06-27  5:20 ` git-local-pull Linus Torvalds
  0 siblings, 1 reply; 4+ messages in thread
From: David S. Miller @ 2005-06-27  3:05 UTC (permalink / raw)
  To: git


I tried to start using git-clone-script to clone repositories
locally.  It crunches on the disk for a couple of seconds,
that's fine, but then I notice the disk activity stop and
git-local-pull becomes cpu bound and grows to 80MB in size
over the course of 5 minutes.

Is this a side effect of the new pack/unpack stuff?

Compared to what this thing is doing, manually symlinking
the object database, copying over the HEAD, and building
the index file is significantly faster.

Actually, git-clone-script didn't build an index file.
So the compute time definitely came from something else
entirely.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: git-local-pull
  2005-06-27  3:05 git-local-pull David S. Miller
@ 2005-06-27  5:20 ` Linus Torvalds
  2005-06-27  5:31   ` git-local-pull Jeff Garzik
  0 siblings, 1 reply; 4+ messages in thread
From: Linus Torvalds @ 2005-06-27  5:20 UTC (permalink / raw)
  To: David S. Miller; +Cc: git



On Sun, 26 Jun 2005, David S. Miller wrote:
> 
> I tried to start using git-clone-script to clone repositories
> locally.  It crunches on the disk for a couple of seconds,
> that's fine, but then I notice the disk activity stop and
> git-local-pull becomes cpu bound and grows to 80MB in size
> over the course of 5 minutes.

Try adding "-l" to the git-local-pull command line, to make it do 
hardlinks by default (and fall back to a copy if that fails).

> Is this a side effect of the new pack/unpack stuff?

Nope, nothing uses that yet. 

I think the 80MB is just because it built up the object relationship for 
the whole kernel db there. I don't know what the 5 minutes of CPU are, 
though, it should just be copying stuff (and at 250MB in and 250MB out, 
that's a lot of stuff to copy, but it shouldn't be five minutes worth, 
especially if it's all cached and you see a pegged CPU).

That copy is stupid, btw. Doing a "mmap + write" is a lot slower than
doing a "while (data) { read(8kB) + write(8kB) }" loop. The read() will
generally do better read-ahead and the small read/write will be better for 
the caches. But that still doesn't explain 5 minutes worth.

> Compared to what this thing is doing, manually symlinking
> the object database, copying over the HEAD, and building
> the index file is significantly faster.

Well, with "-l" that is what git-local-pull will do too. And if you want 
symlinks, use "-s". Or use both, in which case it will try a hardlink 
first, and then a symlink if it gets an EXDEV.

		Linus

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: git-local-pull
  2005-06-27  5:20 ` git-local-pull Linus Torvalds
@ 2005-06-27  5:31   ` Jeff Garzik
  2005-06-27 18:46     ` git-local-pull Linus Torvalds
  0 siblings, 1 reply; 4+ messages in thread
From: Jeff Garzik @ 2005-06-27  5:31 UTC (permalink / raw)
  To: Linus Torvalds; +Cc: David S. Miller, git

Linus Torvalds wrote:
> 
> On Sun, 26 Jun 2005, David S. Miller wrote:
> 
>>I tried to start using git-clone-script to clone repositories
>>locally.  It crunches on the disk for a couple of seconds,
>>that's fine, but then I notice the disk activity stop and
>>git-local-pull becomes cpu bound and grows to 80MB in size
>>over the course of 5 minutes.
> 
> 
> Try adding "-l" to the git-local-pull command line, to make it do 
> hardlinks by default (and fall back to a copy if that fails).
> 
> 
>>Is this a side effect of the new pack/unpack stuff?
> 
> 
> Nope, nothing uses that yet. 
> 
> I think the 80MB is just because it built up the object relationship for 
> the whole kernel db there. I don't know what the 5 minutes of CPU are, 
> though, it should just be copying stuff (and at 250MB in and 250MB out, 
> that's a lot of stuff to copy, but it shouldn't be five minutes worth, 
> especially if it's all cached and you see a pegged CPU).

FWIW, I notice that git-diff-tree uses >100MB of RAM, on a large 
git-pull-script run.  It's generating the diffstat output, AFAICS.

	Jeff

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: git-local-pull
  2005-06-27  5:31   ` git-local-pull Jeff Garzik
@ 2005-06-27 18:46     ` Linus Torvalds
  0 siblings, 0 replies; 4+ messages in thread
From: Linus Torvalds @ 2005-06-27 18:46 UTC (permalink / raw)
  To: Jeff Garzik; +Cc: David S. Miller, git



On Mon, 27 Jun 2005, Jeff Garzik wrote:
> 
> FWIW, I notice that git-diff-tree uses >100MB of RAM, on a large 
> git-pull-script run.  It's generating the diffstat output, AFAICS.

Ahh, yes, I should make the diffstat be conditional, not everybody wants 
it, and it's often the most expensive part of the pull..

		Linus

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2005-06-27 18:38 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-06-27  3:05 git-local-pull David S. Miller
2005-06-27  5:20 ` git-local-pull Linus Torvalds
2005-06-27  5:31   ` git-local-pull Jeff Garzik
2005-06-27 18:46     ` git-local-pull Linus Torvalds

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).