git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* How to efficiently blame an entire repo?
@ 2010-04-29 23:12 Jay Soffian
  2010-04-30 19:45 ` Avery Pennarun
  2010-04-30 21:21 ` Jeff King
  0 siblings, 2 replies; 4+ messages in thread
From: Jay Soffian @ 2010-04-29 23:12 UTC (permalink / raw)
  To: git

Let's say you've got a repo with ~ 40K files and 35K commits.
Well-packed .git is about 800MB.

You want to find out how many lines of code a particular group of
individuals has contributed to HEAD.

The naive solution is to run git blame on all 40K files grep'ing for
the just the authors you want.

Possibly a step up from that is first using log --name-status
--author=... to find just the files which have been touched by those
authors and then blaming only those files.

I guess the next step up would be parsing the diff hunks output by log
-p, but then you're basically re-implementing blame I think.

Am I missing a clever solution?

j.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2010-04-30 21:21 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-04-29 23:12 How to efficiently blame an entire repo? Jay Soffian
2010-04-30 19:45 ` Avery Pennarun
2010-04-30 20:16   ` Jay Soffian
2010-04-30 21:21 ` Jeff King

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).