From: Jakub Narebski <jnareb@gmail.com>
To: "Sverre Rabbelier" <srabbelier@gmail.com>
Cc: git@vger.kernel.org
Subject: Re: [RFC] Use cases for 'git statistics'
Date: Tue, 13 May 2008 15:07:04 +0200 [thread overview]
Message-ID: <200805131507.04912.jnareb@gmail.com> (raw)
In-Reply-To: <bd6139dc0805120604m349b1fbbr39c6dcb8d893e771@mail.gmail.com>
On Mon, 12 May 2008, Sverre Rabbelier wrote:
> [Sorry, I hit 'send' instead of 'save']
And now you apparently forgot to add git mailing list to receipients...
> On Mon, May 12, 2008 at 2:40 PM, Jakub Narebski <jnareb@gmail.com> wrote:
>> This is, IMHO, the most complex example (at least to do properly).
>> It begins with: does given author have code touching given subsystem
>> (i.e. is it for him/her new contribution wrt. subsystem)? How many
>> commits he/she has affecting given subsystem? How often he/she rewrites
>> code? How many bugs were introduced?
>
> Ah, there is a lot more to this example than I thought. Perhaps this
> data could all be shown and then, using some "importance" metric per
> item a "grade" can be calculated?
Weighting different statistics, bayesian hypotesis/filtering, expert
system, machine learning... I guess that would be quite a work to do
it well. Probably would require to calculate and adjust scoring of code
(difficulity) and authors (skill), and matching them...
This is certainly in the "wishlist" scope.
>> Details I think need to be provided by maintainer...
>
> Do you mean Junio, or the user of the program?
I mean that all I can provide is speculation. I'm not, and never was
a maintainer of OSS project, and I don't know what criteria one use
(perhaps unvoiced criteria) to decide whether given patch needs to be
examined more closely, or the cursory browsing should be enough.
>>>> * Contributor: what happened with my code?
>>>
>>> Do you mean a "track my code" like feature? Showing the movement of a
>>> particular piece of code through the code? (Displaying information
>>> like "moved from foo.c to bar.c in commit 0123456789abcd"?)
>>
>> I was thinking there about "git blame --reverse".
>
> Do you mean, filter it's output for a specific user?
I mean, given the code at given version, what happened to this code?
Filtering "git blame --reverse" by user might be one way of solving it.
>>>> * Searching where to contribute: what are oldest part of code dealing
>>>> with error messages (find ancient code)?
>>>>
>> Or find the lines with oldest modification stamp with "die" or "warn",
>> or find which messages are oldest, even if wrapper have changed.
>
> In that case, perhaps a regexp would be more suitable, to allow the
> user to search for any specific line, not just "die" or "warn"?
What I had in mind here, but didn't explain clear enough, was an
extension to pickaxe search. You want to find when current error
message was created, even if the way of handling it (fprintf vs. die)
changed, or if code was indented, or was moved.
Or find all error messages, in the order they were created, for example
in git case to find ancient error messages and replace it by something
more user-friendly (or less selective about choosing friends ;-).
>> P.S. I wonder how hard to be to plug-in such SCM statistic system
>> into something like project management, see
>> "Joel On Software: Evidence based scheduling" (of programming tasks)
>> http://www.joelonsoftware.com/items/2007/10/26.html
>
> Interesting article, I think integrating statistics
> (http://www.statsvn.org/ for example) can be a very powerful tool for
> project management.
You meant http://git.koha.org/gitstat/, didn't you? ;-P
Siriously, what I had in mind was to integrate author dates and commit
dates into project management system scheduling.
--
Jakub Narebski
Poland
next prev parent reply other threads:[~2008-05-13 13:08 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-05-08 15:51 [RFC] Use cases for 'git statistics' Sverre Rabbelier
2008-05-12 9:38 ` Sverre Rabbelier
2008-05-12 10:16 ` Jakub Narebski
2008-05-12 10:19 ` Sverre Rabbelier
2008-05-12 11:19 ` Jakub Narebski
2008-05-12 11:49 ` Sverre Rabbelier
2008-05-12 12:40 ` Jakub Narebski
2008-05-12 13:01 ` Sverre Rabbelier
[not found] ` <bd6139dc0805120604m349b1fbbr39c6dcb8d893e771@mail.gmail.com>
2008-05-13 13:07 ` Jakub Narebski [this message]
2008-05-13 13:37 ` Sverre Rabbelier
2008-05-14 20:34 ` Jakub Narebski
2008-05-15 12:21 ` Andreas Ericsson
2008-05-17 0:02 ` Junio C Hamano
2008-05-18 1:01 ` Sverre Rabbelier
2008-05-21 17:30 ` Junio C Hamano
2008-05-21 20:52 ` Sverre Rabbelier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200805131507.04912.jnareb@gmail.com \
--to=jnareb@gmail.com \
--cc=git@vger.kernel.org \
--cc=srabbelier@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).