From: Derrick Stolee <stolee@gmail.com>
To: Jeff King <peff@peff.net>, Stefan Beller <sbeller@google.com>
Cc: "Junio C Hamano" <gitster@pobox.com>,
"Ævar Arnfjörð Bjarmason" <avarab@gmail.com>,
git <git@vger.kernel.org>, "Duy Nguyen" <pclouds@gmail.com>,
"Derrick Stolee" <dstolee@microsoft.com>
Subject: Measuring Community Involvement (was Re: Contributor Summit planning)
Date: Tue, 14 Aug 2018 13:43:38 -0400 [thread overview]
Message-ID: <3f0da3cd-0929-4aad-2c50-92c9d6c660e5@gmail.com> (raw)
In-Reply-To: <20180813215431.GB16006@sigill.intra.peff.net>
On 8/13/2018 5:54 PM, Jeff King wrote:
> So I try not to think too hard on metrics, and just use them to get a
> rough view on who is active.
I've been very interested in measuring community involvement, with the
knowledge that any metric is flawed and we should not ever say "this
metric is how we measure the quality of a contributor". It can be
helpful, though, to track some metrics and their change over time.
Here are a few measurements we can make:
1. Number of (non-merge) commit author tag-lines.
using git repo:
> git shortlog --no-merges --since 2017 -sne junio/next | head -n 20
284 Nguyễn Thái Ngọc Duy <pclouds@gmail.com>
257 Jeff King <peff@peff.net>
206 Stefan Beller <stefanbeller@gmail.com>
192 brian m. carlson <sandals@crustytoothpaste.net>
159 Brandon Williams <bmwill@google.com>
149 Junio C Hamano <gitster@pobox.com>
137 Elijah Newren <newren@gmail.com>
116 René Scharfe <l.s.r@web.de>
112 Johannes Schindelin <Johannes.Schindelin@gmx.de>
105 Ævar Arnfjörð Bjarmason <avarab@gmail.com>
96 Jonathan Tan <jonathantanmy@google.com>
93 SZEDER Gábor <szeder.dev@gmail.com>
78 Derrick Stolee <dstolee@microsoft.com>
76 Martin Ågren <martin.agren@gmail.com>
66 Michael Haggerty <mhagger@alum.mit.edu>
61 Eric Sunshine <sunshine@sunshineco.com>
46 Christian Couder <chriscool@tuxfamily.org>
36 Phillip Wood <phillip.wood@dunelm.org.uk>
35 Jonathan Nieder <jrnieder@gmail.com>
33 Thomas Gummerer <t.gummerer@gmail.com>
2. Number of other commit tag-lines (Reviewed-By, Helped-By,
Reported-By, etc.).
Using git repo:
$ git log --since=2018-01-01 junio/next|grep by:|grep -v
Signed-off-by:|sort|uniq -c|sort -nr|head -n 20
66 Reviewed-by: Stefan Beller <sbeller@google.com>
22 Reviewed-by: Jeff King <peff@peff.net>
19 Reviewed-by: Jonathan Tan <jonathantanmy@google.com>
12 Helped-by: Eric Sunshine <sunshine@sunshineco.com>
11 Helped-by: Junio C Hamano <gitster@pobox.com>
9 Helped-by: Jeff King <peff@peff.net>
8 Reviewed-by: Elijah Newren <newren@gmail.com>
7 Reported-by: Ramsay Jones <ramsay@ramsayjones.plus.com>
7 Acked-by: Johannes Schindelin <johannes.schindelin@gmx.de>
7 Acked-by: Brandon Williams <bmwill@google.com>
6 Reviewed-by: Eric Sunshine <sunshine@sunshineco.com>
6 Helped-by: Johannes Schindelin <Johannes.Schindelin@gmx.de>
5 Mentored-by: Christian Couder <christian.couder@gmail.com>
5 Acked-by: Johannes Schindelin <Johannes.Schindelin@gmx.de>
4 Reviewed-by: Jonathan Nieder <jrnieder@gmail.com>
4 Reviewed-by: Johannes Schindelin <johannes.schindelin@gmx.de>
4 Helped-by: Stefan Beller <sbeller@google.com>
4 Helped-by: René Scharfe <l.s.r@web.de>
3 Reviewed-by: Martin Ågren <martin.agren@gmail.com>
3 Reviewed-by: Lars Schneider <larsxschneider@gmail.com>
(There does not appear to be enough density here to make a useful
metric.)
3. Number of email messages sent.
Using mailing list repo:
$ git shortlog --since 2017 -sne | head -n 20
3749 Junio C Hamano <gitster@pobox.com>
2213 Stefan Beller <sbeller@google.com>
2112 Jeff King <peff@peff.net>
1106 Nguyễn Thái Ngọc Duy <pclouds@gmail.com>
1028 Johannes Schindelin <Johannes.Schindelin@gmx.de>
965 Ævar Arnfjörð Bjarmason <avarab@gmail.com>
956 Brandon Williams <bmwill@google.com>
947 Eric Sunshine <sunshine@sunshineco.com>
890 Elijah Newren <newren@gmail.com>
753 brian m. carlson <sandals@crustytoothpaste.net>
677 Duy Nguyen <pclouds@gmail.com>
646 Jonathan Nieder <jrnieder@gmail.com>
629 Derrick Stolee <stolee@gmail.com>
545 Christian Couder <christian.couder@gmail.com>
515 Jonathan Tan <jonathantanmy@google.com>
425 Johannes Schindelin <johannes.schindelin@gmx.de>
425 Martin Ågren <martin.agren@gmail.com>
420 Jeff Hostetler <git@jeffhostetler.com>
420 SZEDER Gábor <szeder.dev@gmail.com>
363 Phillip Wood <phillip.wood@talktalk.net>
3. Number of threads started by user.
(For this and the measurements below, I imported emails into a SQL
table with columns [commit, author, date, message-id, in-reply-to,
subject] and ran queries)
SELECT TOP 20
COUNT(*) as NumSent
,[Author]
FROM [git].[dbo].[mailing-list]
WHERE [In-Reply-To] = ''
AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018
00:00')
GROUP BY [Author]
ORDER BY NumSent DESC
| NumSent | Author |
|---------|----------------------------|
| 76 | Junio C Hamano |
| 64 | Stefan Beller |
| 54 | Philip Oakley |
| 50 | Nguyá»…n Thái Ngá»c Duy |
| 49 | Robert P. J. Day |
| 47 | Christian Couder |
| 36 | Ramsay Jones |
| 34 | Elijah Newren |
| 34 | SZEDER Gábor |
| 33 | Johannes Schindelin |
| 31 | Jeff King |
| 30 | Ævar Arnfjörð Bjarmason |
| 24 | Jonathan Tan |
| 22 | Alban Gruin |
| 22 | brian m. carlson |
| 18 | Randall S. Becker |
| 15 | Paul-Sebastian Ungureanu |
| 15 | Jeff Hostetler |
| 15 | Brandon Williams |
| 15 | Luke Diamand |
4. Number of threads where the user participated
(This is measured by completing the transitive closure of In-Reply-To
edges into a new 'BaseMessage' column.)
SELECT TOP 20
COUNT(BaseMessage) as NumResponded
,Author
FROM [git].[dbo].[mailing-list]
WHERE [In-Reply-To] <> ''
AND CONVERT(DATETIME,[Date]) > CONVERT(DATETIME, '01-01-2018
00:00')
GROUP BY Author
ORDER BY NumResponded DESC
| NumResponded | Author |
|--------------|----------------------------|
| 2084 | Junio C Hamano |
| 1596 | Stefan Beller |
| 1211 | Jeff King |
| 1120 | Johannes Schindelin |
| 1021 | Nguyá»…n Thái Ngá»c Duy |
| 799 | Eric Sunshine |
| 797 | Ævar Arnfjörð Bjarmason |
| 693 | Brandon Williams |
| 654 | Duy Nguyen |
| 600 | Elijah Newren |
| 593 | brian m. carlson |
| 591 | Derrick Stolee |
| 318 | SZEDER Gábor |
| 299 | Jonathan Tan |
| 286 | Christian Couder |
| 263 | Jonathan Nieder |
| 257 | Phillip Wood |
| 256 | Derrick Stolee |
| 238 | Taylor Blau |
| 216 | Martin Ã…gren |
(Note, some names have not been de-duplicated across multiple email
addresses, but the email addresses are removed from these tables since
I'm using a markdown generator that strips the emails in < >.)
If you have other ideas for fun measurements, then please let me know.
Thanks,
-Stolee
next prev parent reply other threads:[~2018-08-14 17:43 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-08-13 16:31 Contributor Summit planning Jeff King
2018-08-13 16:58 ` Derrick Stolee
2018-08-13 17:15 ` Jeff King
2018-08-27 13:22 ` Johannes Schindelin
2018-08-27 13:30 ` Derrick Stolee
2018-08-28 12:22 ` Johannes Schindelin
2018-08-28 19:06 ` Jonathan Nieder
2018-08-28 19:11 ` Jonathan Nieder
2018-08-29 14:38 ` Johannes Schindelin
2018-08-29 4:52 ` Jeff King
2018-08-29 14:44 ` Johannes Schindelin
2018-08-13 17:46 ` Stefan Beller
2018-08-14 4:31 ` Christian Couder
2018-08-14 14:35 ` Jeff King
2018-08-13 18:49 ` Ævar Arnfjörð Bjarmason
2018-08-13 19:44 ` Jeff King
2018-08-13 20:36 ` Junio C Hamano
2018-08-13 20:41 ` Stefan Beller
2018-08-13 21:06 ` Jeff King
2018-08-13 21:19 ` Stefan Beller
2018-08-13 21:54 ` Jeff King
2018-08-14 17:43 ` Derrick Stolee [this message]
2018-08-14 19:36 ` Measuring Community Involvement (was Re: Contributor Summit planning) Jeff King
2018-08-14 19:47 ` Stefan Beller
2018-08-14 20:06 ` Jeff King
2018-08-15 7:12 ` Eric Wong
2018-08-14 20:42 ` Junio C Hamano
2018-08-27 15:54 ` Johannes Schindelin
2018-08-15 16:28 ` Duy Nguyen
2018-08-27 15:55 ` Johannes Schindelin
2018-08-14 14:30 ` Contributor Summit planning Duy Nguyen
2018-08-14 14:47 ` Jeff King
2018-08-14 16:57 ` Stefan Beller
2018-08-14 20:59 ` Junio C Hamano
2018-08-17 15:18 ` Duy Nguyen
2018-08-27 22:49 ` Johannes Schindelin
2018-08-29 5:02 ` Jeff King
2018-08-14 6:52 ` Elijah Newren
2018-08-14 13:25 ` Randall S. Becker
2018-08-14 14:06 ` Ævar Arnfjörð Bjarmason
2018-08-14 14:30 ` Jeff King
2018-08-14 14:28 ` Jeff King
2018-08-27 13:34 ` Johannes Schindelin
2018-08-29 4:55 ` Jeff King
2018-08-29 14:46 ` Johannes Schindelin
2018-08-30 3:20 ` Jeff King
2018-08-30 11:36 ` Johannes Schindelin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=3f0da3cd-0929-4aad-2c50-92c9d6c660e5@gmail.com \
--to=stolee@gmail.com \
--cc=avarab@gmail.com \
--cc=dstolee@microsoft.com \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=pclouds@gmail.com \
--cc=peff@peff.net \
--cc=sbeller@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).