From: "Jesus M. Gonzalez-Barahona" <jgb@bitergia.com>
To: Lars Kurth <lars.kurth.xen@gmail.com>,
Ian Campbell <Ian.Campbell@citrix.com>
Cc: Daniel Izquierdo <dizquierdo@bitergia.com>,
Xen-devel <xen-devel@lists.xen.org>
Subject: Re: [RFC] Results of Phase 1 of the Review Process study
Date: Fri, 16 Oct 2015 00:25:29 +0200 [thread overview]
Message-ID: <1444947929.11624.193.camel@bitergia.com> (raw)
In-Reply-To: <9AC1E823-0D62-467D-90E2-F9966BAFE802@gmail.com>
On Thu, 2015-10-15 at 22:36 +0100, Lars Kurth wrote:
> > On 15 Oct 2015, at 10:26, Ian Campbell <Ian.Campbell@citrix.com>
> > wrote:
> >
> > On Thu, 2015-10-15 at 10:06 +0100, Ian Campbell wrote:
> > > On Wed, 2015-10-14 at 18:32 +0100, Lars Kurth wrote:
> > > > C1) Only 60% percent of the reviews on the mailing list could
> > > > be
> > > > matched
> > > > to commits. This can be improved going forward, but we felt
> > > > that the
> > > > dataset is big enough for statical analysis and didn't want to
> > > > spend
> > > > too
> > > > much time to get the matching perfect at this stage. See
> > > > "Coverage
> > > > analysis" for more details
> > >
> > > How strict or fuzzy is the matching?
> > >
> > > Does it account for e.g. spelling, grammar and clarity changes
> > > and things
> > > like adding a subsystem ("tools: libxc:") prefix, either upon
> > > commit or
> > > by
> > > the author in vN+1 based on feedback?
> > >
> > > I often both comment on such things during review and (with the
> > > authors
> > > permission) tweak things upon commit.
> > >
> > > If those changes are not being correlated then I expect that
> > > would skew
> > > the
> > > figures of those for whom English is not their first language
> > > (and not a
> > > small portion of native speakers even!) and newcomers who e.g.
> > > might not
> > > be
> > > aware of the need to prefix things with the subsystem.
> > >
> > > In a (smaller) number of cases a patch is abandoned in favour of
> > > a very
> > > different approach, which I think would be essentially
> > > untrackable, at
> > > least automatically.
> >
> > Looking at the stuff in [47] marked as last reviewed in 2014 it
> > seems the
> > majority of them (at least the ones for which I am involved as a
> > maintainer
> > etc) can be explained by one of these factors, just going from my
> > memory of
> > things having been fixed in one way or another.
>
> I think you are right: we hardly spent any time on more intelligent
> matching.
Yes. We tried to get to a meaningful sample, assuming the skew was
small enough to draw conclusions on the duration of the review process,
which was the main target at this stage. As Lars mentions in some other
message, the nice thing is that once we improve the matching
heuristics, the rest of the analysis can be run automatically, which
means we would get more accurate results.
At this point, without more careful validation, we started to be afraid
of having false positives (relaxing the matching rules to a point where
they start matching messages and commits that are not really the same).
> > There also looks to be identical titles (e.g. "x86: Full support of
> > PAT")
> > being listed there more than once.
>
> Will have to look at this one
Yes. Maybe some whitespace difference or something...
> [...]
Jesus.
--
Bitergia: http://bitergia.com
/me at Twitter: https://twitter.com/jgbarah
next prev parent reply other threads:[~2015-10-15 22:25 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-10-14 17:32 [RFC] Results of Phase 1 of the Review Process study Lars Kurth
2015-10-15 9:06 ` Ian Campbell
2015-10-15 9:26 ` Ian Campbell
2015-10-15 21:36 ` Lars Kurth
2015-10-15 22:25 ` Jesus M. Gonzalez-Barahona [this message]
2015-10-15 22:32 ` Jesus M. Gonzalez-Barahona
2015-10-16 9:19 ` Ian Campbell
2015-10-16 9:29 ` Lars Kurth
2015-10-16 9:58 ` Ian Campbell
2015-10-15 21:18 ` Lars Kurth
2015-10-16 9:06 ` Ian Campbell
2015-10-16 9:15 ` Ian Campbell
2015-10-15 11:58 ` Wei Liu
2015-10-15 21:20 ` Lars Kurth
2015-10-15 22:38 ` Jesus M. Gonzalez-Barahona
2015-10-16 11:06 ` Stefano Stabellini
2015-10-16 18:18 ` Lars Kurth
2015-10-21 12:49 ` Lars Kurth
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1444947929.11624.193.camel@bitergia.com \
--to=jgb@bitergia.com \
--cc=Ian.Campbell@citrix.com \
--cc=dizquierdo@bitergia.com \
--cc=lars.kurth.xen@gmail.com \
--cc=xen-devel@lists.xen.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.