From: Dennis Kaarsemaker <dennis@kaarsemaker.net>
To: Junio C Hamano <gitster@pobox.com>
Cc: git@vger.kernel.org, pclouds@gmail.com
Subject: Re: [PATCH] reflog-walk: don't segfault on non-commit sha1's in the reflog
Date: Wed, 30 Dec 2015 22:33:28 +0100 [thread overview]
Message-ID: <1451511208.9251.21.camel@kaarsemaker.net> (raw)
In-Reply-To: <xmqqege3eiqb.fsf@gitster.mtv.corp.google.com>
On wo, 2015-12-30 at 13:20 -0800, Junio C Hamano wrote:
> Dennis Kaarsemaker <dennis@kaarsemaker.net> writes:
>
> > diff --git a/reflog-walk.c b/reflog-walk.c
> > index 85b8a54..b85c8e8 100644
> > --- a/reflog-walk.c
> > +++ b/reflog-walk.c
> > @@ -236,8 +236,8 @@ void fake_reflog_parent(struct reflog_walk_info
> > *info, struct commit *commit)
> > reflog = &commit_reflog->reflogs->items[commit_reflog
> > ->recno];
> > info->last_commit_reflog = commit_reflog;
> > commit_reflog->recno--;
> > - commit_info->commit = (struct commit *)parse_object(reflog
> > ->osha1);
> > - if (!commit_info->commit) {
> > + commit_info->commit = lookup_commit(reflog->osha1);
> > + if (!commit_info->commit || parse_commit(commit_info
> > ->commit)) {
> > commit->parents = NULL;
> > return;
>
> This looks somewhat roundabout and illogical. The original was bad
> because it blindly assumed reflgo->osha1 refers to a commit without
> making sure that assumption holds. Calling lookup_commit() blindly
> is not much better, even though you are helped that the function
> happens not to barf if the given object is not a commit.
>
> Also this changes semantics, no? Trace the original flow and think
> what happens, when we see a commit object that cannot be parsed in
> parse_commit_buffer(). parse_object() calls parse_object_buffer()
> which in turn calls parse_commit_buffer() and the entire callchain
> returns NULL. commit_info->commit will become NULL in such a case.
>
> With your code, lookup_commit() will store a non NULL in
> commit_info->commit, and parse_commit() calls parse_commit_buffer()
> and that would fail, so you clear commit->parents to NULL but fail
> to set commit_info->commit to NULL.
>
> Why not keep the parse_object() as-is and make sure we error out
> unless the result is a commit with a more explicit check, perhaps
> like this, instead?
lookup_commit actually returns NULL (via object_as_type) for objects
that are not commits, so I don't think the above is true. The code
below also loses the diagnostic message about the object not being a
commit.
> reflog-walk.c | 7 +++++--
> 1 file changed, 5 insertions(+), 2 deletions(-)
>
> diff --git a/reflog-walk.c b/reflog-walk.c
> index 85b8a54..861d7c4 100644
> --- a/reflog-walk.c
> +++ b/reflog-walk.c
> @@ -221,6 +221,7 @@ void fake_reflog_parent(struct reflog_walk_info
> *info, struct commit *commit)
> struct commit_info *commit_info =
> get_commit_info(commit, &info->reflogs, 0);
> struct commit_reflog *commit_reflog;
> + struct object *logobj;
> struct reflog_info *reflog;
>
> info->last_commit_reflog = NULL;
> @@ -236,11 +237,13 @@ void fake_reflog_parent(struct reflog_walk_info
> *info, struct commit *commit)
> reflog = &commit_reflog->reflogs->items[commit_reflog
> ->recno];
> info->last_commit_reflog = commit_reflog;
> commit_reflog->recno--;
> - commit_info->commit = (struct commit *)parse_object(reflog
> ->osha1);
> - if (!commit_info->commit) {
> + logobj = parse_object(reflog->osha1);
> + if (!logobj || logobj->type != OBJ_COMMIT) {
> + commit_info->commit = NULL;
> commit->parents = NULL;
> return;
> }
> + commit_info->commit = (struct commit *)logobj;
>
> commit->parents = xcalloc(1, sizeof(struct commit_list));
> commit->parents->item = commit_info->commit;
>
>
> > +test_expect_success 'reflog containing non-commit sha1s' '
> > + git checkout -b broken-reflog &&
> > + echo "$(git rev-parse HEAD^{tree}) $(git rev-parse HEAD)
> > abc <xyz> 0000000001 +0000" >> .git/logs/refs/heads/broken-reflog
> > &&
> > + git reflog broken-reflog
> > +'
> > +
>
> This will negatively affect the ongoing effort to abstract out the
> on-disk implementation of the reflog. In some future installation
> of Git, the reflog may not even be in .git/logs/refs/whatever file.
I was following the style of the test above it, will fix.
> Use a non-branch ref, so that you can store any valid object not
> just commits, and use a Git command (e.g. "git update-ref" or "git
> tag") instead of the raw filesystem access to update it, perhaps
> like this?
>
> git tag --create-reflog test-logs HEAD^ &&
> git tag -f test-logs HEAD^{tree} &&
> git tag -f test-logs HEAD &&
> git reflog test-logs
--
Dennis Kaarsemaker
www.kaarsemaker.net
next prev parent reply other threads:[~2015-12-30 21:33 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-12-30 9:24 Segfault in git reflog Dennis Kaarsemaker
2015-12-30 10:31 ` Duy Nguyen
2015-12-30 11:17 ` Dennis Kaarsemaker
2015-12-30 11:26 ` Duy Nguyen
2015-12-30 11:28 ` Duy Nguyen
2015-12-30 12:28 ` Dennis Kaarsemaker
2015-12-30 13:19 ` Duy Nguyen
2015-12-30 15:22 ` [PATCH] reflog-walk: don't segfault on non-commit sha1's in the reflog Dennis Kaarsemaker
2015-12-30 21:20 ` Junio C Hamano
2015-12-30 21:33 ` Dennis Kaarsemaker [this message]
2015-12-30 21:41 ` Junio C Hamano
2015-12-30 21:49 ` Dennis Kaarsemaker
2015-12-30 22:17 ` [PATCH v2] " Dennis Kaarsemaker
2015-12-30 22:42 ` Junio C Hamano
2015-12-30 23:33 ` [PATCH v3] " Dennis Kaarsemaker
2015-12-31 0:02 ` Junio C Hamano
2015-12-31 8:57 ` Dennis Kaarsemaker
2015-12-31 15:43 ` Dennis Kaarsemaker
2016-01-05 21:12 ` [PATCH v4] " Dennis Kaarsemaker
2016-01-06 1:05 ` Eric Sunshine
2016-01-06 1:20 ` Dennis Kaarsemaker
2016-01-06 1:28 ` Eric Sunshine
2016-01-06 1:52 ` Eric Sunshine
2016-01-06 9:13 ` Dennis Kaarsemaker
2016-01-06 9:30 ` Duy Nguyen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1451511208.9251.21.camel@kaarsemaker.net \
--to=dennis@kaarsemaker.net \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=pclouds@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.