From: Markus Armbruster <armbru@redhat.com>
To: Paolo Bonzini <pbonzini@redhat.com>
Cc: Peter Maydell <peter.maydell@linaro.org>,
QEMU Developers <qemu-devel@nongnu.org>
Subject: Re: use of uninitialized variable involving visit_type_uint32() and friends
Date: Fri, 01 Apr 2022 15:11:03 +0200 [thread overview]
Message-ID: <875ynt54pk.fsf@pond.sub.org> (raw)
In-Reply-To: <e17c3f3b-000f-4bab-1e3a-2adbafbcdcbb@redhat.com> (Paolo Bonzini's message of "Fri, 1 Apr 2022 13:16:19 +0200")
Paolo Bonzini <pbonzini@redhat.com> writes:
> On 4/1/22 11:15, Markus Armbruster wrote:
>> + assert (v->type == expected_type);
>> + if (expected_type & (VISITOR_INPUT | VISITOR_DEALLOC)) {
>>
>> Backwards.
>
> Yes, I always get input vs output wrong.
Output becomes input becomes output. How not to be confused!
>> With an input visitor @v,
>>
>> visit_type_uint32(v, "name", &val, errp)
>>
>> stores to @val without looking at it first. In other words,
>> uninitialized @val is fine, just like for val = ...
>>
>> Note: you don't actually need VISITOR_DEALLOC here, because a
>> deallocation visitor isn't going to do anything for non-pointer values.
>
> There's a philosophical question on whether other deallocation visitors
> can exist than "the" deallocation visitor, but it's not particularly
> germane to the topic.
Agreed. Same for "the" clone visitor.
>> Two changes:
>>
>> * Skip copying to and from full-width buffer @value:
>>
>> - Skip @value = *obj when we're going to overwrite @value without
>> reading it first.
>>
>> This leaves @value uninitialized instead of initializing it from a
>> (commonly) uninitialized variable.
>>
>> I'm not sure how this helps static analysis, but if it does...
>
> If it can do really serious interprocedural analysis, it _might_ be able
> to see through the visitor constructor and know that the "value = *obj"
> is not initialized (e.g. "all callers of object_property_set use an
> input visitor"). I doubt that honestly, but a man can dream.
I'm wary of arguments based on "a sufficiently smart compiler can"...
> If the conditionals are enough to shut it up, then we won the battle
> (for now).
If they get us more milage per unit of work out of Coverity, I'm in
favor. I'll want a comment explaining the conditionals, though.
> If the conditionals are not enough to shut it up, then you have a bit
> more confidence when marking the false positives.
>
>> - Skip *obj = @value when value must be *obj anyway.
>>
>> Should have no observable effect.
>>
>> Again, I'm not sure how this helps static analysis.
>
> Mostly consistency, could also be changed to an assert(*obj == value);
> /* output visitors don't really need obj to be passed by reference */
I guess whatever is easier to explain in a comment.
>> * Pass visitor type in addition to the visitor. Can you explain why
>> that's useful?
>
> Because it communicates what the caller expects: "I have left this
> uninitialized because I expect my "v" argument to be the kind of visitor
> that fills it in". It's this argument that gives me the confidence
> needed to shut up Coverity's false positives.
>
> Embedding the visitor type in the signature makes it impossible not to
> pass it, unlike e.g. an assertion in every getter or setter.
I think we got two kinds of code calling visitor methods:
1. Code for use with one kind of visitor only
We get to pass a literal argument to the additional parameter you
propose.
2. Code for use with arbitrary visitors (such as qapi-visit*.c)
We need to pass v->type, where @v is the existing visitor argument.
Except we can't: struct Visitor and VisitorType are private, defined
in <visitor-impl.h>. Easy enough to work around, but has a distinct
"this design is falling apart" smell, at least to me.
Note that "intent explicit in every method call" is sufficient, but not
necessary for "intent is locally explicit, which lets us dismiss false
positives with confidence". We could do "every function that calls
methods". Like checking a precondition. We already have
visit_is_input(). We could have visit_is_output().
The sane way to make output intent explicit is of course passing the
thing by value rather than by reference. To get that, we could generate
even more code. So, if the amount of code we currently generate isn't
disgusting enough, ...
next prev parent reply other threads:[~2022-04-01 13:12 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-03-31 17:35 use of uninitialized variable involving visit_type_uint32() and friends Peter Maydell
2022-03-31 22:27 ` Daniel Henrique Barboza
2022-04-01 8:07 ` Paolo Bonzini
2022-04-01 9:15 ` Markus Armbruster
2022-04-01 11:16 ` Paolo Bonzini
2022-04-01 13:11 ` Markus Armbruster [this message]
2022-04-01 15:46 ` Paolo Bonzini
2022-04-04 6:24 ` Markus Armbruster
2022-06-27 13:33 ` Peter Maydell
2022-06-27 15:33 ` Markus Armbruster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=875ynt54pk.fsf@pond.sub.org \
--to=armbru@redhat.com \
--cc=pbonzini@redhat.com \
--cc=peter.maydell@linaro.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).