qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Thomas Huth <thuth@redhat.com>
To: "Philippe Mathieu-Daudé" <philmd@redhat.com>,
	"Greg Kurz" <groug@kaod.org>,
	"Paolo Bonzini" <pbonzini@redhat.com>
Cc: "Markus Armbruster" <armbru@redhat.com>,
	"Daniel P. Berrangé" <berrange@redhat.com>,
	"Eduardo Habkost" <ehabkost@redhat.com>,
	qemu-devel@nongnu.org
Subject: Re: [PATCH 1/2] qdev: Separate implementations of qdev_get_machine() for user and system
Date: Mon, 19 Apr 2021 17:45:30 +0200	[thread overview]
Message-ID: <bccca420-77d6-c3c6-fd7a-aaa97dd8399c@redhat.com> (raw)
In-Reply-To: <1a68b426-47da-9fba-a34a-d024ac9ed148@redhat.com>

On 15/04/2021 21.07, Philippe Mathieu-Daudé wrote:
> On 4/15/21 6:56 PM, Greg Kurz wrote:
>> On Thu, 15 Apr 2021 18:45:45 +0200
>> Philippe Mathieu-Daudé <philmd@redhat.com> wrote:
>>
>>> On 4/15/21 3:30 PM, Greg Kurz wrote:
>>>> On Thu, 15 Apr 2021 14:39:55 +0200
>>>> Philippe Mathieu-Daudé <philmd@redhat.com> wrote:
>>>>
>>>>> On 4/9/21 6:03 PM, Greg Kurz wrote:
>>>>>> Despite its simple name and common usage of "getting a pointer to
>>>>>> the machine" in system-mode emulation, qdev_get_machine() has some
>>>>>> subtilities.
>>>>>>
>>>>>> First, it can be called when running user-mode emulation : this is
>>>>>> because user-mode partly relies on qdev to instantiate its CPU
>>>>>> model.
>>>>>>
>>>>>> Second, but not least, it has a side-effect : if it cannot find an
>>>>>> object at "/machine" in the QOM tree, it creates a dummy "container"
>>>>>> object and put it there. A simple check on the type returned by
>>>>>> qdev_get_machine() allows user-mode to run the common qdev code,
>>>>>> skipping the parts that only make sense for system-mode.
>>>>>>
>>>>>> This side-effect turns out to complicate the use of qdev_get_machine()
>>>>>> for the system-mode case though. Most notably, qdev_get_machine() must
>>>>>> not be called before the machine object is added to the QOM tree by
>>>>>> qemu_create_machine(), otherwise the existing dummy "container" object
>>>>>> would cause qemu_create_machine() to fail with something like :
>>>>>>
>>>>>> Unexpected error in object_property_try_add() at ../../qom/object.c:1223:
>>>>>> qemu-system-ppc64: attempt to add duplicate property 'machine' to
>>>>>>   object (type 'container')
>>>>>> Aborted (core dumped)
>>>>>>
>>>>>> This situation doesn't exist in the current code base, mostly because
>>>>>> of preventive fixing of some "latent bugs" in QEMU 4.0 (see 1a3ec8c1564
>>>>>> and e2fb3fbbf9c for details).
>>>>>>
>>>>>> A new kind of breakage was spotted very recently though :
>>>>>>
>>>>>> $ ./qemu-system-ppc64 -device power8_v2.0-spapr-cpu-core,help
>>>>>> /home/thuth/devel/qemu/include/hw/boards.h:24:
>>>>>>   MACHINE: Object 0x5635bd53af10 is not an instance of type machine
>>>>>> Aborted (core dumped)
>>>>>>
>>>>>> This comes from the change 3df261b6676b in QEMU 5.0. It unwillingly
>>>>>> added a new condition for qdev_get_machine() to be called too early,
>>>>>> breaking MACHINE(qdev_get_machine()) in generic cpu-core code this
>>>>>> time.
>>>>>>
>>>>>> In order to avoid further subtle breakages like this, change the
>>>>>> implentation of qdev_get_machine() to:
>>>>>> - keep the existing behaviour of creating the dummy "container"
>>>>>>    object for the user-mode case only ;
>>>>>> - abort() if the machine doesn't exist yet in the QOM tree for
>>>>>>    the system-mode case. This gives a precise hint to developpers
>>>>>>    that calling qdev_get_machine() too early is a programming bug.
>>>>>>
>>>>>> This is achieved with a new do_qdev_get_machine() function called
>>>>>> from qdev_get_machine(), with different implementations for system
>>>>>> and user mode.
>>>>>>
>>>>>> $ ./qemu-system-ppc64 -device power8_v2.0-spapr-cpu-core,help
>>>>>> qemu-system-ppc64: ../../hw/core/machine.c:1290:
>>>>>>   qdev_get_machine: Assertion `machine != NULL' failed.
>>>>>> Aborted (core dumped)
>>>>>>
>>>>>> Reported-by: Thomas Huth <thuth@redhat.com>
>>>>>> Signed-off-by: Greg Kurz <groug@kaod.org>
>>>>>> ---
>>>>>>   hw/core/machine.c        | 14 ++++++++++++++
>>>>>>   hw/core/qdev.c           |  2 +-
>>>>>>   include/hw/qdev-core.h   |  1 +
>>>>>>   stubs/meson.build        |  1 +
>>>>>>   stubs/qdev-get-machine.c | 11 +++++++++++
>>>>>>   5 files changed, 28 insertions(+), 1 deletion(-)
>>>>>>   create mode 100644 stubs/qdev-get-machine.c
>>>>> ...
>>>>>
>>>>>> diff --git a/stubs/meson.build b/stubs/meson.build
>>>>>> index be6f6d609e58..b99ee2b33e94 100644
>>>>>> --- a/stubs/meson.build
>>>>>> +++ b/stubs/meson.build
>>>>>> @@ -54,3 +54,4 @@ if have_system
>>>>>>   else
>>>>>>     stub_ss.add(files('qdev.c'))
>>>>>>   endif
>>>>>> +stub_ss.add(files('qdev-get-machine.c'))
>>>>>
>>>>> Adding this as a stub looks suspicious...
>>>>> Why not add it in to user_ss in hw/core/meson.build?
>>>>> Maybe name the new file hw/core/qdev-user.c?
>>>>>
>>>>
>>>> It turns out that this isn't specific to user-mode but rather
>>>> to any non-qemu-system-FOO binary built with qdev, e.g.
>>>> test-qdev-global-props :
>>>>
>>>> #0  do_qdev_get_machine () at ../../stubs/qdev-get-machine.c:10
>>>> #1  0x0000000100017938 in qdev_get_machine () at ../../hw/core/qdev.c:1134
>>>> #2  0x000000010001855c in device_set_realized (obj=0x100128b60, value=<optimized out>, errp=0x7fffffffd4e0) at ../../hw/core/qdev.c:745
>>>> #3  0x000000010001cc5c in property_set_bool (obj=0x100128b60, v=<optimized out>, name=<optimized out>, opaque=0x1000f33f0, errp=0x7fffffffd4e0) at ../../qom/object.c:2257
>>>> #4  0x0000000100020a9c in object_property_set (obj=0x100128b60, name=0x100093f78 "realized", v=0x100136d30, errp=0x1000e3af8 <error_fatal>) at ../../qom/object.c:1402
>>>> #5  0x000000010001c38c in object_property_set_qobject (obj=0x100128b60, name=0x100093f78 "realized", value=<optimized out>, errp=0x1000e3af8 <error_fatal>) at ../../qom/qom-qobject.c:28
>>>> #6  0x0000000100020e20 in object_property_set_bool (obj=0x100128b60, name=0x100093f78 "realized", value=<optimized out>, errp=0x1000e3af8 <error_fatal>) at ../../qom/object.c:1472
>>>> #7  0x000000010001612c in qdev_realize (dev=0x100128b60, bus=<optimized out>, errp=0x1000e3af8 <error_fatal>) at ../../hw/core/qdev.c:389
>>>> #8  0x000000010000fb10 in test_static_prop_subprocess () at /home/greg/Work/qemu/qemu-master/include/hw/qdev-core.h:17
>>>> #9  0x00007ffff7e95654 in g_test_run_suite_internal () from /lib64/libglib-2.0.so.0
>>>> #10 0x00007ffff7e954b8 in g_test_run_suite_internal () from /lib64/libglib-2.0.so.0
>>>> #11 0x00007ffff7e954b8 in g_test_run_suite_internal () from /lib64/libglib-2.0.so.0
>>>> #12 0x00007ffff7e954b8 in g_test_run_suite_internal () from /lib64/libglib-2.0.so.0
>>>> #13 0x00007ffff7e954b8 in g_test_run_suite_internal () from /lib64/libglib-2.0.so.0
>>>> #14 0x00007ffff7e959cc in g_test_run_suite () from /lib64/libglib-2.0.so.0
>>>> #15 0x00007ffff7e95a80 in g_test_run () from /lib64/libglib-2.0.so.0
>>>> #16 0x000000010000ecec in main (argc=<optimized out>, argv=<optimized out>) at ../../tests/unit/test-qdev-global-props.c:316
>>>>
>>>> Is there a meson thingy to handle this dependency ?
>>>
>>>    if not have_system
>>>      common_ss.add(files('qdev-machine-stubs.c'))
>>>    endif
>>>
>>> This is not pretty, but better than a generic stubs/qdev-get-machine.c
>>> IMO...
>>>
>>
>> Yeah it isn't that much different,
> 
> I'd expect symbols in stubs/ to be declared weak, while not with this
> approach, so we'd catch clash for incorrect configs. Maybe I'm wrong...

See https://gitlab.com/qemu-project/qemu/-/commit/3bc2f570ec9fc93 for the 
reason why symbols in stubs are *not* declared as weak anymore.

  Thomas



  parent reply	other threads:[~2021-04-19 15:47 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-04-09 16:03 [PATCH 0/2] cpu/core: Fix "help" of CPU core device types Greg Kurz
2021-04-09 16:03 ` [PATCH 1/2] qdev: Separate implementations of qdev_get_machine() for user and system Greg Kurz
2021-04-09 20:14   ` Eduardo Habkost
2021-04-10  6:33     ` Greg Kurz
2021-04-10  4:56   ` Thomas Huth
2021-04-10  8:59   ` Markus Armbruster
2021-04-15  8:26     ` Greg Kurz
2021-04-13 22:25   ` Eduardo Habkost
2021-04-15 10:53     ` Greg Kurz
2021-04-15 12:39   ` Philippe Mathieu-Daudé
2021-04-15 13:30     ` Greg Kurz
2021-04-15 16:45       ` Philippe Mathieu-Daudé
2021-04-15 16:56         ` Greg Kurz
2021-04-15 19:07           ` Philippe Mathieu-Daudé
2021-04-16  6:42             ` Greg Kurz
2021-04-19 15:45             ` Thomas Huth [this message]
2021-04-09 16:03 ` [PATCH 2/2] cpu/core: Fix "help" of CPU core device types Greg Kurz
2021-04-09 20:04   ` Eduardo Habkost
2021-04-10  4:53   ` Thomas Huth

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bccca420-77d6-c3c6-fd7a-aaa97dd8399c@redhat.com \
    --to=thuth@redhat.com \
    --cc=armbru@redhat.com \
    --cc=berrange@redhat.com \
    --cc=ehabkost@redhat.com \
    --cc=groug@kaod.org \
    --cc=pbonzini@redhat.com \
    --cc=philmd@redhat.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).